Back to Compromise

README

plugins/wikipedia/README.md

14.15.01.4 KB
Original Source
<div align="center"> <div>Efficient Named-entity Recognition for <a href="https://github.com/spencermountain/compromise/">compromise</a></div> <!-- npm version --> <a href="https://npmjs.org/package/compromise-wikipedia"> </a> <!-- file size --> <a href="https://unpkg.com/compromise-wikipedia/builds/compromise-wikipedia.min.js"> </a> <div align="center"> <code>npm install compromise-wikipedia</code> </div> </div> <!-- spacer -->

This experimental plugin provides a highly-compressed list of ~38 thousand popular wikipedia articles, and efficiently scans a text for them.

The plugin is approximately 300kb, minified. It does not include wikipedia's redirects (which it should!).

It is intended as a proof-of-concept, for compressing a large lexicon onto the client-side.

Wikipedia

js
import plg from 'compromise-wikipedia'
nlp.extend(plg)

let doc = nlp('You could always go to McGill, the Harvard of Canada!')
let m = doc.wikipedia()
m.json()
// [...]

MIT