Difference between revisions of "Code Resources"
From Knowitall
(Created page with "* [http://opennlp.sourceforge.net/ OpenNLP]. A Java library with many NLP routines. * [http://tika.apache.org/ Apache Tika]. A library for extracting text from various formats ...") |
|||
Line 1: | Line 1: | ||
* [http://opennlp.sourceforge.net/ OpenNLP]. A Java library with many NLP routines. | * [http://opennlp.sourceforge.net/ OpenNLP]. A Java library with many NLP routines. | ||
* [http://tika.apache.org/ Apache Tika]. A library for extracting text from various formats (pdf, doc, ...) | * [http://tika.apache.org/ Apache Tika]. A library for extracting text from various formats (pdf, doc, ...) | ||
+ | * [http://nlp.stanford.edu/ner/index.shtml Stanford NER]. A named-entity extraction library. Preferred by Will and Stephen over Illinois. |
Latest revision as of 19:33, 27 December 2010
- OpenNLP. A Java library with many NLP routines.
- Apache Tika. A library for extracting text from various formats (pdf, doc, ...)
- Stanford NER. A named-entity extraction library. Preferred by Will and Stephen over Illinois.