Code Resources

From Knowitall
Revision as of 19:33, 27 December 2010 by Schmmd (talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search
  • OpenNLP. A Java library with many NLP routines.
  • Apache Tika. A library for extracting text from various formats (pdf, doc, ...)
  • Stanford NER. A named-entity extraction library. Preferred by Will and Stephen over Illinois.