Iarpa
From Knowitall
Contents
Domain Recognizers
Rule Learning
Quality of Extractions
- Too short relations (2 characters)
- Too long relations (7 words)
- some args are just special symbols (e.g., ")
- Pronoun Resolution
Speed
- Compile to native code.
- Compare NLP libraries.
Conversion of Docs to Text
- Detect headlines and handle separately
- Sentences with only single letters
- Detect bullet points