Difference between revisions of "Iarpa"
From Knowitall
(→Conversion of Docs to Text) |
(→TODO) |
||
Line 1: | Line 1: | ||
== TODO == | == TODO == | ||
+ | === June 15 === | ||
+ | * Compare Janara's work with nesty | ||
+ | * Remove WEKA from R2A2 | ||
+ | * Extend relational noun to plurals | ||
+ | |||
=== Engineering Tasks === | === Engineering Tasks === | ||
* Handle capitalized/allcaps/nocap sentences. | * Handle capitalized/allcaps/nocap sentences. |
Revision as of 01:24, 19 May 2011
Contents
TODO
June 15
- Compare Janara's work with nesty
- Remove WEKA from R2A2
- Extend relational noun to plurals
Engineering Tasks
- Handle capitalized/allcaps/nocap sentences.
- Oddity of data where person lastname is in parentheses.
- Handle geotagging and other oddities of the classified data.
- Address efficiency issues.
Research Tasks
- Pronoun Resolution
Tools
Speed
- Compile to native code.
- Compare NLP libraries.
Conversion of Docs to Text
- Detect headlines and handle separately
- Sentences with only single letters
- Detect bullet points