Document-level Open IE
From Knowitall
Contents
Goals
- Extend sentence-based Open IE extractors to incorporate document-level reasoning, such as:
- Coreference
- Entity Linking
- NER
- Rules implemented for TAC 2013 Entity Linking
- Define necessary data structures and interfaces by Oct-9
- End-to-end system evaluation by Nov-11
Work Log
10-17
Completed: Integrated sentence-level Open IE and Freebase Linker, test run OK.
Next Goals:
- Integrate best-mention finding rules.
- First: Drop in code "as-is"
- After: Factor out NER tagging, coref components
- Fix issues with tracking character offsets
- Offsets are not properly computed for Open IE extractions
- Find a good way for retrieving document metadata by character offset.
10-9
Short term goal - define necessary interfaces and data structures by 10-11
- Implemented interfaces for:
- Document
- Sentence
- Extraction
- Argument/Relation
- Coreference Mention
- Coreference Cluster
- Entity Link
- Discussed interfaces at length with John and Michael
- Interfaces to be incorporated into generic NLP tool library (nlptools):
- Document
- Sentence
- CorefResolver
- Interfaces to be incorporated into generic NLP tool library (nlptools):