Rule Learner/Overview

From Knowitall
Revision as of 20:32, 20 April 2011 by Schmmd (talk | contribs) (Created page with "== Creating annotated sentence files == The input is an XML file with the sentences and a annotations file. For each [http://knowitall.cs.washington.edu/javadoc/nlp/edu/washingto...")

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Creating annotated sentence files

The input is an XML file with the sentences and a annotations file. For each Annotation in the annotations file:

  1. Find the Sentences that matches the annotation sentence.
  2. Add that annotation to the AnnotatedSentence

Output the AnnotatedSentences to an XML file using toXmlElement.

Creating base rules

For each AnnotedSentence:

  1. For each extractions (tuples) in the AnnotatedSentence.
    1. Make constraints that capture the annotation's argument from the extraction. A base rule must have at least one argument constraint found in arg1, pred, or arg2 of the extraction. There may be multiple types that capture the annotation's argument, so there may be multiple possible base rules for the same extraction. If all the text in the extraction part (arg1, predicate, arg2) matches the Annotation's argument, use a PartConstraint.
    2. Add all possible additional constraints.
      1. Add all type (class and NER) constraints from arg1, pred, and arg2 of the tuple. Add a term constraint if the text under the type matches the annotation.
      2. Add term constraints for words with pos tag { IN, TO, POS } in arg1, pred, and arg2.