IARPA/Pattern

From Knowitall
Jump to: navigation, search

Regex Expressions:

  • alternation: |
  • option: ?
  • Kleene-star: *
  • plus: +
  • start assertion: ^
  • end assertion: $
  • matching group: ()
  • non-matching group: (?:)
  • named group: (<name>:)

Token Expressions:

  • string: takes a case-insensitive regular expression
  • stringcs: take a case-sensitive regular expression
  • lemma: take a case-insensitive regular expression for the lemma
  • pos: takes a case-insensitive regular expression for the pos tag
  • chunk: takes a case-insensitive regular expression for the chunk tag
  • type: takes a case-insensitive string for any type that spans the token

Examples:

 <string="an?|the">? <pos="JJ">* <pos="NNP">+ <pos="NN">+ <pos="NNP>+
 The incredible U.S. president Barack Obama
 famed UW professor Oren Etzioni
 <pos="NNP">+ <stringcs="president">+ <pos="NNP>+
 U.S. president Barack Obama
 not: U.S. President Barack Obama