Pattern Learning

From Knowitall

Revision as of 19:56, 20 October 2011 by Schmmd (talk | contribs) (→‎Reducing the lemma grep results)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Jump to: navigation, search

Contents

1 Building the boostrapping data

Building the boostrapping data

Determining target relations

Restrict high quality set of ClueWeb extractions to have proper noun arguments
Choose the most frequent relations from this set

Determining target extractions

Measure the occurrence of the arguments
Keep extractions from the target relations that have arguments that occur commonly (100)

Reducing the lemma grep results

Remove patterns that occur less than 5 times.
Remove duplicate sentences.
Remove extractions that have an (extraction, pattern) pairs that occurs anomalously frequently.
1. There was a single one: (hotel reservation, be make, online) ocurred 32k times, the next one ocurred 8k times

Retrieved from "https://dada.cs.washington.edu/knowitall/wiki/index.php?title=Pattern_Learning&oldid=142"

Navigation menu