Difference between revisions of "Pattern Learning"
From Knowitall
(Created page with "= Building the boostrapping data = == Determining target relations == # Restrict high quality set of ClueWeb extractions to have proper noun arguments # Choose the most freque...") |
(No difference)
|
Revision as of 00:31, 20 October 2011
Contents
Building the boostrapping data
Determining target relations
- Restrict high quality set of ClueWeb extractions to have proper noun arguments
- Choose the most frequent relations from this set
Determining target extractions
- Measure the occurrence of the arguments
- Keep extractions from the target relations that have arguments that occur commonly (100)
Reducing the lemma grep results
- Remove duplicate sentences.
- Remove extractions that occur anomalously frequently.