101413Notes

From Knowitall
Jump to: navigation, search

Log

October 17 2013
Met with Mitchell and Xiao and decided upon a new preprocessing module that occurs before Distant Supervision in the pipeline.
Preprocessing will store all the relevant information at the sentence level for downstream processing (Distant Supervision and Feature Generation)
If a user wants to use a different corpus then preprocessing will have to be run again otherwise the cached information will speed up the pipeline and researchers can focus on customizing different modules like entity identification and feature generation.