Vulcan/TextualEvidenceFinder

From Knowitall
Revision as of 17:09, 26 August 2013 by Niranjan (talk | contribs) (moved WeakEvidenceFinder to Vulcan/TextualEvidenceFinder: Name change.)

Jump to: navigation, search

I/O

Input: A Proposition [A natural language sentence + Open IE tuples from the sentence.]

Output: A list of query/score pairs representing evidence for the proposition.

Components

Textual Evidence Finder Details
System Architecture: Textual Evidence Finder

Query Generator

The query generator outputs two types of queries for each proposition:

  1. Keyword queries -- Extract keywords from the query sentence [TBD: Stemming? Stopword removal?]
  2. Template queries -- A template query is simply a tuple (or the sentence) where one or more words in the tuple is replaced with a wild-card operator.

The system will be given a set of rules that specify how to convert a tuple into different template queries. Start with two rules:

  1. Keyword queries -- Remove stopwords.
  2. Template queries -- Take each tuple. For each field (arg1, rel and arg2), if it is a multi-word query replace each word with a wild-card.
Examples

Input: 

      Sentence: Iron nail is a good conductor of electricity
        Tuples: (iron nail, is a good conductor of, electricity)

Output: 
      Q1: (iron *, is a good conductor of, electricity) //Template query
      Q2: (*  nail, is a good conductor of, electricity)//Template query
      Q3: (iron nail, is a * conductor of, electricity) //Template query
      Q4: iron * conductor * electricity                //Template query 
      Q5: iron or conductor or electricity              //Keyword query

Solr/Lucence Layer

A general outline of the process to build the solr index for Vulcan.

Vulcan extractions.jpg

To be filled in...


Index
  1. What corpora will be indexed? [Study guide, definitions and sentences covering glossary terms]
  2. What is the index structure? [Each tuple will be a Lucene document?]
  3. What is in a document? [Arg1, Arg1 Norm, Rel, Rel Norm, ...]
  4. ...
Search
  1. How to find tuples that match template queries? Use bag-of-words search and filter tuples that match filter?
  2. How can we use synonyms or other paraphrase resources?

Open IE 4.0

Use Open IE 4.0.