NewMultir Aggregate Comparison

From Knowitall
Jump to: navigation, search
Results Comparison

After implementing the new distant supervision / feature generation interface and generating the data needed for the Multir algorithm. I ran the aggregate scoring algorithm to validate the current performance by comparing it with the old performance. The relations that the model was trained and tested on remained the same, but there were differences in the corpora, feature generation, knowledge base representation. It is hard for be to determine why the results are so different, but it likely has to do with the distribution of relation instances across the two tests.


Aggregate Scorer

The aggregate scorer is different from the sentential scorer. The aggregate scorer takes all of the test data and runs the extraction inference algorithm on each instance and then checks the labels on the instances of the test data to see if any of the labels match the extraction output at test time. This is more forgiving than the sentential scorer since a pair of entities may have several relations. On the other hand a test-time extraction can only be marked correct if it was labelled as such in the test data, which means it has to be part of the knowledge base.


Aggregate Extraction Precision/Recall Table at Highest Recall Level
Algorithm Precision Recall
Mihai's Reimplementation .328 .183
Original Multir .372 .180
New Multir .494 .324


Original Multir Riedel-Training Data Profile
Relation Count Percentage
NA 91373 76%
/location/location/contains 13164 10.9%
/location/neighborhood/neighborhood_of 2966 2.5%
Total 120290 100%


Original Multir Riedel-Test Data Profile
Relation Count Percentage
NA 166004 97%
/location/location/contains 2085 1.2%
/location/neighborhood/neighborhood_of 68 .03%
Total 171360 100%


New Multir Training Data Profile
Relation Count Percentage
/people/person/nationality 258695 40.7%
NA 256294 40.3%
/location/location/contains 74777 11.8%
/people/person/place_of_birth 13384 2%
Total 635999 100%


New Multir Test Data Profile
Relation Count Percentage
NA 44889 56%
/location/location/contains 23018 28.8%
/people/person/nationality 6434 8%
/people/person/place_of_birth 2249 2.8%
Total 80000 100%