This page contains the data and code associated with the paper:
Felix Muzny, Michael Fang, Angel X. Chang and Dan Jurafsky. 2017. A Two-stage Sieve Approach to Quote Attribution. In Proceedings of the European Chapter of the Association for Computational Linguistics (EACL), Valencia, Spain. bib
We release our quote/mention annotations for the texts of Pride and Prejudice, Emma, and The Steppe.
Full texts (xml): Pride and Prejudice Emma The Steppe
Split by chapter (zip of xmls): Pride and Prejudice Emma The Steppe
Test data for Pride & Prejudice (xml): Pride and Prejudice Test Set
Our quote attribution code is released as part of Stanford CoreNLP. The QuoteAttributionAnnotator documentation can be found here.
Our annotator tool is located at https://github.com/muzny/quoteannotator. More information is located on that page about how to setup and run the tool.