Resources
Related readings:
THe following handouts and write-ups are related to the course:
- Language Model (Michael Collins)
- HMM & Tagging (Michael Collins)
- Log linear model (Michael Collins)
- MEMM (Michael Collins)
- CRF (Michael Collins)
- Multi-class classification (Chih-Jen Lin)
- Neural Network (Yoav Goldberg)
Papers:
The ACL anthology has a large collection of NLP papers.
Software packages/ Libraries
The following is an incomplete list of software packages and libraries for developing NLP systems.
Python
There are many off-the-shelf NLP Python libraries are useful
Some machine learning systems may be helpful as well:
Java/Scala
Several state-of-the-art systems are built in Java
- Stanford coreNLP Pipline
- Illinois NLP packages, Demo
- Berkley NLP software
- Twitter NLP toolkit Some declarative learning libraries
- Wolfe
- Saul
- Dyna
C/C++
Large-scale learning libraries:
Suggested Papers (More papers will be added)
The following are suggested papers for presentations and writing critical review reports.
Language Models
- Yee Whye Teh, “A hierarchical Bayesian language model based on Pitman-Yor processes.” ACL 2006 (PDF).
- Yoshua Bengio, Rejean Ducharme, Pascal Vincent, Christian Jauvin, “A Neural Probability Language Model.” JMLR 2003 (PDF).
- Yoom Kim, Yacine Jernite, David Sontag, Alexander M. Rush, “Character-Aware Neural Language Models”. AAAI 2015 (PDF).
- Brian Roark, Murat Saraclar, Michael Collins, Mark Johnson, “Discriminative Language Modeling with Conditional Random Fields and the Perceptron Algorithm”. ACL 2004 (PDF).
Parsing
- Yoav Goldberg, Joakim Nivre, “Training Deterministic Parsers with Non-Deterministic Oracles.” TACL 2013 (PDF)
- Danqi Chen, Christopher D. Manning, “A Fast and Accurate Dependency Parser using Neural Networks.” EMNLP 2014 (PDF)
- Andre F. T. Martins, Noah A. Smith, Eric P. Xing. “Turbo Parsers: Dependency Parsing by Approximate Variational Inference” EMNLP 2010. (PDF)
- Yoav Goldberg, Michael Elhadad, “An Efficient Algorithm for Easy-First Non-Directional Dependency Parsing.” NAACL 2010 (PDF)
- Dan Klein, Christopher D. Manning, “Accurate Unlexicalized Parsing.” ACL 2003 (PDF)
Information Extraction
- T. Mitchell, W. Cohen, E. Hruschka, P. Talukdar, J. Betteridge, A. Carlson, B. Dalvi, M. Gardner, B. Kisiel, J. Krishnamurthy, N. Lao, K. Mazaitis, T. Mohamed, N. Nakashole, E. Platanios, A. Ritter, M. Samadi, B. Settles, R. Wang, D. Wijaya, A. Gupta, X. Chen, A. Saparov, M. Greaves, J. Welling. Never-Ending Language Learning. AAAI 2015.
- S. Riedel, L. Yao, B. M. Marlin and A. McCallum. “Relation Extraction with Matrix Factorization and Universal Schemas”. NAACL 2013. (PDF)
- S. Singh, T. Rocktaschel and S. Riedel. “Towards Combined Matrix and Tensor Factorization for Universal Schema Relation Extraction”. NAACL 2015. (PDF)
- Michele Banko, Michael J Cafarella, Stephen Soderland, Matt Broadhead and Oren Etzioni. “Open Information Extraction from the Web”. IJCAI 2007. (PDF)
- Mike Mintz, Steven Bills, Rion Snow, Dan Jurafsky. “Distant supervision for relation extraction without labeled data.” ACL 2009. (PDF)
- Panupong Pasupat, Percy Liang. “Zero-shot entity extraction from web pages.” ACL 2014. (PDF)
Question Answering / Semantic Parsing
- Jonathan Berant, Vivek Srikumar, Pei-Chun Chen, Abby Vander Linden, Brittany Harding, Brad Huang, Peter Clark and Christopher D. Manning. “Modeling Biological Processes for Reading Comprehension.” EMNLP 14 (PDF)
- Scott Wen-tau Yih, Ming-Wei Chang, Xiaodong He, Jianfeng Gao, “Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge Base”. ACL 2015 (PDF)
- Percy Liang, Michael I. Jordan, Dan Klein, “Learning dependency-based compositional semantics.” ACL 2011. (PDF)
- Jonathan Berant, Percy Liang. “Imitation learning of agenda-based semantic parsers.” TACL 2015 (PDF)
Semantic Role Labeling
- Kristina Toutanova, Aria Haghigh, Christopher D. Manning. “Joint Learning Improves Semantic Role Labeling.” ACL 2005 (PDF)
- Vivek Srikumar, Dan Roth “A Joint Model for Extended Semantic Role Labeling”. ACL 2011(PDF)
- Daniel Gildea, Daniel Jurafsky, “Automatic Labeling of Semantic Roles”. ACL 2000(PDF)
Corefernece Resolution
- Karthik Raghunathan, Heeyoung Lee, Sudarshan Rangarajan, Nathanael Chambers, Mihai Surdeanu, Dan Jurafsky, Christopher Manning. “A Multi-Pass Sieve for Coreference Resolution”. EMNLP 2010 (PDF)
- Greg Durrett and Dan Klein. “Easy Victories and Uphill Battles in Coreference Resolution.” EMNLP 2013. (PDF)
- Kai-Wei Chang Rajhan Samdani, Dan Roth. “A Constrained Latent Variable Model for Coreference Resolution” EMNLP 2013. (PDF)
- Sam Wiseman, Alexander M. Rush, and Stuart M. Shieber. “Learning Global Features for Coreference Resolution” NAACL 2016. (PDF)
Name Entity Recognition / Entity Linking
- Alan Ritter, Sam Clark, Mausam and Oren Etzioni. “Named Entity Recognition in Tweets: An Experimental Study”. EMNLP 2011 (PDF)
- Lev Ratinov, Dan Roth. “Design Challenges and Misconceptions in Named Entity Recognition”. CoNLL 2009 (PDF)
- Andrea Moro, Alessandro Raganato, Roberto Navigli. “Entity Linking meets Word Sense Disambiguation: A Unified Approach”. TACL 2014 (PDF)
- Greg Durrett and Dan Klein. “A Joint Model for Entity Analysis: Coreference, Typing, and Linking”. TACL 2014 (PDF)
- Xiao Ling, Sameer Singh, and Daniel S. Weld. “Design Challenges for Entity Linking”. TACL 2015 (PDF)
- Parag Singla and Pedro Domingos. “Entity Resolution with Markov Logic”. ICDM 2006 (PDF)
Word Embeddings
- Jeffrey Pennington, Richard Socher and Christopher D. Manning. “Glove: Global Vectors for Word Representation”. EMNLP 2014. (PDF).
- Manaal Faruqui Jesse Dodge Sujay K. Jauhar, Chris Dyer Eduard Hovy Noah A. Smith. “Retrofitting Word Vectors to Semantic Lexicons”. NAACL 2015. (PDF)
- Omer Levy, Yoav Goldberg. “Neural Word Embedding as Implicit Matrix Factorization”. NIPS 2014. (PDF)
Machine Translation
- Philipp Koehn, Franz Josef Och, Daniel Marcu. “Statistical Phrase-Based Translation.” NAACL 2003. (PDF)
- Ilya Sutskever, Oriol Vinyals, Quoc V. Le. “Sequence to Sequence Learning with Neural Networks”. NIPS 2014.
- Dzmitry Bahdanau, KyungHyun Cho and Yoshua Bengio. “Neural machine translation by jointly learning to align and translate”. ICLR 2015. (PDF)
Word Alignment / Paragraph
- Regina Barzilay and Lillian Lee. “Learning to Paraphrase: An Unsupervised Approach Using Multiple-Sequence Alignment”. NAACL 2003. (PDF)
- Colin Bannard and Chris Callison-Burch. “Paraphrasing with Bilingual Parallel Corpora.”. ACL 2005. (PDF)
- Socher, R. and Huang, E.H., and Pennington, J. and Ng, A.Y., and Manning, C.D. “ Dynamic pooling and unfolding recursive autoencoders for paraphrase detection”. NIPS 2011 (PDF)
Others
- P. Liang, H. Daume, and D. Klein. “Structure Compilation: Trading Structure for Features”. ICML 2008. (PDF)
- Nate Kushman, Yoav Artzi, Luke Zettlemoyer, and Regina Barzilay. “Learning to Automatically Solve Algebra Word Problems”. ACL 2014 (PDF)
- Pedro Domingos, Matthew Richardson. “Markov Logic: A Unifying Framework for Statistical Relational Learning”