THe following handouts and write-ups are related to the course:

Papers:

The ACL anthology has a large collection of NLP papers.

Software packages/ Libraries

The following is an incomplete list of software packages and libraries for developing NLP systems.

Python

There are many off-the-shelf NLP Python libraries are useful

Some machine learning systems may be helpful as well:

Java/Scala

Several state-of-the-art systems are built in Java

Stanford coreNLP Pipline
Illinois NLP packages, Demo
Berkley NLP software
Twitter NLP toolkit Some declarative learning libraries
Wolfe
Saul
Dyna

C/C++

Large-scale learning libraries:

Liblinear
Vowpal Wabbit
CMU cnn library and its NLP applications

Suggested Papers (More papers will be added)

The following are suggested papers for presentations and writing critical review reports.

Language Models

Yee Whye Teh, “A hierarchical Bayesian language model based on Pitman-Yor processes.” ACL 2006 (PDF).
Yoshua Bengio, Rejean Ducharme, Pascal Vincent, Christian Jauvin, “A Neural Probability Language Model.” JMLR 2003 (PDF).
Yoom Kim, Yacine Jernite, David Sontag, Alexander M. Rush, “Character-Aware Neural Language Models”. AAAI 2015 (PDF).
Brian Roark, Murat Saraclar, Michael Collins, Mark Johnson, “Discriminative Language Modeling with Conditional Random Fields and the Perceptron Algorithm”. ACL 2004 (PDF).

Parsing

Yoav Goldberg, Joakim Nivre, “Training Deterministic Parsers with Non-Deterministic Oracles.” TACL 2013 (PDF)
Danqi Chen, Christopher D. Manning, “A Fast and Accurate Dependency Parser using Neural Networks.” EMNLP 2014 (PDF)
Andre F. T. Martins, Noah A. Smith, Eric P. Xing. “Turbo Parsers: Dependency Parsing by Approximate Variational Inference” EMNLP 2010. (PDF)
Yoav Goldberg, Michael Elhadad, “An Efficient Algorithm for Easy-First Non-Directional Dependency Parsing.” NAACL 2010 (PDF)
Dan Klein, Christopher D. Manning, “Accurate Unlexicalized Parsing.” ACL 2003 (PDF)

Information Extraction

T. Mitchell, W. Cohen, E. Hruschka, P. Talukdar, J. Betteridge, A. Carlson, B. Dalvi, M. Gardner, B. Kisiel, J. Krishnamurthy, N. Lao, K. Mazaitis, T. Mohamed, N. Nakashole, E. Platanios, A. Ritter, M. Samadi, B. Settles, R. Wang, D. Wijaya, A. Gupta, X. Chen, A. Saparov, M. Greaves, J. Welling. Never-Ending Language Learning. AAAI 2015.
S. Riedel, L. Yao, B. M. Marlin and A. McCallum. “Relation Extraction with Matrix Factorization and Universal Schemas”. NAACL 2013. (PDF)
S. Singh, T. Rocktaschel and S. Riedel. “Towards Combined Matrix and Tensor Factorization for Universal Schema Relation Extraction”. NAACL 2015. (PDF)
Michele Banko, Michael J Cafarella, Stephen Soderland, Matt Broadhead and Oren Etzioni. “Open Information Extraction from the Web”. IJCAI 2007. (PDF)
Mike Mintz, Steven Bills, Rion Snow, Dan Jurafsky. “Distant supervision for relation extraction without labeled data.” ACL 2009. (PDF)
Panupong Pasupat, Percy Liang. “Zero-shot entity extraction from web pages.” ACL 2014. (PDF)
Question Answering / Semantic Parsing
Jonathan Berant, Vivek Srikumar, Pei-Chun Chen, Abby Vander Linden, Brittany Harding, Brad Huang, Peter Clark and Christopher D. Manning. “Modeling Biological Processes for Reading Comprehension.” EMNLP 14 (PDF)
Scott Wen-tau Yih, Ming-Wei Chang, Xiaodong He, Jianfeng Gao, “Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge Base”. ACL 2015 (PDF)
Percy Liang, Michael I. Jordan, Dan Klein, “Learning dependency-based compositional semantics.” ACL 2011. (PDF)
Jonathan Berant, Percy Liang. “Imitation learning of agenda-based semantic parsers.” TACL 2015 (PDF)

Semantic Role Labeling

Kristina Toutanova, Aria Haghigh, Christopher D. Manning. “Joint Learning Improves Semantic Role Labeling.” ACL 2005 (PDF)
Vivek Srikumar, Dan Roth “A Joint Model for Extended Semantic Role Labeling”. ACL 2011(PDF)
Daniel Gildea, Daniel Jurafsky, “Automatic Labeling of Semantic Roles”. ACL 2000(PDF)

Corefernece Resolution

Karthik Raghunathan, Heeyoung Lee, Sudarshan Rangarajan, Nathanael Chambers, Mihai Surdeanu, Dan Jurafsky, Christopher Manning. “A Multi-Pass Sieve for Coreference Resolution”. EMNLP 2010 (PDF)
Greg Durrett and Dan Klein. “Easy Victories and Uphill Battles in Coreference Resolution.” EMNLP 2013. (PDF)
Kai-Wei Chang Rajhan Samdani, Dan Roth. “A Constrained Latent Variable Model for Coreference Resolution” EMNLP 2013. (PDF)
Sam Wiseman, Alexander M. Rush, and Stuart M. Shieber. “Learning Global Features for Coreference Resolution” NAACL 2016. (PDF)

Name Entity Recognition / Entity Linking

Alan Ritter, Sam Clark, Mausam and Oren Etzioni. “Named Entity Recognition in Tweets: An Experimental Study”. EMNLP 2011 (PDF)
Lev Ratinov, Dan Roth. “Design Challenges and Misconceptions in Named Entity Recognition”. CoNLL 2009 (PDF)
Andrea Moro, Alessandro Raganato, Roberto Navigli. “Entity Linking meets Word Sense Disambiguation: A Unified Approach”. TACL 2014 (PDF)
Greg Durrett and Dan Klein. “A Joint Model for Entity Analysis: Coreference, Typing, and Linking”. TACL 2014 (PDF)
Xiao Ling, Sameer Singh, and Daniel S. Weld. “Design Challenges for Entity Linking”. TACL 2015 (PDF)
Parag Singla and Pedro Domingos. “Entity Resolution with Markov Logic”. ICDM 2006 (PDF)

Word Embeddings

Jeffrey Pennington, Richard Socher and Christopher D. Manning. “Glove: Global Vectors for Word Representation”. EMNLP 2014. (PDF).
Manaal Faruqui Jesse Dodge Sujay K. Jauhar, Chris Dyer Eduard Hovy Noah A. Smith. “Retrofitting Word Vectors to Semantic Lexicons”. NAACL 2015. (PDF)
Omer Levy, Yoav Goldberg. “Neural Word Embedding as Implicit Matrix Factorization”. NIPS 2014. (PDF)

Machine Translation

Philipp Koehn, Franz Josef Och, Daniel Marcu. “Statistical Phrase-Based Translation.” NAACL 2003. (PDF)
Ilya Sutskever, Oriol Vinyals, Quoc V. Le. “Sequence to Sequence Learning with Neural Networks”. NIPS 2014.
Dzmitry Bahdanau, KyungHyun Cho and Yoshua Bengio. “Neural machine translation by jointly learning to align and translate”. ICLR 2015. (PDF)

Word Alignment / Paragraph

Regina Barzilay and Lillian Lee. “Learning to Paraphrase: An Unsupervised Approach Using Multiple-Sequence Alignment”. NAACL 2003. (PDF)
Colin Bannard and Chris Callison-Burch. “Paraphrasing with Bilingual Parallel Corpora.”. ACL 2005. (PDF)
Socher, R. and Huang, E.H., and Pennington, J. and Ng, A.Y., and Manning, C.D. “ Dynamic pooling and unfolding recursive autoencoders for paraphrase detection”. NIPS 2011 (PDF)

Others

P. Liang, H. Daume, and D. Klein. “Structure Compilation: Trading Structure for Features”. ICML 2008. (PDF)
Nate Kushman, Yoav Artzi, Luke Zettlemoyer, and Regina Barzilay. “Learning to Automatically Solve Algebra Word Problems”. ACL 2014 (PDF)
Pedro Domingos, Matthew Richardson. “Markov Logic: A Unifying Framework for Statistical Relational Learning”

Related readings: