|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Ling 3000Q/5000: |
Note: This schedule is subject to change, but not without notice. Any changes will be announced in class and reflected on these pages. Be sure to check regularly. |
Dates | Topics | Methods | Readings | ||
---|---|---|---|---|---|
Jan 17, 19 [W,F] |
Introduction Computers in linguistics and Natural Language Processing The nature and use of text corpora |
Some programming basics | Jurafsky & Martin Ch. 1 | ||
Jan 22, – 26 [1 week] |
Regular expressions and regular languages Pattern matching Corpus search, concordances, counting Finite-state automata |
File handling, control structures Regular expressions Some set theory |
J&M Ch. 2 | ||
Jan 29 – Feb 9 [2 weeks] |
N-grams Language modeling Smoothing Evaluation |
Probability theory Object-oriented programming |
J&M 3 Ch. 3 | ||
Feb 12 – 23 [2 weeks] |
Sequence classification Part-of-Speech Tagging Word classes and tagsets Rule-based and stochastic POS tagging Hidden Markov Models Evaluation |
More Python | J&M 3 Ch. 8 | ||
Feb 26 – Mar 1 [1 week] |
Context-free languages Syntactic ambiguity Context-free grammars Push-down automata Chomsky Hierarchy |
Language theory | J&M 3 Ch. 17 | ||
Mar 4 – 8, 18 – 22 [2 weeks] |
Parsing Search and dynamic programming Chart parsing (CKY, Earley) Chunking Evaluation |
More Python | J&M 2 Ch. 13 [J&M 3 Ch. 18] |
||
Mar 25 – 29 [1 week] |
Sentiment classification Bag of words Naive Bayes |
More probability | J&M 3 Ch. 4 | ||
Apr 1 – 19 [3 weeks] |
Computational lexical semantics Measures of collocational strength (Un)supervised word sense disambiguation Classifiers Vector-space semantics |
NLTK, infomap | J&M 3 Ch. 6 [J&M Ch. 20] |
||
Apr 22 – 26 | Loose ends, advanced topics | ||||
Apr 29 – May 3 | Finals Week | ||||