|
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Ling 3000Q/5000: |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Note: This schedule is subject to change, but not without notice. Any changes will be announced in class and reflected on these pages. Be sure to check regularly. |
| Dates | Topics | Methods | Readings |
|---|---|---|---|
| Jan 21 [1 mtg] |
Introduction Syllabus, logistics Computers in linguistics and Natural Language Processing The nature and use of text corpora |
Some programming refreshers | Jurafsky & Martin 3rd edition Ch. 1 |
| Jan 23-Feb 2 [5 mtgs] |
Texts, words, strings Encoding Tokenization Pattern matching Corpus search, concordances, counting Minimum Edit Distance |
File handling, control structures Regular expressions Some set theory |
J&M 3 Ch. 2 |
| Feb 4-11 [4 mtgs] |
N-grams Language modeling Smoothing Evaluation |
Probability Dynamic programming Python: classes |
J&M 3 Ch. 3.1-6 |
| Feb 13-20 [4 mtgs] |
Sequence classification Part-of-Speech Tagging Word classes and tagsets Rule-based and stochastic POS tagging Hidden Markov Models Evaluation |
Python: numpy | J&M 3 Ch. 17.1-4 |
| Feb 23-Mar 2 [4 mtgs] |
Classification I Bag of words Naive Bayes |
J&M 3 Ch. B.1-8 | |
| Mar 4-13 [5 mtgs] |
Classification II Logistic regression |
Python: torch | J&M 3 Ch. 4.1-10 |
| Mar 16-20 | Spring break; no class | ||
| Mar 23-Apr 1 [5 mtgs] |
Embeddings Lexical Semantics Collocational strength Vector spaces Latent Semantic Analysis Word2Vec | Python: numpy | J&M 3 Ch. 5 |
| Apr 1-6 | Kaufmann traveling; no class | ||
| Apr 8-15 [4 mtgs] |
Neural Networks Perceptron Hidden layers |
Python: torch | J&M 3 Ch. 6 |
| Apr 17 | Kaufmann traveling; no class | ||
| Apr 20-May 1 [6 mtgs] |
Parsing Context-free grammars Normal forms Chart parsing |
Formal Language Theory | J&M 3 Ch. 18 |
| May 4 – May 10 | Finals Week | ||