|
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
LING 3000Q/5000: |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Note: This schedule is subject to change, but not without notice. Any changes will be announced in class and reflected on these pages. Be sure to check regularly. |
| Dates | Topics | Methods | Readings |
|---|---|---|---|
| Sep 1 [1 mtg] |
Introduction Syllabus, logistics Computers in linguistics and Natural Language Processing The nature and use of text corpora |
Some programming refreshers | |
| Sep 3-10 [3 mtgs] |
Texts, words, strings Encoding Tokenization Pattern matching Corpus search, concordances, counting Minimum Edit Distance |
File handling, control structures Regular expressions Some set theory |
J&M 3 Ch. 2 |
| Sep 15-22 [3 mtgs] |
N-grams Language modeling Smoothing Evaluation |
Probability Dynamic programming Python: classes |
J&M 3 Ch. 3 |
| Sep 24-Oct 1 [3 mtgs] |
Classification I Bag of words Naive Bayes |
J&M 3 Ch. B | |
| Oct 6-13 [3 mtgs] |
Classification II Logistic regression |
Python: torch | J&M 3 Ch. 4; R App. A |
| Oct 15-22, 27* [5 mtgs] * = online no class 10/29 |
Embeddings Lexical Semantics Collocational strength Vector spaces Latent Semantic Analysis Word2Vec | Python: torch | J&M 3 Ch. 5 |
| Nov 3*, 10 [2 mtgs] * = online no class 11/05 |
Neural Networks Perceptron Hidden layers |
Python: torch | J&M 3 Ch. 6 |
| Nov 12-19 [3 mtgs] |
Large Language Models I Attention |
R Ch. 2,3 | |
| Nov 24-26 | Thanksgiving Break; no class | ||
| Dec 1-10 [4 mtgs] |
Large Language Models II Transformers Training and fine-tuning |
R Ch. 4 | |
| Dec 14-20 | Finals Week | ||