Home  |   Course Info  |  Schedule  |   References  |   HuskyCT 

Ling 3000Q/5000:
Introduction to Computational Linguistics

Schedule

Note: This schedule is subject to change, but not without notice. Any changes will be announced in class and reflected on these pages. Be sure to check regularly.


Dates Topics Methods Readings

Jan 17, 19
[W,F]
Introduction
Computers in linguistics and Natural Language Processing
The nature and use of text corpora
Some programming basics Jurafsky & Martin Ch. 1
Jan 22, – 26
[1 week]
Regular expressions and regular languages
Pattern matching
Corpus search, concordances, counting
Finite-state automata
File handling, control structures
Regular expressions
Some set theory
J&M Ch. 2
Jan 29 – Feb 9
[2 weeks]
N-grams
Language modeling
Smoothing
Evaluation
Probability theory
Object-oriented programming
J&M 3 Ch. 3
Feb 12 – 23
[2 weeks]
Sequence classification
Part-of-Speech Tagging
Word classes and tagsets
Rule-based and stochastic POS tagging
Hidden Markov Models
Evaluation
More Python J&M 3 Ch. 8
Feb 26 – Mar 1
[1 week]
Context-free languages
Syntactic ambiguity
Context-free grammars
Push-down automata
Chomsky Hierarchy
Language theory J&M 3 Ch. 17
Mar 4 – 8, 18 – 22
[2 weeks]
Parsing
Search and dynamic programming
Chart parsing (CKY, Earley)
Chunking
Evaluation
More Python J&M 2 Ch. 13
[J&M 3 Ch. 18]
Mar 25 – 29
[1 week]
Sentiment classification
Bag of words
Naive Bayes
More probability J&M 3 Ch. 4
Apr 1 – 19
[3 weeks]
Computational lexical semantics
Measures of collocational strength
(Un)supervised word sense disambiguation
Classifiers
Vector-space semantics
NLTK, infomap J&M 3 Ch. 6
[J&M Ch. 20]
Apr 22 – 26 Loose ends, advanced topics
Apr 29 – May 3 Finals Week

Last updated: January 17, 2024