This is a seminar-style course devoted to recent research in statistical techniques for the automatic analysis of natural (human) language data. The instructor will give some lectures on the fundamentals and ask students to present topical papers. The topics covered in this course are: syntactic language models and large-scale distributed language models, search algorithms, statistical machine translation and non-parametric Bayesian models/processes (Dirichlet, Pitman-Yor, Indian-Buffet, etc) for natural language processing.
Time: Monday/Wednesday 8:00 pm - 9:15 pm; Location: Joshi Center 193
Shaojun Wang
428, Russ Engineering Center Building
shaojun.wang(at)wright.edu
(937) 775-5140
Office hours: Monday/Wednesday 2:00PM-3:30PM
D. Jurafksy and J. Martin.
Speech and Language Processing, 2nd Edtion
Prentice Hall, 2008.
F. Jelinek.
Statistical Methods for Speech Recognition
MIT Press, 1998.
X. Huang, A. Acero and H. Hon.
Spoken Language Processing: A Guide to Theory, Algorithm and System Development
Prentice Hall, 2001.
J. Pitman.
Combinatorial Stochastic Processes
Springer, 2006
Paper Presentations