Probabilistic Models in the Study of Language

I'm in the process of writing a textbook on the topic of using probabilistic models in scientific work on language ranging from experimental data analysis to corpus work to cognitive modeling. The intended audience is graduate students in linguistics, psychology, cognitive science, and computer science who are interested in using probabilistic models to study language. Feedback (both comments on existing drafts, and expressed desires for additional material to include!) is more than welcome -- send it to rlevy@ucsd.edu.

Note that if you access these chapters repeatedly, you may need to clear the cache of your web browser to ensure that you're getting the latest version.

A current (partial) draft of the complete textis available here.

Here are drafts of those individual chapters that are already available:

Introduction
Univariate Probability, with R code
Multivariate Probability, with R code
Parameter Estimation, with R code
Confidence Intervals and Hypothesis Testing, with R code
Generalized Linear Models, with R code
Interlude chapter (contents TBD)
Hierarchical Models (a.k.a. multi-level, mixed-effects models), with R code
Latent-Variable Models (partial draft), with R code
Nonparametric Models
Probabilistic Grammars

Appendices:

Appendix: Mathematical notation and review
Appendix: More probability Distributions
Appendix: A brief introduction to directed graphical models
A brief introduction to sampling techniques

Last modified: Wed Oct 3 12:03:18 PDT 2012