I'm in the process of writing a textbook on the topic of using probabilistic models in scientific work on language ranging from experimental data analysis to corpus work to cognitive modeling. The intended audience is graduate students in linguistics, psychology, cognitive science, and computer science who are interested in using probabilistic models to study language. Feedback (both comments on existing drafts, and expressed desires for additional material to include!) is more than welcome -- send it to rlevy@ucsd.edu.
Note that if you access these chapters repeatedly, you may need to clear the cache of your web browser to ensure that you're getting the latest version.
A current (partial) draft of the complete textis available here.
Here are drafts of those individual chapters that are already available:
Appendices: