Computational Psycholinguistics, Spring 2019

1 Course information

Lecture TimesMondays & Wednesdays 9:30-11:00am
Lecture Location46-3310
Class websitehttps://learning-modules.mit.edu/class/index.html?uuid=/course/9/sp19/9.19#info
Syllabushttp://www.mit.edu/~rplevy/teaching/2019spring/9.19

2 Instructor information

InstructorRoger Levy (rplevy@mit.edu)
Instructor's office46-3033
Instructor's office hoursThursdays 9-11am (subject to change)
Teaching AssistantsPeng Qian (pqian@mit.edu), Veronica Boyce (vboyce@mit.edu)
TA Office46-3027B (Peng) 46-3027F (Veronica)
TA Office HoursThursdays 1-3pm (Peng); Mondays 11am-1pm (Veronica)

3 Course Description

Over the last two and a half decades, computational linguistics has been revolutionized as a result of three closely related developments: increases in computing power, the advent of large linguistic datasets, and a paradigm shift toward probabilistic modeling. At the same time, similar theoretical developments in cognitive science have led to a view major aspects of human cognition as instances of rational statistical inference. These developments have set the stage for renewed interest in computational approaches to human language use. Correspondingly, this course covers some of the most exciting developments in computational psycholinguistics over the past decade. The course spans human language comprehension, production, and acquisition, and covers key phenomena spanning phonetics, phonology, morphology, syntax, semantics, and pragmatics. Students will learn technical tools including probabilistic models, formal grammars, neural networks, and decision theory, and how theory, computational modeling, and data can be combined to advance our fundamental understanding of human language acquisition and use.

4 Course organization

We'll meet twice a week; the course format will be a combination of lecture, discussion, and in-class exercises as class size, structure, and interests permit.

5 Intended Audience

Undergraduate or graduate students in Brain & Cognitive Sciences, Linguistics, Electrical Engineering & Computer Science, and any of a number of related disciplines. The undergraduate section is 9.19, the graduate section is 9.190. Postdocs and faculty are also welcome to participate!

The course prerequisites are:

  1. One semester of Python programming (fulfillable by 6.00/6.0001+6.0002, for example), plus
  2. Either:
    • one semester of probability/statistics/machine learning (fulfilled by, for example, 6.041B or 9.40), or
    • one semester of introductory linguistics (fulfilled by 24.900).

If you think you have the requisite background but have not taken the specific courses just mentioned, please talk to the instructor to work out whether you should take this course or do other prerequisites first.

We will be doing Python programming in this course, and also using programs that must be run from the Unix/Linux/OS X command line. If you have less than one semester of Python programming expereince and/or would like to strengthen your Python programming background, I have listed some resources recommended by others here.

6 Readings & Textbooks

Readings will frequently be drawn from the following textbooks:

  1. Daniel Jurafsky and James H. Martin. Speech and Language Processing. Third edition (draft). Draft chapters can be found here. (I refer to this book as "SLP" in the syllabus.)

    This textbook is the single most comprehensive and up-to-date introduction available to the field of computational linguistics.

  2. Bird, Steven, Ewan Klein, and Edward Loper. 2009. Natural Language Processing with Python. O'Reilly Media. (I refer to this book as "NLTK" in the syllabus.)

    This is the book for the Natural Language Toolkit (or NLTK), which we will be using extensively to do programming We will also be doing some of our programming in the Python programming language, and will make quite a bit of use of for Python. You can buy this book, or you can freely access it on the Web at http://www.nltk.org/book.

  3. Christopher D. Manning and Hinrich Schütze. (1999). Foundations of statistical natural language processing. Cambridge: MIT press. Book chapter PDFs can be obtained through the MIT library website. (I refer to this book as "M&S" in the syllabus.)

    This is an older but still very useful book on NLP.

  4. Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze. 2008. Introduction to Information Retrieval. Cambridge University Press. Book chapter PDFs can be obtained through the MIT library website. (I refer to this book as "MRS" in the syllabus.)

We'll also occasionally draw upon other sources for readings, including original research papers in computational linguistics, psycholinguistics, and other areas of the cognitive science of language.

7 Syllabus (subject to modification!)

WeekDayTopicReadingsRelated readingsProblem sets
Week 1Wed 6 FebCourse intro; intro to probability theory;Feldman et al., 2009Pset 1 out
Week 2Mon 11 FebSpeech perception and the perceptual magnetMRS Chapter 13
Wed 13 FebWord sequences, language models and \(N\)-gramsSLP 3
Week 3Tue 19 FebPsycholinguistic methods: human eye movements and language understandingNoneMcMurray et al., 2008; Rayner, 1998; Dahan et al., 2001
Wed 20 FebPrediction in human language understanding; surprisalKutas et al. 2011; Piantadosi et al., 2011Smith & Levy, 2013Pset 1 due (Fri 22 Feb)
Week 4Mon 25 FebWord embeddings and neural networksSLP 6; Levy, Goldberg, & Dagan, 2015Mikolov et al., 2013, Pennington et al., 2014, Gutierrez et al., 2016
Wed 27 FebRecurrent neural network models for languageSLP 7; Young et al., 2018, Section IV; Eisenstein, 2018, Section 6.3Goldberg, 2015; Collobert et al., 2011Pset 2 out (Fri 1 Mar)
Week 5Mon 4 MarRegular expressionsSLP 2.1-2.1.6
Wed 6 MarFinite-state machinesSLP 2.2-2.4, 3
Week 6Mon 11 MarFinite-state machines II
Wed 13 MarIs human language finite state?SLP 11; SLP 2nd edition chapter 16 (under Readings on Stellar)Chomsky, 1956
Week 7Mon 18 MarContext-free grammars; Syntactic analysis.SLP 12; NLTK 8.1-8.5; Levy & Andrew, 2006Gazdar, 1981 (esp. Section 2); Müller, 2018 (esp. Section 5.4); Joshi et al., 1991 (on formalisms beyond context-free)Pset 2 due
Wed 20 MarMidterm Exam
Spring Break25-29 MarSpring break, no class
Week 8Mon 1 AprProbabilistic context-free grammars, incremental parsing, human syntactic processingSLP 12; NLTK 8.6; Levy, 2013Jurafsky, 1996; Hale, 2001; Levy, 2008Pset 3 out
Wed 3 AprSearching treebanks, estimating PCFGs, and accounting for syntactic comprehension effectsSLP 10SLP 13
Week 9Mon 8 AprWhat do neural networks learn about language structure – and what don't they learn?Linzen et al., 2016; Wilcox et al., 2018Linzen 2019
Wed 10 AprArtificial neural networks as psycholinguistic subjectsFutrell et al., 2019Pset 3 due (Fri 12 Apr)
Week 10Mon 15 AprPatriots Day, no class (student holiday due to Patriots Day)
Wed 17 AprImplicit associations in word embeddingsCaliskan et al., 2017
Week 11Mon 22 AprBayes Nets and interventionsKraljic et al., 2008; Russell & Norvig, 2010, chapter 14 (on Stellar); Levy in progress, Directed Graphical Models appendix;Bayes Nets lecture notesPset 4 out
Wed 24 AprMulti-factor models: logistic regression; word order preferences in language. Hierarchical models; binomial construction.SLP 7; Graphical models intro ; Morgan & Levy, 2015
Week 12Mon 29 Apr in 56-114Statistical word learning in humans; modeling with nonparametric BayesSaffran et al., 1996; Goldwater, Griffiths, & Johnson, 2009
Wed 1 MayThe emergence of syntactic productivity in language developmentMeylan et al., 2017
Week 13Mon 6 MayCombining grammars and neural networksDyer et al., 2016; Kuncoro et al., 2017Choe & Charniak, 2016; Dyer et al., 2015
Wed 8 MayComparing the grammatical capabilities of sequence-based and hybrid neuro-symbolic modelsFutrell et al., 2019; Wilcox et al., 2019Pset 4 due (Fri 10 May)
Week 14Mon 13 MayNoisy-channel language comprehensionLevy et al., 2009; Gibson et al., 2013; Futrell & Levy, 2017
Wed 15 MayEnd-of-semester reviewUse this class to review readings from the rest of the semester!Final projects due Thursday, May 16
Final ExamTue 21 May 2:15-4:30pm

8 Requirements & grading

You'll be graded on:

WorkGrade percentage (9.19)Grade percentage (9.190)
A number of homework assignments throughout the semester50%37.5%
A midterm exam20%15%
A final exam30%22.5%
If you are enrolled in 9.190, a final project25%

Active participation in the class is also encouraged and taken into account in borderline grade cases!

8.1 Homework late policy

Homework assignments can be turned in up to 7 days late; 10% of your score will be deducted for each 24 hours of lateness (rounded up). For example, if a homework assignment is worth 80 points, you turn it in 3 days late, and earn a 70 before lateness is taken into account, your score will be (1-0.3)*70=49.

8.2 Medical or personal circumstances impacting psets, exams, or projects

If medical or personal circumstances such as illness impact your work on a pset or project, or your ability to take an exam on the scheduled date with adequate preparation, please work with Student Support Services (S3) to verify these circumstances and be in touch with the instructor. We are happy to work with you in whatever way is most appropriate to your individual circumstances to help ensure that you are able to achieve your best performance in class while maintaining your health, happiness, and well-being.

8.3 Mapping of class score to letter grade

We use new homework and exam materials every year, and it can be hard to perfectly predict the difficulty of assignments and exames. Therefore I determine standards for end-of-semester letter grades in light of student performance throughout the class (though I do not grade on a curve). However, I guarantee minimum grades on the basis of the following thresholds:

ThresholdGuaranteed minimum grade
>=90%A-
>=80%B-
>=70%C-
>=60%D

So, for example, an overall score of 90.0001% of points guarantees you an A-, but you could well wind up with a higher grade depending on the curve.

9 Mailing list

There will be a mailing list for this course, which you can access at https://mailman.mit.edu:444/mailman/listinfo/9.19-2019-spring. Please make sure you're signed up for it! This list is both for discussion of ideas in the class and for communications about organizational issues.

Author: Roger Levy (rplevy@mit.edu)

Org version 7.5 with Emacs version 25

Validate XHTML 1.0