ISCL Proseminar (Sommersemester 2010)

Grammar Formalisms in Computational Linguistics

Abstract:

Given that natural languages cannot be characterized by simply listing all possible sentences and their meaning, a range of grammar formalisms have been developed to characterize form and meaning in a general and compact way. The approaches differ in terms of their focus, empirical coverage, formal foundations, expressive power, conceptualization of generalizations, and the processing regimes that have been developed for those formalisms.

After a general overview of grammar types in the Chomsky Hierarchy, we will discuss plain context-free grammars as a baseline on which we will introduce and compare several current grammar formalisms. The plan is to include a discussion of Head-Driven Phrase Structure Grammar (HPSG), Tree Adjoining Grammars (TAG), Lexical Functional Grammar (LFG), Combinatoric Categorial Grammars (CCG), and Dependency Grammar (DG).

The focus will be on obtaining a sound working knowledge of how different formalisms capture some of the fundamental phenomena of natural language syntax: argument and adjunct realization, agreement and government, middle-distance phenomena (e.g., equi, raising), long-distance phenomena (e.g., fronting).

Important: In place of a separate Übung for this Proseminar, all participants are required to enroll in the “Introduction to HPSG” taught by Kordula De Kuthy, Tue, 8:30–10:00.

Instructor: Detmar Meurers

Tutor: Peter Makarov

Credits and Campus:

Course meets:

Moodle: We will be using the university Moodle site for the course, primarily for the discussing forum and to access course materials. Our course is accessible under Moodle at http://moodle01.zdv.uni-tuebingen.de/course/view.php?id=377.

To log into this specific Moodle site, you use your general ZDV university account id and password. The first time you access the course Moodle site, you need a course subscription password, which you get in class. Moodle and privacy: Note that Moodle generally keeps detailed logs of your interaction with the system, e.g., when you log in, etc.

Campus course description:
http://campus.verwaltung.uni-tuebingen.de/lsfpublic/rds?state=verpublish&status=init&vmfile=no&publishid=51333&moduleCall=webInfo&publishConfFile=webInfo&publishSubDir=veranstaltung

Email: In the Moodle system everyone in the course can send messages to other participants in the class, and I will use this to contact you for class related matters. Such email gets sent to your regular ZDV account (@student.uni-tuebingen.de). So

Nature of course and my expectations: This is a core course in ISCL and I expect each participant to take an active role in the class: i) regularly participate in the discussions, ii) carefully read any reading assignments before class and post a question about it in the Moodle forum the day before class.

Grading: The course will be graded based on the final exam (40% of grade), participation and activities (30% of grade), and your grade in the HPSG component (30%). Note: Following the rules of the Neuphilologische Fakultät, missing more than two meetings unexcused, automatically results in failing the class.

Class etiquette: Please come to class on time, do not pack up early, read or work on materials for other classes during our class. When in the computer lab, only use the computers when you are asked to do a specific activity; do not read email or browse the web. All portable electronic devices such as cell phones should be switched off for the entire length of the flight, oops, class. If for some reason, you have to leave early or miss class for an important reason, please let me know before class.

Academic conduct and misconduct: Learning and research are driven by discussion and free exchange of ideas, motivations, and perspectives. So you are encouraged to work in groups, discuss, and exchange ideas. At the same time, the foundation of the free exchange of ideas is that everyone is open about where they obtained which information. Concretely, this means you are expected to always make explicit when you’ve worked on something as a team – and keep in mind that being part of a team means sharing the work! For text you write, you always have to provide explicit references for any ideas or passages you reuse from somewhere else. This includes text “found” on the web, where you should cite the url of the web site in case no more official publication is available. Failure to follow these important guidelines is academic misconduct, which will be sanctioned by failing you on the assignment, exam, or the entire class depending on the severity of the violation.

Session plan:

  1. Wednesday, April 14: Motivation of topic and organization of course
  2. Monday, April 19: The empirical domain: What observations and generalizations do we want to capture? Local phenomena (subcategorization, government, agreement), non-local phenomena, middle distance dependencies (non-finite constructions).
  3. Wednesday, April 21: (continued)
  4. Monday, April 26: (continued)
  5. Wednesday, April 28: Formal Language Theory
  6. Monday, May 3: (continued)
  7. Wednesday, May 5: Context-Free Grammars: Motivation (Constituency) and generalizations over CFG rules (X-bar Schema)
  8. Monday, May 10: (continued)
  9. Wednesday, May 12: From Context-Free Grammars to Definite Clause Grammars
  10. Monday, May 17: (continued)
  11. Wednesday, May 19: CFG and DCG lab session and homework
  12. Monday, May 24: Holiday
  13. Wednesday, May 26: Holiday
  14. Monday, May 31: Lexical-Functional Grammar
  15. Wednesday, June 2: LFG lab session
  16. Monday, June 7: LFG continued
  17. Wednesday, June 9: Dependency Grammar
  18. Monday, June 14: DG lab session
  19. Wednesday, June 16: DG continued
  20. Monday, June 21: CCG
  21. Wednesday, June 23: CCG lab session
  22. Monday, June 28: CCG continued
  23. Wednesday, June 30: Tree-Adjoining Grammar
  24. Monday, July 5: TAG lab session
  25. Wednesday, July 7: TAG continued
  26. Monday, July 12: HPSG
  27. Wednesday, July 14: HPSG lab session