Quantitative Typology: a practical introduction into data exploration and visualization using R
Course description
The programming language R has become the de facto standard for
statistical modeling in academic research. There are several reasons
for this, including the following- R is free and platform independent.
- R is a script language that can be used interactively via the command line.
- R has powerful graphics facilities that are highly useful for exploratory data analysis..
syllabus
date | topic | readings | program
code |
homework
|
---|---|---|---|---|
08/31 | interactive R sessions, data frames. |
Baayen 2008, chapter 1 |
script from
class, verbs.txt, verbs.csv |
|
09/01 | working with data frames, data digging in the world color survey | Baayen 2008, chapter 1 (cont.) Jäger (2010a,b) |
heid.txt,
heid.csv |
Baayen, chapter 1, exercises
(page 20 in the pdf version, page 19 in the printed book) |
09/02 | data
digging
in
the
world
color
survey |
|||
09/03 | power laws in linguistic typology |
readings:
- Baayen, Harald (2008), Analyzing Linguistic Data. A Practical Introduction to Statistics Using R. CUP.
- Jäger, Gerhard (2010a), Natural color categories are convex sets, manuscript, University of Tübingen (an abridged version appeared in the preproceedings of the 17th Amsterdam Colloquium)
- Jäger, Gerhard (2010b), Using statistics for cross-linguistic semantics: a quantitative investigation of the typology of color naming systems, manuscript, University of Tübingen