Suitable for graduate students interested in doing theoretical or applied (computational, ELT, etc.) linguistic research using corpora.
The study of language using corpora. Usage of corpora within linguistics and cognitive science. Definition and varieties of corpora. Building a corpus: sampling, representativeness, encoding and annotation. Characteristics of major available corpora. Necessary statistics to interpret corpus data. Using corpora: corpora in psycholinguistics, corpora and syntax, semantics, and discourse; statistical natural language processing. Using (or writing tools) for corpus-based language studies. Conducting a research project on available corpora.
Course content: