The seminar aims at providing theoretical and methodological background to Corpus Linguistics research, in terms of corpus creation, annotation and analysis. A corpus is a collection of naturally-occurring language text, chosen to characterize a state or variety of a language, a collection of texts representative of a given language put together for linguistic analysis. Corpus-based approaches to language analysis are used to expound, test or exemplify theories and descriptions that were formulated before large corpora became available to inform language study. Corpus-driven linguists are strictly committed to the integrity of the data as a whole. Theoretical statements are fully consistent with, and reflect directly, the evidence provided ...