The course will use mainly the following handbook:
Mc Enery, Tony, and Hardie, Andrew (2012). Corpus Linguistics. Cambridge: Cambridge University Press. (294 pages).
The parts on data analysis and processing will be instead based on parts of:
Gries, Stefan Th. (2009). Quantitative Corpus Linguistics with R. New York and London: Routledge. (248 pages)
Other material, like case-study articles, will be provided during the course.
A good primer on text manipulation is:
Kenneth W. Church’s “Unix for poets”, which can be found easily online in different formats. This is the original link to the document (download it and save it as a PostScript .ps file, which can be then converted to PDF or printed): http://www.cs.jhu.edu/~kchurch/wwwfiles/tutorials/unix_for_poets.ps