Corpus Linguistics

Corpus Linguistics

4.11 - 1251 ratings - Source

Corpus Linguistics seeks to provide a comprehensive sampling of real-life usage in a given language, and to use these empirical data to test language hypotheses. Modern corpus linguistics began fifty years ago, but the subject has seen explosive growth since the early 1990s. These days corpora are being used to advance virtually every aspect of language study, from computer processing techniques such as machine translation, to literary stylistics, social aspects of language use, and improved language-teaching methods. Because corpus linguistics has grown fast from small beginnings, newcomers to the field often find it hard to get their bearings. Important papers can be difficult to track down. This volume reprints forty-two articles on corpus linguistics by an international selection of authors, which comprehensively illustrate the directions in which the subject is developing. It includes articles that are already recognized as classics, and others which deserve to become so, supplemented with editorial introductions relating the individual contributions to the field as a whole. This collection of readings will be useful to students of corpus linguistics at both undergraduate and postgraduate level, as well as academics researching this fascinating area of linguistics. g2.1.4 Indeterminacy A final difference between the Penn Treebank tagset and all other tagsets we are aware of concerns ... Treebank corpus is produced in two stages, using a combination of automatic PoS assignment and manual correction .

Title:Corpus Linguistics
Author:Geoffrey Sampson, Diana McCarthy
Publisher:A&C Black - 2005-12-07


You Must CONTINUE and create a free account to access unlimited downloads & streaming