SAMPLING AND REPRESENTATIVENESS IN CORPUS CONSTRUCTION
Abstract
Abstract: The creation of linguistic corpora is a vital aspect of linguistic research, allowing scholars to analyze language use in various contexts. This article explores the principles of sampling and representativeness in corpus construction, highlighting their significance in ensuring that linguistic data accurately reflects real-world language use. Through examining different sampling methods and their implications for representativeness, this paper aims to provide a clear understanding of how these factors impact linguistic analysis and findings.
References
- Burnard, L. (2000). Reference Guide for the British National Corpus. Oxford University Press.
- McEnery, T., & Wilson, A. (2001). Corpus Linguistics: An Introduction. Edinburgh University Press.
- Biber, D., Conrad, S., & Reppen, R. (1998). Corpus Linguistics: Investigating Language Structure and Use. Cambridge University Press.

