PHRASEOLOGY STUDIES USING CORPUS DATA: A QUANTITATIVE AND QUALITATIVE APPROACH
Abstract
This paper explores the indispensable role of corpus linguistics in contemporary phraseology studies. Traditional phraseology, often focused on highly idiomatic or fixed multi-word units, has been significantly broadened by corpus-based methodologies to include all types of formulaic language, such as collocations, lexical bundles, and phrasal patterns. Utilizing a large-scale, general-purpose corpus (e.g., the Corpus of Contemporary American English – COCA), this study aims to quantitatively identify and qualitatively analyze the frequency, distribution, and semantic prosody of a selected set of multi-word expressions (MWEs). The findings demonstrate the empirical strength of corpus data in revealing the true nature of language use, challenging intuition-based accounts, and providing a foundation for improved lexicography, language teaching, and Natural Language Processing (NLP).
References
[Insert a minimum of 5 relevant academic references here, following a consistent citation style, e.g., APA or MLA.]
2. Biber, D., Johansson, S., Leech, G., Conrad, S., & Finegan, E. (1999). Longman grammar of spoken and written English. London: Longman.
3. Sinclair, J. M. (1991). Corpus, concordance, collocation. Oxford: Oxford University Press.
4. Moon, R. (1998). Fixed expressions and idioms in English: A corpus-based approach. Oxford: Clarendon Press.
5. [Your specific research/methodology reference]
6. [Your specific corpus reference]

