Extraction of Semantic Domains Through Corpus Tools
The increased interest in the techniques of corpus linguistics in the first decade of 21st century was based on the most important premises, which are valid even today – investigation of larger datasets in less time. This article compares the results of different corpus techniques employed for exploring the dominant semantic domains in a corpus. These corpus techniques include use of word clouds, frequency lists and KWIC of a text. This study uses fictional discourse by Kamila Shamsie – namely Broken Verses (2005) – to illustrate the corpus methodology. In addition to different corpus techniques, this study also compares the usability of different corpus software for this purpose such as, Antconc (3.2.4), Nvivo 11, and Sketch Engine. This article will prove to be a good beginning point for the researchers exploring a text in any field of corpus linguistics and digital humanities.
-
CADS, Digital Humanities, E-Humanities, KWIC, Lemma, Semantic Fields, Stemmed
-
(1) Azka Khan
PhD Scholar, Department of English, Fatima Jinnah Women University, Rawalpindi, Punjab, Pakistan.
(2) Sarwet Rasul
Associate Professor, Department of English, Fatima Jinnah Women University, Rawalpindi, Punjab, Pakistan.
- Baker, P. (2006). Glossary of corpus linguistics. Edinburgh University Press.
- Brinton, L. J. (2000). The structure of modern English: A linguistic introduction. John Benjamins Publishing.
- Brinton, L. J. (Ed.). (2001). Historical Linguistics 1999: Selected papers from the 14th International Conference on Historical Linguistics, Vancouver, 9 13 August 1999 (Vol. 215). John Benjamins Publishing.
- Burdick, A., Drucker, J., Lunenfeld, P., Presner, T., & Schnapp, J. (2012). Digital_Humanities. Mit Press.
- Edhlund, B., & McDougall, A. (2019). NVivo 12 Essentials. Lulu. com.
- Hu, C. (2015). Using Wmatrix to Explore Discourse of Economic Growth. English Language Teaching, 8(9), 146-156
- Hunt, S. (2015). Representations of gender and agency in the Harry Potter series. In Corpora and Discourse Studies (pp. 266-284). Palgrave Macmillan, London.
- Knowles, G., & Don, Z. M. (2004). The notion of a
- Mahlberg, M., Stockwell, P., Joode, J. D., Smith, C., & O'Donnell, M. B. (2016). CLiC Dickens: novel uses of concordances for the integration of corpus stylistics and cognitive poetics. Corpora, 11(3), 433-463.
- Pollak, Senja, Coesemans, R., Daelemans, W., & Lavrac, N. (2011). Detect ing contrast patterns in newspaper articles by combining discourse analysis and text mining. Pragmatics 21 (4): 647-
- Rayson, P. (2008). From key words to key semantic domains. International Journal of Corpus Linguistics, 13(4), 519-549.
- Rayson, P. (2009). Wmatrix: a web-based corpus processing environment.
- Rayson, P., Archer, D. E., Baron, A., Culpeper, J., & Smith, N. (2007). Tagging the Bard: Evaluating the accuracy of a modern POS tagger on Early Modern English corpora. In Proceedings of the Corpus Linguistics conference: CL2007.
- Rayson, P., Archer, D., Piao, S., & McEnery, A. M. (2004). The UCREL semantic analysis system.
- Sharoff, S. (2004, May). Towards Basic Categories for Describing Properties of Texts in a Corpus. In LREC.
- Stubbs, M. (2004). Conrad, concordance, collocation: heart of darkness or light at the end of the tunnel?' The Third Sinclair Open Lecture.
- Stubbs, M. (2005). Conrad in the computer: examples of quantitative stylistic methods. Language and Literature, 14(1), 5-24.
- Terras, M. (2011). Quantifying digital humanities. UCL Centre for Digital Humanities.
- Wiedemann, G. (2013). Opening up to big data: Computer-assisted analysis of textual data in social sciences. Historical Social Research/Historische Sozialforschung, 332-357.
Cite this article
-
APA : Khan, A., & Rasul, S. (2020). Extraction of Semantic Domains Through Corpus Tools. Global Language Review, V(I), 153-168. https://doi.org/10.31703/glr.2020(V-I).17
-
CHICAGO : Khan, Azka, and Sarwet Rasul. 2020. "Extraction of Semantic Domains Through Corpus Tools." Global Language Review, V (I): 153-168 doi: 10.31703/glr.2020(V-I).17
-
HARVARD : KHAN, A. & RASUL, S. 2020. Extraction of Semantic Domains Through Corpus Tools. Global Language Review, V, 153-168.
-
MHRA : Khan, Azka, and Sarwet Rasul. 2020. "Extraction of Semantic Domains Through Corpus Tools." Global Language Review, V: 153-168
-
MLA : Khan, Azka, and Sarwet Rasul. "Extraction of Semantic Domains Through Corpus Tools." Global Language Review, V.I (2020): 153-168 Print.
-
OXFORD : Khan, Azka and Rasul, Sarwet (2020), "Extraction of Semantic Domains Through Corpus Tools", Global Language Review, V (I), 153-168
-
TURABIAN : Khan, Azka, and Sarwet Rasul. "Extraction of Semantic Domains Through Corpus Tools." Global Language Review V, no. I (2020): 153-168. https://doi.org/10.31703/glr.2020(V-I).17