Corpus-Driven approaches to the representation of Pakistani Culture in Newspapers’ blogs

  • Ayesha Jamal
  • Tehseen Zahra
Keywords: Blogs, Blogging, Collocation, Corpus, Corpus Driven Approach, Hofstede’s Onion Model


Blogs have become a notable part of online communication culture. They have grown into a massive communication tool of various themes prevailing in society and ultimately represent culture. Blogging allows people to be a part of a substantial communication system. Blogs facilitate researchers in bringing out cultural diversity and practices of masses of a region. The present study explored the representation of Pakistani culture through blogs. Pakistani bloggers used different lexical items to portray the culture of Pakistan. The data for this study has been collected from the blogs published in 2016 and 2017 in two different English online newspapers of Pakistan, The Express Tribune, and The Dawn through purposive sampling. The data comprised of 48,513 words. Hofstede’s (1991) Onion Model provided the foundation of this study. This study explored the different layers and themes of Pakistani culture in blogs and provided a platform for readers to know about the culture of Pakistan. This study is both quantitative and qualitative. Quantitative method dealt with numerical data i.e. wordlist (number of words related to themes) and collocates (number of collocates) while the qualitative method dealt with collocation in context.  The corpus was cleaned through an online text fixer and analyzed through two corpus tools named Antconc and Lanxbox. Antconc helped us in compiling the wordlist and exploring its collocates and Lanxbox was used to explore the relationship of node words with different lexical items. The findings of the study reveal various themes related to Pakistani culture like women’s sufferings and killings, child abuse, students’ politics within Pakistani universities, the superiority of English over Urdu, jirgas injustice, vicious spending’s on Eid ul Azha, lifafa culture on eid ul fitar, Pakistani feminocentric dramas, inequality in marriage certificates, dowry and so on. It is hoped that this study would help national as well as international community of scholars in understanding various layers of the culture of Pakistan.


Badey, Paul, (2011), «Culture, personality and society», in International Journal of Research Development, 4, n. 1, pp.1-7. Retrieved From:

Briggs, Mark, Schaffer, Jan (2007), Journalism 2.0, College Park, MD: J-Lab.

Etikan, Ilker, Musa, Sulaiman Abubakar, Alkassim, Rukayya Sunusi (2016), «Comparison of convenience sampling and purposive sampling», in American Journal of Theoretical and Applied Statistics, 5, n. 1, pp. 1-4.

Fang, Tony (2010), «Asian management research needs more self-confidence: Reflection on Hofstede (2007) and beyond», in Asia Pacific Journal of Management, 27, n. 1, pp. 155-170.

Gene Zucker, Harold (1978), «The variable nature of news media influence», in Annals of the International Communication Association, 2, n. 1, pp. 225-240.

Heine, Bernd, Narrog, Heiko (2015), The Oxford handbook of linguistic analysis, Handbooks in Linguistic, Oxford.

Hofstede, Geert (1984), «Cultural dimensions in management and planning», in Asia Pacific Journal of Management, 1, n. 2, pp. 81-99.

Hofstede, Geert (1991), Cultures and organizations: Software of mind., McGraw-Hill, London.

Hofstede, Geert J. (2009), «Research on cultures: how to use it in training?», in European Journal of Cross-Cultural Competence and Management, 1, n. 1, pp. 14-21.

Hofstede, G. H., Hofstede, G. J., Minkov, M. (2010), Cultures and organizations: Software of the mind (3rd ed.), McGraw-Hill Education, New York, NY.

House, R. J., Hanges, P. J., Javidan, M., Dorfman, P. W., & Gupta, V. (2004), Culture, leadership, and organizations: The GLOBE study of 62 societies. Sage Publications, New York.

Jaborooty, Maryam P., Baker, Paul (2017), «Resisting silence: Moments of empowerment in Iranian women's blogs», in Gender & Language, 11, n. 1, pp. 77-99.

Lehecka, Tomas (2015), Collocation and colligation, in Östman, Jan-Ola, Verschueren, Jef, edited by, Handbook of pragmatics online, John Benjamins, Amsterdam.

Lule, Jack (2012), Understanding Media and Culture: An Introduction to Mass Communication. FlatWorld, Boston, MA.

Mackiewicz, Jo (2016), The aboutness of writing center talk: A corpus-driven and discourse analysis, Taylor & Francis Ltd, London.

Matheson, Donald (2005), Media discourses, McGraw-Hill Education, UK.

McEnery, Tony, Xiao, Richard, Tono, Yukio (2006), Corpus-based language studies: An advanced, Routledge, London.

McQuail, Denis (2010), McQuail's mass communication theory, Sage publications, New York.

Miller, Carolyn, R., & Shepherd, Dawn (2004), Blogging as social action: A genre analysis of the weblog, University of Minnesota.

Sharma, Gaganpreet (2017), «Pros and cons of different sampling techniques», in International Journal of Applied Research, 3, n. 7, pp. 749-752.

Stubbs, Michael (1995), «Collocations and semantic profiles: On the cause of the trouble with quantitative studies», in Functions of Language, 2, n. 1, pp. 23-55.

Shehzad,W and Zahra, T (2019, Ongoing research project), Pakistan Gender Text (PakGenText), Funded by Higher Education Commission Pakistan.

How to Cite
Jamal, A. and Zahra, T. (2022) “Corpus-Driven approaches to the representation of Pakistani Culture in Newspapers’ blogs ”, Rivista Italiana di Filosofia del Linguaggio. doi: 10.4396/202204MC.