Construction of the Corpus of Nigeria New Media Discourse in English(CONNMDE)
CONNMDE started in 2014;
The primary aim of the project is to build a Big data base of different genres of English-based discourses on Web-based and Social Media platforms emanating from Nigeria.
Supported by the Alexander von Humboldt Foundation, Germany, the project is part of the pioneering efforts in Digital Humanities hosted at the Digital Humanities Research Unit (DIHRU), Faculty of Arts, University of Lagos, Nigeria.
The project currently has in its data base more than 1 million word tokens of political discourse text in Nigeria. It is a sub-component of CONNMDE called Corpus of Nigeria New Media Political Discourse in English(CONNMPDE).
Sources of the corpus are from two major Nigerian online Newspaper portal (“www.punch.com”, and “www.vanguardngr.com”). It also contains posts and communications from Nigerian top political figures(APC,PDP, Goodluck Jonathan, Muhammadu Buhari, e.t.c.) on Facebook and Twitter , the two most used micro-blogging sites in Nigeria.
The current text data has been pre-processed and cleaned using R – programming language. The text cleaning process started in October 2016 and ended in January 2017.
Kindly be informed that more data will be available for download on the site!.