TY - JOUR
T1 - Text mining and determinants of sentiments
T2 - Twitter social media usage by traditional media houses in Uganda
AU - Namugera, Frank
AU - Wesonga, Ronald
AU - Jehopio, Peter
N1 - Publisher Copyright:
© 2019, The Author(s).
PY - 2019/12/1
Y1 - 2019/12/1
N2 - Unstructured data generated from sources such as the social media and traditional text documents are increasing and form a larger proportion of unanalysed data especially in the developing countries. In this study, we analysed data received from the major print and non-print media houses in Uganda through the Twitter platform to generate non-trivial knowledge by using text mining analytics. We also explored the determinants of derived sentiments in Twitter messaging. The results show that sentiments generated from tweets derived from the main print media houses (Daily Monitor and New Vision) were positively correlated, so were the sentiments from the non-print media (NBS TV and NTV) for the study period. Most of the sentiments on topics of security, politics and economics were found to be negative, while those on sports were positive. Furthermore, the tweet sentiment statistical logistic model revealed that negative sentiments were determined by the retweet status, retweet count and source of the tweets. Moreover, the positive sentiments were determined by the topic of discussion, type of media house and other sources of tweets (p< 0.05). Therefore, we recommend further extensions on the predictive statistical models to classify sentiments from social media based on the concept of big data analytics.
AB - Unstructured data generated from sources such as the social media and traditional text documents are increasing and form a larger proportion of unanalysed data especially in the developing countries. In this study, we analysed data received from the major print and non-print media houses in Uganda through the Twitter platform to generate non-trivial knowledge by using text mining analytics. We also explored the determinants of derived sentiments in Twitter messaging. The results show that sentiments generated from tweets derived from the main print media houses (Daily Monitor and New Vision) were positively correlated, so were the sentiments from the non-print media (NBS TV and NTV) for the study period. Most of the sentiments on topics of security, politics and economics were found to be negative, while those on sports were positive. Furthermore, the tweet sentiment statistical logistic model revealed that negative sentiments were determined by the retweet status, retweet count and source of the tweets. Moreover, the positive sentiments were determined by the topic of discussion, type of media house and other sources of tweets (p< 0.05). Therefore, we recommend further extensions on the predictive statistical models to classify sentiments from social media based on the concept of big data analytics.
KW - Classification
KW - Sentiments
KW - Statistical models
KW - Text mining
KW - Traditional media
KW - Twitter social media
UR - http://www.scopus.com/inward/record.url?scp=85073242931&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85073242931&partnerID=8YFLogxK
U2 - 10.1186/s40649-019-0063-4
DO - 10.1186/s40649-019-0063-4
M3 - Article
AN - SCOPUS:85073242931
SN - 2197-4314
VL - 6
JO - Computational Social Networks
JF - Computational Social Networks
IS - 1
M1 - 3
ER -