The Linguistic Landscape of “Controversial”: Sentiment and Theme Distribution Insights

Elizaveta G. Grishechko

Abstract


The language used to frame controversial topics on social media has profound implications for public discourse and opinion formation, warranting a close examination of their sentiment and thematic distribution. This study investigates the sentiment and themes associated with controversial topics by analyzing Reddit posts containing the token “controversial” in their titles on three news-related subreddits, aiming to bridge a gap in existing literature by focusing on platform-specific sentiment analysis with an emphasis on content typology. A mixed-methods NLP approach instrumented via Python was employed, combining VADER-supported sentiment analysis and a qualitative content analysis using n-grams to identify and categorize themes. The sentiment analysis results indicated that most of the content had neutral sentiment, which testifies to the predominantly fact-based approach to presenting information with lack of strong emotional connotations. However, the overall compound sentiment scores were negative, which suggests a strong negative undertone in the framing of controversial topics. The theme distribution analysis revealed that Politics and Legislation was the most predominant theme, followed by Technology and Surveillance, Social Issues and Controversies, Health and Medicine, and Environment and Energy. This distribution attests to a range of societal issues that generate controversy on social media platforms. Study findings can be used by content creators and social media analysts to track online content sentiment, guide content moderation practices, and improve audience engagement. By demonstrating the potential of NLP techniques, this study also contributes to the fields of media research and language technology, which can encourage a better scholarly evaluation of online discourse.


Keywords


sentiment analysis; natural language processing; social media; online discourse; Python

Full Text:

PDF

References


Agapova, E. A., & Grishechko, E. G. (2016). Censorship as a factor of information warfare. Russian Linguistic Bulletin, 3(7), 43-44. https://doi.org/10.18454/RULB.7.06

Agur, C., & Frisch, N. (2019). Digital disobedience and the limits of persuasion: Social media activism in Hong Kong’s 2014 Umbrella Movement. Social Media and Society, 5(1). https://doi.org/10.1177/2056305119827002

Aldous, K. K., An, J., & Jansen, B. J. (2023). What really matters? Characterising and predicting user engagement of news postings using multiple platforms, sentiments and topics. Behaviour & Information Technology, 42(5), 545-568. https://doi.org/10.1080/0144929X.2022.2030798

Alkhammash, R. (2021). The social media framing of gender pay gap debate in American women’s sport: A linguistic analysis of emotive language. Training, Language and Culture, 5(4), 22-35. https://doi.org/10.22363/2521-442X-2021-5-4-22-35

Ash, E., Xu, Y., Jenkins, A., & Kumanyika, C. (2019). Framing use of force: An analysis of news organizations’ social media posts about police shootings. Electronic News, 13(2), 93-107. https://doi.org/10.1177/1931243119850239

Azhari, A., & Fang, X. (2018). Social media applications framework: A lexical analysis of users online reviews. International Journal of Services and Standards, 12(2), 140-162. https://doi.org/10.1504/IJSS.2018.091850

Aziz, J. B., & Hashim, F. (2021). Rhetoric of food authenticity and national identity in the new media. GEMA Online Journal of Language Studies, 21(2), 253-272. https://dx.doi.org/10.17576/gema-2021-2102-14

Bednarek, M. (2006). Evaluation in media discourse: Analysis of a newspaper corpus. A&C Black.

Belcastro, L., Branda, F., Cantini, R., Marozzo, F., Talia, D., & Trunfio, P. (2022). Analyzing voter behavior on social media during the 2020 US presidential election campaign. Social Network Analysis and Mining, 12(1). https://doi.org/10.1007/s13278-022-00913-9

Benrouba, F., & Boudour, R. (2023). Emotional sentiment analysis of social media content for mental health safety. Social Network Analysis and Mining, 13(1). https://doi.org/10.1007/s13278-022-01000-9

Berger, J., & Milkman, K. L. (2012). What makes online content viral? Journal of Marketing Research, 49(2), 192-205. https://doi.org/10.2139/ssrn.1528077

Boulianne, S. (2015). Social media use and participation: A meta-analysis of current research. Information, Communication & Society, 18(5), 524-538. https://doi.org/10.1080/1369118X.2015.1008542

Castells, M. (2015). Networks of outrage and hope: Social movements in the Internet age. John Wiley & Sons.

Chau, D., & Lee, C. (2021). “See you soon! ADD OIL AR!”: Code-switching for face-work in edu-social Facebook groups. Journal of Pragmatics, 184, 18-28. https://doi.org/10.1016/j.pragma.2021.07.019

Fernández-Luque, L., & Bau, T. (2015). Health and social media: Perfect storm of information. Healthcare Informatics Research, 21(2), 67-73. https://doi.org/10.4258/hir.2015.21.2.67

Fuchs, T. (2017). Ecology of the brain: The phenomenology and biology of the embodied mind. Oxford University Press.

Gil de Zúñiga, H., Jung, N., & Valenzuela, S. (2012). Social media use for news and individuals’ social capital, civic engagement and political participation. Journal of Computer-Mediated Communication, 17(3), 319-336. https://doi.org/10.1111/j.1083-6101.2012.01574.x

Grašič, T. (2022). The writing style in travel blogs. Folia Linguistica et Litteraria, 13(41), 187-210. https://doi.org/10.31902/fll.41.2022.9

Grishechko, E. G. (2023a). Emojis as nonverbal cues in online communication: Perspectives on conflict resolution and misunderstanding prevention. In Proceedings of 10th SWS International Scientific Conference on Arts And Humanities – ISCAH 2023 (Vol. 23, No. 1). SGEM World Science (SWS) Scholarly Society.

https://doi.org/10.35603/sws.iscah.2023/s11.12

Grishechko, E. G. (2023b). Language and cognition behind simile construction: A Python-powered corpus research. Training, Language and Culture, 7(2), 80-92. https://doi.org/10.22363/2521-442X-2023-7-2-80-92

Hutto, C., & Gilbert, E. (2014, May). Vader: A parsimonious rule-based model for sentiment analysis of social media text. Proceedings of the International AAAI Conference on Web and Social Media, 8(1), 216-225. https://doi.org/10.1609/icwsm.v8i1.14550

Huynh, D., Audet, G., Alabi, N., & Tian, Y. (2021, December 15018). Stock price prediction leveraging Reddit: The role of trust filter and sliding window. In Proceedings of 2021 IEEE International Conference on Big Data (pp. 1054-1060). IEEE. https://doi.org/10.1109/BigData52589.2021.9671412

Johannessen, M. R. (2015, August 30 – September 2). Please like and share! A frame analysis of opinion articles in online news. In Proceedings of the 7th IFIP 8.5 International Conference (pp. 15-26). Springer. https://doi.org/10.1007/978-3-319-22500-5_2

Kasperiuniene, J., & Zydziunaite, V. (2019). A systematic literature review on professional identity construction in social media. Sage Open, 9(1). https://doi.org/10.1177/2158244019828847

Kavitha, M., Naib, B. B., Mallikarjuna, B., Kavitha, R., & Srinivasan, R. (2022, April 28-29). Sentiment analysis using NLP and machine learning techniques on social media data. In Proceedings of the 2nd International Conference on Advance Computing and Innovative Technologies in Engineering (pp. 112-115). IEEE.

https://doi.org/10.1109/ICACITE53722.2022.9823708

Kralj Novak, P., Smailović, J., Sluban, B., & Mozetič, I. (2015). Sentiment of emojis. PloS One, 10(12), e0144296. https://doi.org/10.1371/journal.pone.0144296

Kubin, E., & von Sikorski, C. (2021). The role of (social) media in political polarization: A systematic review. Annals of the International Communication Association, 45(3), 188-206. https://doi.org/10.1080/23808985.2021.1976070

Kumar, N., Nagalla, R., Marwah, T., & Singh, M. (2018). Sentiment dynamics in social media news channels. Online Social Networks and Media, 8, 42-54. https://doi.org/10.1016/j.osnem.2018.10.004

Lai, C. H., & Fu, J. S. (2021). Exploring the linkage between offline collaboration networks and online representational network diversity on social media. Communication Monographs, 88(1), 88-110. https://doi.org/10.1080/03637751.2020.1869797

López-Rabadán, P. (2021). Framing studies evolution in the social media era. Digital advancement and reorientation of the research agenda. Social Sciences, 11(1). https://doi.org/10.3390/socsci11010009

Machavarapu, A. (2022). Reddit sentiments effects on stock market prices. In V. Bhateja, S. C. Satapathy, C. M. Travieso-Gonzalez, & T. Adilakshmi, T. (Eds.), Smart intelligent computing and applications (pp. 75-84). Springer. https://doi.org/10.1007/978-981-16-9669-5_7

Malyuga, E. N., & Akopova, A. S. (2021). Precedence-setting tokens: Issues of classification and functional attribution. Training, Language and Culture, 5(4), 65-76. https://doi.org/10.22363/2521-442X-2021-5-4-65-76

Margetts, H. Z., John, P., Hale, S. A., & Reissfelder, S. (2015). Leadership without leaders? Starters and followers in online collective action. Political Studies, 63(2), 278-299. https://doi.org/10.1111/1467-9248.12075

Miyake, K. (2007). How young Japanese express their emotions visually in mobile phone messages: A sociolinguistic analysis. Japanese Studies, 27(1), 53-72. https://doi.org/10.1080/10371390701268646

Mostafa, M. M. (2019). Clustering halal food consumers: A Twitter sentiment analysis. International Journal of Market Research, 61(3), 320-337. https://doi.org/10.1177/1470785318771451

Oh, H., Goh, K. Y., & Phan, T. Q. (2023). Are you what you tweet? The impact of sentiment on digital news consumption and social media sharing. Information Systems Research, 34(1), 111-136. https://doi.org/10.1287/isre.2022.1112

Omar, N., & Al-Tashi, Q. (2018). Arabic nested noun compound extraction based on linguistic features and statistical measures. GEMA Online Journal of Language Studies, 18(2), 93-107. https://doi.org/10.17576/gema-2018-1802-07

Păvăloaia, V. D., Teodor, E. M., Fotache, D., & Danileţ, M. (2019). Opinion mining on social media data: Sentiment analysis of user preferences. Sustainability, 11(16). https://doi.org/10.3390/su11164459

Schoenebeck, S., Lampe, C., & Triệu, P. (2023). Online harassment: Assessing harms and remedies. Social Media and Society, 9(1). https://doi.org/10.1177/20563051231157297

Shapiro, A. H., Sudhof, M., & Wilson, D. J. (2022). Measuring news sentiment. Journal of Econometrics, 228(2), 221-243. https://doi.org/10.24148/erwp2017-01

Shifman, L. (2013). Memes in digital culture. MIT Press.

Sibul, V. V., Vetrinskaya, V. V., & Grishechko, E. G. (2019). Study of precedent text pragmatic function in modern economic discourse. In E. N. Malyuga (Ed.), Functional approach to professional discourse exploration in linguistics (pp. 131-163). Springer. https://doi.org/10.1007/978-981-32-9103-4_5

Splendiani, S., & Capriello, A. (2022). Crisis communication, social media and natural disasters: The use of Twitter by local governments during the 2016 Italian earthquake. Corporate Communications, 27(3), 509-526. https://doi.org/10.1108/CCIJ-03-2021-0036

Taj, S., Shaikh, B. B., & Meghji, A. F. (2019, January 30-31). Sentiment analysis of news articles: A lexicon-based approach. In Proceedings of the 2nd International Conference on Computing, Mathematics and Engineering Technologies (iCoMET) (pp. 1-5). IEEE. https://doi.org/10.1109/ICOMET.2019.8673428

Tandoc, E. C., Lim, Z. W., & Ling, R. (2018). Defining “fake news”: A typology of scholarly definitions. Digital Journalism, 6(2), 137-153. https://doi.org/10.1080/21670811.2017.1360143

Taufek, T. E., Nor, N. F. M., Jaludin, A., Tiun, S., & Choy, L. K. (2021). Public perceptions on climate change: A sentiment analysis approach. GEMA Online Journal of Language Studies, 21(4), 209-233. https://doi.org/10.17576/gema-2021-2104-11

Valenzuela, S., Halpern, D., Katz, J. E., & Miranda, J. P. (2019). The paradox of participation versus misinformation: Social media, political engagement, and the spread of misinformation. Digital Journalism, 7(6), 802-823. https://doi.org/10.1080/21670811.2019.1623701

Vosoughi, S., Roy, D., & Aral, S. (2018). The spread of true and false news online. Science, 359(6380), 1146-1151. https://doi.org/10.1126/science.aap9559

Waddell, T. F. (2020). The authentic (and angry) audience: How comment authenticity and sentiment impact news evaluation. Digital Journalism, 8(2), 249-266. https://doi.org/10.1080/21670811.2018.1490656

Wankhade, M., Rao, A. C. S., & Kulkarni, C. (2022). A survey on sentiment analysis methods, applications, and challenges. Artificial Intelligence Review, 55(7), 5731-5780. https://doi.org/10.1007/s10462-022-10144-1

Wu, G., & Pan, C. (2022). Audience engagement with news on Chinese social media: A discourse analysis of the People’s Daily official account on WeChat. Discourse & Communication, 16(1), 129-145. https://doi.org/10.1177/17504813211026567

Wu, H., Bakar, K. A., Jaludin, A., & Awal, N. M. (2022). Sentiment analysis of China-related news in The Star Online newspaper. GEMA Online Journal of Language Studies, 22(3), 155-175. https://doi.org/10.17576/gema-2022-2203-09

Yan, C., Law, M., Nguyen, S., Cheung, J., & Kong, J. (2021). Comparing public sentiment toward COVID-19 vaccines across Canadian cities: Analysis of comments on Reddit. Journal of Medical Internet Research, 23(9), e32685. https://doi.org/10.2196/32685

Zappavigna, M. (2018). Searchable talk: Hashtags and social media metadiscourse. Bloomsbury Publishing.

Žitnik, S., Blagus, N., & Bajec, M. (2022). Target-level sentiment analysis for news articles. Knowledge-Based Systems, 249, 108939. https://doi.org/10.1016/j.knosys.2022.108939

Zubbir, N., Dass, L. C., & Ahmad, N. (2021). Analysis on adjective Suki and its co-occurrences in Japanese YouTube’s comment. Pertanika Journal of Social Sciences & Humanities, 29(2), 1357-1374. https://doi.org/10.47836/pjssh.29.2.32

Zuboff, S. (2019). Surveillance capitalism and the challenge of collective action. New Labor Forum, 28(1), 10-29. https://doi.org/10.1177/1095796018819461




DOI: http://dx.doi.org/10.17576/gema-2024-2401-05

Refbacks

  • There are currently no refbacks.


 

 

 

eISSN : 2550-2131

ISSN : 1675-8021