Document Type
Article
Publication Date
12-1-2025
Journal / Book Title
Journal of Big Data
Abstract
The “wisdom of the crowd” (WoC) refers to the notion that collective human knowledge is capable of outperforming even individual expert knowledge. This study investigates the application of this phenomenon to lexicon-based sentiment analysis of text data. Lexicons are frequently used to classify the sentiment of text data, particularly in the absence of sentiment class label information. We propose leveraging some of the most popular, publicly-available lexicons created in the last half century to improve sentiment analysis performance. Specifically, this research argues that the collective information provided by the thirteen lexicons included in the crowd constitutes a WoC situation that can more accurately predict the sentiment in the majority of example cases when compared to individual lexicons, lexicon ensembles, and machine learning methods. Thirteen popular sentiment-labeled text datasets, comprised of different types of text data and covering a variety of domains, are used to test this research proposition. We show that the WoC sentiment analysis achieves greater performance than individual lexicons, which are considered to be ‘experts’, and a lexicon ensemble approach. In comparing our novel approach to sentiment analysis against popular machine learning approaches, the proposed WoC method achieves superior results in the majority of examples. By overcoming many of the limitations of other approaches with high accuracy, the WoC method can provide organizations with real-time, reliable, and accurate sentiment analysis.
DOI
10.1186/s40537-025-01186-7
MSU Digital Commons Citation
Hill, Chelsey H.; Fresneda, Jorge E.; and Anandarajan, Murugan, "The wisdom of the lexicon crowds: leveraging on decades of lexicon-based sentiment analysis for improved results" (2025). Department of Information Management and Business Analytics Faculty Scholarship and Creative Works. 190.
https://digitalcommons.montclair.edu/infomgmt-busanalytics-facpubs/190
Rights
© The Author(s) 2025. Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
Published Citation
Hill, C. H., Fresneda, J. E., & Anandarajan, M. (2025). The wisdom of the lexicon crowds: leveraging on decades of lexicon-based sentiment analysis for improved results. Journal of Big Data, 12(1), 129.