TY - GEN
T1 - Forecasting the Chilean electoral year
T2 - 10th International Conference on Social Computing and Social Media, SCSM 2018 Held as Part of HCI International 2018
AU - Rodríguez, Sebastián
AU - Allende-Cid, Héctor
AU - Palma, Wenceslao
AU - Alfaro, Rodrigo
AU - Gonzalez, Cristian
AU - Elortegui, Claudio
AU - Santander, Pedro
N1 - Publisher Copyright:
© Springer International Publishing AG, part of Springer Nature 2018.
PY - 2018
Y1 - 2018
N2 - Failures of traditional survey methods for measuring political climate and forecasting high impact events such as elections, offers opportunities to seek alternative methods. The analysis of social networks with computational linguistic methods have been proved to be useful as an alternative, but several studies related to these areas were conducted after the event (post hoc). Since 2017 was the election year for the 2018–2022 period for Chile and, moreover, there were three instances of elections in this year. This condition makes a good environment to conduct a case study for forecasting these elections with the use of social media as the main source of Data. This paper describes the implementation of multiple algorithms of supervised machine learning to do political sentiment analysis to predict the outcome of each election with Twitter data. These algorithms are Decision Trees, AdaBoost, Random Forest, Linear Support Vector Machines and ensemble voting classifiers. Manual annotations of a training set are conducted by experts to label pragmatic sentiment over the tweets mentioning an account or the name of a candidate to train the algorithms. Then a predictive set is collected days before the election and an automatic classification is performed. Finally the distribution of votes for each candidate is obtained from this classified set on the positive sentiment of the tweets. Ultimately, an accurate prediction was achieved using an ensemble voting classifier with a Mean Absolute Error of 0.51 % for the second round.
AB - Failures of traditional survey methods for measuring political climate and forecasting high impact events such as elections, offers opportunities to seek alternative methods. The analysis of social networks with computational linguistic methods have been proved to be useful as an alternative, but several studies related to these areas were conducted after the event (post hoc). Since 2017 was the election year for the 2018–2022 period for Chile and, moreover, there were three instances of elections in this year. This condition makes a good environment to conduct a case study for forecasting these elections with the use of social media as the main source of Data. This paper describes the implementation of multiple algorithms of supervised machine learning to do political sentiment analysis to predict the outcome of each election with Twitter data. These algorithms are Decision Trees, AdaBoost, Random Forest, Linear Support Vector Machines and ensemble voting classifiers. Manual annotations of a training set are conducted by experts to label pragmatic sentiment over the tweets mentioning an account or the name of a candidate to train the algorithms. Then a predictive set is collected days before the election and an automatic classification is performed. Finally the distribution of votes for each candidate is obtained from this classified set on the positive sentiment of the tweets. Ultimately, an accurate prediction was achieved using an ensemble voting classifier with a Mean Absolute Error of 0.51 % for the second round.
KW - Election forecasting
KW - Ex ante forecasting
KW - Machine learning
KW - Sentiment analysis
UR - http://www.scopus.com/inward/record.url?scp=85050546489&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-91485-5_23
DO - 10.1007/978-3-319-91485-5_23
M3 - Conference contribution
AN - SCOPUS:85050546489
SN - 9783319914848
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 298
EP - 314
BT - Social Computing and Social Media. Technologies and Analytics - 10th International Conference, SCSM 2018, Held as Part of HCI International 2018, Proceedings
A2 - Meiselwitz, Gabriele
PB - Springer Verlag
Y2 - 15 July 2018 through 20 July 2018
ER -