Sentiment Analysis on Covid-19 Vaccination in Indonesia Using Support Vector Machine and Random Forest

I Made Sumertajaya, Yenni Angraini, Jamaluddin Rabbani Harahap, Anwar Fitrianto

Abstract


World Health Organization (WHO) stated Covid-19 as a global pandemic in March, 2020. This pandemic has influenced people’s life in many sectors such as the economy, health, tourism, and many more. One way to end this pandemic is to make herd immunity obtained through the vaccination program. This program still raises pros and cons at the beginning of its implementation in Indonesia. Many people doubt the safety and side effects of the vaccine. There are also pros and cons to vaccination programs in social media such as Twitter. This platform generates a huge amount of text data containing people's perceptions about vaccines. This research aims to predict sentiment using supervised learning such as support vector machine (SVM) and random forest and capture sentiment about vaccines in Indonesia in the first two weeks of the program. The result shows SVM was a better model than random forest based on the precision and F1-score metrics. The SVM approach produces a precision value of 0.50, a recall of 0.64, and an F1-score of 0.52. In the study, it was also found that tweets with neutral sentiment dominated the twitter user sentiment in the study period. Tweets with negative sentiment decreased after the first week of the COVID-19 vaccination program.


Keywords


Coronavirus, Vaccination, Support Vector Machine, Random Forest, Twitter, Linear Discriminant AnalysisDA

References


[1] F. He, Y. Deng, and W. Li, “Coronavirus disease 2019: What we know?,” Journal of medical virology, vol. 92, no. 7, pp. 719–725, 2020.

[2] S. Law, A. W. Leung, and C. Xu, “Severe acute respiratory syndrome (SARS) and coronavirus disease-2019 (COVID-19): From causes to preventions in Hong Kong,” International Journal of Infectious Diseases, vol. 94, pp. 156–163, 2020.

[3] N. A. Bakar and S. Rosbi, “Effect of Coronavirus disease (COVID-19) to tourism industry,” International Journal of Advanced Engineering Research and Science, vol. 7, no. 4, pp. 189–193, 2020.

[4] H. E. Randolph and L. B. Barreiro, “Herd immunity: understanding COVID-19,” Immunity, vol. 52, no. 5, pp. 737–741, 2020.

[5] C. Rosales Sánchez, M. Craglia, and A. K. Bregt, “New data sources for social indicators: the case study of contacting politicians by Twitter,” International journal of digital earth, vol. 10, no. 8, pp. 829–845, 2017.

[6] J. M. Banda et al., “A large-scale COVID-19 Twitter chatter dataset for open scientific research—an international collaboration,” Epidemiologia, vol. 2, no. 3, pp. 315–324, 2021.

[7] A. Hasan, S. Moin, A. Karim, and S. Shamshirband, “Machine learning-based sentiment analysis for twitter accounts,” Mathematical and Computational Applications, vol. 23, no. 1, p. 11, 2018.

[8] S. A. El Rahman, F. A. AlOtaibi, and W. A. AlShehri, “Sentiment analysis of twitter data,” in 2019 International Conference on Computer and Information Sciences (ICCIS), 2019, pp. 1–4.

[9] K. Sailunaz and R. Alhajj, “Emotion and sentiment analysis from Twitter text,” Journal of Computational Science, vol. 36, p. 101003, 2019.

[10] M. R. Huq, A. Ali, and A. Rahman, “Sentiment analysis on Twitter data using KNN and SVM,” International Journal of Advanced Computer Science and Applications, vol. 8, no. 6, pp. 19–25, 2017.

[11] A. Afiyati, A. Azhari, A. K. Sari, and A. Karim, “Challenges of Sarcasm Detection for Social Network: A Literature Review,” JUITA: Jurnal Informatika, vol. 8, no. 2, pp. 169–178, 2020.

[12] F. Hemmatian and M. K. Sohrabi, “A survey on classification techniques for opinion mining and sentiment analysis,” Artificial intelligence review, vol. 52, no. 3, pp. 1495–1545, 2019.

[13] B. Laurensz and E. Sediyono, “Analisis Sentimen Masyarakat terhadap Tindakan Vaksinasi dalam Upaya Mengatasi Pandemi Covid-19,” Jurnal Nasional Teknik Elektro dan Teknologi Informasi, vol. 10, no. 2, pp. 118–123, 2021.

[14] S. Dey, S. Wasif, D. S. Tonmoy, S. Sultana, J. Sarkar, and M. Dey, “A comparative study of support vector machine and Naive Bayes classifier for sentiment analysis on Amazon product reviews,” in 2020 International Conference on Contemporary Computing and Applications (IC3A), 2020, pp. 217–220.

[15] M. Z. Ansari, M. B. Aziz, M. O. Siddiqui, H. Mehra, and K. P. Singh, “Analysis of political sentiment orientations on twitter,” Procedia Computer Science, vol. 167, pp. 1821–1828, 2020.

[16] P. G. Pratama and N. A. Rakhmawati, “Social Bot Detection on 2019 Indonesia President Candidate’s Supporter’s Tweets,” Procedia Computer Science, vol. 161, pp. 813–820, 2019.

[17] G. Saposnik et al., “Factors associated with the decision-making on endovascular thrombectomy for the management of acute ischemic stroke,” Stroke, vol. 50, no. 9, pp. 2441–2447, 2019.

[18] F. Thabtah, S. Hammoud, F. Kamalov, and A. Gonsalves, “Data imbalance in classification: Experimental evaluation,” Information Sciences, vol. 513, pp. 429–441, 2020.


Full Text: PDF

DOI: 10.30595/juita.v10i1.12394

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

ISSN: 2579-8901