Monitoring rare categories in sentiment and opinion analysis: a Milan mega event on Twitter platform

被引:0
|
作者
Calissano, Anna [1 ]
Vantini, Simone [1 ]
Arena, Marika [2 ]
机构
[1] Politecn Milan, MOX Dept Math, Piazza Leonardo da Vinci 32, I-20133 Milan, Italy
[2] Politecn Milan, Dept Management Econ & Ind Engn, Via Lambruschini 4-B, I-20156 Milan, Italy
来源
STATISTICAL METHODS AND APPLICATIONS | 2020年 / 29卷 / 04期
关键词
Classification; Sentiment analysis; Twitter; Expo; Web reputation; Mega event; MODEL; POSITIONS; TEXT;
D O I
10.1007/s10260-019-00504-7
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This paper proposes a new aggregated classification scheme aimed to support the implementation of semantic text analysis methods in contexts characterized by the presence of rare text categories. The proposed approach starts from the aggregate supervised text classifier developed by Hopkins and King and moves forward, relying on rare event sampling methods. In detail, it enables the analyst to enlarge the number of estimated sentiment categories, both preserving the estimation accuracy and reducing the working time to unconditionally increase the size of the training set. The approach is applied to study the daily evolution of the web reputation of one of the last mega-event taking place in Europe: Expo Milano. The corpus consists of more than one million tweets in both Italian and English, discussing about the event. The analysis provides an interesting portrayal of the evolution of the Expo stakeholders' opinions over time and allows the identification of the main drivers of the Expo reputation. The algorithm will be implemented as a running option in the next release of the R package ReadMe.
引用
收藏
页码:787 / 812
页数:26
相关论文
共 44 条
  • [41] A novel sentiment analysis framework for monitoring the evolving public opinion in real-time: Case study on climate change
    El Barachi, May
    AlKhatib, Manar
    Mathew, Sujith
    Oroumchian, Farhad
    [J]. JOURNAL OF CLEANER PRODUCTION, 2021, 312
  • [42] Comparison of SVM & Naive Bayes Algorithm for Sentiment Analysis Toward West Java']Java Governor Candidate Period 2018-2023 Based on Public Opinion on Twitter
    Kristiyanti, Dinar Ajeng
    Umam, Akhmad Hairul
    Wahyudi, Mochamad
    Amin, Ruhul
    Marlinda, Linda
    [J]. 2018 6TH INTERNATIONAL CONFERENCE ON CYBER AND IT SERVICE MANAGEMENT (CITSM), 2018, : 667 - 672
  • [43] Analyzing online public opinion on Thailand-China high-speed train and Laos-China railway mega-projects using advanced machine learning for sentiment analysis
    Nokkaew, Manussawee
    Nongpong, Kwankamol
    Yeophantong, Tapanan
    Ploykitikoon, Pattravadee
    Arjharn, Weerachai
    Siritaratiwat, Apirat
    Narkglom, Sorawit
    Wongsinlatam, Wullapa
    Remsungnen, Tawun
    Namvong, Ariya
    Surawanitkun, Chayada
    [J]. SOCIAL NETWORK ANALYSIS AND MINING, 2023, 14 (01)
  • [44] Analyzing online public opinion on Thailand-China high-speed train and Laos-China railway mega-projects using advanced machine learning for sentiment analysis
    Manussawee Nokkaew
    Kwankamol Nongpong
    Tapanan Yeophantong
    Pattravadee Ploykitikoon
    Weerachai Arjharn
    Apirat Siritaratiwat
    Sorawit Narkglom
    Wullapa Wongsinlatam
    Tawun Remsungnen
    Ariya Namvong
    Chayada Surawanitkun
    [J]. Social Network Analysis and Mining, 14