Data selection and collection for constructing investor sentiment from social media

被引:0
|
作者
Liu, Qing [1 ,2 ]
Son, Hosung [1 ]
机构
[1] Pukyong Natl Univ, Grad Sch Management Technol, Busan 48547, South Korea
[2] Huainan Normal Univ, Sch Econ & Management, Dongshan West Rd, Huainan, Anhui, Peoples R China
来源
基金
新加坡国家研究基金会;
关键词
STOCK MESSAGE BOARDS; INFORMATION-CONTENT; MICROBLOGGING DATA; TWITTER; NOISE; PREDICTION; NEWS; PATTERNS; OPINIONS; FACEBOOK;
D O I
10.1057/s41599-024-03316-7
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
Research based on investor sentiment in social media has been a hot topic of research in behavioral finance, and the reliability of investor sentiment mined from social media is a potential condition for the reliability of the results of these studies. In the past, scholars have often focused on using more reliable tools to track investor sentiment in order to get more reliable investor sentiment. However, less attention has been paid to another key factor affecting the reliability of investor sentiment on social media: the selection and collection of data. In this study, we systematically investigate the process of data selection and collection in relation to the construction of investor sentiment on social media. Our findings suggest that the process of creating a dataset from social media is a process that starts and ends with a research question. In this process, we need to overcome various obstacles to end up with an imperfect dataset. The researchers must take a series of steps to get close to the best dataset and acknowledge some of the shortcomings and limitations. We emphasize that the absence of accepted, reliable standards makes it particularly important to follow basic principles. This study is an important reference for social media-based behavioral finance research.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Using APIs for Data Collection on Social Media
    Lomborg, Stine
    Bechmann, Anja
    INFORMATION SOCIETY, 2014, 30 (04): : 256 - 265
  • [32] Issues of social data analytics with a new method for sentiment analysis of social media data
    Wang, Zhaoxia
    Tong, Victor Joo Chuan
    Chan, David
    2014 IEEE 6TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), 2014, : 899 - 904
  • [33] Feature Selection for Social Media Data
    Tang, Jiliang
    Liu, Huan
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2014, 8 (04)
  • [34] Hybrid Ensemble Learning With Feature Selection for Sentiment Classification in Social Media
    Sharma, Sanur
    Jain, Anurag
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2020, 10 (02) : 40 - 58
  • [35] Social informedness and investor sentiment in the GameStop short squeeze
    Kwansoo Kim
    Sang-Yong Tom Lee
    Robert J. Kauffman
    Electronic Markets, 2023, 33
  • [36] The emergence of social media data and sentiment analysis in election prediction
    Chauhan, Priyavrat
    Sharma, Nonita
    Sikka, Geeta
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (02) : 2601 - 2627
  • [37] Social informedness and investor sentiment in the GameStop short squeeze
    Kim, Kwansoo
    Lee, Sang-Yong Tom
    Kauffman, Robert J.
    ELECTRONIC MARKETS, 2023, 33 (01)
  • [38] The emergence of social media data and sentiment analysis in election prediction
    Priyavrat Chauhan
    Nonita Sharma
    Geeta Sikka
    Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 2601 - 2627
  • [39] Sentiment Classification of Social Media Data for Telecommunication Companies in Turkey
    Iseri, Ismail
    Atasoy, Omer Faruk
    Alcicek, Harun
    2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2017, : 1015 - 1019
  • [40] Facial Emotion Recognition for Sentiment Analysis of Social Media Data
    de Paula, Diandre
    Alexandre, Luis A.
    PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2022), 2022, 13256 : 207 - 217