Does pre-processing affect the correlation indicator between Twitter message volume and stock market trading volume?

被引:1
|
作者
Michalak, Joanna [1 ]
机构
[1] Nicolaus Copernicus Univ Torun, Fac Econ Sci & Managenent, Dept Econ, Ul Gagarina 13a, PL-87100 Torun, Poland
来源
EKONOMIA I PRAWO-ECONOMICS AND LAW | 2020年 / 19卷 / 04期
关键词
twitter sentiment analysis; behavioral economy; data mining;
D O I
10.12775/EiP.2020.048
中图分类号
F [经济];
学科分类号
02 ;
摘要
Motivation: More and more authors empirically verify the relationship between the volume of tweets and the stock market indicators. The patterns explored from Twitter most often take the form of time series that represent user's activity on different level of granularity (moods, emotions, relevant topic or query-related messages). Sentiment analysis is a technique used to transform text data into information on the mood and related behavioral categories. Supervised machine learning is the most commonly used approach to sentiment analysis. Thus, the results of an empirical analysis of the relationship between social media and stock depend on the quality of results of classification task. The quality of the features used to learn the classifier plays a key role. The feature space is modified using various data pre-processing scenarios that aim to increase accuracy of classification. The impact of pre-processing data on the quality of classification is often discussed in studies. Very few authors discuss the impact of pre-processing on the correlation indicator between Twitter and stock market. Aim: Analysis of the impact of tweets pre-processing on the Pearson correlation indicator between the mood of Twitter users and stock market trading volume. Results: The correlation between the volume of stock market trading and the volume of tweets has been empirically confirmed. The effect of pre-processing on the correlation index was noted for the variables 'all_tweets' and 'negative_tweets'. This is because the training set has a significant amount of tweets with negation. However, the results are not conclusive. The differences between the Pearson correlation index calculated for scenario one and scenario four are not significant. However, this indicates that the effect of noise data may reduce the quality and precision of conclusions. Especially in the case of frequent repetition of a certain category of noise.
引用
收藏
页码:738 / 754
页数:17
相关论文
共 17 条
  • [1] Does investor attention affect trading volume in the Brazilian stock market?
    De Souza, Heloisa Elias
    Da Silveira Barbedo, Claudio Henrique
    Araujo, Gustavo Silva
    [J]. RESEARCH IN INTERNATIONAL BUSINESS AND FINANCE, 2018, 44 : 480 - 487
  • [2] The relationship between trading activity and stock market volatility: Does the volume threshold matter?
    Koubaa, Yosra
    Slim, Skander
    [J]. ECONOMIC MODELLING, 2019, 82 : 168 - 184
  • [3] Does Weather Still Affect The Stock Market?: New Insights Into The Effects Of Weather On Returns, Volatility, And Trading Volume
    Muhlack N.
    Soost C.
    Henrich C.J.
    [J]. Schmalenbach Journal of Business Research, 2022, 74 (1): : 1 - 35
  • [4] The Empirical Relationship between Stock Return and Trading Volume based on Stock Market Cycles
    Christiana, Amanda Melissa
    Septiana, Eva
    Mamduch
    [J]. INDONESIAN CAPITAL MARKET REVIEW, 2016, 8 (01) : 46 - 57
  • [5] The empirical relationship between stock returns volatility and trading volume: evidence on the Tunis stock market
    Boubaker, Adel
    Makram, Beljid
    [J]. INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE AND ENGINEERING MANAGEMENT, 2011, 6 (05) : 374 - 381
  • [6] The Relationship Between Stock Return Volatility and Trading Volume: Evidence from the Investors in the Taiwan Stock Market
    Kuo, Shewhuei
    Hsiao, Junglieh
    Chan Huiju
    [J]. ENVIRONMENT, LOW-CARBON AND STRATEGY, 2011, : 956 - 959
  • [7] The Relationship between Volatility and Trading Volume in the Chinese Stock Market: A Volatility Decomposition Perspective
    Wang, Tianyi
    Huang, Zhuo
    [J]. ANNALS OF ECONOMICS AND FINANCE, 2012, 13 (01): : 211 - 236
  • [8] Time-of-day periodicities of trading volume and volatility in Bitcoin exchange: Does the stock market matter?
    Wang, Jying-Nan
    Liu, Hung-Chun
    Hsu, Yuan-Teng
    [J]. FINANCE RESEARCH LETTERS, 2020, 34
  • [9] Does corporate social responsibility affect risk spillovers between the carbon emissions trading market and the stock market?
    Zhang, Junru
    Hassan, Kamrul
    Wu, Zhuochen
    Gasbarro, Dominic
    [J]. JOURNAL OF CLEANER PRODUCTION, 2022, 362
  • [10] Pre-processing and feature/volume correlation in CT radiomics in non-small cell lung cancer
    Volpe, S.
    Isaksson, L. J.
    Pepa, M.
    Zaffaroni, M.
    Raimondi, S.
    Lo Presti, G.
    Garibaldi, C.
    Rampinelli, C.
    Marvaso, G.
    Gandini, S.
    Cremonesi, M.
    Jereczek-Fossa, B. A.
    [J]. RADIOTHERAPY AND ONCOLOGY, 2022, 170 : S1569 - S1570