News-based intelligent prediction of financial markets using text mining and machine learning: A systematic literature review

被引:33
|
作者
Ashtiani, Matin N. [1 ]
Raahemi, Bijan [1 ]
机构
[1] Univ Ottawa, Telfer Sch Management, Knowledge Discovery & Data Min Lab, 55 Laurier Ave East, Ottawa, ON K1N 6N5, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Stock market prediction; Systematic literature review; Natural language processing; Text mining; Machine learning; ARTIFICIAL NEURAL-NETWORKS; SENTIMENT ANALYSIS; STOCK; ALGORITHMS; RESOURCES; FORECAST; ARTICLES; SUPPORT; IMPACT; RETURN;
D O I
10.1016/j.eswa.2023.119509
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Researchers and practitioners have attempted to predict the financial market by analyzing textual (e.g., news articles and social media) and numeric data (e.g., hourly stock prices, and moving averages). Among textual data, while many papers have been published that analyze social media, news content has gained limited attention in predicting the stock market. Acknowledging that news is critical in predicting the stock market, the focus of this systematic review is on papers investigating machine learning and text mining techniques to predict the stock market using news. Using Kitchenham's methodology, we present a systematic review of the literature on intelligent financial market prediction, examining data mining and machine learning approaches and the employed datasets. From five digital libraries, we identified 61 studies from 2015- 2022 for synthesis and interpretation. We present notable gaps and barriers to predicting financial markets, then recommend future research scopes. Various input data, including numerical (stock prices and technical indicators) and textual data (news text and sentiment), have been employed for news-based stock market prediction. News data collection can be costly and time-consuming: most studies have used custom crawlers to gather news articles; however, there are financial news databases available that could significantly facilitate news collection. Furthermore, although most datasets have covered fewer than 100K records, deep learning and more sophisticated artificial neural networks can process enormous datasets faster, improving future model performance. There is a growing trend toward using artificial neural networks, particularly recurrent neural networks and deep learning models, from 2018 to 2021. Furthermore, regression and gradient-boosting models have been developed for stock market prediction during the last four years. Although word embedding approaches for feature representation have been employed recently with good accuracy, emerging language models may be a focus for future research. Advanced natural language processing methods like transformers have undeniably contributed to intelligent stock market prediction. However, stock market prediction has not yet taken full advantage of them.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Intelligent Fraud Detection in Financial Statements Using Machine Learning and Data Mining: A Systematic Literature Review
    Ashtiani, Matin N.
    Raahemi, Bijan
    [J]. IEEE ACCESS, 2022, 10 : 72504 - 72525
  • [2] News-based Machine Learning and Deep Learning Methods for Stock Prediction
    Guo, Junjie
    Tuckfield, Bradford
    [J]. 4TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE APPLICATIONS AND TECHNOLOGIES (AIAAT 2020), 2020, 1642
  • [3] Software fault prediction using data mining, machine learning and deep learning techniques: A systematic literature review
    Batool, Iqra
    Khan, Tamim Ahmed
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2022, 100
  • [4] Financial Fraud Detection Based on Machine Learning: A Systematic Literature Review
    Ali, Abdulalem
    Abd Razak, Shukor
    Othman, Siti Hajar
    Eisa, Taiseer Abdalla Elfadil
    Al-Dhaqm, Arafat
    Nasser, Maged
    Elhassan, Tusneem
    Elshafie, Hashim
    Saif, Abdu
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [5] Machine Learning Techniques in the Energy Consumption of Buildings: A Systematic Literature Review Using Text Mining and Bibliometric Analysis
    Abdelaziz, Ahmed
    Santos, Vitor
    Dias, Miguel Sales
    [J]. ENERGIES, 2021, 14 (22)
  • [6] Crowdsourcing: a systematic review of the literature using text mining
    Pavlidou, Ioanna
    Papagiannidis, Savvas
    Tsui, Eric
    [J]. INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2020, 120 (11) : 2041 - 2065
  • [7] PREDICTION OF OSTEOARTHRITIS PROGRESSION USING MACHINE LEARNING: A SYSTEMATIC LITERATURE REVIEW
    Castagno, Simone
    Gompels, Benjamin
    Strangmark, Estelle
    Robertson-Waters, Eve
    Birch, Mark
    van der Schaar, Mihaela
    McCaskie, Andrew
    [J]. OSTEOARTHRITIS AND CARTILAGE, 2024, 32 : S68 - S69
  • [8] Crop yield prediction using machine learning: A systematic literature review
    van Klompenburg, Thomas
    Kassahun, Ayalew
    Catal, Cagatay
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 177
  • [9] Factors affecting text mining based stock prediction: Text feature representations, machine learning models, and news platforms
    Lin, Wei -Chao
    Tsai, Chih-Fong
    Chen, Hsuan
    [J]. APPLIED SOFT COMPUTING, 2022, 130
  • [10] A systematic literature review of software effort prediction using machine learning methods
    Ali, Asad
    Gravino, Carmine
    [J]. JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2019, 31 (10)