News-based intelligent prediction of financial markets using text mining and machine learning: A systematic literature review

被引:33
|
作者
Ashtiani, Matin N. [1 ]
Raahemi, Bijan [1 ]
机构
[1] Univ Ottawa, Telfer Sch Management, Knowledge Discovery & Data Min Lab, 55 Laurier Ave East, Ottawa, ON K1N 6N5, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Stock market prediction; Systematic literature review; Natural language processing; Text mining; Machine learning; ARTIFICIAL NEURAL-NETWORKS; SENTIMENT ANALYSIS; STOCK; ALGORITHMS; RESOURCES; FORECAST; ARTICLES; SUPPORT; IMPACT; RETURN;
D O I
10.1016/j.eswa.2023.119509
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Researchers and practitioners have attempted to predict the financial market by analyzing textual (e.g., news articles and social media) and numeric data (e.g., hourly stock prices, and moving averages). Among textual data, while many papers have been published that analyze social media, news content has gained limited attention in predicting the stock market. Acknowledging that news is critical in predicting the stock market, the focus of this systematic review is on papers investigating machine learning and text mining techniques to predict the stock market using news. Using Kitchenham's methodology, we present a systematic review of the literature on intelligent financial market prediction, examining data mining and machine learning approaches and the employed datasets. From five digital libraries, we identified 61 studies from 2015- 2022 for synthesis and interpretation. We present notable gaps and barriers to predicting financial markets, then recommend future research scopes. Various input data, including numerical (stock prices and technical indicators) and textual data (news text and sentiment), have been employed for news-based stock market prediction. News data collection can be costly and time-consuming: most studies have used custom crawlers to gather news articles; however, there are financial news databases available that could significantly facilitate news collection. Furthermore, although most datasets have covered fewer than 100K records, deep learning and more sophisticated artificial neural networks can process enormous datasets faster, improving future model performance. There is a growing trend toward using artificial neural networks, particularly recurrent neural networks and deep learning models, from 2018 to 2021. Furthermore, regression and gradient-boosting models have been developed for stock market prediction during the last four years. Although word embedding approaches for feature representation have been employed recently with good accuracy, emerging language models may be a focus for future research. Advanced natural language processing methods like transformers have undeniably contributed to intelligent stock market prediction. However, stock market prediction has not yet taken full advantage of them.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] A systematic literature review of machine learning based risk prediction models for diabetic retinopathy progression
    Usman, Tiwalade Modupe
    Saheed, Yakub Kayode
    Nsang, Augustine
    Ajibesin, Abel
    Rakshit, Sandip
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2023, 143
  • [42] Evaluation of machine learning-based models for prediction of clinical deterioration: A systematic literature review
    Jahandideh, Sepideh
    Ozavci, Guncag
    Sahle, Berhe W.
    Kouzani, Abbas Z.
    Magrabi, Farah
    Bucknall, Tracey
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2023, 175
  • [43] Rainfall Prediction using Data Mining Techniques: A Systematic Literature Review
    Aftab, Shabib
    Ahmad, Munir
    Hameed, Noureen
    Bashir, Muhammad Salman
    Ali, Iftikhar
    Nawaz, Zahid
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (05) : 143 - 150
  • [44] Fake News Detection Using Deep Learning: A Systematic Literature Review
    Alnabhan, Mohammad Q.
    Branco, Paula
    [J]. IEEE ACCESS, 2024, 12 : 114435 - 114459
  • [45] Flood Prediction Using Machine Learning Models: Literature Review
    Mosavi, Amir
    Ozturk, Pinar
    Chau, Kwok-wing
    [J]. WATER, 2018, 10 (11)
  • [46] A Review of Intelligent Systems for the Prediction of Wind Energy Using Machine Learning
    Kumar Dubey, Ashutosh
    Kumar, Abhishek
    Segovia Ramirez, Isaac
    Garcia Marquez, Fausto Pedro
    [J]. PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING MANAGEMENT - VOL 1, 2022, 144 : 476 - 491
  • [47] Stroke mortality prediction using machine learning: systematic review
    Schwartz, Lihi
    Anteby, Roi
    Klang, Eyal
    Soffer, Shelly
    [J]. JOURNAL OF THE NEUROLOGICAL SCIENCES, 2023, 444
  • [48] Intelligent RFQ Summarization Using Natural Language Processing, Text Mining, and Machine Learning Techniques
    Trappey, Amy J. C.
    Chang, Ai-Che
    Trappey, Charles V.
    Chien, Jack Y. C. Chang
    [J]. JOURNAL OF GLOBAL INFORMATION MANAGEMENT, 2022, 30 (01)
  • [49] ALIGNING TEXT MINING AND MACHINE LEARNING ALGORITHMS WITH BEST PRACTICES FOR STUDY SELECTION IN SYSTEMATIC LITERATURE REVIEWS
    Popoff, E.
    Jansen, J. P.
    Besada, M.
    Cope, S.
    Kanters, S.
    [J]. VALUE IN HEALTH, 2018, 21 : S371 - S371
  • [50] Aligning text mining and machine learning algorithms with best practices for study selection in systematic literature reviews
    E. Popoff
    M. Besada
    J. P. Jansen
    S. Cope
    S. Kanters
    [J]. Systematic Reviews, 9