The Predictability of Tree-based Machine Learning Algorithms in the Big Data Context

被引:5
|
作者
Qolipour, F. [1 ]
Ghasemzadeh, M. [1 ]
Mohammad-Karimi, N. [1 ]
机构
[1] Yazd Univ, Dept Comp Engn, Yazd, Iran
来源
INTERNATIONAL JOURNAL OF ENGINEERING | 2021年 / 34卷 / 01期
关键词
Stock Market; Big Data; Prediction; Machine Learning; Tree-based Algorithms; Ensemble Algorithms; PRICE; DIRECTION; RETURNS;
D O I
10.5829/ije.2021.34.01a.10
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This research work is concerned with the predictability of ensemble and singular tree-based machine learning algorithms during the recession and prosperity of the two companies listed in the Tehran Stock Exchange in the context of big data. In this regard, the main issue is that economic managers and the academic community require predicting models with more accuracy and reduced execution time; moreover, the prediction of the companies recession in the stock market is highly significant. Machine learning algorithms must be able to appropriately predict the stock return sign during the market downturn and boom days. Addressing the stated challenge will upgrade the quality of stock purchases and, subsequently, will increase profitability. In this article, the proposed solution relies on the utilization of tree-based machine learning algorithms in the context of big data. The proposed solution exploits the decision tree algorithm, which is a traditional and singular tree-based learning algorithm. Furthermore, two modern and ensemble tree-based learning algorithms, random forest and gradient boosted tree, has been utilized for predicting the stock return sign during recession and prosperity. The mentioned cases were implemented by applying the machine learning tools in python programming language and PYSPARK library that is used explicitly for the big data context. The utilized research data of the current study are the shares information of two companies of the Tehran Stock Exchange. The obtained results reveal that the applied ensemble learning algorithms have performed better than the singular learning algorithms. Additionally, adding 23 technical features to the initial data and subsequent applying of the PCA feature reduction method have demonstrated the best performance among other modes. In the meantime, it has been concluded that the initial data do not possess the proper resolution or generalizability, either during prosperity or recession.
引用
收藏
页码:82 / 89
页数:8
相关论文
共 50 条
  • [41] Software Metrics and tree-based machine learning algorithms for distinguishing and detecting similar structure design patterns
    Mhawish, Mohammad Y.
    Gupta, Manjari
    SN APPLIED SCIENCES, 2020, 2 (01):
  • [42] Software Metrics and tree-based machine learning algorithms for distinguishing and detecting similar structure design patterns
    Mohammad Y. Mhawish
    Manjari Gupta
    SN Applied Sciences, 2020, 2
  • [43] Comparison of Some Balancing Methods for Classification of Pacing Horses Using Tree-based Machine Learning Algorithms
    Ozen, Hullya
    Ozen, Dogukan
    Yuceer Ozkul, Banu
    Ozbeyaz, Ceyhan
    KAFKAS UNIVERSITESI VETERINER FAKULTESI DERGISI, 2024, 30 (01) : 31 - 40
  • [44] Classifying Familial Hypercholesterolaemia: A Tree-based Machine Learning Approach
    Rosli, Marshima Mohd
    Edward, Jafhate
    Onn, Marcella
    Chua, Yung-An
    Kasim, Noor Alicezah Mohd
    Nawawi, Hapizah
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (09) : 66 - 73
  • [45] Tree-based machine learning approaches for equity market predictions
    Dominik Wolff
    Ulrich Neugebauer
    Journal of Asset Management, 2019, 20 : 273 - 288
  • [46] Analysis in big data of satellite communication network based on machine learning algorithms
    Liu, Xiangjuan
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2021, 32 (07):
  • [47] TREE-BASED MACHINE LEARNING METHODS FOR MODELING AND FORECASTING MORTALITY
    Bjerre, Dorethe Skovgaard
    ASTIN BULLETIN-THE JOURNAL OF THE INTERNATIONAL ACTUARIAL ASSOCIATION, 2022, 52 (03) : 765 - 787
  • [48] On the Efficacy and Vulnerabilities of Logic Locking in Tree-Based Machine Learning
    de Abreu, Brunno Alves
    Paim, Guilherme
    Alrahis, Lilas
    Flores, Paulo
    Sinanoglu, Ozgur
    Bampi, Sergio
    Amrouch, Hussam
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2025, 72 (01) : 180 - 191
  • [49] Tree-based machine learning approaches for equity market predictions
    Wolff, Dominik
    Neugebauer, Ulrich
    JOURNAL OF ASSET MANAGEMENT, 2019, 20 (04) : 273 - 288
  • [50] Leveraging Tree-based Machine Learning for Predicting Earnings Management
    Huy, Tam Phan
    Hong, Tuyet Pham
    Quoc, An Bui Nguyen
    JOURNAL OF INTERNATIONAL COMMERCE ECONOMICS AND POLICY, 2025,