The Predictability of Tree-based Machine Learning Algorithms in the Big Data Context

被引:5
|
作者
Qolipour, F. [1 ]
Ghasemzadeh, M. [1 ]
Mohammad-Karimi, N. [1 ]
机构
[1] Yazd Univ, Dept Comp Engn, Yazd, Iran
来源
INTERNATIONAL JOURNAL OF ENGINEERING | 2021年 / 34卷 / 01期
关键词
Stock Market; Big Data; Prediction; Machine Learning; Tree-based Algorithms; Ensemble Algorithms; PRICE; DIRECTION; RETURNS;
D O I
10.5829/ije.2021.34.01a.10
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This research work is concerned with the predictability of ensemble and singular tree-based machine learning algorithms during the recession and prosperity of the two companies listed in the Tehran Stock Exchange in the context of big data. In this regard, the main issue is that economic managers and the academic community require predicting models with more accuracy and reduced execution time; moreover, the prediction of the companies recession in the stock market is highly significant. Machine learning algorithms must be able to appropriately predict the stock return sign during the market downturn and boom days. Addressing the stated challenge will upgrade the quality of stock purchases and, subsequently, will increase profitability. In this article, the proposed solution relies on the utilization of tree-based machine learning algorithms in the context of big data. The proposed solution exploits the decision tree algorithm, which is a traditional and singular tree-based learning algorithm. Furthermore, two modern and ensemble tree-based learning algorithms, random forest and gradient boosted tree, has been utilized for predicting the stock return sign during recession and prosperity. The mentioned cases were implemented by applying the machine learning tools in python programming language and PYSPARK library that is used explicitly for the big data context. The utilized research data of the current study are the shares information of two companies of the Tehran Stock Exchange. The obtained results reveal that the applied ensemble learning algorithms have performed better than the singular learning algorithms. Additionally, adding 23 technical features to the initial data and subsequent applying of the PCA feature reduction method have demonstrated the best performance among other modes. In the meantime, it has been concluded that the initial data do not possess the proper resolution or generalizability, either during prosperity or recession.
引用
收藏
页码:82 / 89
页数:8
相关论文
共 50 条
  • [21] Tree-based interpretable machine learning of the thermodynamic phases
    Yang, Jintao
    Cao, Junpeng
    PHYSICS LETTERS A, 2021, 412
  • [22] Runtime Optimizations for Tree-based Machine Learning Models
    Asadi, Nima
    Lin, Jimmy
    de Vries, Arjen P.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (09) : 2281 - 2292
  • [23] Improved tree-based machine learning algorithms combining with bagging strategy for landslide susceptibility modeling
    Tingyu Zhang
    Renata Pacheco Quevedo
    Huanyuan Wang
    Quan Fu
    Dan Luo
    Tao Wang
    Guilherme Garcia de Oliveira
    Laurindo Antonio Guasselli
    Camilo Daleles Renno
    Arabian Journal of Geosciences, 2022, 15 (2)
  • [24] A comparative study of patient and staff safety evaluation using tree-based machine learning algorithms
    Simsekler, Mecit Can Emre
    Rodrigues, Clarence
    Qazi, Abroon
    Ellahham, Samer
    Ozonoff, Al
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2021, 208
  • [25] Tree-based machine learning algorithms in the Internet of Things environment for multivariate flood status prediction
    Aswad, Firas Mohammed
    Kareem, Ali Noori
    Khudhur, Ahmed Mahmood
    Khalaf, Bashar Ahmed
    Mostafa, Salama A.
    JOURNAL OF INTELLIGENT SYSTEMS, 2022, 31 (01) : 1 - 14
  • [26] Pixel-wise classification in graphene-detection with tree-based machine learning algorithms
    Cho, Woon Hyung
    Shin, Jiseon
    Kim, Young Duck
    Jung, George J.
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2022, 3 (04):
  • [27] Tree-based Machine Learning Methods for Survey Research
    Kern, Christoph
    Klausch, Thomas
    Kreuter, Frauke
    SURVEY RESEARCH METHODS, 2019, 13 (01): : 73 - 93
  • [28] Cosmic string detection with tree-based machine learning
    Sadr, A. Vafaei
    Farhang, M.
    Movahed, S. M. S.
    Bassett, B.
    Kunz, M.
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2018, 478 (01) : 1132 - 1140
  • [29] Comparison of Tree-Based Machine Learning Algorithms to Predict Reporting Behavior of Electronic Billing Machines
    Murorunkwere, Belle Fille
    Ihirwe, Jean Felicien
    Kayijuka, Idrissa
    Nzabanita, Joseph
    Haughton, Dominique
    INFORMATION, 2023, 14 (03)
  • [30] Use of tree-based machine learning methods to screen affinitive peptides based on docking data
    Feng, Hua
    Wang, Fangyu
    Li, Ning
    Xu, Qian
    Zheng, Guanming
    Sun, Xuefeng
    Hu, Man
    Li, Xuewu
    Xing, Guangxu
    Zhang, Gaiping
    MOLECULAR INFORMATICS, 2023, 42 (12)