The Predictability of Tree-based Machine Learning Algorithms in the Big Data Context

被引:5
|
作者
Qolipour, F. [1 ]
Ghasemzadeh, M. [1 ]
Mohammad-Karimi, N. [1 ]
机构
[1] Yazd Univ, Dept Comp Engn, Yazd, Iran
来源
INTERNATIONAL JOURNAL OF ENGINEERING | 2021年 / 34卷 / 01期
关键词
Stock Market; Big Data; Prediction; Machine Learning; Tree-based Algorithms; Ensemble Algorithms; PRICE; DIRECTION; RETURNS;
D O I
10.5829/ije.2021.34.01a.10
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This research work is concerned with the predictability of ensemble and singular tree-based machine learning algorithms during the recession and prosperity of the two companies listed in the Tehran Stock Exchange in the context of big data. In this regard, the main issue is that economic managers and the academic community require predicting models with more accuracy and reduced execution time; moreover, the prediction of the companies recession in the stock market is highly significant. Machine learning algorithms must be able to appropriately predict the stock return sign during the market downturn and boom days. Addressing the stated challenge will upgrade the quality of stock purchases and, subsequently, will increase profitability. In this article, the proposed solution relies on the utilization of tree-based machine learning algorithms in the context of big data. The proposed solution exploits the decision tree algorithm, which is a traditional and singular tree-based learning algorithm. Furthermore, two modern and ensemble tree-based learning algorithms, random forest and gradient boosted tree, has been utilized for predicting the stock return sign during recession and prosperity. The mentioned cases were implemented by applying the machine learning tools in python programming language and PYSPARK library that is used explicitly for the big data context. The utilized research data of the current study are the shares information of two companies of the Tehran Stock Exchange. The obtained results reveal that the applied ensemble learning algorithms have performed better than the singular learning algorithms. Additionally, adding 23 technical features to the initial data and subsequent applying of the PCA feature reduction method have demonstrated the best performance among other modes. In the meantime, it has been concluded that the initial data do not possess the proper resolution or generalizability, either during prosperity or recession.
引用
收藏
页码:82 / 89
页数:8
相关论文
共 50 条
  • [31] Modelling 5G Data Using Tree-Based Machine Learning Models
    Kumar, P. Mithillesh
    Supriya, M.
    INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, ICICC 2022, VOL 1, 2023, 473 : 81 - 90
  • [32] Differentially Private Tree-Based Contextual Online Learning for Service Big Data Selection in IoT
    Zhao, Weiguang
    Chen, Mingxuan
    Mu, Difan
    Zhou, Pan
    Wang, Kehao
    2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
  • [33] Performance Analysis of Machine Learning Algorithms for Big Data Classification: ML and Al-Based Algorithms for Big Data Analysis
    Punia, Sanjeev Kumar
    Kumar, Manoj
    Stephan, Thompson
    Deverajan, Ganesh Gopal
    Patan, Rizwan
    INTERNATIONAL JOURNAL OF E-HEALTH AND MEDICAL COMMUNICATIONS, 2021, 12 (04) : 60 - 75
  • [34] PGAS Data Structure for Unbalanced Tree-Based Algorithms at Scale
    Helbecque, Guillaume
    Carneiro, Tiago
    Melab, Nouredine
    Gmys, Jan
    Bouvry, Pascal
    COMPUTATIONAL SCIENCE, ICCS 2024, PT III, 2024, 14834 : 103 - 111
  • [35] Machine learning algorithms for oncology big data treatment
    Mohammed, Zouiten
    ICCWCS'17: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTING AND WIRELESS COMMUNICATION SYSTEMS, 2017,
  • [36] Streaming Machine Learning Algorithms with Big Data Systems
    Abeykoon, Vibhatha
    Kamburugamuve, Supun
    Govindrarajan, Kannan
    Wickramasinghe, Pulasthi
    Widanage, Chathura
    Perera, Niranda
    Uyar, Ahmet
    Gunduz, Gurhan
    Akkas, Selahattin
    Von Laszewski, Gregor
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 5661 - 5666
  • [37] A SURVEY OF MACHINE LEARNING ALGORITHMS FOR BIG DATA ANALYTICS
    Athmaja, S.
    Hanumanthappa, M.
    Kavitha, Vasantha
    2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2017,
  • [38] Iceberg-seabed interaction evaluation in clay seabed using tree-based machine learning algorithms
    Azimi, Hamed
    Shiri, Hodjat
    Mahdianpari, Masoud
    JOURNAL OF PIPELINE SCIENCE AND ENGINEERING, 2022, 2 (04):
  • [39] Intrusion Detection and Identification Using Tree-Based Machine Learning Algorithms on DCS Network in the Oil Refinery
    Kim, Kyoung Ho
    Kwak, Byung Il
    Han, Mee Lan
    Kim, Huy Kang
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2022, 37 (06) : 4673 - 4682
  • [40] Modeling interfacial tension of surfactant–hydrocarbon systems using robust tree-based machine learning algorithms
    Ali Rashidi-Khaniabadi
    Elham Rashidi-Khaniabadi
    Behnam Amiri-Ramsheh
    Mohammad-Reza Mohammadi
    Abdolhossein Hemmati-Sarapardeh
    Scientific Reports, 13