Novel modelling strategies for high-frequency stock trading data

被引:7
|
作者
Zhang, Xuekui [1 ]
Huang, Yuying [1 ,2 ]
Xu, Ke [3 ]
Xing, Li [4 ]
机构
[1] Univ Victoria, Math & Stat Dept, Victoria, BC, Canada
[2] Univ Waterloo, Stat & Actuarial Sci, Waterloo, ON, Canada
[3] Univ Victoria, Econ Dept, Victoria, BC, Canada
[4] Univ Saskatchewan, Math & Stat Dept, Saskatoon, SK, Canada
关键词
High-frequency trading; Machine learning; Mid-price prediction strategy; Raw data processing; Multi-class prediction; Ensemble learning; MARKET; REGULARIZATION;
D O I
10.1186/s40854-022-00431-9
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
Full electronic automation in stock exchanges has recently become popular, generating high-frequency intraday data and motivating the development of near real-time price forecasting methods. Machine learning algorithms are widely applied to mid-price stock predictions. Processing raw data as inputs for prediction models (e.g., data thinning and feature engineering) can primarily affect the performance of the prediction methods. However, researchers rarely discuss this topic. This motivated us to propose three novel modelling strategies for processing raw data. We illustrate how our novel modelling strategies improve forecasting performance by analyzing high-frequency data of the Dow Jones 30 component stocks. In these experiments, our strategies often lead to statistically significant improvement in predictions. The three strategies improve the F1 scores of the SVM models by 0.056, 0.087, and 0.016, respectively.
引用
收藏
页数:25
相关论文
共 50 条
  • [31] Graph-based stock correlation and prediction for high-frequency trading systems
    Yin, Tao
    Liu, Chenzhengyi
    Ding, Fangyu
    Feng, Ziming
    Yuan, Bo
    Zhang, Ning
    PATTERN RECOGNITION, 2022, 122
  • [32] Ensemble properties of high-frequency data and intraday trading rules
    Baldovin, F.
    Camana, F.
    Caporin, M.
    Caraglio, M.
    Stella, A. L.
    QUANTITATIVE FINANCE, 2015, 15 (02) : 231 - 245
  • [33] Review of Statistical Approaches for Modeling High-Frequency Trading Data
    Dutta, Chiranjit
    Ravishanker, Nalini
    Karpman, Kara
    Basu, Sumanta
    SANKHYA-SERIES B-APPLIED AND INTERDISCIPLINARY STATISTICS, 2023, 85 (SUPPL 1): : 1 - 48
  • [34] A model for unpacking big data analytics in high-frequency trading
    Seddon, Jonathan J. J. M.
    Currie, Wendy L.
    JOURNAL OF BUSINESS RESEARCH, 2017, 70 : 300 - 307
  • [35] Review of Statistical Approaches for Modeling High-Frequency Trading Data
    Chiranjit Dutta
    Kara Karpman
    Sumanta Basu
    Nalini Ravishanker
    Sankhya B, 2023, 85 : 1 - 48
  • [36] Nonparametric estimation for high-frequency data incorporating trading information
    Cui, Wenhao
    Hu, Jie
    Wang, Jiandong
    JOURNAL OF ECONOMETRICS, 2024, 240 (01)
  • [37] High-frequency trading model for a complex trading hierarchy
    Podobnik, Boris
    Wang, Duan
    Stanley, H. Eugene
    QUANTITATIVE FINANCE, 2012, 12 (04) : 559 - 566
  • [38] Does high-frequency trading cause stock prices to deviate from fundamental values?
    Jung, Michael
    Kwon, Kyung Yoon
    Park, Hyungshin
    ACCOUNTING AND BUSINESS RESEARCH, 2024, 54 (05) : 580 - 613
  • [39] Reinforcement Learning for Stock Prediction and High-Frequency Trading With T+1 Rules
    Zhang, Weipeng
    Yin, Tao
    Zhao, Yunan
    Han, Bing
    Liu, Huanxi
    IEEE ACCESS, 2023, 11 : 14115 - 14127