How Much Can Machines Learn Finance from Chinese Text Data?

被引:4
|
作者
Zhou, Yang [1 ,2 ]
Fan, Jianqing [3 ,4 ,5 ]
Xue, Lirong [4 ]
机构
[1] Fudan Univ, Inst Big Data, Shanghai 200433, Peoples R China
[2] Fudan Univ, MOE Lab Natl Dev & Intelligent Governance, Shanghai 200433, Peoples R China
[3] Capital Univ Econ & Business, Int Sch Econ & Management, Beijing 100070, Peoples R China
[4] Princeton Univ, Dept Operat Res & Financial Engn, Princeton, NJ 08544 USA
[5] Fudan Univ, Sch Data Sci, Shanghai 200433, Peoples R China
基金
中国国家自然科学基金;
关键词
machine learning; FarmPredict; factor model; sparse regression; textual analysis; INVESTOR SENTIMENT; STOCK; RETURNS; NUMBER; RISK; NEWS;
D O I
10.1287/mnsc.2022.01468
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
How much can we learn finance directly from text data? This paper presents a new framework for learning textual data based on the factor augmentation model and sparsity regularization, called the factor -augmented regularized model for prediction (FarmPredict), to let machines learn financial returns directly from news. FarmPredict allows the model itself to extract information directly from articles without predefined information, such as dictionaries or pretrained models as in most studies. Using unsupervised learned factors to augment the predictors would benefit our method with a "doublerobust" feature: that the machine would learn to balance between individual words or text factors/topics. It also avoids the information loss of factor regression in dimensionality reduction. We apply our model to the Chinese stock market with a large proportion of retail investors by using Chinese news data to predict financial returns. We show that positive sentiments scored by our FarmPredict approach from news generate on average 83 basic points (bps) stock daily excess returns, and negative news has an adverse impact of 26 bps on the days of news announcements, where both effects can last for a few days. This asymmetric effect aligns well with the short -sale constraints in the Chinese equity market. The result shows that the machine -learned prediction does provide sizeable predictive power with an annualized return of 54% at most with a simple investment strategy. Compared with other statistical and machine learning methods, FarmPredict significantly outperforms them on model prediction and portfolio performance. Our study demonstrates the of machines to learn text data.
引用
收藏
页码:8962 / 8987
页数:27
相关论文
共 50 条
  • [21] Seeing and Hearing Egocentric Actions: How Much Can We Learn?
    Cartas, Alejandro
    Luque, Jordi
    Radeva, Petia
    Segura, Carlos
    Dimiccoli, Mariella
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4470 - 4480
  • [22] Quality improvement around the world: how much we can learn from each other
    Moss, F
    Palmberg, M
    Plsek, P
    Schellekens, W
    [J]. QUALITY IN HEALTH CARE, 2000, 9 (01): : 63 - 66
  • [23] Pathology of plasma cell dyscrasia - How much can we learn from autopsy studies?
    Picken, MM
    [J]. ARCHIVES OF PATHOLOGY & LABORATORY MEDICINE, 2004, 128 (10) : 1082 - +
  • [24] How Much Do Employers Learn from Referrals?
    Pinkston, Joshua C.
    [J]. INDUSTRIAL RELATIONS, 2012, 51 (02): : 317 - 341
  • [25] How much did Irenaeus learn from Justin?
    Slusser, Michael
    [J]. STUDIA PATRISTICA, VOL XL, 2006, 40 : 515 - 520
  • [26] Can machines ever learn from their own mistakes?
    Marriott, S
    Harrison, RF
    [J]. MEASUREMENT & CONTROL, 1997, 30 (10): : 300 - 307
  • [27] Fine-needle aspiration of the liver and pancreas: How much can we learn from the cytopathologist?
    Freed, KS
    Thomas, LO
    Dodd, LG
    Mooney, EE
    Layfield, LJ
    Paulson, EK
    [J]. RADIOLOGY, 1996, 201 : 561 - 561
  • [28] How much can we learn from long-term extension trials in multiple sclerosis?
    Noseworthy, John H.
    [J]. NEUROLOGY, 2006, 67 (06) : 930 - 931
  • [29] How Much Data Is Sufficient to Learn High-Performing Algorithms?
    Balcan, Maria-Florina
    Deblasio, Dan
    Dick, Travis
    Kingsford, Carl
    Sandholm, Tuomas
    Vitercik, Ellen
    [J]. Journal of the ACM, 2024, 71 (05)
  • [30] CAN WE LEARN FROM CHINESE PEOPLE
    GALTUNG, J
    NISHIMURA, F
    [J]. WORLD DEVELOPMENT, 1976, 4 (10-1) : 883 - 888