Utilizing Text Mining for Labeling Training Models from Futures Corpus in Generative AI

被引:0
|
作者
Chou, Hsien-Ming [1 ]
Cho, Tsai-Lun [1 ,2 ,3 ]
机构
[1] Chung Yuan Christian Univ, Dept Informat Management, Taoyuan City 320314, Taiwan
[2] Chien Hsin Univ Sci & Technol, Dept Informat Management, Taoyuan City 320678, Taiwan
[3] Natl Tsing Hua Univ, Dept Math, Hsinchu 300044, Taiwan
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 17期
关键词
text mining; semantic analysis; labeling bull-bear words; futures corpus; generative AI;
D O I
10.3390/app13179622
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
For highly time-constrained, very short-term investors, reading and extracting valuable information from financial news poses significant challenges. The wide range of topics covered in these news articles further compounds the difficulties for investors. The diverse content adds complexity and uncertainty to the text, making it arduous for very short-term investors to swiftly and accurately extract valuable insights. Variations between authors, media sources, and cultural backgrounds also introduce additional complexities. Hence, performing a bull-bear semantic analysis of financial news using text mining technologies can alleviate the volume, time, and energy pressures on very short-term investors, while enhancing the efficiency and accuracy of their investment decisions. This study proposes labeling bull-bear words using a futures corpus detection method that extracts valuable information from financial news, allowing investors to quickly understand market trends. Generative AI models are trained to provide real-time bull-bear advice, aiding investors in adapting to market changes and devising effective trading strategies. Experimental results show the effectiveness of various models, with random forest and SVMs achieving an impressive 80% accuracy rate. MLP and deep learning models also perform well. By leveraging these models, the study reduces the time spent reading financial articles, enabling faster decision making and increasing the likelihood of investment success. Future research can explore the application of this method in other domains and enhance model design for improved predictive capabilities and practicality.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Utilizing Vector Space Models for Identifying Legal Factors from Text
    Falakmasir, Mohammad H.
    Ashley, Kevin D.
    LEGAL KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 302 : 183 - 192
  • [32] Review of Perfectionism Research From 1990 to 2019 Utilizing a Text-Mining Approach
    Suh, Hanna
    Kim, Seoyoung
    Lee, Dong-gwi
    REVIEW OF GENERAL PSYCHOLOGY, 2021, 25 (03) : 283 - 303
  • [33] Generative Feature Language Models for Mining Implicit Features from Customer Reviews
    Santu, Shubhra Kanti Karmaker
    Sondhi, Parikshit
    Zhai, ChengXiang
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 929 - 938
  • [34] Refining the extraction of relevant documents from biomedical literature to create a corpus for pathway text mining
    Harte, R
    Lu, Y
    Osborn, S
    Dehoney, D
    Chin, D
    PROCEEDINGS OF THE 2003 IEEE BIOINFORMATICS CONFERENCE, 2003, : 644 - 645
  • [35] A Comparison of Two Models for Mining Named Entity Pairs from Web Corpus
    Yao Jianmin
    Guo Lei
    Zhou Meiling
    Zhang Jing
    ICCSE 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION: ADVANCED COMPUTER TECHNOLOGY, NEW EDUCATION, 2008, : 108 - 110
  • [36] DISCo: Distilled Student Models Co-training for Semi-supervised Text Mining
    Jiang, Weifeng
    Mao, Qianren
    Lin, Chenghua
    Li, Jianxin
    Deng, Ting
    Yang, Weiyi
    Wang, Zheng
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 4015 - 4030
  • [37] Interpreting text corpora from androids-related stories using large language models: “Machines like me” by Ian McEwan in generative AI
    Simona-Vasilica Oprea
    Adela Bâra
    Humanities and Social Sciences Communications, 12 (1):
  • [38] Training Generative Models From Privatized Data via Entropic Optimal Transport
    Reshetova, Daria
    Chen, Wei-Ning
    Ozgur, Ayfer
    IEEE JOURNAL ON SELECTED AREAS IN INFORMATION THEORY, 2024, 5 : 221 - 235
  • [39] Data Mining from Free-Text Health Records: State of the Art, New Polish Corpus
    Anetta, Kristof
    RECENT ADVANCES IN SLAVONIC NATURAL LANGUAGE PROCESSING (RASLAN 2020), 2020, : 13 - 22
  • [40] Bridging generational gaps in medication safety: insights from nurses, students, and generative AI models
    Brurya Orkaby
    Erika Kerner
    Mor Saban
    Chedva Levin
    BMC Nursing, 24 (1)