Utilizing Text Mining for Labeling Training Models from Futures Corpus in Generative AI

被引:0
|
作者
Chou, Hsien-Ming [1 ]
Cho, Tsai-Lun [1 ,2 ,3 ]
机构
[1] Chung Yuan Christian Univ, Dept Informat Management, Taoyuan City 320314, Taiwan
[2] Chien Hsin Univ Sci & Technol, Dept Informat Management, Taoyuan City 320678, Taiwan
[3] Natl Tsing Hua Univ, Dept Math, Hsinchu 300044, Taiwan
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 17期
关键词
text mining; semantic analysis; labeling bull-bear words; futures corpus; generative AI;
D O I
10.3390/app13179622
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
For highly time-constrained, very short-term investors, reading and extracting valuable information from financial news poses significant challenges. The wide range of topics covered in these news articles further compounds the difficulties for investors. The diverse content adds complexity and uncertainty to the text, making it arduous for very short-term investors to swiftly and accurately extract valuable insights. Variations between authors, media sources, and cultural backgrounds also introduce additional complexities. Hence, performing a bull-bear semantic analysis of financial news using text mining technologies can alleviate the volume, time, and energy pressures on very short-term investors, while enhancing the efficiency and accuracy of their investment decisions. This study proposes labeling bull-bear words using a futures corpus detection method that extracts valuable information from financial news, allowing investors to quickly understand market trends. Generative AI models are trained to provide real-time bull-bear advice, aiding investors in adapting to market changes and devising effective trading strategies. Experimental results show the effectiveness of various models, with random forest and SVMs achieving an impressive 80% accuracy rate. MLP and deep learning models also perform well. By leveraging these models, the study reduces the time spent reading financial articles, enabling faster decision making and increasing the likelihood of investment success. Future research can explore the application of this method in other domains and enhance model design for improved predictive capabilities and practicality.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images
    Zhu, Junchen
    Yang, Huan
    He, Huiguo
    Wang, Wenjing
    Tuo, Zixi
    Cheng, Wen-Huang
    Gao, Lianli
    Song, Jingkuan
    Fu, Jianlong
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9313 - 9319
  • [42] From text to treatment: the crucial role of validation for generative large language models in health care
    de Hond, Anne
    Leeuwenberg, Tuur
    Bartels, Richard
    van Buchem, Marieke
    Kant, Ilse
    Moons, Karel G. M.
    van Smeden, Maarten
    LANCET DIGITAL HEALTH, 2024, 6 (07): : e441 - e443
  • [43] Utilizing Hubel Wiesel Models for Semantic Associations and Topics Extraction from Unstructured Text
    Tiwari, Sandeep
    Ramanathan, Kiruthika
    2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 892 - 898
  • [44] Large-scale text analysis using generative language models: A case study in discovering public value expressions in AI patents
    Pelaez, Sergio
    Verma, Gaurav
    Ribeiro, Barbara
    Shapira, Philip
    QUANTITATIVE SCIENCE STUDIES, 2024, 5 (01): : 153 - 169
  • [45] Mining Japanese-Vietnamese multi-level parallel text corpus from Wikipedia data resource
    Thi-Ngoc-Diep Do
    2021 RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF 2021), 2021, : 31 - 36
  • [46] Features level sentiment mining in enterprise systems from informal text corpus using machine learning techniques
    Panigrahi, Ritanjali
    Bele, Nishikant
    Panigrahi, Prabin Kumar
    Gupta, Brij B.
    ENTERPRISE INFORMATION SYSTEMS, 2024, 18 (05)
  • [47] Synthetic and privacy-preserving traffic trace generation using generative AI models for training Network Intrusion Detection Systems
    Aceto, Giuseppe
    Giampaolo, Fabio
    Guida, Ciro
    Izzo, Stefano
    Pescape, Antonio
    Piccialli, Francesco
    Prezioso, Edoardo
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2024, 229
  • [48] A Text-Based Predictive Maintenance Approach for Facility Management Requests Utilizing Association Rule Mining and Large Language Models
    Lowin, Maximilian
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2024, 6 (01): : 233 - 258
  • [49] From RGB-D Images to RGB Images: Single Labeling for Mining Visual Models
    Zhang, Quanshi
    Song, Xuan
    Shao, Xiaowei
    Zhao, Huijing
    Shibasaki, Ryosuke
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2015, 6 (02)
  • [50] Integrating AI-powered text mining from PubTator into the manual curation workflow at the Comparative Toxicogenomics Database
    Wiegers, Thomas C.
    Davis, Allan Peter
    Wiegers, Jolene
    Sciaky, Daniela
    Barkalow, Fern
    Wyatt, Brent
    Strong, Melissa
    Mcmorran, Roy
    Abrar, Sakib
    Mattingly, Carolyn J.
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2025, 2025