Utilizing Text Mining for Labeling Training Models from Futures Corpus in Generative AI

被引:0
|
作者
Chou, Hsien-Ming [1 ]
Cho, Tsai-Lun [1 ,2 ,3 ]
机构
[1] Chung Yuan Christian Univ, Dept Informat Management, Taoyuan City 320314, Taiwan
[2] Chien Hsin Univ Sci & Technol, Dept Informat Management, Taoyuan City 320678, Taiwan
[3] Natl Tsing Hua Univ, Dept Math, Hsinchu 300044, Taiwan
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 17期
关键词
text mining; semantic analysis; labeling bull-bear words; futures corpus; generative AI;
D O I
10.3390/app13179622
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
For highly time-constrained, very short-term investors, reading and extracting valuable information from financial news poses significant challenges. The wide range of topics covered in these news articles further compounds the difficulties for investors. The diverse content adds complexity and uncertainty to the text, making it arduous for very short-term investors to swiftly and accurately extract valuable insights. Variations between authors, media sources, and cultural backgrounds also introduce additional complexities. Hence, performing a bull-bear semantic analysis of financial news using text mining technologies can alleviate the volume, time, and energy pressures on very short-term investors, while enhancing the efficiency and accuracy of their investment decisions. This study proposes labeling bull-bear words using a futures corpus detection method that extracts valuable information from financial news, allowing investors to quickly understand market trends. Generative AI models are trained to provide real-time bull-bear advice, aiding investors in adapting to market changes and devising effective trading strategies. Experimental results show the effectiveness of various models, with random forest and SVMs achieving an impressive 80% accuracy rate. MLP and deep learning models also perform well. By leveraging these models, the study reduces the time spent reading financial articles, enabling faster decision making and increasing the likelihood of investment success. Future research can explore the application of this method in other domains and enhance model design for improved predictive capabilities and practicality.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Utilizing Context in Generative Bayesian Models for Linked Corpus
    Kataria, Saurabh
    Mitra, Prasenjit
    Bhatia, Sumit
    PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 1340 - 1345
  • [2] Is text and data mining synonymous with AI training?
    Rosati, Eleonora
    JOURNAL OF INTELLECTUAL PROPERTY LAW & PRACTICE, 2024, 19 (12) : 851 - 852
  • [3] Copyright, text & data mining and the innovation dimension of generative AI
    Tyagi, Kalpana
    JOURNAL OF INTELLECTUAL PROPERTY LAW & PRACTICE, 2024, 19 (07) : 557 - 570
  • [4] Mining, Scraping, Training, Generating: Copyright Implications of Generative AI
    Centivany, Alissa
    Proceedings of the Association for Information Science and Technology, 2024, 61 (01) : 68 - 79
  • [5] Evaluating Generative AI Models for Image-Text Modification
    Soni, Jayesh
    Upadhyay, Himanshu
    Victor, Prince Patrick Anand
    Tripathi, Sarvapriya
    IEEE ACCESS, 2025, 13 : 40703 - 40729
  • [6] Utilizing Text-Generative AI for Creating Oral Reading Fluency Probes
    Sidwell, MacKenzie D.
    Bonner, Landon W.
    Bates-Brantley, Kayla
    Wu, Shengtian
    INTERVENTION IN SCHOOL AND CLINIC, 2024, 60 (02) : 119 - 125
  • [7] A dataset of text prompts, videos and video quality metrics from generative text-to-video AI models
    Chivileva, Iya
    Lynch, Philip
    Ward, Tomas E.
    Smeaton, Alan F.
    DATA IN BRIEF, 2024, 54
  • [8] The Norwegian Colossal Corpus: A Text Corpus for Training Large Norwegian Language Models
    Kummervold, Per E.
    Wetjen, Freddy
    de la Rosa, Javier
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 3852 - 3860
  • [9] RxBERT: Enhancing drug labeling text mining and analysis with AI language modeling
    Wu, Leihong
    Gray, Magnus
    Dang, Oanh
    Xu, Joshua
    Fang, Hong
    Tong, Weida
    EXPERIMENTAL BIOLOGY AND MEDICINE, 2023, 248 (21) : 1937 - 1943
  • [10] Text and data mining exceptions in the development of generative AI models: What the EU member states could learn from the Japanese "nonenjoyment" purposes?
    Dermawan, Artha
    JOURNAL OF WORLD INTELLECTUAL PROPERTY, 2024, 27 (01): : 44 - 68