Neural Data Augmentation for Legal Overruling Task: Small Deep Learning Models vs. Large Language Models

被引:0
|
作者
Sheik, Reshma [1 ]
Sundara, K. P. Siva [2 ]
Nirmala, S. Jaya [1 ]
机构
[1] Natl Inst Technol, Dept Comp Sci & Engn, Tiruchirapalli 620015, India
[2] Coimbatore Inst Technol, Dept Elect & Commun Engn, Coimbatore 641013, India
关键词
Deep learning; Natural language processing; Data augmentation; Legal overruling task; Transformer; Few-shot; GPT-3; Large language models;
D O I
10.1007/s11063-024-11574-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning models produce impressive results in any natural language processing applications when given a better learning strategy and trained with large labeled datasets. However, the annotation of massive training data is far too expensive, especially in the legal domain, due to the need for trained legal professionals. Data augmentation solves the problem of learning without labeled big data. In this paper, we employ pre-trained language models and prompt engineering to generate large-scale pseudo-labeled data for the legal overruling task using 100 data samples. We train small recurrent and convolutional deep-learning models using this data and fine-tune a few other transformer models. We then evaluate the effectiveness of the models, both with and without data augmentation, using the benchmark dataset and analyze the results. We also test the performance of these models with the state-of-the-art GPT-3 model under few-shot setting. Our experimental findings demonstrate that data augmentation results in better model performance in the legal overruling task than models trained without augmentation. Furthermore, our best-performing deep learning model trained on augmented data outperforms the few-shot GPT-3 by 18% in the F1-score. Additionally, our results highlight that the small neural networks trained with augmented data achieve outcomes comparable to those of other large language models.
引用
下载
收藏
页数:21
相关论文
共 50 条
  • [1] Neural Data Augmentation for Legal Overruling Task: Small Deep Learning Models vs. Large Language Models
    Reshma Sheik
    K. P. Siva Sundara
    S. Jaya Nirmala
    Neural Processing Letters, 56
  • [2] Mapping Landslides on EO Data: Performance of Deep Learning Models vs. Traditional Machine Learning Models
    Prakash, Nikhil
    Manconi, Andrea
    Loew, Simon
    REMOTE SENSING, 2020, 12 (03)
  • [3] Detecting Data Races in OpenMP with Deep Learning and Large Language Models
    Alsofyani, May
    Wang, Liqiang
    53RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2024, 2024, : 96 - 103
  • [4] Knowledge Augmentation and Task Planning in Large Language Models for Dexterous Grasping
    Li, Hui
    Tran, Dang
    Zhang, Xinyu
    He, Hongsheng
    2023 IEEE-RAS 22ND INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS, HUMANOIDS, 2023,
  • [5] Survey on Videos Data Augmentation for Deep Learning Models
    Cauli, Nino
    Recupero, Diego Reforgiato
    FUTURE INTERNET, 2022, 14 (03)
  • [6] A Bayesian Data Augmentation Approach for Learning Deep Models
    Toan Tran
    Trung Pham
    Carneiro, Gustavo
    Palmer, Lyle
    Reid, Ian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [7] Deep Learning for Genomics: From Early Neural Nets to Modern Large Language Models
    Yue, Tianwei
    Wang, Yuanxin
    Zhang, Longxiang
    Gu, Chunming
    Xue, Haoru
    Wang, Wenping
    Lyu, Qi
    Dun, Yujie
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (21)
  • [8] Deep Generative Models for Data Synthesis and Augmentation in Machine Learning
    Adavala, Kiran Mayee
    Vhatkar, Sangeeta
    Ruprah, Taranpreet Singh
    Bhatia, Sukhwinder Kaur
    Kumar, Vipin
    Sharma, Dharmendra
    Praveen, B. Shyam
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (03) : 1242 - 1249
  • [9] Sparse Signal Models for Data Augmentation in Deep Learning ATR
    Agarwal, Tushar
    Sugavanam, Nithin
    Ertin, Emre
    REMOTE SENSING, 2023, 15 (16)
  • [10] Data Augmentation for Time Series Classification with Deep Learning Models
    Pialla, Gautier
    Devanne, Maxime
    Weber, Jonathan
    Idoumghar, Lhassane
    Forestier, Germain
    ADVANCED ANALYTICS AND LEARNING ON TEMPORAL DATA, AALTD 2022, 2023, 13812 : 117 - 132