PAL-BERT: An Improved Question Answering Model

被引:51
|
作者
Zheng, Wenfeng [1 ]
Lu, Siyu [1 ]
Cai, Zhuohang [1 ]
Wang, Ruiyang [1 ]
Wang, Lei [2 ]
Yin, Lirong [2 ]
机构
[1] Univ Elect Sci & Technol China, Sch Automat, Chengdu 610054, Peoples R China
[2] Louisiana State Univ, Dept Geog & Anthropol, Baton Rouge, LA 70803 USA
来源
关键词
PAL-BERT; question answering model; pretraining language models; ALBERT; pruning model; network pruning; TextCNN; BiLSTM;
D O I
10.32604/cmes.2023.046692
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In the field of natural language processing (NLP), there have been various pre-training language models in recent years, with question answering systems gaining significant attention. However, as algorithms, data, and computing power advance, the issue of increasingly larger models and a growing number of parameters has surfaced. Consequently, model training has become more costly and less efficient. To enhance the efficiency and accuracy of the training process while reducing the model volume, this paper proposes a first-order pruning model PAL-BERT based on the ALBERT model according to the characteristics of question-answering (QA) system and language model. Firstly, a first-order network pruning method based on the ALBERT model is designed, and the PAL-BERT model is formed. Then, the parameter optimization strategy of the PAL-BERT model is formulated, and the Mish function was used as an activation function instead of ReLU to improve the performance. Finally, after comparison experiments with traditional deep learning models TextCNN and BiLSTM, it is confirmed that PALBERT is a pruning model compression method that can significantly reduce training time and optimize training efficiency. Compared with traditional models, PAL-BERT significantly improves the NLP task's performance.
引用
下载
收藏
页码:2729 / 2745
页数:17
相关论文
共 50 条
  • [31] BB-KBQA: BERT-Based Knowledge Base Question Answering
    Liu, Aiting
    Huang, Ziqi
    Lu, Hengtong
    Wang, Xiaojie
    Yuan, Caixia
    CHINESE COMPUTATIONAL LINGUISTICS, CCL 2019, 2019, 11856 : 81 - 92
  • [32] Chop Chop BERT: Visual Question Answering by Chopping VisualBERT's Heads
    Gao, Chenyu
    Zhu, Qi
    Wang, Peng
    Wu, Qi
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 664 - 670
  • [33] Fine-Tuning BERT for Question and Answering Using PubMed Abstract Dataset
    Cheon, Saeyeon
    Ahn, Insung
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 681 - 684
  • [34] Improved Question Answering System by semantic refomulation
    Umamehaswari, Muthukrishanan
    Ramprasath, Muthukrishnan
    Hariharan, Shanmugasundaram
    2012 FOURTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC), 2012,
  • [35] BEQAIN: An Effective and Efficient Identifier Normalization Approach With BERT and the Question Answering System
    Zhang, Jingxuan
    Liu, Siyuan
    Gong, Lina
    Zhang, Haoxiang
    Huang, Zhiqiu
    Jiang, He
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (04) : 2597 - 2620
  • [36] An open domain question answering system based on improved system similarity model
    Zhao, Yu-Ming
    Xu, Zhi-Ming
    Guan, Yi
    Wang, Xiao-Long
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 4521 - +
  • [37] Image captioning improved visual question answering
    Sharma, Himanshu
    Jalal, Anand Singh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (24) : 34775 - 34796
  • [38] Question reformulation based question answering environment model
    Ali I.
    Yadav D.
    International Journal of Information Technology, 2021, 13 (1) : 59 - 67
  • [39] A RETRIEVAL MODEL FOR QUESTION IN COMMUNITY QUESTION ANSWERING SYSTEM
    Sun, Yueping
    Wang, Xiaojie
    Liu, Song
    Yuan, Caixia
    Wang, Xuwen
    2012 IEEE 2nd International Conference on Cloud Computing and Intelligent Systems (CCIS) Vols 1-3, 2012, : 1534 - 1539
  • [40] Improved Cross-Lingual Question Retrieval for Community Question Answering
    Ruckle, Andreas
    Swarnkar, Krishnkant
    Gurevych, Iryna
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 3179 - 3186