PAL-BERT: An Improved Question Answering Model

被引：51

作者：

Zheng, Wenfeng ^{[1
]}

Lu, Siyu ^{[1
]}

Cai, Zhuohang ^{[1
]}

Wang, Ruiyang ^{[1
]}

Wang, Lei ^{[2
]}

Yin, Lirong ^{[2
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Automat, Chengdu 610054, Peoples R China

[2] Louisiana State Univ, Dept Geog & Anthropol, Baton Rouge, LA 70803 USA

来源：

CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES | 2024年 / 139卷 / 03期

关键词：

PAL-BERT; question answering model; pretraining language models; ALBERT; pruning model; network pruning; TextCNN; BiLSTM;

D O I：

10.32604/cmes.2023.046692

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

In the field of natural language processing (NLP), there have been various pre-training language models in recent years, with question answering systems gaining significant attention. However, as algorithms, data, and computing power advance, the issue of increasingly larger models and a growing number of parameters has surfaced. Consequently, model training has become more costly and less efficient. To enhance the efficiency and accuracy of the training process while reducing the model volume, this paper proposes a first-order pruning model PAL-BERT based on the ALBERT model according to the characteristics of question-answering (QA) system and language model. Firstly, a first-order network pruning method based on the ALBERT model is designed, and the PAL-BERT model is formed. Then, the parameter optimization strategy of the PAL-BERT model is formulated, and the Mish function was used as an activation function instead of ReLU to improve the performance. Finally, after comparison experiments with traditional deep learning models TextCNN and BiLSTM, it is confirmed that PALBERT is a pruning model compression method that can significantly reduce training time and optimize training efficiency. Compared with traditional models, PAL-BERT significantly improves the NLP task's performance.

引用

下载

页码：2729 / 2745

页数：17

共 50 条

[31] BB-KBQA: BERT-Based Knowledge Base Question Answering
Liu, Aiting
Huang, Ziqi
Lu, Hengtong
Wang, Xiaojie
Yuan, Caixia
CHINESE COMPUTATIONAL LINGUISTICS, CCL 2019, 2019, 11856 : 81 - 92
[32] Chop Chop BERT: Visual Question Answering by Chopping VisualBERT's Heads
Gao, Chenyu
Zhu, Qi
Wang, Peng
Wu, Qi
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 664 - 670
[33] Fine-Tuning BERT for Question and Answering Using PubMed Abstract Dataset
Cheon, Saeyeon
Ahn, Insung
PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 681 - 684
[34] Improved Question Answering System by semantic refomulation
Umamehaswari, Muthukrishanan
Ramprasath, Muthukrishnan
Hariharan, Shanmugasundaram
2012 FOURTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC), 2012,
[35] BEQAIN: An Effective and Efficient Identifier Normalization Approach With BERT and the Question Answering System
Zhang, Jingxuan
Liu, Siyuan
Gong, Lina
Zhang, Haoxiang
Huang, Zhiqiu
Jiang, He
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (04) : 2597 - 2620
[36] An open domain question answering system based on improved system similarity model
Zhao, Yu-Ming
Xu, Zhi-Ming
Guan, Yi
Wang, Xiao-Long
PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 4521 - +
[37] Image captioning improved visual question answering
Sharma, Himanshu
Jalal, Anand Singh
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (24) : 34775 - 34796
[38] Question reformulation based question answering environment model
Ali I.
Yadav D.
International Journal of Information Technology, 2021, 13 (1) : 59 - 67
[39] A RETRIEVAL MODEL FOR QUESTION IN COMMUNITY QUESTION ANSWERING SYSTEM
Sun, Yueping
Wang, Xiaojie
Liu, Song
Yuan, Caixia
Wang, Xuwen
2012 IEEE 2nd International Conference on Cloud Computing and Intelligent Systems (CCIS) Vols 1-3, 2012, : 1534 - 1539
[40] Improved Cross-Lingual Question Retrieval for Community Question Answering
Ruckle, Andreas
Swarnkar, Krishnkant
Gurevych, Iryna
WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 3179 - 3186

← 1 2 3 4 5 →