Multi-model Transfer and Optimization for Cloze Task

被引:0
|
作者
Tang, Jiahao [1 ]
Ling, Long [1 ]
Ma, Chenyu [1 ]
Zhang, Hanwen [1 ]
Huang, Jianqiang [1 ]
机构
[1] Qinghai Univ, Dept Comp Technol & Applicat, Xining, Peoples R China
基金
中国国家自然科学基金;
关键词
NLP; model transfer; adversarial training; cloze task;
D O I
10.1117/12.2579412
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Substantial progress has been made recently in training context-aware language models. CLOTH is a human created cloze dataset, which can better evaluate machine reading comprehension. Although the author of CLOTH has done many experiments on BERT and context2wec, it is still worth studying the performance of other models. We applied the CLOTH dataset to other models and evaluated their performance based on different model mechanisms. The results showed that ALBERT performed well on the cloze task. The accuracy of ALBERT is 92.24%, which is 6.34% higher than the human performance. In addition, we introduce adversarial training into the model. Experiments show that adversarial training has significant effects in improving the robustness and accuracy of the model. On the BERT-large model, the accuracy rate is up to 0.15% after using adversarial training.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Robust stochastic maximum principle for multi-model worst case optimization
    Poznyak, AS
    Duncan, TE
    Pasik-Duncan, B
    Boltyansky, VG
    INTERNATIONAL JOURNAL OF CONTROL, 2002, 75 (13) : 1032 - 1048
  • [32] MULTI-MODEL FEDERATED LEARNING OPTIMIZATION BASED ON MULTI-AGENT REINFORCEMENT LEARNING
    Atapour, S. Kaveh
    Seyedmohammadi, S. Jamal
    Sheikholeslami, S. Mohammad
    Abouei, Jamshid
    Mohammadi, Arash
    Plataniotis, Konstantinos N.
    2023 IEEE 9TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING, CAMSAP, 2023, : 151 - 155
  • [33] Knowledge transfer in evolutionary multi-task optimization: A survey
    Tan, Ziying
    Luo, Linbo
    Zhong, Jinghui
    APPLIED SOFT COMPUTING, 2023, 138
  • [34] A Multi-model Estimation of Distribution Algorithm for Agent Routing Problem in Multi-point Dynamic Task
    Lu, Sai
    Xin, Bin
    Dou, Lihua
    Wang, Ling
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 2468 - 2473
  • [35] Comparison Between Single-Model and Multi-Model Optimization Methods for Multiphysical Design
    Vivier, S.
    Friedrich, G.
    2016 XXII INTERNATIONAL CONFERENCE ON ELECTRICAL MACHINES (ICEM), 2016, : 1399 - 1404
  • [36] Multi-model approach to model selection
    Stoica, P
    Selén, Y
    Jian, L
    DIGITAL SIGNAL PROCESSING, 2004, 14 (05) : 399 - 412
  • [37] Abstract Model for Multi-model Data
    Contos, Pavel
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2021), PT III, 2021, 12683 : 647 - 651
  • [38] The Multi-model Databases - A Review
    Pluciennik, Ewa
    Zgorzalek, Kamil
    BEYOND DATABASES, ARCHITECTURES AND STRUCTURES: TOWARDS EFFICIENT SOLUTIONS FOR DATA ANALYSIS AND KNOWLEDGE REPRESENTATION, 2017, 716 : 141 - 152
  • [39] MULTI-MODEL TRAFFIC MICROSIMULATIONS
    Claes, Rutger
    Holvoet, Tom
    PROCEEDINGS OF THE 2009 WINTER SIMULATION CONFERENCE (WSC 2009 ), VOL 1-4, 2009, : 1093 - 1103
  • [40] Multi-Model Inference in Biogeography
    Millington, James D. A.
    Perry, George L. W.
    GEOGRAPHY COMPASS, 2011, 5 (07): : 448 - 463