Multi-model Transfer and Optimization for Cloze Task

被引:0
|
作者
Tang, Jiahao [1 ]
Ling, Long [1 ]
Ma, Chenyu [1 ]
Zhang, Hanwen [1 ]
Huang, Jianqiang [1 ]
机构
[1] Qinghai Univ, Dept Comp Technol & Applicat, Xining, Peoples R China
基金
中国国家自然科学基金;
关键词
NLP; model transfer; adversarial training; cloze task;
D O I
10.1117/12.2579412
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Substantial progress has been made recently in training context-aware language models. CLOTH is a human created cloze dataset, which can better evaluate machine reading comprehension. Although the author of CLOTH has done many experiments on BERT and context2wec, it is still worth studying the performance of other models. We applied the CLOTH dataset to other models and evaluated their performance based on different model mechanisms. The results showed that ALBERT performed well on the cloze task. The accuracy of ALBERT is 92.24%, which is 6.34% higher than the human performance. In addition, we introduce adversarial training into the model. Experiments show that adversarial training has significant effects in improving the robustness and accuracy of the model. On the BERT-large model, the accuracy rate is up to 0.15% after using adversarial training.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Multi-criteria multi-model design optimization
    Bestle, D
    Eberhard, P
    IUTAM SYMPOSIUM ON OPTIMIZATION OF MECHANICAL SYSTEMS, 1996, 43 : 33 - 40
  • [2] Skill transfer improved with a multi-model approach
    Nakawaki, DE
    Joo, S
    Miyazaki, F
    ADVANCED ROBOTICS, 2000, 14 (05) : 371 - 375
  • [3] Linear multi-model time-optimization
    Boltyanski, V
    Poznyak, A
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2002, 23 (03): : 141 - 161
  • [4] A multi-model approach to intravenous filter optimization
    Vassilevski, Y. V.
    Simakov, S. S.
    Kapranov, S. A.
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN BIOMEDICAL ENGINEERING, 2010, 26 (07) : 915 - 925
  • [5] Mass Transport Based Multi-Model Color Transfer
    Yantao LIU
    Shengjing TIAN
    Meng LIU
    Xiuping LIU
    JournalofMathematicalResearchwithApplications, 2017, 37 (01) : 119 - 126
  • [6] A Multi-Model Power Estimation Engine for Accuracy Optimization
    Klein, Felipe
    Araujo, G.
    Azevedo, Rodolfo
    Leao, Roberto
    dos Santos, Luiz C. V.
    ISLPED'07: PROCEEDINGS OF THE 2007 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, 2007, : 280 - 285
  • [7] A Global Optimization Approach to Robust Multi-Model Fitting
    Yu, Jin
    Chin, Tat-Jun
    Suter, David
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011,
  • [8] Multi-model Optimization with Discounted Reward and Budget Constraint
    Shi, Jixuan
    Chen, Mei
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON MATHEMATICS AND ARTIFICIAL INTELLIGENCE (ICMAI 2018), 2018, : 10 - 14
  • [9] Optimization of multi-model ensemble forecasting of typhoon waves
    Pan, Shun-qi
    Fan, Yang-ming
    Chen, Jia-ming
    Kao, Chia-chuen
    WATER SCIENCE AND ENGINEERING, 2016, 9 (01) : 52 - 57
  • [10] Optimization of multi-model ensemble forecasting of typhoon waves
    Shun-qi Pan
    Yang-ming Fan
    Jia-ming Chen
    Chia-chuen Kao
    Water Science and Engineering, 2016, 9 (01) : 52 - 57