Multilevel Image-Enhanced Sentence Representation Net for Natural Language Inference

被引:0
|
作者
Zhang, Kun [1 ]
Lv, Guangyi [1 ]
Wu, Le [2 ]
Chen, Enhong [1 ]
Liu, Qi [1 ]
Wu, Han [1 ]
Xie, Xing [3 ]
Wu, Fangzhao [3 ]
机构
[1] Univ Sci & Technol China, Sch Comp Sci & Technol, Anhui Prov Key Lab Big Data Anal & Applicat, Hefei 230026, Peoples R China
[2] Hefei Univ Technol, Sch Comp & Informat, Hefei 230029, Peoples R China
[3] Microsoft Res Asia, Social Comp Grp, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Image-enhanced representation; multiple level; natural language inference (NLI); sentence semantic;
D O I
10.1109/TSMC.2019.2932410
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Natural language inference (NLI) task requires an agent to determine the semantic relation between a premise sentence (p) and a hypothesis sentence (h), which demands sufficient understanding about sentences semantic. Due to the issues, such as polysemy, ambiguity, as well as fuzziness of sentences, intense sentence understanding is very challenging. To this end, in this article, we introduce the corresponding image of sentences as reference information for enhancing sentence semantic understanding and representing. Specifically, we first propose an image-enhanced multilevel sentence representation net (IEMLRN), that utilizes the image features from pretrained models for enhancing the sentence semantic understanding at different scales, i.e., lexical-level, phrase-level, and sentence-level. The proposed model advances the performance on NLI tasks by leveraging the pretrained global features of images. However, as these pretrained image features are optimized on specific image classification datasets, they may not show the best performance on NLI tasks. Therefore, we further propose to design an adaptive image feature generator that extracts fine-grained image labels from the corresponding sentences. After that, we extend the IEMLRN to multilevel image-enhanced sentence representation net (MIESR) by utilizing not only the coarse-grained pretrained features of images, but also the fine-grained adaptive features of images. Therefore, sentence semantic can be evaluated and enhanced more comprehensively and precisely. Extensive experiments on two benchmark datasets (SNLI, SICK) clearly show our proposed IEMLRN significantly outperform the state-of-the-art baselines, and our proposed MIESR model achieves the best performance by considering not only the text but also images in an adaptive multigranularities way.
引用
收藏
页码:3781 / 3795
页数:15
相关论文
共 35 条
  • [1] Multilevel Image-Enhanced Sentence Representation Net for Natural Language Inference
    Zhang, Kun
    Lv, Guangyi
    Wu, Le
    Chen, Enhong
    Liu, Qi
    Wu, Han
    Xie, Xing
    Wu, Fangzhao
    [J]. Chen, Enhong (cheneh@ustc.edu.cn), 1600, Institute of Electrical and Electronics Engineers Inc. (51): : 3781 - 3795
  • [2] Image-Enhanced Multi-Level Sentence Representation Net for Natural Language Inference
    Zhang, Kun
    Lv, Guangyi
    Wu, Le
    Chen, Enhong
    Liu, Qi
    Wu, Han
    Wu, Fangzhao
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 747 - 756
  • [3] Natural language inference for Malayalam language using language agnostic sentence representation
    Renjit, Sara
    Idicula, Sumam
    [J]. PEERJ COMPUTER SCIENCE, 2021,
  • [4] Natural language inference for Malayalam language using language agnostic sentence representation
    Renjit, Sara
    Idicula, Sumam
    [J]. PeerJ Computer Science, 2021, 7 : 1 - 25
  • [5] Collecting Diverse Natural Language Inference Problems for Sentence Representation Evaluation
    Poliak, Adam
    Haldar, Aparajita
    Rudinger, Rachel
    Hu, J. Edward
    Pavlick, Ellie
    White, Aaron Steven
    Van Durme, Benjamin
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 67 - 81
  • [6] Natural language as the basis for meaning representation and inference
    Dagan, Ido
    Bar-Haim, Roy
    Szpektor, Idan
    Greental, Iddo
    Shnarchl, Eyal
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2008, 4919 : 151 - +
  • [7] Data and Representation for Turkish Natural Language Inference
    Budur, Emrah
    Ozcelik, Riza
    Gungor, Tunga
    Potts, Christopher
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 8253 - 8267
  • [8] Enhanced LSTM for Natural Language Inference
    Chen, Qian
    Zhu, Xiaodan
    Ling, Zhenhua
    Wei, Si
    Jiang, Hui
    Inkpen, Diana
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1657 - 1668
  • [9] Natural Language Inference Using LSTM Model with Sentence Fusion
    Zhang, Senlin
    Liu, Siyang
    Liu, Meiqin
    [J]. PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 11081 - 11085
  • [10] Dependent Multilevel Interaction Network for Natural Language Inference
    Li, Yun
    Yang, Yan
    Deng, Yong
    Hu, Qinmin Vivian
    Chen, Chengcai
    He, Liang
    Yu, Zhou
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: TEXT AND TIME SERIES, PT IV, 2019, 11730 : 9 - 21