A Comparison of Deep Learning Methods for Language Understanding

被引:6
|
作者
Korpusik, Mandy [1 ]
Liu, Zoe [2 ]
Glass, James [2 ]
机构
[1] Loyola Marymount Univ, Los Angeles, CA 90045 USA
[2] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
来源
关键词
BERT; Semantic Tagging; CNN; RNN; CRF; MODELS;
D O I
10.21437/Interspeech.2019-1262
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
In this paper, we compare a suite of neural networks (recurrent, convolutional, and the recently proposed BERT model) to a CRF with hand-crafted features on three semantic tagging corpora: the Air Travel Information System (ATIS) benchmark, restaurant queries, and written and spoken meal descriptions. Our motivation is to investigate pre-trained BERT's transferability to the domains we are interested in. We demonstrate that neural networks without feature engineering outperform state-of-the-art statistical and deep learning approaches on all three tasks (except written meal descriptions, where the CRF is slightly better) and that deep, attention-based BERT, in particular, surpasses state-of-the-art results on these tasks. Error analysis shows the models are less confident when making errors, enabling the system to follow up with the user when uncertain.
引用
收藏
页码:849 / 853
页数:5
相关论文
共 50 条
  • [31] Development and comparison of deep learning toolkit with other machine learning methods
    Mitrofanov, Artem
    Korotcov, Alexandru
    Tkachenko, Valery
    Ekins, Sean
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 254
  • [32] Improving Spoken Language Understanding with information retrieval and active learning methods
    Jars, Isabelle
    Panaget, Franck
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 5001 - 5004
  • [33] Deep Learning-based Natural Language Processing Methods Comparison for Presumptive Detection of Cyberbullying in Social Networks
    Andrade-Segarra, Diego A.
    Leon-Paredes, Gabriel A.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (05) : 796 - 803
  • [34] Spam SMS Detection for Turkish Language with Deep Text Analysis and Deep Learning Methods
    Onur Karasoy
    Serkan Ballı
    [J]. Arabian Journal for Science and Engineering, 2022, 47 : 9361 - 9377
  • [35] Spam SMS Detection for Turkish Language with Deep Text Analysis and Deep Learning Methods
    Karasoy, Onur
    Balli, Serkan
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (08) : 9361 - 9377
  • [36] Understanding Deep Learning
    Wang, Ge
    [J]. NATURE MACHINE INTELLIGENCE, 2024, 6 (05) : 502 - 503
  • [37] A short review of deep learning methods for understanding group and crowd activities
    Felipe Borja-Borja, Luis
    Saval-Calvo, Marcelo
    Azorin-Lopez, Jorge
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [38] Understanding multilingual four-scene comics with deep learning methods
    Chen, Jiali
    Iwasaki, Ryo
    Mori, Naoki
    Okada, Makoto
    Ueno, Miki
    [J]. 2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDARW) AND 13TH IAPR INTERNATIONAL WORKSHOP ON GRAPHICS RECOGNITION (GREC 2019), VOL 1, 2019, : 32 - 37
  • [39] Understanding Naturalistic Facial Expressions with Deep Learning and Multimodal Large Language Models
    Bian, Yifan
    Kuester, Dennis
    Liu, Hui
    Krumhuber, Eva G.
    [J]. SENSORS, 2024, 24 (01)
  • [40] An Effective Natural Language Understanding Model using Deep Learning and PyDial Toolkit
    Ganesan, Karthik
    Patil, Akhilesh P.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2017, : 810 - 816