A Comparison of Deep Learning Methods for Language Understanding

被引：6

作者：

Korpusik, Mandy ^{[1
]}

Liu, Zoe ^{[2
]}

Glass, James ^{[2
]}

机构：

[1] Loyola Marymount Univ, Los Angeles, CA 90045 USA

[2] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA

来源：

INTERSPEECH 2019 | 2019年

关键词：

BERT; Semantic Tagging; CNN; RNN; CRF; MODELS;

D O I：

10.21437/Interspeech.2019-1262

中图分类号：

R36 [病理学]; R76 [耳鼻咽喉科学];

学科分类号：

100104 ; 100213 ;

摘要：

In this paper, we compare a suite of neural networks (recurrent, convolutional, and the recently proposed BERT model) to a CRF with hand-crafted features on three semantic tagging corpora: the Air Travel Information System (ATIS) benchmark, restaurant queries, and written and spoken meal descriptions. Our motivation is to investigate pre-trained BERT's transferability to the domains we are interested in. We demonstrate that neural networks without feature engineering outperform state-of-the-art statistical and deep learning approaches on all three tasks (except written meal descriptions, where the CRF is slightly better) and that deep, attention-based BERT, in particular, surpasses state-of-the-art results on these tasks. Error analysis shows the models are less confident when making errors, enabling the system to follow up with the user when uncertain.

引用

页码：849 / 853

页数：5

共 50 条

[31] Development and comparison of deep learning toolkit with other machine learning methods
Mitrofanov, Artem
Korotcov, Alexandru
Tkachenko, Valery
Ekins, Sean
[J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 254
[32] Improving Spoken Language Understanding with information retrieval and active learning methods
Jars, Isabelle
Panaget, Franck
[J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 5001 - 5004
[33] Deep Learning-based Natural Language Processing Methods Comparison for Presumptive Detection of Cyberbullying in Social Networks
Andrade-Segarra, Diego A.
Leon-Paredes, Gabriel A.
[J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (05) : 796 - 803
[34] Spam SMS Detection for Turkish Language with Deep Text Analysis and Deep Learning Methods
Onur Karasoy
Serkan Ballı
[J]. Arabian Journal for Science and Engineering, 2022, 47 : 9361 - 9377
[35] Spam SMS Detection for Turkish Language with Deep Text Analysis and Deep Learning Methods
Karasoy, Onur
Balli, Serkan
[J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (08) : 9361 - 9377
[36] Understanding Deep Learning
Wang, Ge
[J]. NATURE MACHINE INTELLIGENCE, 2024, 6 (05) : 502 - 503
[37] A short review of deep learning methods for understanding group and crowd activities
Felipe Borja-Borja, Luis
Saval-Calvo, Marcelo
Azorin-Lopez, Jorge
[J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
[38] Understanding multilingual four-scene comics with deep learning methods
Chen, Jiali
Iwasaki, Ryo
Mori, Naoki
Okada, Makoto
Ueno, Miki
[J]. 2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDARW) AND 13TH IAPR INTERNATIONAL WORKSHOP ON GRAPHICS RECOGNITION (GREC 2019), VOL 1, 2019, : 32 - 37
[39] Understanding Naturalistic Facial Expressions with Deep Learning and Multimodal Large Language Models
Bian, Yifan
Kuester, Dennis
Liu, Hui
Krumhuber, Eva G.
[J]. SENSORS, 2024, 24 (01)
[40] An Effective Natural Language Understanding Model using Deep Learning and PyDial Toolkit
Ganesan, Karthik
Patil, Akhilesh P.
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2017, : 810 - 816

← 1 2 3 4 5 →