An Empirical Investigation of Word Class-Based Features for Natural Language Understanding

被引:4
|
作者
Celikyilmaz, Asli [1 ]
Sarikaya, Ruhi [1 ]
Jeong, Minwoo [1 ]
Deoras, Anoop [2 ,3 ]
机构
[1] Microsoft Corp, Intent Sci Team, Redmond, WA 98052 USA
[2] Microsoft Corp, Conversat Understanding Sci, Redmond, WA 98052 USA
[3] Netflix, Algorithms Res & Engn Grp, Los Gatos, CA 95032 USA
关键词
Class-based features; conditional random fields; exponential models; natural language understanding; regularization; shrinkage features; NETWORKS;
D O I
10.1109/TASLP.2015.2511925
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
There are many studies that show using class-based features improves the performance of natural language processing (NLP) tasks such as syntactic part-of-speech tagging, dependency parsing, sentiment analysis, and slot filling in natural language understanding (NLU), but not much has been reported on the underlying reasons for the performance improvements. In this paper, we investigate the effects of the word class-based features for the exponential family of models specifically focusing on NLU tasks, and demonstrate that the performance improvements could be attributed to the regularization effect of the class-based features on the underlying model. Our hypothesis is based on empirical observation that shrinking the sum of parameter magnitudes in an exponential model tends to improve performance. We show on several semantic tagging tasks that there is a positive correlation between the model size reduction by the addition of the class-based features and the model performance on a held-out dataset. We also demonstrate that class-based features extracted from different data sources using alternate word clustering methods can individually contribute to the performance gain. Since the proposed features are generated in an unsupervised manner without significant computational overhead, the improvements in performance largely come for free and we show that such features provide gains for a wide range of tasks from semantic classification and slot tagging in NLU to named entity recognition (NER).
引用
收藏
页码:994 / 1005
页数:12
相关论文
共 50 条
  • [21] Unsupervised class-based language model adaptation for spontaneous speech recognition
    Yokoyama, T
    Shinozaki, T
    Iwano, K
    Furui, S
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 236 - 239
  • [22] Under the Radar: The Role of Invisible Discourse in Understanding Class-Based Privilege
    Sanders, Melissa R.
    Mahalingam, Ramaswami
    [J]. JOURNAL OF SOCIAL ISSUES, 2012, 68 (01) : 112 - 127
  • [23] Statistical language modeling with a class-based n-multigram model
    Deligne, S
    Sagisaka, Y
    [J]. COMPUTER SPEECH AND LANGUAGE, 2000, 14 (03): : 261 - 279
  • [24] Word sense disambiguation using semantic kernels with class-based term values
    Altinel, Berna
    Ganiz, Murat Can
    Sipal, Bilge
    Erkaya, Erencan
    Yucedag, Onur Can
    Dogan, Muhammed Ali
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (04) : 3180 - 3194
  • [25] Constituency Parsing of Bulgarian: Word- vs. Class-based Parsing
    Ghayoomi, Masood
    Simov, Kiril
    Osenova, Petya
    [J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 4056 - 4060
  • [26] Fusion of region based extracted features for instance- and class-based CBIR applications
    Pradhan, Jitesh
    Pal, Arup Kumar
    Banka, Haider
    Dansena, Prabhat
    [J]. APPLIED SOFT COMPUTING, 2021, 102
  • [27] Application of Morphosyntactic and Class-Based Language Models in Automatic Speech Recognition of Polish
    Smywinski-Pohl, Alexsander
    Ziolko, Bartosz
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2016, 25 (02)
  • [28] CLASS-BASED AGGRESSIVE FEATURE SELECTION FOR POLYNOMIAL NETWORKS TEXT CLASSIFIERS - AN EMPIRICAL STUDY
    Al-Tahrawi, Mayy
    [J]. UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2015, 77 (02): : 93 - 110
  • [29] Class-based aggressive feature selection for polynomial networks text classifiers - an empirical study
    AL-Tahrawi, Mayy
    [J]. UPB Scientific Bulletin, Series C: Electrical Engineering and Computer Science, 2015, 77 77 (2 2): : 93 - 110
  • [30] Enhancing performance of transformer-based models in natural language understanding through word importance embedding
    Hong, Seung-Kyu
    Jang, Jae-Seok
    Kwon, Hyuk-Yoon
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 304