Learning Spoken Language Representations with Neural Lattice Language Modeling

被引:0
|
作者
Huang, Chao-Wei [1 ]
Chen, Yun-Nung [1 ]
机构
[1] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei, Taiwan
关键词
NETWORKS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pre-trained language models have achieved huge improvement on many NLP tasks. However, these methods are usually designed for written text, so they do not consider the properties of spoken language. Therefore, this paper aims at generalizing the idea of language model pre-training to lattices generated by recognition systems. We propose a framework that trains neural lattice language models to provide contextualized representations for spoken language understanding tasks. The proposed two-stage pre-training approach reduces the demands of speech data and has better efficiency. Experiments on intent detection and dialogue act recognition datasets demonstrate that our proposed method consistently outperforms strong baselines when evaluated on spoken inputs.(1)
引用
收藏
页码:3764 / 3769
页数:6
相关论文
共 50 条
  • [1] CONVERTING WRITTEN LANGUAGE TO SPOKEN LANGUAGE WITH NEURAL MACHINE TRANSLATION FOR LANGUAGE MODELING
    Ando, Shintaro
    Suzuki, Masayuki
    Itoh, Nobuyasu
    Kurata, Gakuto
    Minematsu, Nobuaki
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8124 - 8128
  • [2] Spoken language understanding software for language learning
    Alam, Hassan
    Kumar, Aman
    Rahman, Fuad
    Hartono, Rachmat
    Tarnikova, Yuliya
    [J]. INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS/INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL II, 2007, : 107 - +
  • [3] Learning Semantic Hierarchy with Distributed Representations for Unsupervised Spoken Language Understanding
    Chen, Yun-Nung
    Wang, William Yang
    Rudnicky, Alexander I.
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1869 - 1873
  • [4] Efficient Self-Supervised Learning Representations for Spoken Language Identification
    Liu, Hexin
    Perera, Leibny Paola Garcia
    Khong, Andy W. H.
    Chng, Eng Siong
    Styles, Suzy J.
    Khudanpur, Sanjeev
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (06) : 1296 - 1307
  • [5] A NEURAL DOCUMENT LANGUAGE MODELING FRAMEWORK FOR SPOKEN DOCUMENT RETRIEVAL
    Yen, Li-Phen
    Wu, Zhen-Yu
    Chen, Kuan-Yu
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8139 - 8143
  • [6] Predictors of spoken language learning
    Wong, Patrick C. M.
    Ettlinger, Marc
    [J]. JOURNAL OF COMMUNICATION DISORDERS, 2011, 44 (05) : 564 - 567
  • [7] Spoken language, bilingualism, and learning difficulties in written language
    Rodriguez-Parra, Maria J.
    Buiza, Juan J.
    Adrian, Jose A.
    Alegria, Jesus
    [J]. INFANCIA Y APRENDIZAJE, 2012, 35 (03): : 343 - 364
  • [8] Joint Spoken Language Understanding and Domain Adaptive Language Modeling
    Zhang, Huifeng
    Zhu, Su
    Fan, Shuai
    Yu, Kai
    [J]. INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING, 2018, 11266 : 311 - 324
  • [9] Generative Spoken Dialogue Language Modeling
    Nguyen, Tu Anh
    Kharitonov, Eugene
    Copet, Jade
    Adi, Yossi
    Hsu, Wei-Ning
    Elkahky, Ali
    Tomasello, Paden
    Algayres, Robin
    Sagot, Benoit
    Mohamed, Abdelrahman
    Dupoux, Emmanuel
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2023, 11 : 250 - 266
  • [10] Neural Representations of Natural Language
    Feng, Haoda
    [J]. NATURAL LANGUAGE ENGINEERING, 2022, 28 (05) : 669 - 672