Where are we in Named Entity Recognition from Speech?

被引:0
|
作者
Caubriere, Antoine [1 ]
Rosset, Sophie [2 ]
Esteve, Yannick [3 ]
Laurent, Antoine [1 ]
Morin, Emmanuel [4 ]
机构
[1] Le Mans Univ, LIUM, Le Mans, France
[2] Univ Paris Saclay, LIMSI, CNRS, Paris, France
[3] Avignon Univ, LIA, Avignon, France
[4] Univ Nantes, CNRS, LS2N, Nantes, France
关键词
Named Entity Recognition; Automatic Speech Recognition; Tree-structured Named Entity; End-to-End;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Named entity recognition (NER) from speech is usually made through a pipeline process that consists in (i) processing audio using an automatic speech recognition system (ASR) and (ii) applying a NER to the ASR outputs. The latest data available for named entity extraction from speech in French were produced during the ETAPE evaluation campaign in 2012. Since the publication of ETAPE's campaign results, major improvements were done on NER and ASR systems, especially with the development of neural approaches for both of these components. In addition, recent studies have shown the capability of End-to-End (E2E) approach for NER / SLU tasks. In this paper, we propose a study of the improvements made in speech recognition and named entity recognition for pipeline approaches. For this type of systems, we propose an original 3-pass approach. We also explore the capability of an E2E system to do structured NER. Finally, we compare the performances of ETAPE's systems (state-of-the-art systems in 2012) with the performances obtained using current technologies. The results show the interest of the E2E approach, which however remains below an updated pipeline approach.
引用
收藏
页码:4514 / 4520
页数:7
相关论文
共 50 条
  • [1] Speech recognition of a named entity
    Tomita, T
    Okimoto, Y
    Yamamoto, H
    Sagisaka, Y
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1057 - 1060
  • [2] Joint Speech Translation and Named Entity Recognition
    Gaido, Marco
    Papi, Sara
    Negri, Matteo
    Turchi, Marco
    [J]. INTERSPEECH 2023, 2023, : 47 - 51
  • [3] Discriminative Named Entity Recognition of Speech Data using Speech Recognition Confidence
    Sudoh, Katsuhito
    Tsukada, Hajime
    Isozaki, Hideki
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 337 - 340
  • [4] Incorporating Pinyin into Pipeline Named Entity Recognition from Chinese Speech
    Zhang, Min
    Qiao, Xiaosong
    Zhao, Yanqing
    Su, Chang
    Li, Yinglu
    Zhu, Ming
    Zhu, Junhao
    Li, Yuang
    Zhao, Xiaofeng
    Liu, Yilun
    Ma, Wenbing
    Piao, Mengyao
    Yu, Jiawei
    Lv, Xinglin
    Peng, Song
    Tao, Shimin
    Yang, Hao
    Jiang, Yanfei
    [J]. 2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 947 - 953
  • [5] Incorporating speech recognition confidence into discriminative named entity recognition of speech data
    Sudoh, Katsuhito
    Tsukada, Hajime
    Isozaki, Hideki
    [J]. COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 617 - 624
  • [6] AISHELL-NER: NAMED ENTITY RECOGNITION FROM CHINESE SPEECH
    Chen, Boli
    Xu, Guangwei
    Wang, Xiaobin
    Xie, Pengjun
    Zhang, Meishan
    Huang, Fei
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8352 - 8356
  • [7] End-to-end Named Entity Recognition from English Speech
    Yadav, Hemant
    Ghosh, Sreyan
    Yu, Yi
    Shah, Rajiv Ratn
    [J]. INTERSPEECH 2020, 2020, : 4268 - 4272
  • [8] Incorporating Named Entity Recognition into the Speech Transcription Process
    Hatmi, Mohamed
    Jacquin, Christine
    Morin, Emmanuel
    Meignier, Sylvain
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3699 - 3703
  • [9] OOV Sensitive Named-Entity Recognition in Speech
    Parada, Carolina
    Dredze, Mark
    Jelinek, Frederick
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2096 - +
  • [10] End-to-end named entity recognition for Vietnamese speech
    Nguyen, Thu-Hien
    Nguyen, Thai-Binh
    Do, Quoc-Truong
    Nguyen, Tuan-Linh
    [J]. 2022 25TH CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA 2022), 2022,