Where are we in Named Entity Recognition from Speech?

被引：0

作者：

Caubriere, Antoine ^{[1
]}

Rosset, Sophie ^{[2
]}

Esteve, Yannick ^{[3
]}

Laurent, Antoine ^{[1
]}

Morin, Emmanuel ^{[4
]}

机构：

[1] Le Mans Univ, LIUM, Le Mans, France

[2] Univ Paris Saclay, LIMSI, CNRS, Paris, France

[3] Avignon Univ, LIA, Avignon, France

[4] Univ Nantes, CNRS, LS2N, Nantes, France

来源：

PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020) | 2020年

关键词：

Named Entity Recognition; Automatic Speech Recognition; Tree-structured Named Entity; End-to-End;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Named entity recognition (NER) from speech is usually made through a pipeline process that consists in (i) processing audio using an automatic speech recognition system (ASR) and (ii) applying a NER to the ASR outputs. The latest data available for named entity extraction from speech in French were produced during the ETAPE evaluation campaign in 2012. Since the publication of ETAPE's campaign results, major improvements were done on NER and ASR systems, especially with the development of neural approaches for both of these components. In addition, recent studies have shown the capability of End-to-End (E2E) approach for NER / SLU tasks. In this paper, we propose a study of the improvements made in speech recognition and named entity recognition for pipeline approaches. For this type of systems, we propose an original 3-pass approach. We also explore the capability of an E2E system to do structured NER. Finally, we compare the performances of ETAPE's systems (state-of-the-art systems in 2012) with the performances obtained using current technologies. The results show the interest of the E2E approach, which however remains below an updated pipeline approach.

引用

页码：4514 / 4520

页数：7

共 50 条

[1] Speech recognition of a named entity
Tomita, T
Okimoto, Y
Yamamoto, H
Sagisaka, Y
[J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1057 - 1060
[2] Joint Speech Translation and Named Entity Recognition
Gaido, Marco
Papi, Sara
Negri, Matteo
Turchi, Marco
[J]. INTERSPEECH 2023, 2023, : 47 - 51
[3] Discriminative Named Entity Recognition of Speech Data using Speech Recognition Confidence
Sudoh, Katsuhito
Tsukada, Hajime
Isozaki, Hideki
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 337 - 340
[4] Incorporating Pinyin into Pipeline Named Entity Recognition from Chinese Speech
Zhang, Min
Qiao, Xiaosong
Zhao, Yanqing
Su, Chang
Li, Yinglu
Zhu, Ming
Zhu, Junhao
Li, Yuang
Zhao, Xiaofeng
Liu, Yilun
Ma, Wenbing
Piao, Mengyao
Yu, Jiawei
Lv, Xinglin
Peng, Song
Tao, Shimin
Yang, Hao
Jiang, Yanfei
[J]. 2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 947 - 953
[5] Incorporating speech recognition confidence into discriminative named entity recognition of speech data
Sudoh, Katsuhito
Tsukada, Hajime
Isozaki, Hideki
[J]. COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 617 - 624
[6] AISHELL-NER: NAMED ENTITY RECOGNITION FROM CHINESE SPEECH
Chen, Boli
Xu, Guangwei
Wang, Xiaobin
Xie, Pengjun
Zhang, Meishan
Huang, Fei
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8352 - 8356
[7] End-to-end Named Entity Recognition from English Speech
Yadav, Hemant
Ghosh, Sreyan
Yu, Yi
Shah, Rajiv Ratn
[J]. INTERSPEECH 2020, 2020, : 4268 - 4272
[8] Incorporating Named Entity Recognition into the Speech Transcription Process
Hatmi, Mohamed
Jacquin, Christine
Morin, Emmanuel
Meignier, Sylvain
[J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3699 - 3703
[9] OOV Sensitive Named-Entity Recognition in Speech
Parada, Carolina
Dredze, Mark
Jelinek, Frederick
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2096 - +
[10] End-to-end named entity recognition for Vietnamese speech
Nguyen, Thu-Hien
Nguyen, Thai-Binh
Do, Quoc-Truong
Nguyen, Tuan-Linh
[J]. 2022 25TH CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA 2022), 2022,

← 1 2 3 4 5 →