Discriminative Named Entity Recognition of Speech Data using Speech Recognition Confidence

被引:0
|
作者
Sudoh, Katsuhito [1 ]
Tsukada, Hajime [1 ]
Isozaki, Hideki [1 ]
机构
[1] NTT, Commun Sci Labs, Keihanna Sci City, Kyoto 6190237, Japan
关键词
named entity recognition; speech recognition; confidence scoring; discriminative models; information retrieval;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a method for the named entity recognition (NER) of speech data that uses automatic speech recognition (ASR) confidence as a feature that indicates whether each word is correctly recognized. An NER model is trained using ASR results with named entity (NE) labels to include an ASR confidence feature as well as corresponding transcriptions with NE labels. Experiments using support vector machines (SVMs) and speech data from Japanese newspaper articles show that the proposed method achieves higher F-measure in NER than a simple application of text-based NER to ASR results.
引用
收藏
页码:337 / 340
页数:4
相关论文
共 50 条
  • [1] Incorporating speech recognition confidence into discriminative named entity recognition of speech data
    Sudoh, Katsuhito
    Tsukada, Hajime
    Isozaki, Hideki
    [J]. COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 617 - 624
  • [2] Speech recognition of a named entity
    Tomita, T
    Okimoto, Y
    Yamamoto, H
    Sagisaka, Y
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1057 - 1060
  • [3] Joint Speech Translation and Named Entity Recognition
    Gaido, Marco
    Papi, Sara
    Negri, Matteo
    Turchi, Marco
    [J]. INTERSPEECH 2023, 2023, : 47 - 51
  • [4] Where are we in Named Entity Recognition from Speech?
    Caubriere, Antoine
    Rosset, Sophie
    Esteve, Yannick
    Laurent, Antoine
    Morin, Emmanuel
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 4514 - 4520
  • [5] Incorporating Named Entity Recognition into the Speech Transcription Process
    Hatmi, Mohamed
    Jacquin, Christine
    Morin, Emmanuel
    Meignier, Sylvain
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3699 - 3703
  • [6] OOV Sensitive Named-Entity Recognition in Speech
    Parada, Carolina
    Dredze, Mark
    Jelinek, Frederick
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2096 - +
  • [7] Discriminative transform for confidence estimation in Mandarin speech recognition
    Guo, G
    Wang, RH
    [J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 269 - 272
  • [8] End-to-end named entity recognition for Vietnamese speech
    Nguyen, Thu-Hien
    Nguyen, Thai-Binh
    Do, Quoc-Truong
    Nguyen, Tuan-Linh
    [J]. 2022 25TH CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA 2022), 2022,
  • [9] Simultaneous Estimation of Confidence and Error Cause in Speech Recognition Using Discriminative Model
    Ogawa, Atsunori
    Nakamura, Atsushi
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1203 - 1206
  • [10] Using SVMs and discriminative models for speech recognition
    Smith, ND
    Gales, MJF
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 77 - 80