Automatic Speech Recognition for Live TV Subtitling for Hearing-Impaired People

被引:0
|
作者
Obach, Michael [1 ]
Lehr, Maider [1 ]
Arruti, Andoni
机构
[1] VICOMTech Visual Interact & Commun Technol Ctr, E-20009 Donostia San Sebastian, Spain
来源
关键词
Subtitling; Live Subtitling; Closed Captioning; Automatic Speech Recognition; Hearing Impaired; Deaf and Hard of Hearing; Teletext;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most Spanish TV channels offer subtitles (closed captions) only for some of their pre-recorded programmes, and mainly due to the costs of specially trained stenographers and fast typists, subtitles are rarely available for live programmes like news broadcasts, sports events, and others. Progress in automatic speech recognition (ASR) opens a new way for live subtitling, but only works well when trained to recognise a single voice and when trained previously with material related to the contents of the programmes. We developed a prototype based on ASR that could be applied to generate automatically live subtitles as teletext for Spanish news broadcasts without human participation. The main goal was to evaluate the feasibility of using this technology to improve the quality of life of millions of hearing-impaired people, in accordance with applicable and future Spanish legislation. State-of-the-art speech recognition software for dictation as literal transcription of speech and a commercial teletext generator conforming to Spanish standards were integrated with our modules for improved pre-processing of the audio signal, voice normalization for speaker independence, speech/non-speech segmentation, and tools for the generation and update of dictionaries. The prototype was validated in cooperation with a TV broadcaster, which provided audiovisual material for the generation of the language corpus and specific dictionaries. System outputs were evaluated by organizations of the deaf and the hard of hearing. Results indicate that ASR is (still) not suitable for fully automated live subtitling. A delay of several seconds between speech and subtitle was observed. A limited word recognition rate, mainly caused by a huge number of named entities and variability of speakers and acoustic conditions, made understanding of the news sometimes impossible. We identified the lack of automatic punctuation as a major problem that decreased the readability of the contents of subtitles and also affected recognition quality. Many results are valid for other languages and other areas of subtitling than television.
引用
收藏
页码:286 / 291
页数:6
相关论文
共 50 条
  • [21] SOME EFFECTS OF SPECTRAL SHAPING ON RECOGNITION OF SPEECH BY HEARING-IMPAIRED LISTENERS
    KAMM, CA
    DIRKS, DD
    CARTERETTE, EC
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1982, 71 (05): : 1211 - 1224
  • [22] SPEECH-RECOGNITION DIFFICULTIES OF THE HEARING-IMPAIRED ELDERLY - THE CONTRIBUTIONS OF AUDIBILITY
    HUMES, LE
    ROBERTS, L
    [J]. JOURNAL OF SPEECH AND HEARING RESEARCH, 1990, 33 (04): : 726 - 735
  • [23] Cued Speech Recognition for Augmentative Communication in Normal-hearing and Hearing-impaired Subjects
    Heracleous, Panikos
    Beautemps, Denis
    Abboutabit, Noureddine
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1419 - 1422
  • [24] Some effects of spectral shaping on recognition of speech by hearing-impaired listeners
    Kamm, C.A.
    Dirks, D.D.
    Carterette, E.C.
    [J]. 1600, (71):
  • [25] A model of speech recognition for hearing-impaired listeners based on deep learning
    Rossbach, Jana
    Kollmeier, Birger
    Meyer, Bernd T.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2022, 151 (03): : 1417 - 1427
  • [26] TACTILE AIDS FOR SPEECH-PERCEPTION AND PRODUCTION BY HEARING-IMPAIRED PEOPLE
    WEISENBERGER, J
    [J]. VOLTA REVIEW, 1989, 91 (05) : 79 - 100
  • [27] SPEECH PRODUCTION IN HEARING-IMPAIRED CHILDREN
    GOLD, T
    [J]. JOURNAL OF COMMUNICATION DISORDERS, 1980, 13 (06) : 397 - 418
  • [28] THE SPEECH OF HEARING-IMPAIRED CHILDREN - MARKIDES,A
    SILVERMAN, SR
    [J]. JOURNAL OF THE BRITISH ASSOCIATION OF TEACHERS OF THE DEAF, 1985, 9 (02): : 48 - 49
  • [29] TEACHING SPEECH TO HEARING-IMPAIRED CHILDREN
    GATTY, JC
    [J]. VOLTA REVIEW, 1992, 94 (05) : 49 - 61
  • [30] THE SPEECH OF HEARING-IMPAIRED CHILDREN - MARKIDES,A
    BENCH, J
    [J]. BRITISH JOURNAL OF DISORDERS OF COMMUNICATION, 1985, 20 (01): : 92 - 93