A simple but effective approach to speaker tracking in broadcast news

被引:0
|
作者
Rodriguez, Luis Javier [1 ]
Penagarikano, Mikel [1 ]
Bordel, German [1 ]
机构
[1] Univ Basque Country, Fac Ciencia & Tecnol, Dept Elect & Elect, Grp Trabajo Tecnol Software, Barrio Sarriena S-N, Leioa 48940, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The automatic transcription of broadcast news and meetings involves the segmentation, identification and tracking of speaker turns during each session, which is known as speaker diarization. This paper presents a simple but effective approach to a slightly different task, called speaker tracking, also involving audio segmentation and speaker identification, but with a subset of known speakers, which allows to estimate speaker models and to perform identification on a segment-by-segment basis. The proposed algorithm segments the audio signal in a fully unsupervised way, by locating the most likely change points from an purely acoustic point of view. Then the available speaker data are used to estimate single-Gaussian acoustic models. Finally, speaker models are used to classify the audio segments by choosing the most likely speaker or, alternatively, the Other category, if none of the speakers is likely enough. Despite its simplicity, the proposed approach yielded the best performance in the speaker tracking challenge organized in November 2006 by the Spanish Network on Speech Technology.
引用
收藏
页码:48 / +
页数:2
相关论文
共 50 条
  • [1] A System for Speaker Detection and Tracking in Audio Broadcast News
    Zibert, Janez
    Vesnicer, Bostjan
    Mihelic, France
    [J]. INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2008, 32 (01): : 51 - 61
  • [2] A system for speaker detection and tracking in audio broadcast news
    University of Ljubljana, Faculty of Electrical Engineering, Trzaška 25, SI-1000, Ljubljana, Slovenia
    [J]. Inf, 2008, 1 (51-61):
  • [3] A system for Speaker Detection and Tracking in Audio Broadcast News
    Karim, Dabbabi
    Adnen, Cherif
    Salah, Hajji
    [J]. 2017 INTERNATIONAL CONFERENCE ON ENGINEERING & MIS (ICEMIS), 2017,
  • [4] On the use of linguistic information for broadcast news speaker tracking
    Antoni, William
    Fredouille, Corinne
    Bonastre, Jean-Francois
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 1021 - 1024
  • [5] The impact of audio segmentation to speaker tracking in broadcast news data
    Zibert, Janez
    [J]. Elektrotehniski Vestnik/Electrotechnical Review, 2008, 75 (04): : 205 - 210
  • [6] The Impact of Audio Segmentation to Speaker Tracking in Broadcast News Data
    Zibert, Janez
    [J]. ELEKTROTEHNISKI VESTNIK-ELECTROCHEMICAL REVIEW, 2008, 75 (04): : 205 - 210
  • [7] Speaker diarization of French broadcast news
    Gupta, Vishwa
    Boulianne, Gilles
    Kenny, Patrick
    Ouellet, Pierre
    Dumouchel, Pierre
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4365 - 4368
  • [8] Multistage speaker diarization of broadcast news
    Barras, Claude
    Zhu, Xuan
    Meignier, Sylvain
    Gauvain, Jean-Luc
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (05): : 1505 - 1512
  • [9] Robust Speaker Diarization for News Broadcast
    Karthik, M. L. N. S.
    Ganesh, Mirishkar Sai
    Patnaik, Bijayananda
    [J]. 2018 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2018,
  • [10] A speaker-role based approach for detecting politicians in TV broadcast news
    Charlet, Delphine
    Damnati, Geraldine
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1850 - 1853