The Impact of Audio Segmentation to Speaker Tracking in Broadcast News Data

被引:0
|
作者
Zibert, Janez [1 ]
机构
[1] Univ Ljubljani, Fak Electrotehn, Trzaska 25, Ljubljana 1000, Slovenia
来源
关键词
audio segmentation; speaker clustering; audio indexing; speaker tracking;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A system for speaker tracking in broadcast-news audio data is presented. The process of speaker tracking in continuous audio streams involves several processing tasks and is therefore treated as a multistage process. The main building blocks of such system include the components for audio segmentation, speech detection, speaker clustering and speaker identification. Our system was developed by implementing the most recent published methods in each component of the system, whereas we focused mainly on the component for audio segmentation for being considered as one of the most critical components of such systems. Two alternative approaches of speaker change detection and speaker clustering are explored and their impacts to the overall speaker-tracking performance are evaluated. The evaluation experiments were performed on broadcast-news audio data with a speaker-tracking system capable of detecting 41 target speakers. The comparison of the evaluation results of different versions of the speaker-tracking system indicates the importance of the tasks in the audio-segmentation module and provides valuable insights into how the system works.
引用
收藏
页码:205 / 210
页数:6
相关论文
共 50 条
  • [1] A System for Speaker Detection and Tracking in Audio Broadcast News
    Zibert, Janez
    Vesnicer, Bostjan
    Mihelic, France
    [J]. INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2008, 32 (01): : 51 - 61
  • [2] A system for Speaker Detection and Tracking in Audio Broadcast News
    Karim, Dabbabi
    Adnen, Cherif
    Salah, Hajji
    [J]. 2017 INTERNATIONAL CONFERENCE ON ENGINEERING & MIS (ICEMIS), 2017,
  • [3] Broadcast news segmentation by audio type analysis
    Nwe, TL
    Li, HZ
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1065 - 1068
  • [4] Audio-visual speaker recognition for video broadcast news
    Maison, B
    Neti, C
    Senior, A
    [J]. JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2001, 29 (1-2): : 71 - 79
  • [5] Audio-Visual Speaker Recognition for Video Broadcast News
    Benoît Maison
    Chalapathy Neti
    Andrew Senior
    [J]. Journal of VLSI signal processing systems for signal, image and video technology, 2001, 29 : 71 - 79
  • [6] On the use of linguistic information for broadcast news speaker tracking
    Antoni, William
    Fredouille, Corinne
    Bonastre, Jean-Francois
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 1021 - 1024
  • [7] A simple but effective approach to speaker tracking in broadcast news
    Rodriguez, Luis Javier
    Penagarikano, Mikel
    Bordel, German
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 2, PROCEEDINGS, 2007, 4478 : 48 - +
  • [8] Multimedia approach for audio segmentation in TV broadcast news
    Perez-Freire, L
    Garcia-Mateo, C
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 369 - 372
  • [9] Audio segmentation, classification and clustering in a broadcast news task
    Meinedo, H
    Neto, J
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 5 - 8
  • [10] Comparison of Segmentation and Clustering Methods for Speaker Diarization of Broadcast Stream Audio
    Prazak, Jan
    Silovsky, Jan
    [J]. ANALYSIS OF VERBAL AND NONVERBAL COMMUNICATION AND ENACTMENT: THE PROCESSING ISSUES, 2011, 6800 : 214 - 222