Multimedia approach for audio segmentation in TV broadcast news

被引:0
|
作者
Perez-Freire, L [1 ]
Garcia-Mateo, C [1 ]
机构
[1] Univ Vigo, ETSI Telecomunicac, Vigo, Spain
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper deals with the task of audio segmentation in TV broadcast news. A multimedia approach for this purpose, by means of audio and video processing, is proposed. Thus, the segmentation system is composed by two differentiated parts: one analyzes the audio stream, and is based on the well-known Bayesian Information Criterion (BIC), whereas the other part extracts useful information from the video stream to improve the performance of BIC. An investigation of parameters involved in BIC formulation is also accomplished, in order to achieve the best results possible in our experimental framework: the database Transcrigal-DB. The final system provides significative improvements in both overall performance and robustness.
引用
收藏
页码:369 / 372
页数:4
相关论文
共 50 条
  • [1] Story Segmentation in TV News Broadcast
    Kannao, Raghvendra
    Guha, Prithwijit
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2948 - 2953
  • [2] Fischlar-News: Multimedia Access to Broadcast TV News
    Smeaton, Alan F.
    O'Connor, Noel E.
    Lee, Hyowon
    ERCIM NEWS, 2005, (62): : 24 - 25
  • [3] Broadcast news segmentation by audio type analysis
    Nwe, TL
    Li, HZ
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1065 - 1068
  • [4] A system for semantic segmentation of TV news broadcast videos
    Kannao, Raghvendra
    Guha, Prithwijit
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (9-10) : 6191 - 6225
  • [5] A system for semantic segmentation of TV news broadcast videos
    Raghvendra Kannao
    Prithwijit Guha
    Multimedia Tools and Applications, 2020, 79 : 6191 - 6225
  • [6] Audio segmentation, classification and clustering in a broadcast news task
    Meinedo, H
    Neto, J
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 5 - 8
  • [7] Audio segmentation-by-classification approach based on factor analysis in broadcast news domain
    Castan, Diego
    Ortega, Alfonso
    Miguel, Antonio
    Lleida, Eduardo
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014, : 1 - 13
  • [8] Audio segmentation-by-classification approach based on factor analysis in broadcast news domain
    Departamento Ingeniería Electrónica y Comunicaciones, Universidad de Zaragoza, María de Luna, 1, Zaragoza
    50018, Spain
    Eurasip J. Audio Speech Music Process., 1 (1-13):
  • [9] Audio segmentation-by-classification approach based on factor analysis in broadcast news domain
    Diego Castán
    Alfonso Ortega
    Antonio Miguel
    Eduardo Lleida
    EURASIP Journal on Audio, Speech, and Music Processing, 2014
  • [10] Diachronic Semantic Cohesion for Topic Segmentation of TV Broadcast News
    Bouchekif, Abdessalam
    Damnati, Geraldine
    Esteve, Yannick
    Charlet, Delphine
    Camelin, Nathalie
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2932 - 2936