Audio Segmentation via the Similarity Measure of Audio Feature Vectors

被引:0
|
作者
CHEN Gang
机构
关键词
audio segmentation; abrupt change detection; overall error; similarity measure; self-similarity matrix; relevance feedback;
D O I
暂无
中图分类号
TP391.41 [];
学科分类号
080203 ;
摘要
A formula to compute the similarity between two audio feature vectors is proposed, which can map arbitrary pair of vectors with equivalent dimension to \[0,1). To fulfill the task of audio segmentation, a self-similarity matrix is computed to reveal the inner structure of an audio clip to be segmented. As the final result must be consistent with the subjective evaluation and be adaptive to some special applications, a set of weights is adopted, which can be modified through relevance feedback techniques. Experiments show that satisfactory result can be achieved via the algorithm proposed in this paper.
引用
收藏
页码:833 / 837
页数:5
相关论文
共 50 条
  • [1] Dominant feature vectors based audio similarity measure
    Gu, Jing
    Lu, Lie
    Cai, Rui
    Zhang, Hong-Jiang
    Yang, Jian
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2004, 3332 : 890 - 897
  • [2] Dominant feature vectors based audio similarity measure
    Gu, J
    Lu, L
    Cai, R
    Zhang, HJ
    Yang, J
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 2, PROCEEDINGS, 2004, 3332 : 890 - 897
  • [3] Automatic audio segmentation using a measure of audio novelty
    Foote, J
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 452 - 455
  • [4] Quantitative Analysis of a Common Audio Similarity Measure
    Jensen, Jesper Hojvang
    Christensen, Mads Graesboll
    Ellis, Daniel P. W.
    Jensen, Soren Holdt
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (04): : 693 - 703
  • [5] Quick audio retrieval using multiple feature vectors
    Kim, KM
    Kim, SY
    Jeon, JK
    Park, KS
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2006, 52 (01) : 200 - 205
  • [6] Audio similarity measure based on distance correlation image
    School of Computer Science and Technology, Beijing University of Aeronautics and Astronautics, Beijing 100083, China
    [J]. Beijing Hangkong Hangtian Daxue Xuebao, 2006, 2 (224-227):
  • [7] A Novel Audio Segmentation for Audio Diarization
    Ma, Xuehan
    [J]. INFORMATION TECHNOLOGY AND INTELLIGENT TRANSPORTATION SYSTEMS, VOL 2, 2017, 455 : 399 - 407
  • [8] Audio feature extraction and analysis for scene segmentation and classification
    Liu, Z
    Wang, Y
    Chen, TH
    [J]. JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 1998, 20 (1-2): : 61 - 79
  • [9] Audio Feature Extraction and Analysis for Scene Segmentation and Classification
    Zhu Liu
    Yao Wang
    Tsuhan Chen
    [J]. Journal of VLSI signal processing systems for signal, image and video technology, 1998, 20 : 61 - 79
  • [10] Audio feature extraction and analysis for scene segmentation and classification
    Polytechnic Univ, Brooklyn, United States
    [J]. J VLSI Signal Process Syst Signal Image Video Technol, 1-2 (61-79):