Multiresolution alignment for multiple unsynchronized audio sequences using sequential Monte Carlo samplers

被引:1
|
作者
Basaran, Dogac [1 ]
Cemgil, Ali Taylan [2 ]
Anarim, Emin [3 ]
机构
[1] Telecom Paristech Univ, Signal & Image Proc Dept, 46 Rue Barrault, Paris, France
[2] Bogazici Univ, Comp Engn Dept, TR-34342 Istanbul, Turkey
[3] Bogazici Univ, Elect & Elect Engn Dept, TR-34342 Istanbul, Turkey
关键词
Multiple audio alignment; Multiresolution alignment; Audio fingerprint; Bayesian inference; Sequential Monte Carlo samplers; Sequential alignment;
D O I
10.1016/j.dsp.2017.10.024
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
It is increasingly more common that an occasion is recorded by multiple individuals with the proliferation of recording devices such as smart phones. When properly aligned, these recordings may provide several audio and visual perspectives to a scene which leads to several applications in restoring, remastering and remixing frameworks in various fields. In this work, we propose a multiresolution alignment algorithm for aligning multiple unsynchronized audio sequences using Sequential Monte Carlo samplers. We employ a model based approach and a score function analogous to similarity based methods. The optimum alignments are obtained in a course to fine structure with multiresolution sampling and a heuristic sequential search method. The proposed method is evaluated with a real-life dataset from Jiku Mobile Video Datasets. The simulation results suggest that our method is competitive with the baseline methods in terms of accuracy with suitable choice of parameters.
引用
收藏
页码:77 / 85
页数:9
相关论文
共 50 条