Joint correlation analysis of audio-visual dance figures

被引：0

作者：

Ofli, F. ^{[1
]}

Demir, Y. ^{[1
]}

Erzin, E. ^{[1
]}

Yemez, Y. ^{[1
]}

Tekalp, A. M. ^{[1
]}

机构：

[1] Koc Univ, Goru Grafik Lab, TR-34450 Istanbul, Turkey

来源：

2007 IEEE 15TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS, VOLS 1-3 | 2007年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we present a framework for analysis of dance figures from audio-visual data. Our audio-visual data is the multiview video of a dancing actor which is acquired using 8 synchronized cameras. The multi-camera motion capture technique of this framework is based on 3D tracking of the markers attached to the dancer's body, using stereo color information. The extracted 31) points are used to calculate the body motion features as 3D displacement vectors. On the other hand, MFC coefficients serve as the audio features. In the first stage of the two stage analysis task, we perform Hidden Markov Model (HMM) based unsupervised temporal segmentation of the audio and body motion features, separately, to extract the recurrent elementary audio and body motion patterns. In the second stage, the correlation of body motion patterns with audio patterns is investigated to create a correlation model that can be used during the synthesis of an audio-driven body animation.

引用

页码：604 / 607

页数：4

共 50 条

[1] Multicamera audio-visual analysis of dance figures
Ofli, F.
Demir, Y.
Erzin, E.
Yemez, Y.
Tekalp, A. M.
2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 1703 - 1706
[2] Analysis and Synthesis of Multiview Audio-Visual Dance Figures
Ofli, F.
Demir, Y.
Canton-Ferrer, C.
Tilmanne, J.
Balci, K.
Bozkurt, E.
Kizoglu, I.
Yemez, Y.
Erzin, E.
Tekalp, A. M.
Akarun, L.
Erdem, A. T.
2008 IEEE 16TH SIGNAL PROCESSING, COMMUNICATION AND APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2008, : 761 - +
[3] Objectivization of Audio-Visual Correlation Analysis
Kunka, Bartosz
Kostek, Bozena
ARCHIVES OF ACOUSTICS, 2012, 37 (01) : 63 - 72
[4] A JOINT AUDIO-VISUAL APPROACH TO AUDIO LOCALIZATION
Jensen, Jesper Rindom
Christensen, Mads Graesboll
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 454 - 458
[5] Multimodal Dance Generation Networks Based on Audio-Visual Analysis
Duan, Lijuan
Xu, Xiao
En, Qing
INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT, 2021, 12 (01): : 17 - 32
[6] MUSIC, DANCE AND THEATRE IN AUDIO-VISUAL MEDIA
不详
CULTURES, 1973, 1 (01): : 276 - 280
[7] Joint watermarking of audio-visual data
Dittmann, J
Steinebach, M
2001 IEEE FOURTH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2001, : 601 - 606
[8] Joint Audio-Visual Deepfake Detection
Zhou, Yipin
Lim, Ser-Nam
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14780 - 14789
[9] Indexing audio-visual sequences by joint audio and video processing
Saraceno, C
Leonardi, R
VSMM98: FUTUREFUSION - APPLICATION REALITIES FOR THE VIRTUAL AGE, VOLS 1 AND 2, 1998, : 686 - 691
[10] Audio-visual speech recognition based on joint training with audio-visual speech enhancement for robust speech recognition
Hwang, Jung-Wook
Park, Jeongkyun
Park, Rae-Hong
Park, Hyung-Min
APPLIED ACOUSTICS, 2023, 211

← 1 2 3 4 5 →