Joint correlation analysis of audio-visual dance figures

被引:0
|
作者
Ofli, F. [1 ]
Demir, Y. [1 ]
Erzin, E. [1 ]
Yemez, Y. [1 ]
Tekalp, A. M. [1 ]
机构
[1] Koc Univ, Goru Grafik Lab, TR-34450 Istanbul, Turkey
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a framework for analysis of dance figures from audio-visual data. Our audio-visual data is the multiview video of a dancing actor which is acquired using 8 synchronized cameras. The multi-camera motion capture technique of this framework is based on 3D tracking of the markers attached to the dancer's body, using stereo color information. The extracted 31) points are used to calculate the body motion features as 3D displacement vectors. On the other hand, MFC coefficients serve as the audio features. In the first stage of the two stage analysis task, we perform Hidden Markov Model (HMM) based unsupervised temporal segmentation of the audio and body motion features, separately, to extract the recurrent elementary audio and body motion patterns. In the second stage, the correlation of body motion patterns with audio patterns is investigated to create a correlation model that can be used during the synthesis of an audio-driven body animation.
引用
收藏
页码:604 / 607
页数:4
相关论文
共 50 条
  • [1] Multicamera audio-visual analysis of dance figures
    Ofli, F.
    Demir, Y.
    Erzin, E.
    Yemez, Y.
    Tekalp, A. M.
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 1703 - 1706
  • [2] Analysis and Synthesis of Multiview Audio-Visual Dance Figures
    Ofli, F.
    Demir, Y.
    Canton-Ferrer, C.
    Tilmanne, J.
    Balci, K.
    Bozkurt, E.
    Kizoglu, I.
    Yemez, Y.
    Erzin, E.
    Tekalp, A. M.
    Akarun, L.
    Erdem, A. T.
    2008 IEEE 16TH SIGNAL PROCESSING, COMMUNICATION AND APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2008, : 761 - +
  • [3] Objectivization of Audio-Visual Correlation Analysis
    Kunka, Bartosz
    Kostek, Bozena
    ARCHIVES OF ACOUSTICS, 2012, 37 (01) : 63 - 72
  • [4] A JOINT AUDIO-VISUAL APPROACH TO AUDIO LOCALIZATION
    Jensen, Jesper Rindom
    Christensen, Mads Graesboll
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 454 - 458
  • [5] Multimodal Dance Generation Networks Based on Audio-Visual Analysis
    Duan, Lijuan
    Xu, Xiao
    En, Qing
    INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT, 2021, 12 (01): : 17 - 32
  • [6] MUSIC, DANCE AND THEATRE IN AUDIO-VISUAL MEDIA
    不详
    CULTURES, 1973, 1 (01): : 276 - 280
  • [7] Joint watermarking of audio-visual data
    Dittmann, J
    Steinebach, M
    2001 IEEE FOURTH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2001, : 601 - 606
  • [8] Joint Audio-Visual Deepfake Detection
    Zhou, Yipin
    Lim, Ser-Nam
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14780 - 14789
  • [9] Indexing audio-visual sequences by joint audio and video processing
    Saraceno, C
    Leonardi, R
    VSMM98: FUTUREFUSION - APPLICATION REALITIES FOR THE VIRTUAL AGE, VOLS 1 AND 2, 1998, : 686 - 691
  • [10] Audio-visual speech recognition based on joint training with audio-visual speech enhancement for robust speech recognition
    Hwang, Jung-Wook
    Park, Jeongkyun
    Park, Rae-Hong
    Park, Hyung-Min
    APPLIED ACOUSTICS, 2023, 211