Informative subspaces for audio-visual processing: High-level function from low-level fusion

被引:0
|
作者
Fisher, JW [1 ]
Darrell, T [1 ]
机构
[1] MIT, Cambridge, MA 02139 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose anew probabilistic model of single source multi-modal generation, and show algorithms for maximizing mutual information which find correspondences between signal components. We show a nonparametric method for finding informative subspaces that captures complex statistical relationships between different modalities. We extend a previous subspace method to include new priors on the projection weights, yielding more robust results. Applied to human speakers, our model finds a relationship between audio speech and video of facial motion, and partially segments background events in both channels, We present new results on the problem of audio-visual verification, and show how the audio and video of a speaker can be matched without a prior model of the speaker's voice or appearance.
引用
收藏
页码:4104 / 4107
页数:2
相关论文
共 50 条
  • [31] Bimodal fusion of low-level visual features and high-level semantic features for near-duplicate video clip detection
    Min, Hyun-seok
    Choi, Jae Young
    De Neve, Wesley
    Ro, Yong Man
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2011, 26 (10) : 612 - 627
  • [32] Decision Level Fusion for Audio-Visual Speech Recognition in Noisy Conditions
    Sad, Gonzalo D.
    Terissi, Lucas D.
    Gomez, Juan C.
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2016, 2017, 10125 : 360 - 367
  • [33] Mimi4x: an interactive audio-visual installation for high-level structural improvisation
    Francois, Alexandre R. J.
    Schankler, Isaac
    Chew, Elaine
    INTERNATIONAL JOURNAL OF ARTS AND TECHNOLOGY, 2013, 6 (02) : 138 - 151
  • [34] MIMI4X: AN INTERACTIVE AUDIO-VISUAL INSTALLATION FOR HIGH-LEVEL STRUCTURAL IMPROVISATION
    Francois, Alexandre R. J.
    Schankler, Isaac
    Chew, Elaine
    2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 1618 - 1623
  • [36] Color image retrieval: From low-level representation to high-level concept
    Larabi, M. -C
    Richard, N.
    Fernandez-Maloigne, C.
    CELLULAR AND MOLECULAR BIOLOGY, 2006, 52 (06) : 61 - 76
  • [37] INTERFACE CIRCUITS DRIVE HIGH-LEVEL SWITCHES FROM LOW-LEVEL INPUTS
    JENKINS, JOM
    ELECTRONIC ENGINEERING, 1971, 43 (519): : 45 - &
  • [38] From Low-level to High-level: Comparative Study of Music Similarity Measures
    Bogdanov, Dmitry
    Serra, Joan
    Wack, Nicolas
    Herrera, Perfecto
    2009 11TH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2009), 2009, : 453 - 458
  • [39] Unified Multirate Control: From Low-Level Actuation to High-Level Planning
    Rosolia, Ugo
    Singletary, Andrew
    Ames, Aaron D.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (12) : 6627 - 6640
  • [40] Extracting high-level activities from low-level program execution logs
    Stepanov, Evgenii V.
    Mitsyuk, Alexey A.
    AUTOMATED SOFTWARE ENGINEERING, 2024, 31 (02)