Informative subspaces for audio-visual processing: High-level function from low-level fusion

被引:0
|
作者
Fisher, JW [1 ]
Darrell, T [1 ]
机构
[1] MIT, Cambridge, MA 02139 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose anew probabilistic model of single source multi-modal generation, and show algorithms for maximizing mutual information which find correspondences between signal components. We show a nonparametric method for finding informative subspaces that captures complex statistical relationships between different modalities. We extend a previous subspace method to include new priors on the projection weights, yielding more robust results. Applied to human speakers, our model finds a relationship between audio speech and video of facial motion, and partially segments background events in both channels, We present new results on the problem of audio-visual verification, and show how the audio and video of a speaker can be matched without a prior model of the speaker's voice or appearance.
引用
收藏
页码:4104 / 4107
页数:2
相关论文
共 50 条
  • [41] From low-level features to high-level semantics: Are we bridging the gap?
    Chen, TH
    ISM 2005: SEVENTH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, PROCEEDINGS, 2005, : 179 - 179
  • [42] Predicting early reading fluency based on preschool measures of low-level visual temporal processing: A possible mediation by high-level visual temporal processing skills
    Liu, Ningyu
    Zhao, Jing
    Huang, Chen
    Xing, Xiaopei
    Lu, Shan
    Wang, Zhengyan
    INFANT AND CHILD DEVELOPMENT, 2021, 30 (02)
  • [43] High-Level Prediction Signals in a Low-Level Area of the Macaque Face-Processing Hierarchy
    Schwiedrzik, Caspar M.
    Freiwald, Winrich A.
    NEURON, 2017, 96 (01) : 89 - +
  • [44] Low-level awareness accompanies "unconscious'' high-level processing during continuous flash suppression
    Gelbard-Sagiv, Hagar
    Faivre, Nathan
    Mudrik, Liad
    Koch, Christof
    JOURNAL OF VISION, 2016, 16 (01): : 1 - 16
  • [46] Reconciling High-Level Optimizations and Low-Level Code in LLVM
    Lee, Juneyoung
    Hur, Chung-Kil
    Jung, Ralf
    Liu, Zhengyang
    Regehr, John
    Lopes, Nuno P.
    PROCEEDINGS OF THE ACM ON PROGRAMMING LANGUAGES-PACMPL, 2018, 2
  • [47] LOW-LEVEL RADIOACTIVE-WASTES, HIGH-LEVEL RISK
    NEWMAN, A
    ENVIRONMENTAL SCIENCE & TECHNOLOGY, 1994, 28 (11) : A488 - A491
  • [48] Drawing the boundary between low-level and high-level mindreading
    de Vignemont, Frederique
    PHILOSOPHICAL STUDIES, 2009, 144 (03) : 457 - 466
  • [49] Music Genre Prediction by Low-Level and High-Level Characteristics
    Vatolkin, Igor
    Roetter, Guenther
    Weihs, Claus
    DATA ANALYSIS, MACHINE LEARNING AND KNOWLEDGE DISCOVERY, 2014, : 427 - 434
  • [50] Drawing the boundary between low-level and high-level mindreading
    Frédérique de Vignemont
    Philosophical Studies, 2009, 144 : 457 - 466