Audio analysis for multimedia retrieval from a ubiquitous home

被引:0
|
作者
de Silva, Gamhewage C. [1 ]
Yamasaki, Toshihiko [1 ]
Aizawa, Kiyoharu [1 ]
机构
[1] Univ Tokyo, Dept Informat & Commun Engn, Bunkyo Ku, Tokyo 1138656, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a system for video retrieval based on analyzing audio data from a large number of microphones in a home-like environment. Silence elimination on individual microphones is followed by noise reduction based on regions consisting of multiple microphones, to identify audio segments. An algorithm based on the energy distribution of sounds in the house is used to localize sound sources, thereby removing sounds heard in regions other than they are generated. A set of time domain features are used to classify these sounds for video retrieval. The algorithms were evaluated with 200 minutes of audio data from each microphone, gathered during an experiment where a family lived in the ubiquitous home. It was possible to achieve an overall accuracy of above 80% from all algorithms.
引用
收藏
页码:466 / 476
页数:11
相关论文
共 50 条
  • [41] Multimedia performance of a ubiquitous processor
    Fukase, Masa-aki
    Noda, Kazunori
    Takeda, Hirolki
    Sato, Tomoaki
    [J]. 2007 INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES, VOLS 1-3, 2007, : 1464 - +
  • [42] Audio keywords discovery for text-like audio content analysis and retrieval
    Lu, Lie
    Hanjalic, Alan
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2008, 10 (01) : 74 - 85
  • [43] From Knowledge Visualization Techniques to Trends in Ubiquitous Multimedia Computing
    Lee, Maria R.
    Chen, Tsung Teng
    [J]. INTERNATIONAL SYMPOSIUM ON UBIQUITOUS MULTIMEDIA COMPUTING, PROCEEDINGS, 2008, : 73 - +
  • [44] Approaching Multimedia Retrieval from a Polyrepresentative Perspective
    Zellhoefer, David
    Schmitt, Ingo
    [J]. ADAPTIVE MULTIMEDIA RETRIEVAL: CONTEXT, EXPLORATION, AND FUSION, 2012, 6817 : 46 - 60
  • [45] Virtual bass for home entertainment, multimedia PC, game station and portable audio systems
    Gan, WS
    Kuo, SM
    Toh, CW
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2001, 47 (04) : 787 - 794
  • [46] C-iUMS:: Context based smart and secure multimedia service in intelligent ubiquitous home
    Park, Jong Hyuk
    Lee, Sangjin
    Hong, Sung Hee
    [J]. EMERGING DIRECTIONS IN EMBEDDED AND UBIQUITOUS COMPUTING, 2006, 4097 : 660 - 670
  • [47] Extended symbolic projection for content-based indexing and retrieval of audio, video, and multimedia documents
    Arndt, T
    Guercio, A
    [J]. STORAGE AND RETRIEVAL FOR IMAGE AND VIDEO DATABASES V, 1997, 3022 : 417 - 426
  • [48] TOWARDS A UNIVERSAL REPRESENTATION FOR AUDIO INFORMATION RETRIEVAL AND ANALYSIS
    Jensen, Bjorn Sand
    Troelsgaard, Rasmus
    Larsen, Jan
    Hansen, Lars Kai
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 3168 - 3172
  • [49] Searching for multimedia: analysis of audio, video and image Web queries
    Jansen B.J.
    Goodrum A.
    Spink A.
    [J]. World Wide Web, 2000, 3 (04) : 249 - 254
  • [50] Multimedia content analysis - Using both audio and visual clues
    Wang, Y
    Liu, Z
    Huang, JC
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2000, 17 (06) : 12 - 36