Audio-visual fused online context analysis toward smart meeting room

被引:0
|
作者
Dai, Peng [1 ]
Tao, Linmi [1 ]
Xu, Guangyou [1 ]
机构
[1] Tsinghua Univ, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Context-aware systems incorporate multimodal information to analyze contextual information in users' environment and provide various proactive services according to dynamic context. In this paper, a novel online context analysis framework is proposed to support context-aware computing in smart meeting room. A novel dynamic context model is presented to model human group interactions. Robust audio and visual modules are integrated for the effective processing of multimodal signals from various sensors, based on which a multi-level dynamic context reasoning mechanism is adopted for the online understanding of group interactions in meeting scenarios. Experimental results show the effectiveness of our framework.
引用
收藏
页码:868 / +
页数:2
相关论文
共 50 条
  • [1] ADVANCES IN ONLINE AUDIO-VISUAL MEETING TRANSCRIPTION
    Yoshioka, Takuya
    Abramovski, Igor
    Aksoylar, Cem
    Chen, Zhuo
    David, Moshe
    Dimitriadis, Dimitrios
    Gong, Yifan
    Gurvich, Ilya
    Huang, Xuedong
    Huang, Yan
    Hurvitz, Aviv
    Jiang, Li
    Koubi, Sharon
    Krupka, Eyal
    Leichter, Ido
    Liu, Changliang
    Parthasarathy, Partha
    Vinnikov, Alon
    Wu, Lingfeng
    Xiao, Xiong
    Xiong, Wayne
    Wang, Huaming
    Wang, Zhenghao
    Zhang, Jun
    Zhao, Yong
    Zhou, Tianyan
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 276 - 283
  • [2] Audio-visual perception of a lecturer in a smart seminar room
    Stiefelhagen, R.
    Bernardin, K.
    Ekenel, H. K.
    McDonough, J.
    Nickel, K.
    Voit, M.
    Woelfel, M.
    SIGNAL PROCESSING, 2006, 86 (12) : 3518 - 3533
  • [3] Audio-Visual Face Detection for Tracking in a Meeting Room Environment
    Barnard, Mark
    Wang, Wenwu
    Kittler, Josef
    Naqvi, Syed Mohsen
    Chambers, Jonathon
    2013 16TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2013, : 1222 - 1227
  • [4] Online Diarization of Streaming Audio-Visual Data for Smart Environments
    Schmalenstroeer, Joerg
    Haeb-Umbach, Reinhold
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2010, 4 (05) : 845 - 856
  • [5] LEARNING CONTEXTUALLY FUSED AUDIO-VISUAL REPRESENTATIONS FOR AUDIO-VISUAL SPEECH RECOGNITION
    Zhang, Zi-Qiang
    Zhang, Jie
    Zhang, Jian-Shu
    Wu, Ming-Hui
    Fang, Xin
    Dai, Li-Rong
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1346 - 1350
  • [6] AUDIO-VISUAL CONVERSATION ANALYSIS BY SMART POSTERBOARD AND HUMANOID ROBOT
    Kawahara, Tatsuya
    Inoue, Koji
    Lala, Divesh
    Takanashi, Katsuya
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6573 - 6577
  • [7] Robust multimodal audio-visual processing for advanced context awareness in smart spaces
    Pnevmatikakis, A.
    Soldatos, J.
    Talantzis, F.
    Polymenakos, L.
    PERSONAL AND UBIQUITOUS COMPUTING, 2009, 13 (01) : 3 - 14
  • [8] Robust multimodal audio-visual processing for advanced context awareness in smart spaces
    Pnevmatikakis, Aristodemos
    Soldatos, John
    Talantzis, Fotios
    Polymenakos, Lazaros
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, 2006, 204 : 290 - 301
  • [9] YouTube Movie Reviews: Sentiment Analysis in an Audio-Visual Context
    Woellmer, Martin
    Weninger, Felix
    Knaup, Tobias
    Schuller, Bjoern
    Sun, Congkai
    Sagae, Kenji
    Morency, Louis-Philippe
    IEEE INTELLIGENT SYSTEMS, 2013, 28 (03) : 46 - 53
  • [10] Management Software Development for Online Music Audio-visual
    Wang, Jian
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCES IN MECHANICAL ENGINEERING AND INDUSTRIAL INFORMATICS, 2015, 15 : 378 - 381