Audio-visual fused online context analysis toward smart meeting room

被引：0

作者：

Dai, Peng ^{[1
]}

Tao, Linmi ^{[1
]}

Xu, Guangyou ^{[1
]}

机构：

[1] Tsinghua Univ, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China

来源：

UBIQUITOUS INTELLIGENCE AND COMPUTING, PROCEEDINGS | 2007年 / 4611卷

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Context-aware systems incorporate multimodal information to analyze contextual information in users' environment and provide various proactive services according to dynamic context. In this paper, a novel online context analysis framework is proposed to support context-aware computing in smart meeting room. A novel dynamic context model is presented to model human group interactions. Robust audio and visual modules are integrated for the effective processing of multimodal signals from various sensors, based on which a multi-level dynamic context reasoning mechanism is adopted for the online understanding of group interactions in meeting scenarios. Experimental results show the effectiveness of our framework.

引用

页码：868 / +

页数：2

共 50 条

[1] ADVANCES IN ONLINE AUDIO-VISUAL MEETING TRANSCRIPTION
Yoshioka, Takuya
Abramovski, Igor
Aksoylar, Cem
Chen, Zhuo
David, Moshe
Dimitriadis, Dimitrios
Gong, Yifan
Gurvich, Ilya
Huang, Xuedong
Huang, Yan
Hurvitz, Aviv
Jiang, Li
Koubi, Sharon
Krupka, Eyal
Leichter, Ido
Liu, Changliang
Parthasarathy, Partha
Vinnikov, Alon
Wu, Lingfeng
Xiao, Xiong
Xiong, Wayne
Wang, Huaming
Wang, Zhenghao
Zhang, Jun
Zhao, Yong
Zhou, Tianyan
2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 276 - 283
[2] Audio-visual perception of a lecturer in a smart seminar room
Stiefelhagen, R.
Bernardin, K.
Ekenel, H. K.
McDonough, J.
Nickel, K.
Voit, M.
Woelfel, M.
SIGNAL PROCESSING, 2006, 86 (12) : 3518 - 3533
[3] Audio-Visual Face Detection for Tracking in a Meeting Room Environment
Barnard, Mark
Wang, Wenwu
Kittler, Josef
Naqvi, Syed Mohsen
Chambers, Jonathon
2013 16TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2013, : 1222 - 1227
[4] Online Diarization of Streaming Audio-Visual Data for Smart Environments
Schmalenstroeer, Joerg
Haeb-Umbach, Reinhold
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2010, 4 (05) : 845 - 856
[5] LEARNING CONTEXTUALLY FUSED AUDIO-VISUAL REPRESENTATIONS FOR AUDIO-VISUAL SPEECH RECOGNITION
Zhang, Zi-Qiang
Zhang, Jie
Zhang, Jian-Shu
Wu, Ming-Hui
Fang, Xin
Dai, Li-Rong
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1346 - 1350
[6] AUDIO-VISUAL CONVERSATION ANALYSIS BY SMART POSTERBOARD AND HUMANOID ROBOT
Kawahara, Tatsuya
Inoue, Koji
Lala, Divesh
Takanashi, Katsuya
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6573 - 6577
[7] Robust multimodal audio-visual processing for advanced context awareness in smart spaces
Pnevmatikakis, A.
Soldatos, J.
Talantzis, F.
Polymenakos, L.
PERSONAL AND UBIQUITOUS COMPUTING, 2009, 13 (01) : 3 - 14
[8] Robust multimodal audio-visual processing for advanced context awareness in smart spaces
Pnevmatikakis, Aristodemos
Soldatos, John
Talantzis, Fotios
Polymenakos, Lazaros
ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, 2006, 204 : 290 - 301
[9] YouTube Movie Reviews: Sentiment Analysis in an Audio-Visual Context
Woellmer, Martin
Weninger, Felix
Knaup, Tobias
Schuller, Bjoern
Sun, Congkai
Sagae, Kenji
Morency, Louis-Philippe
IEEE INTELLIGENT SYSTEMS, 2013, 28 (03) : 46 - 53
[10] Management Software Development for Online Music Audio-visual
Wang, Jian
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCES IN MECHANICAL ENGINEERING AND INDUSTRIAL INFORMATICS, 2015, 15 : 378 - 381

← 1 2 3 4 5 →