Audiovisual Information Fusion in Human-Computer Interfaces and Intelligent Environments: A Survey

被引：77

作者：

Shivappa, Shankar T. ^{[1
]}

Trivedi, Mohan Manubhai ^{[1
]}

Rao, Bhaskar D. ^{[1
]}

机构：

[1] Univ Calif San Diego, Dept Elect & Comp Engn, La Jolla, CA 92093 USA

来源：

PROCEEDINGS OF THE IEEE | 2010年 / 98卷 / 10期

基金：

美国国家科学基金会;

关键词：

Audiovisual fusion; dynamic Bayesian networks (DBNs); hidden Markov models; human activity analysis; human activity modeling; information fusion; machine learning; multimodal systems; SPEECH; RECOGNITION; IDENTIFICATION; COMBINATION; TRACKING; AUTHENTICATION; VISION; MODEL;

D O I：

10.1109/JPROC.2010.2057231

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Microphones and cameras have been extensively used to observe and detect human activity and to facilitate natural modes of interaction between humans and intelligent systems. Human brain processes the audio and video modalities, extracting complementary and robust information from them. Intelligent systems with audiovisual sensors should be capable of achieving similar goals. The audiovisual information fusion strategy is a key component in designing such systems. In this paper, we exclusively survey the fusion techniques used in various audiovisual information fusion tasks. The fusion strategy used tends to depend mainly on the model, probabilistic or otherwise, used in the particular task to process sensory information to obtain higher level semantic information. The models themselves are task oriented. In this paper, we describe the fusion strategies and the corresponding models used in audiovisual tasks such as speech recognition, tracking, biometrics, affective state recognition, and meeting scene analysis. We also review the challenges and existing solutions and also unresolved or partially resolved issues in these fields. Specifically, we discuss established and upcoming work in hierarchical fusion strategies and cross-modal learning techniques, identifying these as critical areas of research in the future development of intelligent systems.

引用

页码：1692 / 1715

页数：24

共 50 条

[1] Prolog To: Audiovisual Information Fusion in Human-Computer Interfaces and Intelligent Environments: A Survey Introduction
Esch, Jim
[J]. PROCEEDINGS OF THE IEEE, 2010, 98 (10) : 1690 - 1691
[2] Audiovisual Analysis and Synthesis for Multimodal Human-Computer Interfaces
Sevillano, Xavier
Melenchon, Javier
Cobo, German
Claudi Socoro, Joan
Alias, Francesc
[J]. ENGINEERING THE USER INTERFACE: FROM RESEARCH TO PRACTICE, 2009, : 179 - 194
[3] Smart interfaces for human-computer intelligent interaction
Yven, J
Wechsler, H
[J]. CCA 2003: PROCEEDINGS OF 2003 IEEE CONFERENCE ON CONTROL APPLICATIONS, VOLS 1 AND 2, 2003, : 1192 - 1197
[4] Human-computer intelligent interaction: A survey
Lew, Michael
Bakker, Erwin M.
Sebe, Nicu
Huang, Thomas S.
[J]. HUMAN-COMPUTER INTERACTION, PROCEEDINGS, 2007, 4796 : 1 - +
[5] Intelligent support mechanisms in adaptable human-computer interfaces
Spath, D.
Weule, H.
[J]. CIRP Annals - Manufacturing Technology, 1993, 42 (01) : 519 - 522
[6] Natural and Tangible Human-Computer Interfaces for Augmented Environments
Sales Dias, Jose Miguel
[J]. SIGDOC'08: PROCEEDINGS OF THE 26TH ACM INTERNATIONAL CONFERENCE ON DESIGN OF COMMUNICATION, 2008, : 181 - 182
[7] Brain Computer Interfaces as Intelligent Sensors for Enhancing Human-Computer Interaction
Poel, Mannes
Nijboer, Femke
van den Broek, Egon L.
Fairclough, Stephen
Nijholt, Anton
[J]. ICMI '12: PROCEEDINGS OF THE ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2012, : 379 - 382
[8] Information assurance and advanced human-computer interfaces Preface
Vitabile, Salvatore
Gentile, Antonio
[J]. MOBILE INFORMATION SYSTEMS, 2008, 4 (03) : 163 - 164
[9] Impact of familiarity on information complexity in human-computer interfaces
Bakaev, Maxim
[J]. 2016 INTERNATIONAL CONFERENCE ON MEASUREMENT INSTRUMENTATION AND ELECTRONICS (ICMIE 2016), 2016, 75
[10] Tracking of multiple faces for human-computer interfaces and virtual environments
Huang, FJ
Chen, TS
[J]. 2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1563 - 1566

← 1 2 3 4 5 →