Level of interest sensing in spoken dialog using decision-level fusion of acoustic and lexical evidence

被引:5
|
作者
Jeon, Je Hun [1 ]
Xia, Rui [1 ]
Liu, Yang [1 ]
机构
[1] Univ Texas Dallas, Dept Comp Sci, Richardson, TX 75083 USA
来源
COMPUTER SPEECH AND LANGUAGE | 2014年 / 28卷 / 02期
关键词
Level of interest; Decision-level fusion; Human machine interaction; EMOTION RECOGNITION;
D O I
10.1016/j.csl.2013.09.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic detection of a user's interest in spoken dialog plays an important role in many applications, such as tutoring systems and customer service systems. In this study, we propose a decision-level fusion approach using acoustic and lexical information to accurately sense a user's interest at the utterance level. Our system consists of three parts: acoustic/prosodic model, lexical model, and a model that combines their decisions for the final output. We use two different regression algorithms to complement each other for the acoustic model. For lexical information, in addition to the bag-of-words model, we propose new features including a level-of-interest value for each word, length information using the number of words, estimated speaking rate, silence in the utterance, and similarity with other utterances. We also investigate the effectiveness of using more automatic speech recognition (ASR) hypotheses (n-best lists) to extract lexical features. The outputs from the acoustic and lexical models are combined at the decision level. Our experiments show that combining acoustic evidence with lexical information improves level-of-interest detection performance, even when lexical features are extracted from ASR output with high word error rate. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:420 / 433
页数:14
相关论文
共 50 条
  • [1] Level of Interest Sensing in Spoken Dialog Using Multi-level Fusion of Acoustic and Lexical Evidence
    Jeon, Je Hun
    Xia, Rui
    Liu, Yang
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2806 - 2809
  • [2] Remote Sensing Scene Classification Based on Decision-Level Fusion
    Li, Xiaobin
    Jiang, Bitao
    Sun, Tong
    Wang, Shengjin
    PROCEEDINGS OF 2018 IEEE 4TH INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC 2018), 2018, : 393 - 397
  • [3] Gait Recognition System using Decision-Level Fusion
    Lee, Byungyun
    Hong, Sungjun
    Lee, Heesung
    Kim, Euntai
    ICIEA 2010: PROCEEDINGS OF THE 5TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOL 1, 2010, : 336 - 339
  • [4] Decision-level fusion for vehicle detection
    Sun, Zehang
    Bebis, George
    Bourbakis, Nikolaos
    PROCEEDING OF THE 11TH WSEAS INTERNATIONAL CONFERENCE ON COMPUTERS: COMPUTER SCIENCE AND TECHNOLOGY, VOL 4, 2007, : 622 - +
  • [5] Decision-level fusion in fingerprint verification
    Prabhakar, S
    Jain, AK
    PATTERN RECOGNITION, 2002, 35 (04) : 861 - 874
  • [6] Hybrid Fusion for Biometrics: Combining Score-level and Decision-level Fusion
    Tao, Qian
    Veldhuis, Raymond
    2008 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, VOLS 1-3, 2008, : 1144 - 1149
  • [7] Multimodal decision-level fusion for person authentication
    Chatzis, V
    Bors, AG
    Pitas, I
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 1999, 29 (06): : 674 - 680
  • [8] Multimodal Emotion Recognition Framework Using a Decision-Level Fusion and Feature-Level Fusion Approach
    Devi, C. Akalya
    Renuka, D.
    IETE JOURNAL OF RESEARCH, 2023, 69 (12) : 8909 - 8920
  • [9] A Bayesian framework for ATR decision-level fusion experiments
    Morgan, Douglas R.
    Ross, Timothy D.
    MULTISENSOR, MULTISOURCE INFORMATION FUSION: ARCHITECTURES, ALGORITHMS, AND APPLICATIONS 2007, 2007, 6571
  • [10] Comparison between Decision-Level and Feature-Level Fusion of Acoustic and Linguistic Features for Spontaneous Emotion Recognition
    Planet, Santiago
    Iriondo, Ignasi
    SISTEMAS Y TECNOLOGIAS DE INFORMACION, VOLS 1 AND 2, 2012, : 199 - 204