Level of interest sensing in spoken dialog using decision-level fusion of acoustic and lexical evidence

被引:5
|
作者
Jeon, Je Hun [1 ]
Xia, Rui [1 ]
Liu, Yang [1 ]
机构
[1] Univ Texas Dallas, Dept Comp Sci, Richardson, TX 75083 USA
来源
COMPUTER SPEECH AND LANGUAGE | 2014年 / 28卷 / 02期
关键词
Level of interest; Decision-level fusion; Human machine interaction; EMOTION RECOGNITION;
D O I
10.1016/j.csl.2013.09.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic detection of a user's interest in spoken dialog plays an important role in many applications, such as tutoring systems and customer service systems. In this study, we propose a decision-level fusion approach using acoustic and lexical information to accurately sense a user's interest at the utterance level. Our system consists of three parts: acoustic/prosodic model, lexical model, and a model that combines their decisions for the final output. We use two different regression algorithms to complement each other for the acoustic model. For lexical information, in addition to the bag-of-words model, we propose new features including a level-of-interest value for each word, length information using the number of words, estimated speaking rate, silence in the utterance, and similarity with other utterances. We also investigate the effectiveness of using more automatic speech recognition (ASR) hypotheses (n-best lists) to extract lexical features. The outputs from the acoustic and lexical models are combined at the decision level. Our experiments show that combining acoustic evidence with lexical information improves level-of-interest detection performance, even when lexical features are extracted from ASR output with high word error rate. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:420 / 433
页数:14
相关论文
共 50 条
  • [21] Decision-level fusion approach to face recognition with multiple cameras
    Yeom, Seokwon
    MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2014, 2014, 9120
  • [22] Decision-level information fusion powered human pose estimation
    Yiqing Zhang
    Weiting Chen
    Applied Intelligence, 2023, 53 : 2161 - 2172
  • [23] Decision-Level Fusion of Infrared and Visible images for Face Recognition
    Zhao, Yunfeng
    Yin, Yixin
    Fu, Dongmei
    2008 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-11, 2008, : 2411 - 2414
  • [24] Nonminutiae-based decision-level fusion for fingerprint verification
    Helfroush, Sadegh
    Ghassemian, Hassan
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2007, 2007 (1)
  • [25] Fingerprint verification by decision-level fusion of optical and capacitive sensors
    Marcialis, GL
    Roli, F
    BIOMETRIC AUTHENTICATION, PROCEEDINGS, 2004, 3087 : 307 - 317
  • [26] Palmprint Retrieval based on Match Scores and Decision-Level Fusion
    Kavati, Ilaiah
    Prasad, Munaga V. N. K.
    Bhagvati, Chakravarthy
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2015, : 1591 - 1595
  • [27] A DECISION-LEVEL DATA FUSION APPROACH TO SURFACE ROUGHNESS PREDICTION
    Wei, Yupeng
    Wu, Dazhong
    Terpenny, Janus
    PROCEEDINGS OF THE ASME 14TH INTERNATIONAL MANUFACTURING SCIENCE AND ENGINEERING CONFERENCE, 2019, VOL 1, 2019,
  • [28] Decision-Level Fusion for Collaborative Service-Oriented Manufacture
    Xiong, Li
    Xiang, Zhengtao
    Li, Xiongyi
    Mei, Yi
    2009 IEEE 10TH INTERNATIONAL CONFERENCE ON COMPUTER-AIDED INDUSTRIAL DESIGN & CONCEPTUAL DESIGN, VOLS 1-3: E-BUSINESS, CREATIVE DESIGN, MANUFACTURING - CAID&CD'2009, 2009, : 1990 - +
  • [29] Nonminutiae-Based Decision-Level Fusion for Fingerprint Verification
    Sadegh Helfroush
    Hassan Ghassemian
    EURASIP Journal on Advances in Signal Processing, 2007
  • [30] Decision-Level Data Fusion in Quality Control and Predictive Maintenance
    Wei, Yupeng
    Wu, Dazhong
    Terpenny, Janis
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2021, 18 (01) : 184 - 194