An efficient algorithm for clustering short spoken utterances

被引:0
|
作者
Liu, Z [1 ]
机构
[1] AT&T Labs Res, Middletown, NJ 07748 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, spoken dialogue systems which provide automated customer service at call centers become more prevalent. It is time consuming to determine a set of call types for the dialogue system by analyzing a large volume of unstructured spoken utterances. Traditional hierarchical agglomerative clustering (HAC) algorithm can bootstrap the call types in an unsupervised way, yet the time and space complexities are huge, especially for large data set. Based on our observation that spoken utterances containing less than ten terms are common in the spoken dialogue system, we proposed an efficient HAC algorithm for short utterances. By utilizing the particular properties of short utterances, we significantly reduced both the time and the space complexities of the clustering, algorithm.
引用
收藏
页码:593 / 596
页数:4
相关论文
共 50 条
  • [1] Detection of Conditionals in Spoken Utterances
    Weigelt, Sebastian
    Hey, Tobias
    Steurer, Vanessa
    2018 IEEE 12TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2018, : 85 - 92
  • [2] Grouping utterances in spoken dialogues
    Xu, Wei-Qun
    Xu, Bo
    Huang, Tai-Yi
    Ruan Jian Xue Bao/Journal of Software, 2006, 17 (02): : 250 - 258
  • [3] Feature Representation of Short Utterances based on Knowledge Distillation for Spoken Language Identification
    Shen, Peng
    Lu, Xugang
    Li, Sheng
    Kawai, Hisashi
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1813 - 1817
  • [4] Detection of Control Structures in Spoken Utterances
    Weigelt, Sebastian
    Hey, Tobias
    Steurer, Vanessa
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2018, 12 (03) : 335 - 360
  • [5] A Probabilistic Approach to the Interpretation of Spoken Utterances
    Zukerman, Ingrid
    Makalic, Enes
    Niemann, Michael
    George, Sarah
    PRICAI 2008: TRENDS IN ARTIFICIAL INTELLIGENCE, 2008, 5351 : 581 - 592
  • [6] An efficient clustering algorithm
    Zhang, YF
    Mao, JL
    Xiong, ZY
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 261 - 265
  • [7] EFFICIENT CLUSTERING ALGORITHM
    BHAT, MV
    HAUPT, A
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1976, 6 (01): : 61 - 64
  • [8] An efficient clustering algorithm
    Jiang, SY
    Xu, YM
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1513 - 1518
  • [9] Context Model Acquisition from Spoken Utterances
    Weigelt, Sebastian
    Hey, Tobias
    Tichy, Walter F.
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2017, 27 (9-10) : 1439 - 1453
  • [10] Hypothesis generation and maintenance in the interpretation of spoken utterances
    Niemann, M.
    Zukerman, I.
    Makalic, E.
    George, S.
    AI 2007: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4830 : 466 - +