Automatic dialog act segmentation and classification in multiparty meetings

被引:0
|
作者
Ang, J
Liu, Y
Shriberg, E
机构
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We explore the two related tasks of dialog act (DA) segmentation and DA classification for speech from the ICSI Meeting Corpus. We employ simple lexical and prosodic knowledge sources, and compare results for human-transcribed versus automatically recognized words. Since there is little previous work on DA segmentation and classification in the meeting domain, our study provides baseline performance rates for both tasks. We introduce a range of metrics for use in evaluation, each of which measures different aspects of interest. Results show that both tasks are difficult, particularly for a fully automatic system. We find that a very simple prosodic model aids performance over lexical information alone, especially for segmentation. Both tasks, but particularly word-based segmentation, are degraded by word recognition errors. Finally, while classification results for meeting data show some similarities to previous results for telephone conversations, findings also suggest a potential difference with respect to the effect of modeling DA context.
引用
收藏
页码:1061 / 1064
页数:4
相关论文
共 50 条
  • [41] OPTIMIZING NEURAL NETWORK HYPERPARAMETERS WITH GAUSSIAN PROCESSES FOR DIALOG ACT CLASSIFICATION
    Dernoncourt, Franck
    Lee, Ji Young
    [J]. 2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 406 - 413
  • [42] Dialog Act Classification using Acoustic and Discourse Information of MapTask Data
    Julia, Fatema N.
    Iftekharuddin, Khan M.
    [J]. 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 1472 - 1479
  • [43] Enriching text-to-speech synthesis using automatic dialog act tags
    Sridhar, Vivek Kumar Rangarajan
    Syrdal, Ann
    Conkie, Alistair
    Bangalore, Srinivas
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 324 - 327
  • [44] Interpretation of multiparty meetings the AMI and AMIDA projects
    Renals, Steve
    Hain, Thomas
    Bourlard, Herve
    [J]. 2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS, 2008, : 116 - +
  • [45] Joint Segmentation and Classification of Dialog Acts using Conditional Random Fields
    Zimmermann, Matthias
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 836 - 839
  • [46] Automatic Segmentation of Dermoscopic Images by Iterative Classification
    Zortea, Maciel
    Skrovseth, Stein Olav
    Schopf, Thomas R.
    Kirchesch, Herbert M.
    Godtliebsen, Fred
    [J]. INTERNATIONAL JOURNAL OF BIOMEDICAL IMAGING, 2011, 2011
  • [47] An automatic approach towards audio segmentation and classification
    Pan, Wenjuan
    Wang, Zongwu
    Liu, Zhijing
    [J]. PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, PROCEEDINGS, 2007, : 405 - 408
  • [48] Automatic Segmentation and Classification of Resistors in Digital Images
    Muminovic, Mia
    Sokic, Emir
    [J]. 2019 XXVII INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND AUTOMATION TECHNOLOGIES (ICAT 2019), 2019,
  • [49] SEGMENTATION AS A PREPROCESSING TOOL FOR AUTOMATIC GRAPEVINE CLASSIFICATION
    Carneiro, Gabriel Antonio
    Padua, Luis
    Peres, Emanuel
    Morais, Raul
    Sousa, Joaquim J.
    Cunha, Antonio
    [J]. 2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 6053 - 6056
  • [50] Automatic segmentation and classification of mice ultrasonic vocalizations
    Pessoa, Diogo
    Petrella, Lorena
    Martins, Pedro
    Castelo-Branco, Miguel
    Teixeira, Cesar
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2022, 152 (01): : 266 - 280