Automatic dialog act segmentation and classification in multiparty meetings

被引：0

作者：

Ang, J

Liu, Y

Shriberg, E

机构：

来源：

2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING | 2005年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We explore the two related tasks of dialog act (DA) segmentation and DA classification for speech from the ICSI Meeting Corpus. We employ simple lexical and prosodic knowledge sources, and compare results for human-transcribed versus automatically recognized words. Since there is little previous work on DA segmentation and classification in the meeting domain, our study provides baseline performance rates for both tasks. We introduce a range of metrics for use in evaluation, each of which measures different aspects of interest. Results show that both tasks are difficult, particularly for a fully automatic system. We find that a very simple prosodic model aids performance over lexical information alone, especially for segmentation. Both tasks, but particularly word-based segmentation, are degraded by word recognition errors. Finally, while classification results for meeting data show some similarities to previous results for telephone conversations, findings also suggest a potential difference with respect to the effect of modeling DA context.

引用

页码：1061 / 1064

页数：4

共 50 条

[41] OPTIMIZING NEURAL NETWORK HYPERPARAMETERS WITH GAUSSIAN PROCESSES FOR DIALOG ACT CLASSIFICATION
Dernoncourt, Franck
Lee, Ji Young
[J]. 2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 406 - 413
[42] Dialog Act Classification using Acoustic and Discourse Information of MapTask Data
Julia, Fatema N.
Iftekharuddin, Khan M.
[J]. 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 1472 - 1479
[43] Enriching text-to-speech synthesis using automatic dialog act tags
Sridhar, Vivek Kumar Rangarajan
Syrdal, Ann
Conkie, Alistair
Bangalore, Srinivas
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 324 - 327
[44] Interpretation of multiparty meetings the AMI and AMIDA projects
Renals, Steve
Hain, Thomas
Bourlard, Herve
[J]. 2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS, 2008, : 116 - +
[45] Joint Segmentation and Classification of Dialog Acts using Conditional Random Fields
Zimmermann, Matthias
[J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 836 - 839
[46] Automatic Segmentation of Dermoscopic Images by Iterative Classification
Zortea, Maciel
Skrovseth, Stein Olav
Schopf, Thomas R.
Kirchesch, Herbert M.
Godtliebsen, Fred
[J]. INTERNATIONAL JOURNAL OF BIOMEDICAL IMAGING, 2011, 2011
[47] An automatic approach towards audio segmentation and classification
Pan, Wenjuan
Wang, Zongwu
Liu, Zhijing
[J]. PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, PROCEEDINGS, 2007, : 405 - 408
[48] Automatic Segmentation and Classification of Resistors in Digital Images
Muminovic, Mia
Sokic, Emir
[J]. 2019 XXVII INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND AUTOMATION TECHNOLOGIES (ICAT 2019), 2019,
[49] SEGMENTATION AS A PREPROCESSING TOOL FOR AUTOMATIC GRAPEVINE CLASSIFICATION
Carneiro, Gabriel Antonio
Padua, Luis
Peres, Emanuel
Morais, Raul
Sousa, Joaquim J.
Cunha, Antonio
[J]. 2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 6053 - 6056
[50] Automatic segmentation and classification of mice ultrasonic vocalizations
Pessoa, Diogo
Petrella, Lorena
Martins, Pedro
Castelo-Branco, Miguel
Teixeira, Cesar
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2022, 152 (01): : 266 - 280

← 1 2 3 4 5 →