UNSUPERVISED BROADCAST CONVERSATION SPEAKER ROLE LABELING

被引：12

作者：

Hutchinson, Brian ^{[1
]}

Zhang, Bin ^{[1
]}

Ostendorf, Mari ^{[1
]}

机构：

[1] Univ Washington, Dept Elect Engn, Seattle, WA 98195 USA

来源：

2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年

关键词：

Unsupervised learning; meta-clustering; speaker role classification; broadcast conversations;

D O I：

10.1109/ICASSP.2010.5494958

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We present an approach to unsupervised speaker role labeling in talk show data that makes use of two complementary sets of features: structural features that encode the participation patterns of speakers, and lexical features, which capture characteristic phrases. Techniques for using multiple clusterings are explored, leading to more robust results. Experiments on English and Mandarin talk shows yield performance similar to that reported for broadcast news using supervised learning.

引用

页码：5322 / 5325

页数：4

共 50 条

[41] Speaker diarization: From broadcast news to lectures
Zhu, X.
Barras, C.
Lamel, L.
Gauvain, J-L.
MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2006, 4299 : 396 - +
[42] Study on Integration of Speaker Diarization with Speaker Adaptive Speech Recognition for Broadcast Transcription
Silovsky, Jan
Cerva, Petr
Zdansky, Jindrich
Nouza, Jan
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 478 - 481
[43] Unsupervised speaker adaptation for speaker independent acoustic to articulatory speech inversion
Sivaraman, Ganesh
Mitra, Vikramjit
Nam, Hosung
Tiede, Mark
Espy-Wilson, Carol
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 146 (01): : 316 - 329
[44] CONVERSATION MEMORY - THE EFFECTS OF SPEAKER STATUS ON MEMORY FOR THE ASSERTIVENESS OF CONVERSATION REMARKS
HOLTGRAVES, T
SRULL, TK
SOCALL, D
JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1989, 56 (02) : 149 - 160
[45] Unsupervised Labeling of noun clusters
Jickels, Theresa
Kondrak, Grzegorz
ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4013 : 278 - 287
[46] Speech, Speaker and Speaker's Gender Identification in Automatically Processed Broadcast Stream
Silovsky, Jan
Nouza, Jan
RADIOENGINEERING, 2006, 15 (03) : 42 - 48
[47] Native speaker/non-native speaker conversation and the negotiation of comprehensible input
LONG, MH
APPLIED LINGUISTICS, 1983, 4 (02) : 126 - 141
[48] The token "yeah" in nonnative speaker English conversation
Wong, J
RESEARCH ON LANGUAGE AND SOCIAL INTERACTION, 2000, 33 (01) : 39 - 67
[49] Partitioning of Two-Speaker Conversation Datasets
Vaquero, Carlos
Ortega, Alfonso
Lleida, Eduardo
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 392 - 395
[50] Investigating Morphological Decomposition for Transcription of Arabic Broadcast News and Broadcast Conversation Data
Lamel, Lori
Messaoudi, Abdel.
Gauvain, Jean-Luc
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1429 - 1432

← 1 2 3 4 5 →