UNSUPERVISED BROADCAST CONVERSATION SPEAKER ROLE LABELING

被引：12

作者：

Hutchinson, Brian ^{[1
]}

Zhang, Bin ^{[1
]}

Ostendorf, Mari ^{[1
]}

机构：

[1] Univ Washington, Dept Elect Engn, Seattle, WA 98195 USA

来源：

2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年

关键词：

Unsupervised learning; meta-clustering; speaker role classification; broadcast conversations;

D O I：

10.1109/ICASSP.2010.5494958

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We present an approach to unsupervised speaker role labeling in talk show data that makes use of two complementary sets of features: structural features that encode the participation patterns of speakers, and lexical features, which capture characteristic phrases. Techniques for using multiple clusterings are explored, leading to more robust results. Experiments on English and Mandarin talk shows yield performance similar to that reported for broadcast news using supervised learning.

引用

页码：5322 / 5325

页数：4

共 50 条

[1] AUTOMATIC IDENTIFICATION OF SPEAKER ROLE AND AGREEMENT/DISAGREEMENT IN BROADCAST CONVERSATION
Wang, Wen
Yaman, Sibel
Precoda, Kristin
Richey, Colleen
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5556 - 5559
[2] ROBUST SPEAKER TURN ROLE LABELING OF TV BROADCAST NEWS SHOWS
Damnati, Geraldine
Charlet, Delphine
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5684 - 5687
[3] Unsupervised training for Mandarin Broadcast News and Conversation transcription
Wang, L.
Gales, M. J. F.
Woodland, P. C.
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 353 - +
[4] Unsupervised Speaker Identification using Overlaid Texts in TV Broadcast
Poignant, Johann
Bredin, Herve
Le, Viet Bac
Besacier, Laurent
Barras, Claude
Quenot, Georges
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2649 - 2652
[5] Unsupervised Speaker Identification in TV Broadcast Based on Written Names
Poignant, Johann
Besacier, Laurent
Quenot, Georges
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (01) : 57 - 68
[6] Multi-view approach for speaker turn role labeling in TV Broadcast News shows
Damnati, Geraldine
Charlet, Delphine
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1292 - 1295
[7] Unsupervised Language Model Adaptation for Mandarin Broadcast Conversation Transcription
Mrva, David
Woodland, Philip C.
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2210 - 2213
[8] Domain Adaptation of PLDA models in Broadcast Diarization by means of Unsupervised Speaker Clustering
Vinals, Ignacio
Ortega, Alfonso
Villalba, Jesus
Miguel, Antonio
Lleida, Eduardo
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2829 - 2833
[9] Automatic speech recognition fusion approach to unsupervised speaker clustering and labeling
Lawson, A. D.
Huggins, M. C.
Grieco, J. J.
Galligan, S. A.
Harris, D. M.
2006 IEEE AEROSPACE CONFERENCE, VOLS 1-9, 2006, : 3280 - 3285
[10] Neural Unsupervised Semantic Role Labeling
Munir, Kashif
Zhao, Hai
Li, Zuchao
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (06)

← 1 2 3 4 5 →