UNSUPERVISED BROADCAST CONVERSATION SPEAKER ROLE LABELING

被引:12
|
作者
Hutchinson, Brian [1 ]
Zhang, Bin [1 ]
Ostendorf, Mari [1 ]
机构
[1] Univ Washington, Dept Elect Engn, Seattle, WA 98195 USA
关键词
Unsupervised learning; meta-clustering; speaker role classification; broadcast conversations;
D O I
10.1109/ICASSP.2010.5494958
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present an approach to unsupervised speaker role labeling in talk show data that makes use of two complementary sets of features: structural features that encode the participation patterns of speakers, and lexical features, which capture characteristic phrases. Techniques for using multiple clusterings are explored, leading to more robust results. Experiments on English and Mandarin talk shows yield performance similar to that reported for broadcast news using supervised learning.
引用
收藏
页码:5322 / 5325
页数:4
相关论文
共 50 条
  • [41] Speaker diarization: From broadcast news to lectures
    Zhu, X.
    Barras, C.
    Lamel, L.
    Gauvain, J-L.
    MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2006, 4299 : 396 - +
  • [42] Study on Integration of Speaker Diarization with Speaker Adaptive Speech Recognition for Broadcast Transcription
    Silovsky, Jan
    Cerva, Petr
    Zdansky, Jindrich
    Nouza, Jan
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 478 - 481
  • [43] Unsupervised speaker adaptation for speaker independent acoustic to articulatory speech inversion
    Sivaraman, Ganesh
    Mitra, Vikramjit
    Nam, Hosung
    Tiede, Mark
    Espy-Wilson, Carol
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 146 (01): : 316 - 329
  • [44] CONVERSATION MEMORY - THE EFFECTS OF SPEAKER STATUS ON MEMORY FOR THE ASSERTIVENESS OF CONVERSATION REMARKS
    HOLTGRAVES, T
    SRULL, TK
    SOCALL, D
    JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1989, 56 (02) : 149 - 160
  • [45] Unsupervised Labeling of noun clusters
    Jickels, Theresa
    Kondrak, Grzegorz
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4013 : 278 - 287
  • [46] Speech, Speaker and Speaker's Gender Identification in Automatically Processed Broadcast Stream
    Silovsky, Jan
    Nouza, Jan
    RADIOENGINEERING, 2006, 15 (03) : 42 - 48
  • [48] The token "yeah" in nonnative speaker English conversation
    Wong, J
    RESEARCH ON LANGUAGE AND SOCIAL INTERACTION, 2000, 33 (01) : 39 - 67
  • [49] Partitioning of Two-Speaker Conversation Datasets
    Vaquero, Carlos
    Ortega, Alfonso
    Lleida, Eduardo
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 392 - 395
  • [50] Investigating Morphological Decomposition for Transcription of Arabic Broadcast News and Broadcast Conversation Data
    Lamel, Lori
    Messaoudi, Abdel.
    Gauvain, Jean-Luc
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1429 - 1432