Incremental Diarization of Telephone Conversations

被引:0
|
作者
Ben-Harush, Oshiy [1 ]
Lapidot, Itshak [2 ]
Guterman, Hugo [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Elect & Comp Engn, POB 653, IL-84105 Beer Sheva, Israel
[2] Sami Shamoon Coll Engn, Dept Elect & Elect Engn, Ashdod, Israel
关键词
SPEAKER DIARIZATION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speaker diarization systems attempt segmentation and labeling of a conversation between R speakers, while no prior information is given regarding the conversation. Most state of the art diarization systems require the full body of the conversation data prior to the application of some diarization approach. However, for some applications such as forensics, which handles vast amount of data, an on-line or incremental diarization is of high importance. For that purpose, a two-stage incremental diarization of telephone conversations algorithm is suggested. On the first stage, a fully unsupervised diarization algorithm is applied over an initial training segment from the conversation. The second-stage is composed of time-series clustering of increments of the conversation. Applying incremental diarization over 1802 telephone conversations from NIST 2005 SER generated an increase in diarization error of approximately 2% compared to the diarization error of an off-line diarization system.
引用
收藏
页码:2226 / +
页数:2
相关论文
共 50 条
  • [41] A Study of the Cosine Distance-Based Mean Shift for Telephone Speech Diarization
    Senoussaoui, Mohammed
    Kenny, Patrick
    Stafylakis, Themos
    Dumouchel, Pierre
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (01) : 217 - 227
  • [42] A study of the cosine distance-based mean shift for telephone speech diarization
    Senoussaoui, Mohammed
    Kenny, Patrick
    Stafylakis, Themos
    Dumouchel, Pierre
    [J]. IEEE Transactions on Audio, Speech and Language Processing, 2014, 22 (01): : 217 - 227
  • [43] CONVOLUTIONAL NEURAL NETWORK FOR SPEAKER CHANGE DETECTION IN TELEPHONE SPEAKER DIARIZATION SYSTEM
    Hruz, Marek
    Zajic, Zbynek
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4945 - 4949
  • [44] Investigation of Segmentation in i-Vector Based Speaker Diarization of Telephone Speech
    Zajic, Zbynek
    Kunesova, Marie
    Radova, Vlasta
    [J]. SPEECH AND COMPUTER, 2016, 9811 : 411 - 418
  • [45] Inferring social nature of conversations from words: Experiments on a corpus of everyday telephone conversations
    Stark, Anthony
    Shafran, Izhak
    Kaye, Jeffrey
    [J]. COMPUTER SPEECH AND LANGUAGE, 2014, 28 (01): : 224 - 239
  • [46] On interaction behaviour in telephone conversations under transmission delay
    Schoenenberg, Katrin
    Raake, Alexander
    Egger, Sebastian
    Schatz, Raimund
    [J]. SPEECH COMMUNICATION, 2014, 63-64 : 1 - 14
  • [47] Couples, contentious conversations, mobile telephone use and driving
    Lansdown, Terry C.
    Stephens, Amanda N.
    [J]. ACCIDENT ANALYSIS AND PREVENTION, 2013, 50 : 416 - 422
  • [48] Speaker Diarization using Eye-gaze Information in Multi-party Conversations
    Inoue, Koji
    Wakabayashi, Yukoh
    Yoshimoto, Hiromasa
    Kawahara, Tatsuya
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 562 - 566
  • [49] Intonational settings as markers of discourse units in telephone conversations
    Douglas-Cowie, E
    Cowie, R
    [J]. LANGUAGE AND SPEECH, 1998, 41 : 351 - 374
  • [50] Faces on the phone: Facial expressivity during telephone conversations
    Hess, U
    Murard, N
    Bourgeois, P
    Cheung, N
    [J]. PSYCHOPHYSIOLOGY, 2001, 38 : S50 - S50