Unsupervised Segmentation of Audio Speech Using the Voting Experts Algorithm

被引:0
|
作者
Miller, Matthew [1 ]
Wong, Peter [1 ]
Stoytchev, Alexander [1 ]
机构
[1] Iowa State Univ, Dev Robot Lab, Ames, IA 50011 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human beings have an apparently innate ability to segment continuous audio speech into words, and that ability is present in infants as young as 8 months old. This propensity towards audio segmentation seems to lay the groundwork for language learning. To artificially reproduce this ability would be both practically useful and theoretically enlightening. In this paper we propose an algorithm for the unsupervised segmentation of audio speech, based on the Voting Experts (VE) algorithm, which was originally designed to segment sequences of discrete tokens into categorical episodes. We demonstrate that our procedure is capable of inducing breaks with an accuracy substantially greater than chance, and suggest possible avenues Of exploration to further increase the segmentation quality.
引用
收藏
页码:138 / 143
页数:6
相关论文
共 50 条
  • [1] Hierarchical Voting Experts: An Unsupervised Algorithm for Hierarchical Sequence Segmentation
    Miller, Matthew
    Stoytchev, Alexander
    [J]. 2008 IEEE 7TH INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, 2008, : 186 - 191
  • [2] Voting experts: An unsupervised algorithm for segmenting sequences
    Cohen, Paul
    Adams, Niall
    Heeringa, Brent
    [J]. INTELLIGENT DATA ANALYSIS, 2007, 11 (06) : 607 - 625
  • [3] A speaker based unsupervised speech segmentation algorithm used in conversational speech
    Chen, Yanxiang
    Wang, Qiong
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, 2007, 4798 : 396 - +
  • [4] A Hybrid Unsupervised Segmentation Algorithm for Arabic Speech Using Feature Fusion and a Genetic Algorithm (July 2018)
    Absa, Ahmed Hamdi Abo
    Deriche, Mohamed
    Elshafei-Ahmed, Moustafa
    Elhadj, Yahya Mohamed
    Juang, Biing-Hwang
    [J]. IEEE ACCESS, 2018, 6 : 43157 - 43169
  • [5] AN UNSUPERVISED AUDIO SEGMENTATION METHOD USING BAYESIAN INFORMATION CRITERION
    Ozan, Ezgi Can
    Tankiz, Seda
    Acar, Banu Oskay
    Ciloglu, Tolga
    [J]. 2014 6TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING (ISCCSP), 2014, : 640 - 643
  • [6] AUDIO SEGMENTATION FOR SPEECH RECOGNITION USING SEGMENT FEATURES
    Rybach, David
    Gollan, Christian
    Schlueter, Ralf
    Ney, Hermann
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4197 - 4200
  • [7] An unsupervised audio segmentation and classification approach
    Pan, Wenjuan
    Yao, Yong
    Liu, Zhijing
    [J]. FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 3, PROCEEDINGS, 2007, : 303 - 306
  • [8] Using spatial audio cues from speech excitation for meeting speech segmentation
    Cheng, Eva
    Burnett, Ian
    Ritz, Christian
    [J]. 2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 3067 - +
  • [9] Unsupervised audio segmentation using extended Baum-Welch transformations
    Sainath, Tara N.
    Kanevsky, Dimitri
    Iyengar, Giridharan
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS, 2007, : 209 - +
  • [10] Indoor/Outdoor Audio Classification using Foreground Speech Segmentation
    Khonglah, Banriskhem K.
    Deepak, K. T.
    Prasanna, S. R. Mahadeva
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 464 - 468