Unsupervised Segmentation of Audio Speech Using the Voting Experts Algorithm

被引：0

作者：

Miller, Matthew ^{[1
]}

Wong, Peter ^{[1
]}

Stoytchev, Alexander ^{[1
]}

机构：

[1] Iowa State Univ, Dev Robot Lab, Ames, IA 50011 USA

来源：

ARTIFICIAL GENERAL INTELLIGENCE PROCEEDINGS | 2009年 / 8卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Human beings have an apparently innate ability to segment continuous audio speech into words, and that ability is present in infants as young as 8 months old. This propensity towards audio segmentation seems to lay the groundwork for language learning. To artificially reproduce this ability would be both practically useful and theoretically enlightening. In this paper we propose an algorithm for the unsupervised segmentation of audio speech, based on the Voting Experts (VE) algorithm, which was originally designed to segment sequences of discrete tokens into categorical episodes. We demonstrate that our procedure is capable of inducing breaks with an accuracy substantially greater than chance, and suggest possible avenues Of exploration to further increase the segmentation quality.

引用

页码：138 / 143

页数：6

共 50 条

[1] Hierarchical Voting Experts: An Unsupervised Algorithm for Hierarchical Sequence Segmentation
Miller, Matthew
Stoytchev, Alexander
[J]. 2008 IEEE 7TH INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, 2008, : 186 - 191
[2] Voting experts: An unsupervised algorithm for segmenting sequences
Cohen, Paul
Adams, Niall
Heeringa, Brent
[J]. INTELLIGENT DATA ANALYSIS, 2007, 11 (06) : 607 - 625
[3] A speaker based unsupervised speech segmentation algorithm used in conversational speech
Chen, Yanxiang
Wang, Qiong
[J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, 2007, 4798 : 396 - +
[4] A Hybrid Unsupervised Segmentation Algorithm for Arabic Speech Using Feature Fusion and a Genetic Algorithm (July 2018)
Absa, Ahmed Hamdi Abo
Deriche, Mohamed
Elshafei-Ahmed, Moustafa
Elhadj, Yahya Mohamed
Juang, Biing-Hwang
[J]. IEEE ACCESS, 2018, 6 : 43157 - 43169
[5] AN UNSUPERVISED AUDIO SEGMENTATION METHOD USING BAYESIAN INFORMATION CRITERION
Ozan, Ezgi Can
Tankiz, Seda
Acar, Banu Oskay
Ciloglu, Tolga
[J]. 2014 6TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING (ISCCSP), 2014, : 640 - 643
[6] AUDIO SEGMENTATION FOR SPEECH RECOGNITION USING SEGMENT FEATURES
Rybach, David
Gollan, Christian
Schlueter, Ralf
Ney, Hermann
[J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4197 - 4200
[7] An unsupervised audio segmentation and classification approach
Pan, Wenjuan
Yao, Yong
Liu, Zhijing
[J]. FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 3, PROCEEDINGS, 2007, : 303 - 306
[8] Using spatial audio cues from speech excitation for meeting speech segmentation
Cheng, Eva
Burnett, Ian
Ritz, Christian
[J]. 2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 3067 - +
[9] Unsupervised audio segmentation using extended Baum-Welch transformations
Sainath, Tara N.
Kanevsky, Dimitri
Iyengar, Giridharan
[J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS, 2007, : 209 - +
[10] Indoor/Outdoor Audio Classification using Foreground Speech Segmentation
Khonglah, Banriskhem K.
Deepak, K. T.
Prasanna, S. R. Mahadeva
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 464 - 468

← 1 2 3 4 5 →