Modification in Sequential Dynamic Time Warping for Fast Computation of Query-by-Example Spoken Term Detection Task

被引：0

作者：

Madhavi, Maulik C. ^{[1
]}

Patil, Hemant A. ^{[1
]}

机构：

[1] DA IICT, Gandhinagar 382007, Gujarat, India

来源：

2016 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM) | 2016年

关键词：

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Query-by-Example Spoken Term Detection (QbE-STD) under low-resource settings, is the task of retrieval which can be done via the example of an audio. The searching phase involves highly computationally intensive Dynamic TimeWarping (DTW)-based matching techniques. Search space reduction is an important need in order to reduce the space of searching and hence, reduce the computational complexity. In this paper, to perform DTW in a faster mode, the average of consecutive features is considered without overlapping. Much of the information is lost during feature reduction process. For instance, the posterior features on either side of phone boundaries exhibit characteristics. Hence, one such loss might be introduced due to the merging of feature vectors in the vicinity of phoneme boundaries. To overcome this, we perform merging of features after considering the phoneme boundaries (detected using spectral transition measure). The QbE-STD task is performed on MediaEval SWS 2013 dataset. The presented approach reduces the computation time by 46.15% to 49.16 % with very low-performance degradation, i.e., 0.017-0.023 in Maximum Term Weight Value (MTWV) with respect to no feature reduction.

引用

页数：5

共 50 条

[1] Segmented Dynamic Time Warping for Spoken Query-by-Example Search
Proenca, Jorge
Perdigao, Fernando
[J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 750 - 754
[2] Query-by-Example Retrieval via Fast Sequential Dynamic Time Warping Algorithm
Vavrek, Jozef
Viszlay, Peter
Kiktova, Eva
Lojka, Martin
Juhar, Jozef
Cizmar, Anton
[J]. 2015 38TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2015,
[3] Query-by-example spoken term detection based on phonetic posteriorgram Query-by-example spoken term detection based on phonetic posteriorgram
Song, Beili
Zhang, Wei-Qiang
Cai, Meng
Liu, Jia
Johnson, Michael T.
[J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON EDUCATION, MANAGEMENT AND COMPUTING TECHNOLOGY, 2015, 30 : 1255 - 1260
[4] A Fast Query-by-Example Spoken Term Detection for Zero Resource Languages
Pandia, Karthik D. S.
Saranya, M. S.
Murthy, Hema A.
[J]. 2016 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2016,
[5] Query-by-Example Spoken Term Detection using Frequency Domain Linear Prediction and Non-Segmental Dynamic Time Warping
Mantena, Gautam
Achanta, Sivanand
Prahallad, Kishore
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (05) : 946 - 955
[6] Analysis of Constraints on Segmental DTW for the Task of Query-by-Example Spoken Term Detection
Dumpala, Harsha
Alluri, K. N. R. K. Raju
Gangashetty, Suryakanth V.
Vuppala, Anil Kumar
[J]. 2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
[7] Query-by-Example Spoken Term Detection For OOV Terms
Parada, Carolina
Sethy, Abhinav
Ramabhadran, Bhuvana
[J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 404 - +
[8] A Comparison of Query-by-Example Methods for Spoken Term Detection
Shen, Wade
White, Christopher M.
Hazen, Timothy J.
[J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2107 - 2110
[9] Query-by-Example Spoken Term Detection Using Bessel Features
Vasudev, Drisya
Gangashetty, Suryakanth V.
Babu, Anish K. K.
Riyas, K. S.
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, INFORMATICS, COMMUNICATION AND ENERGY SYSTEMS (SPICES), 2015,
[10] ALBAYZIN Query-by-example Spoken Term Detection 2016 evaluation
Javier Tejedor
Doroteo T. Toledano
Paula Lopez-Otero
Laura Docio-Fernandez
Jorge Proença
Fernando Perdigão
Fernando García-Granada
Emilio Sanchis
Anna Pompili
Alberto Abad
[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2018

← 1 2 3 4 5 →