Modification in Sequential Dynamic Time Warping for Fast Computation of Query-by-Example Spoken Term Detection Task

被引:0
|
作者
Madhavi, Maulik C. [1 ]
Patil, Hemant A. [1 ]
机构
[1] DA IICT, Gandhinagar 382007, Gujarat, India
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Query-by-Example Spoken Term Detection (QbE-STD) under low-resource settings, is the task of retrieval which can be done via the example of an audio. The searching phase involves highly computationally intensive Dynamic TimeWarping (DTW)-based matching techniques. Search space reduction is an important need in order to reduce the space of searching and hence, reduce the computational complexity. In this paper, to perform DTW in a faster mode, the average of consecutive features is considered without overlapping. Much of the information is lost during feature reduction process. For instance, the posterior features on either side of phone boundaries exhibit characteristics. Hence, one such loss might be introduced due to the merging of feature vectors in the vicinity of phoneme boundaries. To overcome this, we perform merging of features after considering the phoneme boundaries (detected using spectral transition measure). The QbE-STD task is performed on MediaEval SWS 2013 dataset. The presented approach reduces the computation time by 46.15% to 49.16 % with very low-performance degradation, i.e., 0.017-0.023 in Maximum Term Weight Value (MTWV) with respect to no feature reduction.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Segmented Dynamic Time Warping for Spoken Query-by-Example Search
    Proenca, Jorge
    Perdigao, Fernando
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 750 - 754
  • [2] Query-by-Example Retrieval via Fast Sequential Dynamic Time Warping Algorithm
    Vavrek, Jozef
    Viszlay, Peter
    Kiktova, Eva
    Lojka, Martin
    Juhar, Jozef
    Cizmar, Anton
    [J]. 2015 38TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2015,
  • [3] Query-by-example spoken term detection based on phonetic posteriorgram Query-by-example spoken term detection based on phonetic posteriorgram
    Song, Beili
    Zhang, Wei-Qiang
    Cai, Meng
    Liu, Jia
    Johnson, Michael T.
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON EDUCATION, MANAGEMENT AND COMPUTING TECHNOLOGY, 2015, 30 : 1255 - 1260
  • [4] A Fast Query-by-Example Spoken Term Detection for Zero Resource Languages
    Pandia, Karthik D. S.
    Saranya, M. S.
    Murthy, Hema A.
    [J]. 2016 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2016,
  • [5] Query-by-Example Spoken Term Detection using Frequency Domain Linear Prediction and Non-Segmental Dynamic Time Warping
    Mantena, Gautam
    Achanta, Sivanand
    Prahallad, Kishore
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (05) : 946 - 955
  • [6] Analysis of Constraints on Segmental DTW for the Task of Query-by-Example Spoken Term Detection
    Dumpala, Harsha
    Alluri, K. N. R. K. Raju
    Gangashetty, Suryakanth V.
    Vuppala, Anil Kumar
    [J]. 2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
  • [7] Query-by-Example Spoken Term Detection For OOV Terms
    Parada, Carolina
    Sethy, Abhinav
    Ramabhadran, Bhuvana
    [J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 404 - +
  • [8] A Comparison of Query-by-Example Methods for Spoken Term Detection
    Shen, Wade
    White, Christopher M.
    Hazen, Timothy J.
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2107 - 2110
  • [9] Query-by-Example Spoken Term Detection Using Bessel Features
    Vasudev, Drisya
    Gangashetty, Suryakanth V.
    Babu, Anish K. K.
    Riyas, K. S.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, INFORMATICS, COMMUNICATION AND ENERGY SYSTEMS (SPICES), 2015,
  • [10] ALBAYZIN Query-by-example Spoken Term Detection 2016 evaluation
    Javier Tejedor
    Doroteo T. Toledano
    Paula Lopez-Otero
    Laura Docio-Fernandez
    Jorge Proença
    Fernando Perdigão
    Fernando García-Granada
    Emilio Sanchis
    Anna Pompili
    Alberto Abad
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2018