Segmented Dynamic Time Warping for Spoken Query-by-Example Search

被引:5
|
作者
Proenca, Jorge [1 ]
Perdigao, Fernando
机构
[1] Univ Coimbra, Inst Telecomunicacoes, Coimbra, Portugal
关键词
Query-by-example; Spoken term detection; Dynamic Time Warping; TERM DETECTION;
D O I
10.21437/Interspeech.2016-1276
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes a low-resource approach to a Query-by-Example task, where spoken queries must be matched in a large dataset of spoken documents sometimes in complex or non-exact ways. Our approach tackles these complex match cases by using Dynamic Time Warping to obtain alternative paths that account for reordering of words, small extra content and small lexical variations. We also report certain advances on calibration and fusion of sub-systems that improve overall results, such as manipulating the score distribution per query and using an average posteriorgram distance matrix as an extra sub-system. Results are evaluated on the MediaEval task of Query-by-Example Search on Speech (QUESST). For this task, the language of the audio being searched is almost irrelevant, approaching the use case scenario to a language of very low resources. For that, we use as features the posterior probabilities obtained from five phonetic recognizers trained with five different languages.
引用
收藏
页码:750 / 754
页数:5
相关论文
共 50 条
  • [21] A STAGE MATCH FOR QUERY-BY-EXAMPLE SPOKEN TERM DETECTION BASED ON STRUCTURE INFORMATION OF QUERY
    Zhan, Junyao
    He, Qianhua
    Su, Jianbin
    Li, Yanxiong
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6833 - 6837
  • [22] ACOUSTIC SPAN EMBEDDINGS FOR MULTILINGUAL QUERY-BY-EXAMPLE SEARCH
    Hu, Yushi
    Settle, Shane
    Livescu, Karen
    [J]. 2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 935 - 942
  • [23] PRIVACY-PRESERVING QUERY-BY-EXAMPLE SPEECH SEARCH
    Portelo, Jose
    Abad, Alberto
    Raj, Bhiksha
    Trancoso, Isabel
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 1797 - 1801
  • [24] MEMORY EFFICIENT SUBSEQUENCE DTW FOR QUERY-BY-EXAMPLE SPOKEN TERM DETECTION
    Anguera, Xavier
    Ferrarons, Miquel
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2013), 2013,
  • [25] Unsupervised Query-by-example spoken term detection based on DPHMM tokenizer
    Cao Jiankai
    Zhang Lianhai
    [J]. 2017 IEEE 2ND ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2017, : 1321 - 1325
  • [26] Query-by-Example Spoken Term Detection using Attentive Pooling Networks
    Zhang, Kun
    Wu, Zhiyong
    Jia, Jia
    Meng, Helen
    Song, Binheng
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1267 - 1272
  • [27] AN ACOUSTIC SEGMENT MODELING APPROACH TO QUERY-BY-EXAMPLE SPOKEN TERM DETECTION
    Wang, Haipeng
    Leung, Cheung-Chi
    Lee, Tan
    Ma, Bin
    Li, Haizhou
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5157 - 5160
  • [28] A Fast Query-by-Example Spoken Term Detection for Zero Resource Languages
    Pandia, Karthik D. S.
    Saranya, M. S.
    Murthy, Hema A.
    [J]. 2016 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2016,
  • [29] EFFECTIVE UTILIZATION OF MULTIPLE EXAMPLES IN QUERY-BY-EXAMPLE SPOKEN TERM DETECTION
    Xu, Ji
    Zhang, Ge
    Yan, Yonghong
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5440 - 5444
  • [30] MedlineQBE (query-by-example)
    Bernstam, E
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2001, : 47 - 51