ALBAYZIN Query-by-example Spoken Term Detection 2016 evaluation

被引:0
|
作者
Javier Tejedor
Doroteo T. Toledano
Paula Lopez-Otero
Laura Docio-Fernandez
Jorge Proença
Fernando Perdigão
Fernando García-Granada
Emilio Sanchis
Anna Pompili
Alberto Abad
机构
[1] Universidad San Pablo-CEU,Escuela Politécnica Superior
[2] CEU Universities,Multimedia Technologies Group (GTM), AtlantTIC Research Center, E. E. Telecomunicación
[3] Campus de Montepríncipe,Instituto de Telecomunicações, Department of Electrical and Computer Engineering
[4] AuDIaS,ELiRF
[5] Universidad Autónoma de Madrid, Departament de Sistemes Informàtics i Computació
[6] Universidade da Coruña,L2F
[7] IRLab, Spoken Language Systems Lab, INESC
[8] CITIC,ID, IST
[9] Campus Universitario de Vigo, Instituto Superior Técnico
[10] s/n,undefined
[11] University of Coimbra,undefined
[12] Universitat Politècnica de València,undefined
[13] University of Lisbon,undefined
关键词
Query-by-example Spoken Term Detection; International evaluation; Spanish; Search on spontaneous speech;
D O I
暂无
中图分类号
学科分类号
摘要
Query-by-example Spoken Term Detection (QbE STD) aims to retrieve data from a speech repository given an acoustic (spoken) query containing the term of interest as the input. This paper presents the systems submitted to the ALBAYZIN QbE STD 2016 Evaluation held as a part of the ALBAYZIN 2016 Evaluation Campaign at the IberSPEECH 2016 conference. Special attention was given to the evaluation design so that a thorough post-analysis of the main results could be carried out. Two different Spanish speech databases, which cover different acoustic and language domains, were used in the evaluation: the MAVIR database, which consists of a set of talks from workshops, and the EPIC database, which consists of a set of European Parliament sessions in Spanish. We present the evaluation design, both databases, the evaluation metric, the systems submitted to the evaluation, the results, and a thorough analysis and discussion. Four different research groups participated in the evaluation, and a total of eight template matching-based systems were submitted. We compare the systems submitted to the evaluation and make an in-depth analysis based on some properties of the spoken queries, such as query length, single-word/multi-word queries, and in-language/out-of-language queries.
引用
收藏
相关论文
共 50 条
  • [41] Acoustic Word Embedding System for Code-Switching Query-by-example Spoken Term Detection
    Ma, Murong
    Wu, Haiwei
    Wang, Xuyang
    Yang, Lin
    Wang, Junjie
    Li, Ming
    2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
  • [42] CNN-based Bottleneck Feature for Noise Robust Query-by-Example Spoken Term Detection
    Lim, Hyungjun
    Kim, Younggwan
    Kim, Yoonhoe
    Kim, Hoirin
    2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 1237 - 1240
  • [43] Capturing Indian Phonemic Diversity with Multiple Posteriorgrams for Multilingual Query-by-Example Spoken Term Detection
    Popli, Abhimanyu
    Kumar, Arun
    2017 TWENTY-THIRD NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2017,
  • [44] UNSUPERVISED QUERY-BY-EXAMPLE SPOKEN TERM DETECTION USING SEGMENT-BASED BAG OF ACOUSTIC WORDS
    George, Basil
    Yegnanarayana, B.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [45] QUERY-BY-EXAMPLE SPOKEN TERM DETECTION USING ATTENTION-BASED MULTI-HOP NETWORKS
    Ao, Chia-Wei
    Lee, Hung-yi
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6264 - 6268
  • [46] Modification in Sequential Dynamic Time Warping for Fast Computation of Query-by-Example Spoken Term Detection Task
    Madhavi, Maulik C.
    Patil, Hemant A.
    2016 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2016,
  • [47] Comparison of Methods for Language-Dependent and Language-Independent Query-by-Example Spoken Term Detection
    Tejedor, Javier
    Fapso, Michal
    Szoeke, Igor
    Cernocky, Jan 'Honza'
    Grezl, Frantisek
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2012, 30 (03)
  • [48] USE OF ARTICULATORY BOTTLE-NECK FEATURES FOR QUERY-BY-EXAMPLE SPOKEN TERM DETECTION IN LOW RESOURCE SCENARIOS
    Mantena, Gautam
    Prahallad, Kishore
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [49] Vocal Tract Length Normalization using a Gaussian mixture model framework for query-by-example spoken term detection
    Madhavi, Maulik C.
    Patil, Hemant A.
    COMPUTER SPEECH AND LANGUAGE, 2019, 58 : 175 - 202
  • [50] DOUBLE-LAYER NEIGHBORHOOD GRAPH BASED SIMILARITY SEARCH FOR FAST QUERY-BY-EXAMPLE SPOKEN TERM DETECTION
    Aoyama, Kazuo
    Ogawa, Atsunori
    Hattori, Takashi
    Hori, Takaaki
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5216 - 5220