Search in speech, language identification and speaker recognition in Speech@FIT

被引:0
|
作者
Cernocky, Jan [1 ]
Burget, Lukas [1 ]
Schwarz, Petr [1 ]
Matejka, Pavel [1 ]
Karafiat, Martin [1 ]
Glembek, Ondrej [1 ]
Kopecky, Jiri [1 ]
Szoeke, Igor [1 ]
Fapso, Michal [1 ]
Grezl, Frantisek [1 ]
Hubeika, Valiantsina [1 ]
Oparin, Ilya [1 ]
机构
[1] Brno Univ Technol, Speech FIT, Dept Comp Graph & Multimedia, Fac Informat Technol, Brno, Czech Republic
关键词
speech recognition; speaker recognition; language identification; spoken term detection; Speech@FIT;
D O I
暂无
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
This paper describes "search in speech" techniques developed in the Speech@FIT research group at FIT BUT in the last couple of years. It concentrates on spoken term detection (STD) and presents our system for NIST STD 2006 evaluations in detail. It also briefly mentions our systems for speaker and language recognition.
引用
收藏
页码:132 / +
页数:2
相关论文
共 50 条
  • [1] SPEAKER IDENTIFICATION AND MESSAGE IDENTIFICATION IN SPEECH RECOGNITION
    GARVIN, PL
    LADEFOGED, P
    [J]. PHONETICA, 1963, 9 (04) : 193 - 199
  • [2] Speech fragment decoding techniques for simultaneous speaker identification and speech recognition
    Barker, Jon
    Ma, Ning
    Coy, Andre
    Cooke, Martin
    [J]. COMPUTER SPEECH AND LANGUAGE, 2010, 24 (01): : 94 - 111
  • [3] Continuous Speech Recognition and Identification of the Speaker System
    Guffanti, Diego
    Martinez, Danilo
    Paladines, Jose
    Sarmiento, Andrea
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY & SYSTEMS (ICITS 2018), 2018, 721 : 767 - 776
  • [4] Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
    Kanda, Naoyuki
    Gaur, Yashesh
    Wang, Xiaofei
    Meng, Zhong
    Chen, Zhuo
    Zhou, Tianyan
    Yoshioka, Takuya
    [J]. INTERSPEECH 2020, 2020, : 36 - 40
  • [5] A Study on the Search of the Most Discriminative Speech Features in the Speaker Dependent Speech Emotion Recognition
    Pao, Tsang-Long
    Wang, Chun-Hsiang
    Li, Yu-Ji
    [J]. 2012 FIFTH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP), 2012, : 157 - 162
  • [6] Bayesian networks in multimodal speech recognition and speaker identification
    Nefian, AV
    Liang, LH
    [J]. CONFERENCE RECORD OF THE THIRTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 2003, : 2004 - 2008
  • [7] Speaker identification and speech recognition using phased arrays
    Xu, Roger
    Mei, Gang
    Ren, ZuBing
    Kwan, Chiman
    Aube, Julien
    Rochet, Cedrick
    Stanford, Vincent
    [J]. AMBIENT INTELLIGENCE IN EVERDAY LIFE, 2006, 3864 : 227 - 238
  • [8] An Integrated Approach to Robust Speaker Identification and Speech Recognition
    Kwan, C.
    Yin, J.
    Ayhan, B.
    Chu, S.
    Liu, X.
    Puckett, K.
    Zhao, Y.
    Ho, K. C.
    Kruger, M.
    Sityar, I.
    [J]. 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 1635 - +
  • [9] An Automatic Speech Recognition Solution with Speaker Identification Support
    Buzo, Andi
    Cucu, Horia
    Petrica, Lucian
    Burileanu, Dragos
    Burileanu, Corneliu
    [J]. 2014 10TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS (COMM), 2014,
  • [10] Unified System for Visual Speech Recognition and Speaker Identification
    Rekik, Ahmed
    Ben-Hamadou, Achraf
    Mahdi, Walid
    [J]. ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, ACIVS 2015, 2015, 9386 : 381 - 390