Search in speech, language identification and speaker recognition in Speech@FIT

被引:0
|
作者
Cernocky, Jan [1 ]
Burget, Lukas [1 ]
Schwarz, Petr [1 ]
Matejka, Pavel [1 ]
Karafiat, Martin [1 ]
Glembek, Ondrej [1 ]
Kopecky, Jiri [1 ]
Szoeke, Igor [1 ]
Fapso, Michal [1 ]
Grezl, Frantisek [1 ]
Hubeika, Valiantsina [1 ]
Oparin, Ilya [1 ]
机构
[1] Brno Univ Technol, Speech FIT, Dept Comp Graph & Multimedia, Fac Informat Technol, Brno, Czech Republic
关键词
speech recognition; speaker recognition; language identification; spoken term detection; Speech@FIT;
D O I
暂无
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
This paper describes "search in speech" techniques developed in the Speech@FIT research group at FIT BUT in the last couple of years. It concentrates on spoken term detection (STD) and presents our system for NIST STD 2006 evaluations in detail. It also briefly mentions our systems for speaker and language recognition.
引用
收藏
页码:132 / +
页数:2
相关论文
共 50 条
  • [41] A unified system for multilingual speech recognition and language identification
    Liu, Danyang
    Xu, Ji
    Zhang, Pengyuan
    Yan, Yonghong
    [J]. SPEECH COMMUNICATION, 2021, 127 : 17 - 28
  • [42] Enhancing multilingual recognition of emotion in speech by language identification
    Sagha, Hesam
    Matejka, Pavel
    Gavryukova, Maryna
    Povolny, Filip
    Marchi, Erik
    Schuller, Bjoern
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2949 - 2953
  • [43] Streaming Multi-talker Speech Recognition with Joint Speaker Identification
    Lu, Liang
    Kanda, Naoyuki
    Li, Jinyu
    Gong, Yifan
    [J]. INTERSPEECH 2021, 2021, : 1782 - 1786
  • [44] Speaker De-identification using Diphone Recognition and Speech Synthesis
    Justin, Tadej
    Struc, Vitomir
    Dobrisek, Simon
    Vesnicer, Bostjan
    Ipsic, Ivo
    Mihelic, France
    [J]. 2015 11TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG): DE-IDENTIFICATION FOR PRIVACY PROTECTION IN MULTIMEDIA (DEID 2015), VOL 4, 2015,
  • [45] SPEAKER RECOGNITION SPEECH CHARACTERISTICS SPEECH EVALUATION AND MODIFICATION OF SPEECH SIGNAL - A SELECTED BIBLIOGRAPHY
    HOLMGREN, GL
    [J]. IEEE TRANSACTIONS ON AUDIO AND ELECTROACOUSTICS, 1966, AU14 (01): : 32 - &
  • [46] Effect of Various Visual Speech Units on Language Identification Using Visual Speech Recognition
    Brahme, Aparna
    Bhadade, Umesh
    [J]. INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2020, 20 (04)
  • [47] Speaker Independent Isolated Speech Recognition System for Tamil Language using HMM
    Vimala, C.
    Radha, V.
    [J]. INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY AND SYSTEM DESIGN 2011, 2012, 30 : 1097 - 1102
  • [48] Multi-cultural speech emotion recognition using language and speaker cues
    Pandey, Sandeep Kumar
    Shekhawat, Hanumant Singh
    Prasanna, S. R. M.
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 83
  • [49] Robust several-speaker speech recognition with highly dependable online speaker adaptation and identification
    Shih, Po-Yi
    Lin, Po-Chuan
    Wang, Jhing-Fa
    Lin, Yuan-Ning
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2011, 34 (05) : 1459 - 1467
  • [50] Speaker identification utilizing noncontemporary speech
    Hollien, H
    Schwartz, R
    [J]. JOURNAL OF FORENSIC SCIENCES, 2001, 46 (01) : 63 - 67