Search in speech, language identification and speaker recognition in Speech@FIT

被引：0

作者：

Cernocky, Jan ^{[1
]}

Burget, Lukas ^{[1
]}

Schwarz, Petr ^{[1
]}

Matejka, Pavel ^{[1
]}

Karafiat, Martin ^{[1
]}

Glembek, Ondrej ^{[1
]}

Kopecky, Jiri ^{[1
]}

Szoeke, Igor ^{[1
]}

Fapso, Michal ^{[1
]}

Grezl, Frantisek ^{[1
]}

Hubeika, Valiantsina ^{[1
]}

Oparin, Ilya ^{[1
]}

机构：

[1] Brno Univ Technol, Speech FIT, Dept Comp Graph & Multimedia, Fac Informat Technol, Brno, Czech Republic

来源：

2007 17TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA, VOLS 1 AND 2 | 2007年

关键词：

speech recognition; speaker recognition; language identification; spoken term detection; Speech@FIT;

D O I：

暂无

中图分类号：

TN [电子技术、通信技术];

学科分类号：

0809 ;

摘要：

This paper describes "search in speech" techniques developed in the Speech@FIT research group at FIT BUT in the last couple of years. It concentrates on spoken term detection (STD) and presents our system for NIST STD 2006 evaluations in detail. It also briefly mentions our systems for speaker and language recognition.

引用

页码：132 / +

页数：2

共 50 条

[41] A unified system for multilingual speech recognition and language identification
Liu, Danyang
Xu, Ji
Zhang, Pengyuan
Yan, Yonghong
[J]. SPEECH COMMUNICATION, 2021, 127 : 17 - 28
[42] Enhancing multilingual recognition of emotion in speech by language identification
Sagha, Hesam
Matejka, Pavel
Gavryukova, Maryna
Povolny, Filip
Marchi, Erik
Schuller, Bjoern
[J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2949 - 2953
[43] Streaming Multi-talker Speech Recognition with Joint Speaker Identification
Lu, Liang
Kanda, Naoyuki
Li, Jinyu
Gong, Yifan
[J]. INTERSPEECH 2021, 2021, : 1782 - 1786
[44] Speaker De-identification using Diphone Recognition and Speech Synthesis
Justin, Tadej
Struc, Vitomir
Dobrisek, Simon
Vesnicer, Bostjan
Ipsic, Ivo
Mihelic, France
[J]. 2015 11TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG): DE-IDENTIFICATION FOR PRIVACY PROTECTION IN MULTIMEDIA (DEID 2015), VOL 4, 2015,
[45] SPEAKER RECOGNITION SPEECH CHARACTERISTICS SPEECH EVALUATION AND MODIFICATION OF SPEECH SIGNAL - A SELECTED BIBLIOGRAPHY
HOLMGREN, GL
[J]. IEEE TRANSACTIONS ON AUDIO AND ELECTROACOUSTICS, 1966, AU14 (01): : 32 - &
[46] Effect of Various Visual Speech Units on Language Identification Using Visual Speech Recognition
Brahme, Aparna
Bhadade, Umesh
[J]. INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2020, 20 (04)
[47] Speaker Independent Isolated Speech Recognition System for Tamil Language using HMM
Vimala, C.
Radha, V.
[J]. INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY AND SYSTEM DESIGN 2011, 2012, 30 : 1097 - 1102
[48] Multi-cultural speech emotion recognition using language and speaker cues
Pandey, Sandeep Kumar
Shekhawat, Hanumant Singh
Prasanna, S. R. M.
[J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 83
[49] Robust several-speaker speech recognition with highly dependable online speaker adaptation and identification
Shih, Po-Yi
Lin, Po-Chuan
Wang, Jhing-Fa
Lin, Yuan-Ning
[J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2011, 34 (05) : 1459 - 1467
[50] Speaker identification utilizing noncontemporary speech
Hollien, H
Schwartz, R
[J]. JOURNAL OF FORENSIC SCIENCES, 2001, 46 (01) : 63 - 67

← 1 2 3 4 5 →