State-of-the-art in speaker recognition

被引:54
|
作者
Faundez-Zanuy, M
Monte-Moreno, E
机构
[1] Escola Univ Politecn Mataro, Barcelona 08303, Spain
[2] TALP Res Ctr, Barcelona, Spain
关键词
D O I
10.1109/MAES.2005.1432568
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Recent advances in speech technologies have produced new tools that can be used to improve the performance and flexibility of speaker recognition. While there are few degrees of freedom or alternative methods when using fingerprint or iris identification techniques, speech offers much more flexibility and different levels to perform recognition: the system can force the user to speak in a particular manner, different for each attempt to enter. Also, with voice input, the system has other degrees of freedom, such as the use of knowledge/codes that only the user knows, or dialectical/semantical traits that are difficult to forge. This paper offers an overview of the state-of-the-art in speaker recognition, with special emphasis on the pros and cons, and the current research lines. The current research lines include improved classification systems, and the use of high level information by means of probabilistic grammars. In conclusion, speaker recognition is far away from being a technology where all the possibilities have already been explored.
引用
收藏
页码:7 / 12
页数:6
相关论文
共 50 条
  • [21] Driver Intention Recognition: State-of-the-Art Review
    Vellenga, Koen
    Steinhauer, H. Joe
    Karlsson, Alexander
    Falkman, Goran
    Rhodin, Asli
    Koppisetty, Ashok Chaitanya
    IEEE OPEN JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 3 : 602 - 616
  • [22] On the Recognition Performance of BioHashing on state-of-the-art Face Recognition models
    Shahreza, Hatef Otroshi
    Hahn, Vedrana Krivokuca
    Marcel, Sebastien
    2021 IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY (WIFS), 2021, : 50 - 55
  • [23] State-of-the-art speaker recognition with neural network embeddings in NIST SRE18 and Speakers in the Wild evaluations
    Villalba, Jesus
    Chen, Nanxin
    Snyder, David
    Garcia-Romero, Daniel
    McCree, Alan
    Sell, Gregory
    Borgstrom, Jonas
    Garcia-Perera, Leibny Paola
    Richardson, Fred
    Dehak, Reda
    Torres-Carrasquillo, Pedro A.
    Dehak, Najim
    COMPUTER SPEECH AND LANGUAGE, 2020, 60
  • [24] State-of-the-art Speaker Recognition for Telephone and Video Speech: the JHU-MIT Submission for NIST SRE18
    Villalba, Jesus
    Chen, Nanxin
    Snyder, David
    Garcia-Romero, Daniel
    McCree, Alan
    Sell, Gregory
    Borgstrom, Jonas
    Richardson, Fred
    Shon, Suwon
    Grondin, Francois
    Dehak, Reda
    Garcia-Perera, Leibny Paola
    Povey, Daniel
    Torres-Carrasquillo, Pedro A.
    Khudanpur, Sanjeev
    Dehak, Najim
    INTERSPEECH 2019, 2019, : 1488 - 1492
  • [25] State-of-the-art of situation recognition systems for intraoperative procedures
    Junger, D.
    Frommer, S. M.
    Burgert, O.
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2022, 60 (04) : 921 - 939
  • [26] Handwritten digit recognition using state-of-the-art techniques
    Liu, CL
    Nakashima, K
    Sako, H
    Fujisawa, H
    EIGHTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION: PROCEEDINGS, 2002, : 320 - 325
  • [27] State-of-the-art of situation recognition systems for intraoperative procedures
    D. Junger
    S. M. Frommer
    O. Burgert
    Medical & Biological Engineering & Computing, 2022, 60 : 921 - 939
  • [28] Optical music recognition: state-of-the-art and open issues
    Ana Rebelo
    Ichiro Fujinaga
    Filipe Paszkiewicz
    Andre R. S. Marcal
    Carlos Guedes
    Jaime S. Cardoso
    International Journal of Multimedia Information Retrieval, 2012, 1 (3) : 173 - 190
  • [29] Handwritten digit recognition: benchmarking of state-of-the-art techniques
    Liu, CL
    Nakashima, K
    Sako, H
    Fujisawa, H
    PATTERN RECOGNITION, 2003, 36 (10) : 2271 - 2285
  • [30] A survey of state-of-the-art approaches for emotion recognition in text
    Alswaidan, Nourah
    Menai, Mohamed El Bachir
    KNOWLEDGE AND INFORMATION SYSTEMS, 2020, 62 (08) : 2937 - 2987