Text-dependent and text-independent speaker recognition of reverberant speech based on CNN

被引：7

作者：

El-Moneim, Samia Abd ^{[1
]}

Sedik, Ahmed ^{[2
]}

Nassar, M. A. ^{[3
]}

El-Fishawy, Adel S. ^{[3
]}

Sharshar, A. M. ^{[3
]}

Hassan, Shaimaa E. A. ^{[3
]}

Mahmoud, Adel Zaghloul ^{[4
]}

Dessouky, Moawd I. ^{[3
]}

El-Banby, Ghada M. ^{[5
]}

El-Samie, Fathi E. Abd ^{[3
,6
]}

El-Rabaie, El-Sayed M. ^{[3
]}

Neyazi, Badawi ^{[7
]}

Seddeq, H. S. ^{[8
]}

Ismail, Nabil A. ^{[9
]}

Khalaf, Ashraf A. M. ^{[10
]}

Elabyad, G. S. M. ^{[3
]}

机构：

[1] Tanta High Inst Engn & Technol, Commun & Elect Dept, Tanta, Egypt

[2] Kafrelsheikh Univ, Fac Artificial Intelligents, Dept Robot & Intelligent Machines, Kafr Al Sheikh, Egypt

[3] Menoufia Univ, Fac Elect Engn, Dept Elect & Elect Commun & Elect, Menoufia 32952, Egypt

[4] Zagazig Univ, Elect & Commun Dept, Fac Engn, Zagazig, Egypt

[5] Menoufia Univ, Fac Elect Engn, Automat Control Dept, Menoufia, Egypt

[6] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Informat Technol, Riyadh, Saudi Arabia

[7] Minist Ind, Prod & Vocat Training Dept, Cairo, Egypt

[8] Housing & Bldg Natl Res Ctr, Acoust Lab, Giza, Egypt

[9] Menoufia Univ, Fac Elect Engn, Dept Comp Sci & Engn, Menoufia 32952, Egypt

[10] Minia Univ, Fac Engn, Elect Engn Dept, Al Minya, Egypt

来源：

INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY | 2021年 / 24卷 / 04期

关键词：

Speaker recognition; Biometrics; CNN; Reverberation; Spectrogram; Recognition accuracy;

D O I：

10.1007/s10772-021-09805-3

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Speaker recognition is one of several biometric recognition systems owing to its high importance in numerous applications of security and telecommunications. The key aspiration of speaker recognition systems is to know who is speaking depending on voice characteristics. This paper presents an extensive study of speaker recognition in both text-dependent and text-independent cases. Convolutional Neural Network (CNN) based feature extraction is extended to the text-dependent and text-independent speaker recognition tasks. In addition, the effect of reverberation on the speaker recognition system is addressed. All speech signals are converted into images by obtaining their spectrograms. Two proposed CNN models are presented for efficient speaker recognition from clean and reverberant speech signals. They depend on image processing concepts applied on spectrograms of speech signals. One of the proposed models is compared with a conventional Benchmark model in the text-independent scenario. The performance of the recognition system is measured by the recognition rate in the cases of clean and reverberant speech.

引用

页码：993 / 1006

页数：14

共 50 条

[1] Text-dependent and text-independent speaker recognition of reverberant speech based on CNN
Samia Abd El-Moneim
Ahmed Sedik
M. A. Nassar
Adel S. El-Fishawy
A. M. Sharshar
Shaimaa E. A. Hassan
Adel Zaghloul Mahmoud
Moawd I. Dessouky
Ghada M. El-Banby
Fathi E. Abd El-Samie
El-Sayed M. El-Rabaie
Badawi Neyazi
H. S. Seddeq
Nabil A. Ismail
Ashraf A. M. Khalaf
G. S. M. Elabyad
[J]. International Journal of Speech Technology, 2021, 24 : 993 - 1006
[2] Text-Dependent Versus Text-Independent Speech Emotion Recognition
Nayak, Biswajit
Pradhan, Manoj Kumar
[J]. PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGIES, IC3T 2015, VOL 1, 2016, 379 : 153 - 161
[3] VQ score normalisation for text-dependent and text-independent speaker recognition
Finan, RA
Sapeluk, AT
Damper, RI
[J]. AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 1997, 1206 : 211 - 218
[4] A Survey on Text-Dependent and Text-Independent Speaker Verification
Tu, Youzhi
Lin, Weiwei
Mak, Man-Wai
[J]. IEEE ACCESS, 2022, 10 : 99038 - 99049
[5] EFFECTS OF GENDER INFORMATION IN TEXT-INDEPENDENT AND TEXT-DEPENDENT SPEAKER VERIFICATION
Kanervisto, Anssi
Vestman, Ville
Sahidullah, Md
Hautamaki, Ville
Kinnunen, Tomi
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5360 - 5364
[6] TEXT-INDEPENDENT SPEAKER RECOGNITION
ATAL, BS
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1972, 52 (01): : 181 - &
[7] Effects of speech coding on text-dependent speaker recognition
Phythian, M
Ingram, J
Sridharan, S
[J]. IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 137 - 140
[8] Improving Robustness of Speaker Verification by Fusion of Prompted Text-Dependent and Text-Independent Operation Modalities
Mporas, Iosif
Safavi, Saeid
Sotudeh, Reza
[J]. SPEECH AND COMPUTER, 2016, 9811 : 378 - 385
[9] Effect of Spoken Text on Text-independent Speaker Recognition
Alsulaiman, Mansour
[J]. PROCEEDINGS FIFTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION, 2014, : 279 - 284
[10] Text-dependent Speaker Recognition for Vietnamese
Diep Dao Thi Thu
Quang Nguyen Hong
Loan Trinh Van
Hung Pham Ngoc
[J]. 2013 INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2013, : 196 - 200

← 1 2 3 4 5 →