Noisy Phoneme Recognition Using 2D Convolution Neural Network

被引:0
|
作者
Ramonaite, Justina [1 ]
Korvel, Grazina [1 ]
机构
[1] Vilnius Univ, Inst Data Sci & Digital Technol, Vilnius, Lithuania
关键词
speech recognition; convolutional neural network; spectrograms; mel spectrograms;
D O I
10.1109/AIEEE58915.2023.10134866
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Speech is one of the most important parts of everyday life, thus it has been investigated from various standpoints, however, there is still room for exploration within noisy speech signals. This study examines how speech signals are recognized in the presence of noise by conducting a recognition process using both clean speech and speech data with additive noise. Spectrograms and Mel Spectrograms have been extracted and tested using a Convolutional Neural Network. Training on noise-free data and on mixed data which has been composed of clean and noisy phoneme signals has been considered. The experimental results showed that model trained with set which includes noisy samples gives better results when classifying signals with noise present compared to noise-free trained model. It was also revealed that Mel Spectrograms represent noisy signals better than Spectrograms.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Optical Character Recognition on Bank Cheques Using 2D Convolution Neural Network
    Srivastava, Shriansh
    Priyadarshini, J.
    Gopal, Sachin
    Gupta, Sanchay
    Dayal, Har Shobhit
    [J]. APPLICATIONS OF ARTIFICIAL INTELLIGENCE TECHNIQUES IN ENGINEERING, VOL 2, 2019, 697 : 589 - 596
  • [2] Recognition of facial expressions using 2D DCT and neural network
    Xiao, YG
    Chandrasiri, NP
    Tadokoro, Y
    Oda, M
    [J]. ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 1999, 82 (07): : 1 - 11
  • [3] Recognition of facial expressions using 2D DCT and neural network
    Xiao, Yegui
    Chandrasiri, N.P.
    Tadokoro, Yoshiaki
    Oda, Masaomi
    [J]. Electronics and Communications in Japan, Part III: Fundamental Electronic Science (English translation of Denshi Tsushin Gakkai Ronbunshi), 1999, 82 (07): : 1 - 11
  • [4] 2D and 3D Face Recognition Using Convolutional Neural Network
    Hu, Huiying
    Shah, Syed Afaq Ali
    Bennamoun, Mohammed
    Molton, Michael
    [J]. TENCON 2017 - 2017 IEEE REGION 10 CONFERENCE, 2017, : 133 - 138
  • [5] Vehicle recognition using convolution neural network
    Khan, Maleika Heenaye-Mamode
    Khan, Chonnoo Abubakar Siddick
    Oumeir, Rengony Mohammad
    [J]. INTERNATIONAL JOURNAL OF BIOMETRICS, 2023, 15 (3-4) : 344 - 358
  • [6] Dynamic Hand Gesture Recognition using 2D Convolutional Neural Network
    Liu, Yupeng
    Yang, Mingqiang
    Li, Jie
    Zheng, Qinghe
    Wang, Deqiang
    [J]. ENGINEERING LETTERS, 2020, 28 (01) : 243 - 254
  • [7] Phoneme sequence pattern recognition using fuzzy neural network
    Kwan, HK
    Dong, X
    [J]. PROCEEDINGS OF 2003 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS & SIGNAL PROCESSING, PROCEEDINGS, VOLS 1 AND 2, 2003, : 535 - 538
  • [8] A Hybrid 2D and 3D Convolution Neural Network for Stereo Matching
    Zeng, Xuan
    Li, Yewen
    Chen, Ziqian
    Zhu, Liping
    [J]. 2018 21ST IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2018), 2018, : 152 - 156
  • [9] Manufacturing feature recognition with a 2D convolutional neural network
    Shi, Yang
    Zhang, Yicha
    Harik, Ramy
    [J]. CIRP JOURNAL OF MANUFACTURING SCIENCE AND TECHNOLOGY, 2020, 30 : 36 - 57
  • [10] Feature recognition of a 2D array vortex interferogram using a convolutional neural network
    Li, Yong
    Li, You
    Zhang, Dawei
    Li, Jianlang
    Zhang, Junyong
    [J]. APPLIED OPTICS, 2022, 61 (26) : 7685 - 7691