Efficient feature extraction and classification for the development of Pashto speech recognition system

被引:0
|
作者
Irfan Ahmed
Muhammad Abeer Irfan
Abid Iqbal
Amaad Khalil
Salman Ilahi Siddiqui
机构
[1] University of Engineering and Technology Peshawar,Department of Electrical Engineering
[2] Jalozai Campus,Department of Computer Systems Engineering
[3] University of Engineering and Technology Peshawar,undefined
来源
关键词
Automatic speech recognition (ASR); Machine learning (ML); Feature extraction; MFCC; DWT; SVM; -NN;
D O I
暂无
中图分类号
学科分类号
摘要
In this work, a novel framework for the efficient feature extraction and recognition of Pashto speech signals is proposed. The targeted language is one of the low-resource languages and prone to higher Automatic Speech Recognition (ASR) errors due to the availability of its colloquial dialects. We devised a framework which not only employed classical Machine Learning (ML) models for speech recognition tasks, but also achieved a higher level of performance accuracy by using the optimal feature extraction techniques. The designed frameworks for feature extraction are based on two well-know feature extraction techniques: Discrete Wavelet Transform (DWT )coefficients and Mel-Frequency Cepstral Coefficients (MFCC). In our work, we deployed classical ML models i.e., Support Vector Machine (SVM) and K-Nearest Neighbors (k-NN), due to their efficiency in terms of computation complexity, energy efficiency, and higher accuracy as compared to other ML and Deep Learning (DL) model. Hence, our proposed framework exhibited improved performance level when trained on a Pashto isolated words dataset.
引用
下载
收藏
页码:54081 / 54096
页数:15
相关论文
共 50 条
  • [1] Efficient feature extraction and classification for the development of Pashto speech recognition system
    Ahmed, Irfan
    Irfan, Muhammad Abeer
    Iqbal, Abid
    Khalil, Amaad
    Siddiqui, Salman Ilahi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (18) : 54081 - 54096
  • [2] The Development of Isolated Words Pashto Automatic Speech Recognition System
    Ahmed, Irfan
    Ahmad, Nasir
    Ali, Hazrat
    Ahmad, Gulzar
    PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC 12), 2012, : 333 - 336
  • [3] Efficient Feature Extraction Algorithms to Develop an Arabic Speech Recognition System
    Alasadi, Abdulmalik A.
    Adhyani, Theyazn H. H.
    Deshmukh, Ratnadeep R.
    Alahmadi, Ahmed H.
    Alshebami, Ali Saleh
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2020, 10 (02) : 5547 - 5553
  • [4] A Review of Feature Extraction and Classification Techniques in Speech Recognition
    Yadav S.
    Kumar A.
    Yaduvanshi A.
    Meena P.
    SN Computer Science, 4 (6)
  • [5] Efficient feature extraction, encoding and classification for action recognition
    Kantorov, Vadim
    Laptev, Ivan
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 2593 - 2600
  • [6] Feature Extraction Analysis on Indonesian Speech Recognition System
    Wisesty, Untari N.
    Adiwijaya
    Astuti, Widi
    2015 3rd International Conference on Information and Communication Technology (ICoICT), 2015, : 54 - 58
  • [7] Efficient Feature Extraction for Emotion Recognition System
    Lynn, May Mon
    Su, Chaw
    Maw, Kyi Kyi
    2018 4TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2018,
  • [8] Visual Speech Recognition: a solution from feature extraction to words classification
    Da Silveira, L
    Facon, J
    Borges, DL
    XVI BRAZILIAN SYMPOSIUM ON COMPUTER GRAPHICS AND IMAGE PROCESSING, PROCEEDINGS, 2003, : 399 - 405
  • [9] Arabic Speech Recognition Using MFCC Feature Extraction and ANN Classification
    Wahyuni, Elvira Sukma
    2017 2ND INTERNATIONAL CONFERENCES ON INFORMATION TECHNOLOGY, INFORMATION SYSTEMS AND ELECTRICAL ENGINEERING (ICITISEE): OPPORTUNITIES AND CHALLENGES ON BIG DATA FUTURE INNOVATION, 2017, : 22 - 25
  • [10] Speech recognition as feature extraction for speaker recognition
    Stolcke, A.
    Shriberg, E.
    Ferrer, L.
    Kajarekar, S.
    Sonmez, K.
    Tur, G.
    2007 IEEE WORKSHOP ON SIGNAL PROCESSING APPLICATIONS FOR PUBLIC SECURITY AND FORENSICS, 2007, : 39 - +