Distinctive Phonetic Features Modeling and Extraction Using Deep Neural Networks

被引:7
|
作者
Seddiq, Yasser [1 ]
Alotaibi, Yousef A. [2 ]
Selouani, Sid-Ahmed [3 ]
Meftah, Ali Hamid [2 ]
机构
[1] KACST, Riyadh 11442, Saudi Arabia
[2] King Saud Univ, Coll Comp & Informat Sci, Riyadh 4545, Saudi Arabia
[3] Univ Moncton, LARIHS Lab, Shippegan, NB E8S 1P6, Canada
关键词
Modern standard Arabic; distinctive phonetic features; speech processing; deep belief networks; restricted Boltzmann machine;
D O I
10.1109/ACCESS.2019.2924014
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature extraction is a critical stage of digital speech processing systems. Quality of features is of great importance to provide a solid foundation upon which the subsequent stages stand. Distinctive phonetic features (DPFs) are one of the most representative features of the speech signals. The significance of DPFs is in their ability to provide abstract description of the places and manners of articulation of the language phonemes. A phoneme's DPF element reflects unique articulatory information about that phoneme. Therefore, there is a need to discover and investigate each DPF element individually in order to achieve a deeper understanding and to come up with a descriptive model for each one. Such fine-grained modeling will satisfy the uniqueness of each DPF element. In this paper, the problem of DPF modeling and extraction of modern standard Arabic is tackled. Due to the remarkable success of deep neural networks (DNNs) that are initialized using deep belief networks (DBNs) in serving DSP applications and its capability of extracting highly representative features from the raw data, we exploit its modeling power to investigate and model the DPF elements. DNN models are compared with the classical multilayer perceptron (MLP) models. The representativeness of several acoustic cues for different DPF elements was also measured. This paper is based on formalizing DPF modeling problem as a binary classification problem. Because the DPF elements are highly imbalanced data, evaluating the quality of models is a very tricky process. This paper addresses the proper evaluation measures satisfying the imbalanced nature of the DPF elements. After modeling each element individually, the two top-level DPF extractors are designed: MLP- and DNN-based extractors. The results show the quality of DNN models and their superiority over MLPs with accuracies of 89.0% and 86.7%, respectively.
引用
收藏
页码:81382 / 81396
页数:15
相关论文
共 50 条
  • [41] Recognition of Arabic phonetic features using neural networks and knowledge-based system: a comparative study
    Selouani, SA
    Caelen, J
    [J]. IEEE INTERNATIONAL JOINT SYMPOSIA ON INTELLIGENCE AND SYSTEMS - PROCEEDINGS, 1998, : 404 - 411
  • [42] Network traffic classification using deep convolutional recurrent autoencoder neural networks for spatial-temporal features extraction
    D'Angelo, Gianni
    Palmieri, Francesco
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2021, 173
  • [43] Modeling the Biocatalytic Method of Lipid Extraction Using Artificial Neural Networks
    Shafrai, Anton V.
    Prosekov, Alexander Yu.
    Vechtomova, Elena A.
    [J]. INFORMATION, 2023, 14 (08)
  • [44] A Canonicalization of Distinctive Phonetic Features to Improve Arabic Speech Recognition
    Alotaibi, Yousef A.
    Selouani, Sidh-Amed
    Yakoub, Mohammed Sidi
    Seddiq, Yasser Mohammed
    Meftah, Ali
    [J]. ACTA ACUSTICA UNITED WITH ACUSTICA, 2019, 105 (06) : 1269 - 1277
  • [45] Extraction of the Sivers function with deep neural networks
    Fernando, I. P.
    Keller, D.
    [J]. PHYSICAL REVIEW D, 2023, 108 (05)
  • [46] Deep neural networks for climate relation extraction
    Zheng, J.
    Wang, J.
    Chen, S.
    Li, J.
    Chen, Y.
    Li, B.
    [J]. GLOBAL NEST JOURNAL, 2021, 23 (04): : 544 - 549
  • [47] Deep neural networks for Arabic information extraction
    Saadi, Abdelhalim
    Belhadef, Hacene
    [J]. SMART AND SUSTAINABLE BUILT ENVIRONMENT, 2020, 9 (04) : 467 - 482
  • [48] Discriminative Feature Extraction with Deep Neural Networks
    Stuhlsatz, Andre
    Lippel, Jens
    Zielke, Thomas
    [J]. 2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [49] On Correlation of Features Extracted by Deep Neural Networks
    Ayinde, Babajide O.
    Inane, Tamer
    Zurada, Jacek M.
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [50] How transferable are features in deep neural networks?
    Yosinski, Jason
    Clune, Jeff
    Bengio, Yoshua
    Lipson, Hod
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27