Distinctive Phonetic Features Modeling and Extraction Using Deep Neural Networks

被引：7

作者：

Seddiq, Yasser ^{[1
]}

Alotaibi, Yousef A. ^{[2
]}

Selouani, Sid-Ahmed ^{[3
]}

Meftah, Ali Hamid ^{[2
]}

机构：

[1] KACST, Riyadh 11442, Saudi Arabia

[2] King Saud Univ, Coll Comp & Informat Sci, Riyadh 4545, Saudi Arabia

[3] Univ Moncton, LARIHS Lab, Shippegan, NB E8S 1P6, Canada

来源：

IEEE ACCESS | 2019年 / 7卷

关键词：

Modern standard Arabic; distinctive phonetic features; speech processing; deep belief networks; restricted Boltzmann machine;

D O I：

10.1109/ACCESS.2019.2924014

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Feature extraction is a critical stage of digital speech processing systems. Quality of features is of great importance to provide a solid foundation upon which the subsequent stages stand. Distinctive phonetic features (DPFs) are one of the most representative features of the speech signals. The significance of DPFs is in their ability to provide abstract description of the places and manners of articulation of the language phonemes. A phoneme's DPF element reflects unique articulatory information about that phoneme. Therefore, there is a need to discover and investigate each DPF element individually in order to achieve a deeper understanding and to come up with a descriptive model for each one. Such fine-grained modeling will satisfy the uniqueness of each DPF element. In this paper, the problem of DPF modeling and extraction of modern standard Arabic is tackled. Due to the remarkable success of deep neural networks (DNNs) that are initialized using deep belief networks (DBNs) in serving DSP applications and its capability of extracting highly representative features from the raw data, we exploit its modeling power to investigate and model the DPF elements. DNN models are compared with the classical multilayer perceptron (MLP) models. The representativeness of several acoustic cues for different DPF elements was also measured. This paper is based on formalizing DPF modeling problem as a binary classification problem. Because the DPF elements are highly imbalanced data, evaluating the quality of models is a very tricky process. This paper addresses the proper evaluation measures satisfying the imbalanced nature of the DPF elements. After modeling each element individually, the two top-level DPF extractors are designed: MLP- and DNN-based extractors. The results show the quality of DNN models and their superiority over MLPs with accuracies of 89.0% and 86.7%, respectively.

引用

页码：81382 / 81396

页数：15

共 50 条

[41] Recognition of Arabic phonetic features using neural networks and knowledge-based system: a comparative study
Selouani, SA
Caelen, J
[J]. IEEE INTERNATIONAL JOINT SYMPOSIA ON INTELLIGENCE AND SYSTEMS - PROCEEDINGS, 1998, : 404 - 411
[42] Network traffic classification using deep convolutional recurrent autoencoder neural networks for spatial-temporal features extraction
D'Angelo, Gianni
Palmieri, Francesco
[J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2021, 173
[43] Modeling the Biocatalytic Method of Lipid Extraction Using Artificial Neural Networks
Shafrai, Anton V.
Prosekov, Alexander Yu.
Vechtomova, Elena A.
[J]. INFORMATION, 2023, 14 (08)
[44] A Canonicalization of Distinctive Phonetic Features to Improve Arabic Speech Recognition
Alotaibi, Yousef A.
Selouani, Sidh-Amed
Yakoub, Mohammed Sidi
Seddiq, Yasser Mohammed
Meftah, Ali
[J]. ACTA ACUSTICA UNITED WITH ACUSTICA, 2019, 105 (06) : 1269 - 1277
[45] Extraction of the Sivers function with deep neural networks
Fernando, I. P.
Keller, D.
[J]. PHYSICAL REVIEW D, 2023, 108 (05)
[46] Deep neural networks for climate relation extraction
Zheng, J.
Wang, J.
Chen, S.
Li, J.
Chen, Y.
Li, B.
[J]. GLOBAL NEST JOURNAL, 2021, 23 (04): : 544 - 549
[47] Deep neural networks for Arabic information extraction
Saadi, Abdelhalim
Belhadef, Hacene
[J]. SMART AND SUSTAINABLE BUILT ENVIRONMENT, 2020, 9 (04) : 467 - 482
[48] Discriminative Feature Extraction with Deep Neural Networks
Stuhlsatz, Andre
Lippel, Jens
Zielke, Thomas
[J]. 2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
[49] On Correlation of Features Extracted by Deep Neural Networks
Ayinde, Babajide O.
Inane, Tamer
Zurada, Jacek M.
[J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[50] How transferable are features in deep neural networks?
Yosinski, Jason
Clune, Jeff
Bengio, Yoshua
Lipson, Hod
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27

← 1 2 3 4 5 →