Deep Neural Networks for Shimmer Approximation in Synthesized Audio Signal

被引:0
|
作者
Alejandro Garcia, Mario [1 ]
Atilio Destefanis, Eduardo [1 ]
机构
[1] Univ Tecnol Nacl, Cordoba, Argentina
来源
关键词
Shimmer; Voice quality; Deep learning; Deep neural network; Convolutional neural network; SPEECH; JITTER; CLASSIFICATION; GENDER;
D O I
10.1007/978-3-319-75214-3_1
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Shimmer is a classical acoustic measure of the amplitude perturbation in a signal. This kind of variation in the human voice allows to characterize some properties, not only of the voice itself, but of the person who speaks. During the last years deep learning techniques have become the state of the art for recognition tasks on the voice. In this work the relationship between shimmer and deep neural networks is analyzed. A deep learning model is created. It is able to approximate shimmer value of a simple synthesized audio signal (stationary and without formants) taking the spectrogram as input feature. It is concluded firstly, that for this kind of synthesized signal, a neural network like the one we proposed can approximate shimmer, and secondly, that the convolution layers can be designed in order to preserve the information of shimmer and transmit it to the following layers.
引用
收藏
页码:3 / 12
页数:10
相关论文
共 50 条
  • [1] Audio signal processing by neural networks
    Uncini, A
    [J]. NEUROCOMPUTING, 2003, 55 (3-4) : 593 - 625
  • [2] A COMPARISON OF AUDIO SIGNAL PREPROCESSING METHODS FOR DEEP NEURAL NETWORKS ON MUSIC TAGGING
    Choi, Keunwoo
    Fazekas, Gyorgy
    Sandler, Mark
    Cho, Kyunghyun
    [J]. 2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 1870 - 1874
  • [3] Information Redundancy in Constructing Systems for Audio Signal Examination on Deep Learning Neural Networks
    Solovyov, V., I
    Rybalskiy, O., V
    Zhuravel, V. V.
    Shablya, A. N.
    Tymko, E., V
    [J]. CYBERNETICS AND SYSTEMS ANALYSIS, 2022, 58 (01) : 8 - 15
  • [4] Information Redundancy in Constructing Systems for Audio Signal Examination on Deep Learning Neural Networks
    V. I. Solovyov
    O. V. Rybalskiy
    V. V. Zhuravel
    A. N. Shablya
    E. V. Tymko
    [J]. Cybernetics and Systems Analysis, 2022, 58 : 8 - 15
  • [5] DEEP NEURAL NETWORKS FOR AUDIO SCENE RECOGNITION
    Petetin, Yohan
    Laroche, Cyrille
    Mayoue, Aurelien
    [J]. 2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 125 - 129
  • [6] Performance of Deep Neural Networks in Audio Surveillance
    Arslan, Yuksel
    Canbolat, Huseyin
    [J]. 2018 6TH INTERNATIONAL CONFERENCE ON CONTROL ENGINEERING & INFORMATION TECHNOLOGY (CEIT), 2018,
  • [7] Approximation Spaces of Deep Neural Networks
    Gribonval, Remi
    Kutyniok, Gitta
    Nielsen, Morten
    Voigtlaender, Felix
    [J]. CONSTRUCTIVE APPROXIMATION, 2022, 55 (01) : 259 - 367
  • [8] Approximation Spaces of Deep Neural Networks
    Rémi Gribonval
    Gitta Kutyniok
    Morten Nielsen
    Felix Voigtlaender
    [J]. Constructive Approximation, 2022, 55 : 259 - 367
  • [9] Deep Residual Neural Networks for Audio Spoofing Detection
    Alzantot, Mousulfa
    Wang, Ziqi
    Srivastava, Mani B.
    [J]. INTERSPEECH 2019, 2019, : 1078 - 1082
  • [10] AUDIO CONCEPT CLASSIFICATION WITH HIERARCHICAL DEEP NEURAL NETWORKS
    Ravanelli, Mirco
    Elizalde, Benjamin
    Ni, Karl
    Friedland, Gerald
    [J]. 2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 606 - 610