Fine-grained statistical structure of speech

被引:0
|
作者
Deloche, Francois [1 ]
机构
[1] CNRS, EHESS, Ctr Anal & Math Sociales, Paris, France
来源
PLOS ONE | 2020年 / 15卷 / 03期
关键词
AUDITORY-NERVE FIBERS; VOCAL-TRACT; FREQUENCIES; BANDWIDTHS; RESONANCES; RESPONSES; CODE;
D O I
10.1371/journal.pone.0230233
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In spite of its acoustic diversity, the speech signal presents statistical regularities that can be exploited by biological or artificial systems for efficient coding. Independent Component Analysis (ICA) revealed that on small time scales (similar to 10 ms), the overall structure of speech is well captured by a time-frequency representation whose frequency selectivity follows the same power law in the high frequency range 1-8 kHz as cochlear frequency selectivity in mammals. Variations in the power-law exponent, i.e. different time-frequency trade-offs, have been shown to provide additional adaptation to phonetic categories. Here, we adopt a parametric approach to investigate the variations of the exponent at a finer level of speech. The estimation procedure is based on a measure that reflects the sparsity of decompositions in a set of Gabor dictionaries whose atoms are Gaussian-modulated sinusoids. We examine the variations of the exponent associated with the best decomposition, first at the level of phonemes, then at an intra-phonemic level. We show that this analysis offers a rich interpretation of the fine-grained statistical structure of speech, and that the exponent values can be related to key acoustic properties. Two main results are: i) for plosives, the exponent is lowered by the release bursts, concealing higher values during the opening phases; ii) for vowels, the exponent is bound to formant bandwidths and decreases with the degree of acoustic radiation at the lips. This work further suggests that an efficient coding strategy is to reduce frequency selectivity with sound intensity level, congruent with the nonlinear behavior of cochlear filtering.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Fine-Grained Grounding for Multimodal Speech Recognition
    Srinivasan, Tejas
    Sanabria, Ramon
    Metze, Florian
    Elliott, Desmond
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 2667 - 2677
  • [2] Improving Speech Enhancement through Fine-Grained Speech Characteristics
    Yang, Muqiao
    Konan, Joseph
    Bick, David
    Kumar, Anurag
    Watanabe, Shinji
    Raj, Bhiksha
    [J]. INTERSPEECH 2022, 2022, : 2953 - 2957
  • [3] Fine-Grained Crowdsourcing for Fine-Grained Recognition
    Jia Deng
    Krause, Jonathan
    Li Fei-Fei
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 580 - 587
  • [4] Fine-grained Noise Control for Multispeaker Speech Synthesis
    Nikitaras, Karolos
    Vamvoukakis, Georgios
    Ellinas, Nikolaos
    Klapsas, Konstantinos
    Markopoulos, Konstantinos
    Raptis, Spyros
    Sung, June Sig
    Jho, Gunu
    Chalamandaris, Aimilios
    Tsiakoulis, Pirros
    [J]. INTERSPEECH 2022, 2022, : 828 - 832
  • [5] Hierarchical CVAE for Fine-Grained Hate Speech Classification
    Qian, Jing
    ElSherief, Mai
    Belding, Elizabeth
    Wang, William Yang
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3550 - 3559
  • [6] Fine-grained and coarse-grained entropy in problems of statistical mechanics
    Kozlov, V. V.
    Treshchev, D. V.
    [J]. THEORETICAL AND MATHEMATICAL PHYSICS, 2007, 151 (01) : 539 - 555
  • [7] Fine-grained and coarse-grained entropy in problems of statistical mechanics
    V. V. Kozlov
    D. V. Treshchev
    [J]. Theoretical and Mathematical Physics, 2007, 151 : 539 - 555
  • [8] FINE-GRAINED COLOUR DISCRIMINATION WITHOUT FINE-GRAINED COLOUR
    Gert, Joshua
    [J]. AUSTRALASIAN JOURNAL OF PHILOSOPHY, 2015, 93 (03) : 602 - 605
  • [9] Microporous fine-grained copper: structure and properties
    Kumar, KS
    Duesbery, MS
    Louat, NP
    Provenzano, V
    DiPietro, MS
    [J]. PHILOSOPHICAL MAGAZINE A-PHYSICS OF CONDENSED MATTER STRUCTURE DEFECTS AND MECHANICAL PROPERTIES, 2001, 81 (05): : 1023 - 1040
  • [10] FINE-GRAINED STEREOTYPING AND THE STRUCTURE OF SOCIAL COGNITION
    LITMAN, GK
    POWELL, GE
    STEWART, RA
    [J]. JOURNAL OF SOCIAL PSYCHOLOGY, 1983, 120 (01): : 45 - 56