Yet another acoustic representation of speech sounds

被引:0
|
作者
Minematsu, N [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Bunkyo Ku, Tokyo 1130033, Japan
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes yet another representation of speech sounds. The proposed speech modeling can remove both multiplicative and linear transformational distortion from speech theoretically. It means that speech sounds are represented without being affected by any static distortion inevitably involved in production, encoding, transmission, decoding, and hearing processes, such as differences in vocal tract length, gender, age, microphone, room, line, auditory characteristics, and so on. The method acoustically models not individual phones but their entire system, where only acoustic interrelation embedded in all the kinds of phones is focused. Since the method provides us with no absolute acoustic properties of phones, it cannot recognize or synthesize even a single phone. On the contrary, the proposed method is shown to be able to be applied to pronunciation assessment effectively and reliably, where the proficiency of pronunciation is estimated without using acoustic models of the individual phones directly in the matching.
引用
收藏
页码:585 / 588
页数:4
相关论文
共 50 条
  • [1] REPRESENTATION OF SPEECH SOUNDS IN PRECATEGORICAL ACOUSTIC STORAGE
    CROWDER, RG
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1973, 98 (01): : 14 - 24
  • [2] PSYCHOLOGICAL REPRESENTATION OF SPEECH SOUNDS
    LACKNER, JR
    GOLDSTEIN, LM
    QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1975, 27 (MAY): : 173 - 185
  • [3] Speech sounds and their representation for diagnosis
    Ptok, M.
    HNO, 2009, 57 (10) : 1057 - 1063
  • [4] REPRESENTATION OF SPEECH SOUNDS BY YOUNG INFANTS
    JUSCZYK, PW
    DERRAH, C
    DEVELOPMENTAL PSYCHOLOGY, 1987, 23 (05) : 648 - 654
  • [5] Neural representation of amplified speech sounds
    Tremblay, KL
    Billings, CJ
    Friesen, LM
    Souza, PE
    EAR AND HEARING, 2006, 27 (02): : 93 - 103
  • [6] THE INTERVALGRAM AS A VISUAL REPRESENTATION OF SPEECH SOUNDS
    CHANG, SH
    PIHL, GE
    WIREN, J
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1951, 23 (06): : 674 - 679
  • [7] ACOUSTIC SPECTROGRAPHIC ANALYSIS OF SPEECH SOUNDS
    ARONSON, AE
    POSTGRADUATE MEDICINE, 1966, 40 (01) : 113 - &
  • [8] YET ANOTHER ROMANIAN READ SPEECH CORPUS
    Pistol, Laura
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE 'LINQUISTIC RESOURCES AND TOOLS FOR PROCESSING THE ROMANIAN LANGUAGE', 2014, 2014, : 95 - 102
  • [9] YET ANOTHER REPRESENTATION OF MOLECULAR-STRUCTURE
    DIETZ, A
    JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1995, 35 (05): : 787 - 802
  • [10] The mental representation of sounds in speech sound disorders
    Soujanya Pathi
    Prakash Mondal
    Humanities and Social Sciences Communications, 8