SPEAKER-INDEPENDENT DETECTION OF CHILD-DIRECTED SPEECH

被引:0
|
作者
Schuster, Sebastian [1 ]
Pancoast, Stephanie [2 ]
Ganjoo, Milind [1 ]
Frank, Michael C. [3 ]
Jurafsky, Dan [1 ,4 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Elect Engn, Stanford, CA 94305 USA
[3] Stanford Univ, Dept Psychol, Stanford, CA USA
[4] Stanford Univ, Dept Linguist, Stanford, CA 94305 USA
关键词
Speech Analysis; Child-directed Speech; Language Development; Prosody; LANGUAGE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Identifying the distinct register that adults use when speaking to children is an important task for child development research. We present a fully automatic, speaker-independent system that detects child-directed speech. The two-stage system uses diarization-style voice activation techniques to extract speech segments followed by a supervised nu-SVM classifier trained on 1582 prosodic and log Mel energy features. The system significantly improves the state of the art, detecting child-directed speech with F1 of .66 (exact boundary) and .83 (within 1 second). A feature analysis confirms the importance of F0 features (especially 3rd quartile and range) as well as new features like the variance, kurtosis, and min of log Mel energy within a frequency band.
引用
收藏
页码:366 / 371
页数:6
相关论文
共 50 条
  • [1] Child-directed Speech
    Miller, Simone
    [J]. SPRACHE-STIMME-GEHOR, 2013, 37 (03): : 117 - 117
  • [2] Child-directed speech
    Meyer, S.
    Jungheim, M.
    Ptok, M.
    [J]. HNO, 2011, 59 (11) : 1129 - 1134
  • [3] The quality of child-directed speech depends on the speaker's language proficiency
    Hoff, Erika
    Core, Cynthia
    Shanks, Katherine F.
    [J]. JOURNAL OF CHILD LANGUAGE, 2020, 47 (01) : 132 - 145
  • [4] Honorifics in child-directed speech
    Hildebrandt, Gwendolyn
    [J]. ASIA-PACIFIC LANGUAGE VARIATION, 2024, 10 (01) : 1 - 39
  • [5] SPEAKER-INDEPENDENT CONTINUOUS SPEECH DICTATION
    GAUVAIN, JL
    LAMEL, LF
    ADDA, G
    ADDADECKER, M
    [J]. SPEECH COMMUNICATION, 1994, 15 (1-2) : 21 - 37
  • [6] The study on continuous speech of speaker-independent
    Ye Hong
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2006, 15 (4A) : 921 - 924
  • [7] SPEAKER-INDEPENDENT CLASSIFICATION OF PHONETIC SEGMENTS FROM RAW ULTRASOUND IN CHILD SPEECH
    Ribeiro, Manuel Sam
    Eshky, Aciel
    Richmond, Korin
    Renals, Steve
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1328 - 1332
  • [8] SPEAKER-INDEPENDENT BRAIN ENHANCED SPEECH DENOISING
    Hosseini, Maryam
    Celotti, Luca
    Plourde, Eric
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1310 - 1314
  • [9] SPEAKER-INDEPENDENT VOWEL RECOGNITION IN PERSIAN SPEECH
    Nazari, Mohammad
    Sayadiyan, Abolghasem
    Valiollahzadeh, Seyyed Majid
    [J]. 2008 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES: FROM THEORY TO APPLICATIONS, VOLS 1-5, 2008, : 672 - 676
  • [10] Japanese Speaker-Independent Homonyms Speech Recognition
    Murakami, Jin'ichi
    Hotta, Haseo
    [J]. COMPUTATIONAL LINGUISTICS AND RELATED FIELDS, 2011, 27 : 306 - 313