SPEAKER-INDEPENDENT DETECTION OF CHILD-DIRECTED SPEECH

被引：0

作者：

Schuster, Sebastian ^{[1
]}

Pancoast, Stephanie ^{[2
]}

Ganjoo, Milind ^{[1
]}

Frank, Michael C. ^{[3
]}

Jurafsky, Dan ^{[1
,4
]}

机构：

[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA

[2] Stanford Univ, Dept Elect Engn, Stanford, CA 94305 USA

[3] Stanford Univ, Dept Psychol, Stanford, CA USA

[4] Stanford Univ, Dept Linguist, Stanford, CA 94305 USA

来源：

2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014 | 2014年

关键词：

Speech Analysis; Child-directed Speech; Language Development; Prosody; LANGUAGE;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Identifying the distinct register that adults use when speaking to children is an important task for child development research. We present a fully automatic, speaker-independent system that detects child-directed speech. The two-stage system uses diarization-style voice activation techniques to extract speech segments followed by a supervised nu-SVM classifier trained on 1582 prosodic and log Mel energy features. The system significantly improves the state of the art, detecting child-directed speech with F1 of .66 (exact boundary) and .83 (within 1 second). A feature analysis confirms the importance of F0 features (especially 3rd quartile and range) as well as new features like the variance, kurtosis, and min of log Mel energy within a frequency band.

引用

页码：366 / 371

页数：6

共 50 条

[1] Child-directed Speech
Miller, Simone
[J]. SPRACHE-STIMME-GEHOR, 2013, 37 (03): : 117 - 117
[2] Child-directed speech
Meyer, S.
Jungheim, M.
Ptok, M.
[J]. HNO, 2011, 59 (11) : 1129 - 1134
[3] The quality of child-directed speech depends on the speaker's language proficiency
Hoff, Erika
Core, Cynthia
Shanks, Katherine F.
[J]. JOURNAL OF CHILD LANGUAGE, 2020, 47 (01) : 132 - 145
[4] Honorifics in child-directed speech
Hildebrandt, Gwendolyn
[J]. ASIA-PACIFIC LANGUAGE VARIATION, 2024, 10 (01) : 1 - 39
[5] SPEAKER-INDEPENDENT CONTINUOUS SPEECH DICTATION
GAUVAIN, JL
LAMEL, LF
ADDA, G
ADDADECKER, M
[J]. SPEECH COMMUNICATION, 1994, 15 (1-2) : 21 - 37
[6] The study on continuous speech of speaker-independent
Ye Hong
[J]. CHINESE JOURNAL OF ELECTRONICS, 2006, 15 (4A) : 921 - 924
[7] SPEAKER-INDEPENDENT CLASSIFICATION OF PHONETIC SEGMENTS FROM RAW ULTRASOUND IN CHILD SPEECH
Ribeiro, Manuel Sam
Eshky, Aciel
Richmond, Korin
Renals, Steve
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1328 - 1332
[8] SPEAKER-INDEPENDENT BRAIN ENHANCED SPEECH DENOISING
Hosseini, Maryam
Celotti, Luca
Plourde, Eric
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1310 - 1314
[9] SPEAKER-INDEPENDENT VOWEL RECOGNITION IN PERSIAN SPEECH
Nazari, Mohammad
Sayadiyan, Abolghasem
Valiollahzadeh, Seyyed Majid
[J]. 2008 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES: FROM THEORY TO APPLICATIONS, VOLS 1-5, 2008, : 672 - 676
[10] Japanese Speaker-Independent Homonyms Speech Recognition
Murakami, Jin'ichi
Hotta, Haseo
[J]. COMPUTATIONAL LINGUISTICS AND RELATED FIELDS, 2011, 27 : 306 - 313

← 1 2 3 4 5 →