Structured Semantic Knowledge Can Emerge Automatically from Predicting Word Sequences in Child-Directed Speech

被引:35
|
作者
Huebner, Philip A. [1 ]
Willits, Jon A. [2 ]
机构
[1] Univ Calif Riverside, Interdept Neurosci Grad Program, Riverside, CA 92521 USA
[2] Univ Calif Riverside, Dept Psychol, Riverside, CA 92521 USA
来源
FRONTIERS IN PSYCHOLOGY | 2018年 / 9卷
关键词
semantic development; language learning; neural networks; statistical learning; REPRESENTATIONS; NETWORKS; MODEL; STATISTICS; LANGUAGE; MEMORY; CONNECTIONISM; COOCCURRENCE; RECOGNITION; ACQUISITION;
D O I
10.3389/fpsyg.2018.00133
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Previous research has suggested that distributional learning mechanisms may contribute to the acquisition of semantic knowledge. However, distributional learning mechanisms, statistical learning, and contemporary "deep learning" approaches have been criticized for being incapable of learning the kind of abstract and structured knowledge that many think is required for acquisition of semantic knowledge. In this paper, we show that recurrent neural networks, trained on noisy naturalistic speech to children, do in fact learn what appears to be abstract and structured knowledge. We trained two types of recurrent neural networks (Simple Recurrent Network, and Long Short-Term Memory) to predict word sequences in a 5-million-word corpus of speech directed to children ages 0-3 years old, and assessed what semantic knowledge they acquired. We found that learned internal representations are encoding various abstract grammatical and semantic features that are useful for predicting word sequences. Assessing the organization of semantic knowledge in terms of the similarity structure, we found evidence of emergent categorical and hierarchical structure in both models. We found that the Long Short-term Memory (LSTM) and SRN are both learning very similar kinds of representations, but the LSTM achieved higher levels of performance on a quantitative evaluation. We also trained a non-recurrent neural network, Skip-gram, on the same input to compare our results to the state-of-the-art in machine learning. We found that Skip-gram achieves relatively similar performance to the LSTM, but is representing words more in terms of thematic compared to taxonomic relations, and we provide reasons why this might be the case. Our findings show that a learning system that derives abstract, distributed representations for the purpose of predicting sequential dependencies in naturalistic language may provide insight into emergence of many properties of the developing semantic system.
引用
收藏
页数:18
相关论文
共 38 条
  • [11] THE AVAILABILITY OF CUES FOR WORD SEGMENTATION AND VOCABULARY ACQUISITION IN CATALAN CHILD-DIRECTED SPEECH
    Feijoo, Sara
    Hilferty, Joseph
    [J]. RLA-REVISTA DE LINGUISTICA TEORICA Y APLICADA, 2013, 51 (02): : 13 - 27
  • [12] Diminutives in child-directed speech supplement metric with distributional word segmentation cues
    Vera Kempe
    Patricia J. Brooks
    Steven Gillis
    [J]. Psychonomic Bulletin & Review, 2005, 12 : 145 - 151
  • [13] Diminutives in child-directed speech supplement metric with distributional word segmentation cues
    Kempe, V
    Brooks, PJ
    Gillis, S
    [J]. PSYCHONOMIC BULLETIN & REVIEW, 2005, 12 (01) : 145 - 151
  • [14] Child-directed speech: relation to socioeconomic status, knowledge of child development and child vocabulary skill
    Rowe, Meredith L.
    [J]. JOURNAL OF CHILD LANGUAGE, 2008, 35 (01) : 185 - 205
  • [15] Additive Effects of Lengthening on the Utterance-Final Word in Child-Directed Speech
    Ko, Eon-Suk
    Soderstrom, Melanie
    [J]. JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2013, 56 (01): : 364 - 371
  • [16] Cross-linguistically consistent semantic and syntactic annotation of child-directed speech
    Szubert, Ida
    Abend, Omri
    Schneider, Nathan
    Gibbon, Samuel
    Mahon, Louis
    Goldwater, Sharon
    Steedman, Mark
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2024,
  • [17] Adult listeners can extract age-related cues from child-directed speech
    Bozkurt, Ceren
    Soley, Gaye
    [J]. QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 2022, 75 (12): : 2244 - 2255
  • [18] Children's Production of Unfamiliar Word Sequences Is Predicted by Positional Variability and Latent Classes in a Large Sample of Child-Directed Speech
    Matthews, Danielle
    Bannard, Colin
    [J]. COGNITIVE SCIENCE, 2010, 34 (03) : 465 - 488
  • [19] Testing the Hyperarticulation and Prosodic Hypotheses of Child-Directed Speech: Insights From the Perceptual and Acoustic Characteristics of Child-Directed Cantonese Tones
    Wong, Puisan
    Ng, Kelly Wing Sum
    [J]. JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2018, 61 (08): : 1907 - 1925
  • [20] A computational model of learning semantic roles from child-directed language
    Alishahi, Afra
    Stevenson, Suzanne
    [J]. LANGUAGE AND COGNITIVE PROCESSES, 2010, 25 (01): : 50 - 93