IS DESCRIBING LANGUAGE MERE BUTTERFLY COLLECTION? ON EPISTEMOLOGY, STATISTICAL LANGUAGE MODELS, AND CORPUS

被引:0
|
作者
de Uzeda-Garrao, Milena [1 ]
机构
[1] Univ Fed Rural Rio de Janeiro, Seropedica, RJ, Brazil
关键词
Language Philosophy; Statistical Models; Corpus Linguistics;
D O I
暂无
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
This work focuses on the philosophical tension between conventionalists and rationalists, present in the history of science from pre-Socratic period to contemporary Linguistics. R also takes into account the relevance of statistical models rather than considering language phenomenon as an elegant representation of the expression of thought. For that purpose, we take some conventionalists' accounts of language throughout philosophy history and we also analyze computer scientist Peter Norvig's recent essay on whether language science should really take the path linguist Noam Chomsky considers as the only one to a scientific approach. Therefore, in this work, we attempt to take empirical facts and, therefore, pure description, to portray linguistic phenomenon as an evidence of a conventional use rather than a rational creative human impetus. Finally, we also claim that Corpus Linguistics grounds Norvig's arguments on language science.
引用
收藏
页码:10900 / 10903
页数:4
相关论文
共 50 条
  • [1] Large Language Models are Not Models of Natural Language: They are Corpus Models
    Veres, Csaba
    IEEE ACCESS, 2022, 10 : 61970 - 61979
  • [2] Statistical Analysis of Multilingual Text Corpus and Development of Language Models
    Agrawal, Shyam S.
    Bansal, Abhimanue Shweta
    Mahajan, Minakshi
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 2436 - 2440
  • [3] Collection and Preprocessing of Czech Sign Language Corpus for Sign Language Recognition
    Campr, Pavel
    Hruz, Marek
    Trojanova, Jana
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 3175 - 3178
  • [4] Language Independent Statistical Software for Corpus Exploration
    John Sinclair
    Oliver Mason
    Jackie Ball
    Geoff Barnbrook
    Computers and the Humanities, 1997, 31 : 229 - 255
  • [5] Detoxifying Language Models with a Toxic Corpus
    Park, Yoon A.
    Rudzicz, Frank
    PROCEEDINGS OF THE SECOND WORKSHOP ON LANGUAGE TECHNOLOGY FOR EQUALITY, DIVERSITY AND INCLUSION (LTEDI 2022), 2022, : 41 - 46
  • [6] Language independent statistical software for Corpus exploration
    Sinclair, J
    Mason, O
    Ball, J
    Barnbrook, G
    COMPUTERS AND THE HUMANITIES, 1997, 31 (03): : 229 - 255
  • [7] Design and Data Collection for the Accentological Corpus of the Russian Language
    Grishina, E.
    Savchuk, S.
    Poljakov, A.
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,
  • [8] The Influence of Corpus Quality on Statistical Measurements on Language Resources
    Eckart, Thomas
    Quasthoff, Uwe
    Goldhahn, Dirk
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 2318 - 2321
  • [9] Statistical Corpus and Language Comparison using Comparable Corpora
    Eckart, Thomas
    Quasthoff, Uwe
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 15 - 20
  • [10] Large Language Models Demonstrate the Potential of Statistical Learning in Language
    Contreras Kallens, Pablo
    Kristensen-McLachlan, Ross Deans
    Christiansen, Morten H.
    COGNITIVE SCIENCE, 2023, 47 (03) : e13256