Evaluating the Irregularity of Natural Languages

被引：5

作者：

Hernandez-Gomez, Candelario ^{[1
]}

Basurto-Flores, Rogelio ^{[2
]}

Obregon-Quintana, Bibiana ^{[3
]}

Guzman-Vargas, Lev ^{[2
]}

机构：

[1] Inst Politecn Nacl, Dept Fis, Escuela Super Fis & Math, Ciudad De Mexico 07738, Mexico

[2] Inst Politecn Nacl, Unidad Interdisciplinaria Ingn & Tecnol Avanzadas, Ciudad De Mexico 07340, Mexico

[3] Univ Nacl Autonoma Mexico, Fac Ciencias, Ciudad Univ, Ciudad De Mexico 04510, Mexico

来源：

ENTROPY | 2017年 / 19卷 / 10期

关键词：

approximate entropy of texts; sample entropy; text irregularity; symbol sequences; LONG-RANGE CORRELATIONS; MULTISCALE ENTROPY ANALYSIS; APPROXIMATE ENTROPY; COMPLEXITY;

D O I：

10.3390/e19100521

中图分类号：

O4 [物理学];

学科分类号：

0702 ;

摘要：

In the present work, we quantify the irregularity of different European languages belonging to four linguistic families (Romance, Germanic, Uralic and Slavic) and an artificial language (Esperanto). We modified a well-known method to calculate the approximate and sample entropy of written texts. We find differences in the degree of irregularity between the families and our method, which is based on the search of regularities in a sequence of symbols, and consistently distinguishes between natural and synthetic randomized texts. Moreover, we extended our study to the case where multiple scales are accounted for, such as the multiscale entropy analysis. Our results revealed that real texts have non-trivial structure compared to the ones obtained from randomization procedures.

引用

页数：10

共 50 条

[1] EVALUATING DATABASE LANGUAGES
STAMEN, J
COSTELLO, W
[J]. DATAMATION, 1981, 27 (05): : 116 - &
[2] PROGRAMMING LANGUAGES, NATURAL LANGUAGES, AND MATHEMATICS
NAUR, P
[J]. COMMUNICATIONS OF THE ACM, 1975, 18 (12) : 676 - 682
[3] Natural languages versus artificial languages
Rujillo Arreno, Ramon
Ortilla Urand, Luisa
[J]. LETRAS, 2013, 84 (119): : 173 - 182
[4] Evaluating the usability of natural language query languages and interfaces to Semantic Web knowledge bases
Kaufmann, Esther
Bernstein, Abraham
[J]. JOURNAL OF WEB SEMANTICS, 2010, 8 (04): : 377 - 393
[5] Natural head position and lower incisor irregularity
Uysal, Tancan
Yagci, Ahmet
Ekizer, Abdullah
Usumez, Serdar
[J]. JOURNAL OF OROFACIAL ORTHOPEDICS-FORTSCHRITTE DER KIEFERORTHOPADIE, 2016, 77 (02): : 112 - 118
[6] ON EVALUATING INTERACTIVE QUERY LANGUAGES
LOCHOVSKY, FH
TSICHRITZIS, DC
[J]. INFORMATION SCIENCES, 1983, 29 (2-3) : 93 - 113
[7] Evaluating nonwoven fabric irregularity on the basis of Linnik functionals
Cherkassky, A
[J]. TEXTILE RESEARCH JOURNAL, 1999, 69 (10) : 701 - 708
[8] NATURAL LANGUAGES AND CONTEXT-FREE LANGUAGES
PULLUM, GK
GAZDAR, G
[J]. LINGUISTICS AND PHILOSOPHY, 1982, 4 (04) : 471 - 504
[9] THE INTERTRANSLATABILITY OF NATURAL LANGUAGES
MALPAS, JE
[J]. SYNTHESE, 1989, 78 (03) : 233 - 264
[10] Comprehension of Natural Languages
姚文明
[J]. 西南民族大学学报(自然科学版), 1999, (01) : 102 - 104+107

← 1 2 3 4 5 →