Estimating Linguistic Diversity on the Internet: A Taxonomy to Avoid Pitfalls and Paradoxes

被引:5
|
作者
Gerrand, Peter [1 ,2 ]
机构
[1] La Trobe Univ, Spanish Catalan Galician & Media Studies, Bundoora, Vic, Australia
[2] Univ Melbourne, Fac Engn, Melbourne, Vic 3010, Australia
来源
关键词
D O I
10.1111/j.1083-6101.2007.00374.x
中图分类号
G2 [信息与知识传播];
学科分类号
05 ; 0503 ;
摘要
Both UNESCO and OECD have recognized the public policy benefit of publicizing information on linguistic diversity on the Internet. However, the published methodologies for estimating "linguistic diversity" or "Internet statistics (by language)" do so with different interpretations of these key terms. This article creates a new taxonomy, defining and contrasting user activity, user profile, web presence, and diversity index to distinguish among the various indicators used to estimate language usage on the Internet. This taxonomy facilitates comparisons of the available methodologies, whose limitations are then critiqued. It also helps to resolve the apparent paradox as to whether the use of English on the Internet has declined rapidly or has remained fairly stable. The study concludes that the best estimates of web presence can be achieved by direct measurement: randomly addressing and analyzing a representative sample of all public websites. However, this approach will only suffice if the language detection software used is progressively extended to recognize all the world's written languages.
引用
收藏
页码:1298 / 1321
页数:24
相关论文
共 8 条
  • [1] Avoid internet pitfalls
    Headley, T
    [J]. CHEMICAL ENGINEERING PROGRESS, 2003, 99 (04) : 59 - 61
  • [2] Entropic Inference: some pitfalls and paradoxes we can avoid
    Caticha, Ariel
    [J]. BAYESIAN INFERENCE AND MAXIMUM ENTROPY METHODS IN SCIENCE AND ENGINEERING, 2013, 1553 : 200 - 211
  • [3] Global linguistic diversity for the Internet
    Anderson, D
    [J]. COMMUNICATIONS OF THE ACM, 2005, 48 (01) : 27 - 28
  • [4] How to avoid the top ten pitfalls in insect conservation and diversity research and minimise your chances of manuscript rejection
    Leather, Simon R.
    Basset, Yves
    Didham, Raphael K.
    [J]. INSECT CONSERVATION AND DIVERSITY, 2014, 7 (01) : 1 - 3
  • [5] Pitfalls of linear regression for estimating slopes over time and how to avoid them by using linear mixed-effects models
    Janmaat, Cynthia J.
    van Diepen, Merel
    Tsonaka, Roula
    Jager, Kitty J.
    Zoccali, Carmine
    Dekker, Friedo W.
    [J]. NEPHROLOGY DIALYSIS TRANSPLANTATION, 2019, 34 (04) : 561 - 566
  • [6] Linguistic diversity on the internet: Arabic, Chinese and Cyrillic script top-level domain names
    Baasanjav, Undrah B.
    [J]. TELECOMMUNICATIONS POLICY, 2014, 38 (11) : 961 - 969
  • [7] Toward multi-lingual information retrieval system based on internet linguistic diversity measurement
    Mohamed, Ebtsam
    Elmougy, Samir
    Aref, Mostafa
    [J]. AIN SHAMS ENGINEERING JOURNAL, 2019, 10 (03) : 489 - 497
  • [8] Estimating Diversity of Black Flies in the Simulium ignescens and Simulium tunja Complexes in Colombia: Chromosomal Rearrangements as the Core of Integrative Taxonomy
    Colorado-Garzon, Fredy A.
    Adler, Peter H.
    Garcia, Luis F.
    de Hoyos, Paulina Muuoz
    Bueno, Marta L.
    Matta, Nubia E.
    [J]. JOURNAL OF HEREDITY, 2017, 108 (01) : 12 - 24