Language Homogeneity in the Japanese Wikipedia

被引:0
|
作者
Skevik, Karl-Andre [1 ]
机构
[1] Inferno Nettverk AS, Forskningspk,Gaustadalleen 21, NO-0349 Oslo, Norway
来源
PROCEEDINGS OF THE 24TH PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION | 2010年
关键词
wikipedia; japanese; nlp;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Wikipedia is a potentially very useful source of information, but intuitively it is difficult to have confidence in the quality of an encyclopedia that anyone can modify. One aspect of correctness is writing style, which we examine in a computer based study of the full Japanese Wikipedia. This is possible because Japanese is a language with clearly distinct writing styles using e.g., different verb forms. We find that the writing style of the Japanese Wikipedia is largely consistent with the style guidelines for the project. Exceptions appear to occur primarily in articles with a small number of changes and editors.
引用
收藏
页码:527 / 534
页数:8
相关论文
共 50 条
  • [31] Constructing a Chinese-Japanese Parallel Corpus from Wikipedia
    Chu, Chenhui
    Nakazawa, Toshiaki
    Kurohashi, Sadao
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 642 - 647
  • [32] Building up a Class Hierarchy with Properties from Japanese Wikipedia
    Morita, Takeshi
    Sekimoto, Yuka
    Tamagawa, Susumu
    Yamaguchi, Takahira
    2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2012), VOL 1, 2012, : 514 - 521
  • [33] Wikipedia Mining for Huge Scale Japanese Association Thesaurus Construction
    Nakayama, Kotaro
    Ito, Masahiro
    Hara, Takahiro
    Nishio, Shojiro
    2008 22ND INTERNATIONAL WORKSHOPS ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOLS 1-3, 2008, : 1150 - 1155
  • [35] The Nature of Language: On the Homogeneity of Language and Spirit in Hegel’s Phenomenology of Spirit
    Chunge Liu
    Mingli Qin
    Ishraq Ali
    Axiomathes, 2022, 32 : 225 - 240
  • [36] The Nature of Language: On the Homogeneity of Language and Spirit in Hegel's Phenomenology of Spirit
    Liu, Chunge
    Qin, Mingli
    Ali, Ishraq
    AXIOMATHES, 2022, 32 (SUPPL 2): : 225 - 240
  • [37] Wikipedia and indigenous language preservation: analysis of Setswana and Punjabi languages
    Minhas, Shahid
    Salawu, Abiodun
    FRONTIERS IN COMMUNICATION, 2025, 10
  • [38] Language-Agnostic Relation Extraction from Wikipedia Abstracts
    Heist, Nicolas
    Paulheim, Heiko
    SEMANTIC WEB - ISWC 2017, PT I, 2017, 10587 : 383 - 399
  • [39] Wikipedia-based Semantic Interpretation for Natural Language Processing
    Gabrilovich, Evgeniy
    Markovitch, Shaul
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2009, 34 : 443 - 498
  • [40] Culture, Language, Education: Beyond the Standardization of the Japanese Language and Japanese Education
    Saito, Masami
    SOCIAL SCIENCE JAPAN JOURNAL, 2009, 12 (02) : 310 - 313