Research on Automatic identification of Chinese minority language websites

被引:0
|
作者
Liu, Haifeng [1 ]
Yang, Yuanyuan [2 ]
Li, Jing [3 ]
Han, Zhiqiang [1 ]
机构
[1] Minzu Univ China, Sch Informat & Engn, Beijing 100081, Peoples R China
[2] Minzu Univ China, Sch Minor Language & Literature, Beijing 100081, Peoples R China
[3] Minzu Univ China, Sch Kazak Language & Literature, Beijing 100081, Peoples R China
关键词
minority language websites; identification; feature;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper presents features of Chinese minority language text collection on websites, analyses the problems of webpage identification of Chinese minority language text, and proposes three automatic identification methods. Based on these methods, designs and realizes software to identify Chinese minority language text such as: Mongolian, Tibetan, Uyghur, Kazak, Kirgiz, Yi script Tai Lue script, Korean, Russian, Zhuang script and so on.
引用
收藏
页码:836 / 840
页数:5
相关论文
共 50 条
  • [1] Learn a Rich Foreign Language Research on Teaching Voice Chinese Language of Minority
    Wan, Kaiyan
    [J]. 2014 2ND INTERNATIONAL CONFERENCE ON SOCIAL SCIENCE AND HEALTH (ICSSH 2014), PT 4, 2014, 58 : 54 - 57
  • [2] The language of bullying: Social issues on Chinese websites
    Li, Wanqi
    [J]. AGGRESSION AND VIOLENT BEHAVIOR, 2020, 53
  • [3] Automatic language identification
    Zissman, MA
    Berkling, KM
    [J]. SPEECH COMMUNICATION, 2001, 35 (1-2) : 115 - 124
  • [4] Internationalization and Localization of Websites: Navigation in English Language and Chinese Language Sites
    Petrie, Helen
    Power, Christopher
    Song, Wei
    [J]. INTERNATIONALIZATION, DESIGN AND GLOBAL DEVELOPMENT, PROCEEDINGS, 2009, 5623 : 293 - 300
  • [5] Reviewing automatic language identification
    Muthusamy, Yeshwant K.
    Barnard, Etienne
    Cole, Ronald A.
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 1994, 11 (04) : 33 - 41
  • [6] AUTOMATIC SIGN LANGUAGE IDENTIFICATION
    Gebre, Binyam Gebrekidan
    Wittenburg, Peter
    Heskes, Tom
    [J]. 2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 2626 - 2630
  • [7] Automatic Bilingual Lexicon Extraction for a Minority Target Language
    Tiu, Eileen Pamela
    Roxas, Rachel Edita O.
    [J]. PACLIC 22: PROCEEDINGS OF THE 22ND PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, 2008, : 368 - 376
  • [8] AUTOMATIC CHINESE SEAL IDENTIFICATION
    FAN, TJ
    TSAI, WH
    [J]. COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1984, 25 (03): : 311 - 330
  • [9] Automatic Identification of Replicated Criminal Websites Using Combined Clustering
    Drew, Jake
    Moore, Tyler
    [J]. 2014 IEEE SECURITY AND PRIVACY WORKSHOPS (SPW 2014), 2014, : 116 - 123
  • [10] The Research on Chinese Automatic Segmentation
    Li Huan-qin
    Yan Shi-tao
    [J]. ADVANCES IN APPLIED SCIENCE AND INDUSTRIAL TECHNOLOGY, PTS 1 AND 2, 2013, 798-799 : 818 - +