Research on Automatic identification of Chinese minority language websites

被引:0
|
作者
Liu, Haifeng [1 ]
Yang, Yuanyuan [2 ]
Li, Jing [3 ]
Han, Zhiqiang [1 ]
机构
[1] Minzu Univ China, Sch Informat & Engn, Beijing 100081, Peoples R China
[2] Minzu Univ China, Sch Minor Language & Literature, Beijing 100081, Peoples R China
[3] Minzu Univ China, Sch Kazak Language & Literature, Beijing 100081, Peoples R China
关键词
minority language websites; identification; feature;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper presents features of Chinese minority language text collection on websites, analyses the problems of webpage identification of Chinese minority language text, and proposes three automatic identification methods. Based on these methods, designs and realizes software to identify Chinese minority language text such as: Mongolian, Tibetan, Uyghur, Kazak, Kirgiz, Yi script Tai Lue script, Korean, Russian, Zhuang script and so on.
引用
收藏
页码:836 / 840
页数:5
相关论文
共 50 条
  • [41] Automatic Identification and Classification of Misogynistic Language on Twitter
    Anzovino, Maria
    Fersini, Elisabetta
    Rosso, Paolo
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2018), 2018, 10859 : 57 - 64
  • [42] Automatic Idiom Identification Model for Amharic Language
    Fenta, Anduamlak Abebe
    Gebeyehu, Seffi
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (08)
  • [43] Automatic gender identification optimised for language independence
    Slomka, S
    Sridharan, S
    [J]. IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 145 - 148
  • [44] Automatic identification of Chinese maximal noun phrases
    Zhou, Qiang
    Sun, Maosong
    Huang, Changning
    [J]. Ruan Jian Xue Bao/Journal of Software, 2000, 11 (02): : 195 - 201
  • [45] Effective method on automatic identification of Chinese name
    [J]. 2000, Inst Sci Tech Infor China (10):
  • [46] Automatic Identification of Chinese Paired Discourse Connectives
    Costa, Nelson Filipe
    Cheng, Yushun
    Muermans, Thomas Chapados
    Hanel, Blaise
    Kosseim, Leila
    [J]. 2023 IEEE 17TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, ICSC, 2023, : 114 - 117
  • [47] A Method of Automatic Recognition of Attributive Clauses in Chinese Language
    Wang, Lei
    Qu, Weiguang
    Wang, Houfeng
    Yu, Shiwen
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 172 - 175
  • [48] Using MMI Criterion to Realize Language Identification of Minority Languages
    Cheng, Yang
    Yang, Jian
    Kui, Liping
    [J]. PROCEEDINGS OF THE 31ST CHINESE CONTROL CONFERENCE, 2012, : 3651 - 3655
  • [49] Research on the Role of an Automatic Resource Generation System to Promote Chinese as a Second Language Learners' Learning in Colleges
    Wang, Qi
    Yu, Shengquan
    Wang, Xiaofeng
    [J]. JOURNAL OF EDUCATIONAL COMPUTING RESEARCH, 2024, 62 (02) : 501 - 531
  • [50] Chinese minority script identification using wavelets: Transform and SVM
    Guo, Hai
    Zhao, Jing-Ying
    Jiang, Nan
    Shi, Lei
    [J]. ICIC Express Letters, 2010, 4 (03): : 653 - 658