A Method of Building Chinese Field Association Knowledge from Wikipedia

被引:0
|
作者
Wang, Li [1 ]
Yata, Susumu [1 ]
Atlam, El-sayed [1 ]
Fuketa, Masao [1 ]
Morita, Kazuhiro [1 ]
Bando, Hiroaki [1 ]
Aoe, Jun-ichi [1 ]
机构
[1] Univ Tokushima, Fac Engn, Dept Informat Sci & Intelligent Syst, Tokushima 7708506, Japan
关键词
Field association terms; Feature fields; Wikipedia; Chinese documents; Field recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Field Association (FA) terms form a limited set of discriminating terms that give us the knowledge to identify document fields. The primary goal of this research is to make a system that can imitate the process whereby humans recognize the fields by looking at a few Chinese FA terms in a document. This paper proposes a new approach to build a Chinese FA terms dictionary automatically from Wikipedia. 104,532 FA terms are added in the dictionary. The resulting FA terms by using this dictionary are applied to recognize the fields of 5,841 documents. The average accuracy in the experiment is 92.04%. The results show that the presented method is effective in building FA terms from Wikipedia automatically.
引用
收藏
页码:568 / 572
页数:5
相关论文
共 50 条
  • [1] Building Chinese field association knowledge base from Wikipedia
    Wang, Li
    Yao, Min
    Zhang, Yuanpeng
    Qian, Danmin
    Geng, Xinyun
    Jiang, Kui
    Dong, Jiancheng
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2015, 52 (2-3) : 168 - 176
  • [2] Building Terrorist Knowledge Graph from Global Terrorism Database and Wikipedia
    Xia, Tian
    Gu, Yijun
    2019 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), 2019, : 194 - 196
  • [3] Building a Text Classifier by a Keyword and Wikipedia Knowledge
    Qiu, Qiang
    Zhang, Yang
    Zhu, Junping
    Qu, Wei
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2009, 5678 : 277 - 287
  • [4] From Callimachus to the Wikipedia: an ancient method for the representation of knowledge in the WWW era
    Boda, Istvan Karoly
    Toth, Erzsebet
    2018 9TH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2018, : 205 - 210
  • [5] Exploiting Wikipedia Priori Knowledge for Chinese Named Entity Recognition
    Li, Jianfeng
    Zhu, Conghui
    Li, Sheng
    Zhao, Tiejun
    Zheng, Dequan
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1548 - 1552
  • [6] Extracting Geographic Knowledge from Wikipedia
    Benhaddouche, Djamila
    Tekkouk, Mohamed
    Youcef, Abdelghani Chernnouf
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND NEW TECHNOLOGIES (ICSENT '18), 2018,
  • [7] Automatically building large-scale named entity recognition corpora from Chinese Wikipedia
    Zhou, Jie
    Li, Bi-cheng
    Chen, Gang
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2015, 16 (11) : 940 - 956
  • [8] Automatically building large-scale named entity recognition corpora from Chinese Wikipedia
    Jie Zhou
    Bi-cheng Li
    Gang Chen
    Frontiers of Information Technology & Electronic Engineering, 2015, 16 : 940 - 956
  • [9] SoftKG: Building A Software Development Knowledge Graph through Wikipedia Taxonomy
    Wang, Jihu
    Shi, Xueliang
    Cheng, Lin
    Zhang, Kun
    Shi, Yuliang
    2020 IEEE WORLD CONGRESS ON SERVICES (SERVICES), 2020, : 151 - 156
  • [10] A domain knowledge graph construction method based on Wikipedia
    Yu, Haoze
    Li, Haisheng
    Mao, Dianhui
    Cai, Qiang
    JOURNAL OF INFORMATION SCIENCE, 2021, 47 (06) : 783 - 793