Building Chinese field association knowledge base from Wikipedia

被引:0
|
作者
Wang, Li [1 ]
Yao, Min [1 ]
Zhang, Yuanpeng [1 ]
Qian, Danmin [1 ]
Geng, Xinyun [1 ]
Jiang, Kui [1 ]
Dong, Jiancheng [1 ]
机构
[1] Nantong Univ, Dept Med Informat, Qi Xiu Rd 19, Nantong 226001, Peoples R China
基金
美国国家科学基金会;
关键词
field association term; Wikipedia; structured knowledge; topical field; text categorisation;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Field association (FA) terms are a limited set of discriminating terms that offer humans the knowledge to identify fields exiting in the document (text). Field association knowledge base is composed of FA terms and their potential hierarchical relationship of the fields they belong to. The main purpose of this research is building Chinese FA knowledge base. After this, the new knowledge base is tested through a system which can imitate the process whereby humans recognise the fields by looking at a few special terms. In doing so, a novel approach makes use of the structured knowledge in Chinese Wikipedia. A totally new Chinese FA knowledge base is built including 115,696 FA terms. The resulting FA knowledge from this knowledge base is applied to text categorisation. The average accuracies, 97.7% and 89%, are both higher than values obtained by SVM.
引用
收藏
页码:168 / 176
页数:9
相关论文
共 50 条
  • [21] Building a knowledge base: an example
    Michael Gelfond
    Alfredo Gabaldon
    Annals of Mathematics and Artificial Intelligence, 1999, 25 : 165 - 199
  • [22] Exploiting Wikipedia Priori Knowledge for Chinese Named Entity Recognition
    Li, Jianfeng
    Zhu, Conghui
    Li, Sheng
    Zhao, Tiejun
    Zheng, Dequan
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1548 - 1552
  • [23] Extracting Geographic Knowledge from Wikipedia
    Benhaddouche, Djamila
    Tekkouk, Mohamed
    Youcef, Abdelghani Chernnouf
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND NEW TECHNOLOGIES (ICSENT '18), 2018,
  • [24] Mining an "Anti-Knowledge Base" from Wikipedia Updates with Applications to Fact Checking and Beyond
    Karagiannis, Georgios
    Trummer, Immanuel
    Jo, Saehan
    Khandelwal, Shubham
    Wang, Xuezhi
    Yu, Cong
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 13 (04): : 561 - 573
  • [25] Automatically building large-scale named entity recognition corpora from Chinese Wikipedia
    Zhou, Jie
    Li, Bi-cheng
    Chen, Gang
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2015, 16 (11) : 940 - 956
  • [26] Automatically building large-scale named entity recognition corpora from Chinese Wikipedia
    Jie Zhou
    Bi-cheng Li
    Gang Chen
    Frontiers of Information Technology & Electronic Engineering, 2015, 16 : 940 - 956
  • [27] Building knowledge base from semi-structured data
    Liu, Xiao-Li
    Wu, Guo-Qing
    Yang, Min
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 839 - +
  • [28] Building a Knowledge Base on Genetic Variation and Cancer Risk Through Field Synopses
    Yesupriya, Ajay
    Gwinn, Marta
    Khoury, Muin J.
    JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2009, 101 (01): : 4 - 5
  • [29] SUITABLE DEMON FUNCTION FOR BUILDING KNOWLEDGE-BASE IN MECHANICAL DESIGN FIELD
    NAGASAKA, Y
    WIBISONO, H
    OHTAKI, H
    ISHIKAWA, Y
    JSME INTERNATIONAL JOURNAL SERIES C-DYNAMICS CONTROL ROBOTICS DESIGN AND MANUFACTURING, 1994, 37 (02): : 390 - 398
  • [30] Domain-specific knowledge base enrichment using Wikipedia tables
    Ran, Chenwei
    Shen, Wei
    Wang, Jianyong
    Zhu, Xuan
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 349 - 358