Building Chinese field association knowledge base from Wikipedia

被引:0
|
作者
Wang, Li [1 ]
Yao, Min [1 ]
Zhang, Yuanpeng [1 ]
Qian, Danmin [1 ]
Geng, Xinyun [1 ]
Jiang, Kui [1 ]
Dong, Jiancheng [1 ]
机构
[1] Nantong Univ, Dept Med Informat, Qi Xiu Rd 19, Nantong 226001, Peoples R China
基金
美国国家科学基金会;
关键词
field association term; Wikipedia; structured knowledge; topical field; text categorisation;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Field association (FA) terms are a limited set of discriminating terms that offer humans the knowledge to identify fields exiting in the document (text). Field association knowledge base is composed of FA terms and their potential hierarchical relationship of the fields they belong to. The main purpose of this research is building Chinese FA knowledge base. After this, the new knowledge base is tested through a system which can imitate the process whereby humans recognise the fields by looking at a few special terms. In doing so, a novel approach makes use of the structured knowledge in Chinese Wikipedia. A totally new Chinese FA knowledge base is built including 115,696 FA terms. The resulting FA knowledge from this knowledge base is applied to text categorisation. The average accuracies, 97.7% and 89%, are both higher than values obtained by SVM.
引用
收藏
页码:168 / 176
页数:9
相关论文
共 50 条
  • [1] A Method of Building Chinese Field Association Knowledge from Wikipedia
    Wang, Li
    Yata, Susumu
    Atlam, El-sayed
    Fuketa, Masao
    Morita, Kazuhiro
    Bando, Hiroaki
    Aoe, Jun-ichi
    IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 568 - 572
  • [2] Construction of Encyclopedic Knowledge Base from Infobox of Indonesian Wikipedia
    Wahyudi
    Khodra, Masayu Leylia
    Wibisono, Yudi
    2018 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY SYSTEMS AND INNOVATION (ICITSI), 2018, : 542 - 546
  • [3] YAGO: A Multilingual Knowledge Base from Wikipedia, Wordnet, and Geonames
    Rebele, Thomas
    Suchanek, Fabian
    Hoffart, Johannes
    Biega, Joanna
    Kuzey, Erdal
    Weikum, Gerhard
    SEMANTIC WEB - ISWC 2016, PT II, 2016, 9982 : 177 - 185
  • [4] DETECTING SPATIAL PATTERNS OF NATURAL HAZARDS FROM THE WIKIPEDIA KNOWLEDGE BASE
    Fan, J.
    Stewart, K.
    ISPRS INTERNATIONAL WORKSHOP ON SPATIOTEMPORAL COMPUTING, 2015, : 87 - 93
  • [5] Populating ConceptNet knowledge base with Information Acquired from Japanese Wikipedia
    Krawczyk, Marek
    Rzepka, Rafal
    Araki, Kenji
    2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 2985 - 2989
  • [6] Building Terrorist Knowledge Graph from Global Terrorism Database and Wikipedia
    Xia, Tian
    Gu, Yijun
    2019 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), 2019, : 194 - 196
  • [7] Building a Text Classifier by a Keyword and Wikipedia Knowledge
    Qiu, Qiang
    Zhang, Yang
    Zhu, Junping
    Qu, Wei
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2009, 5678 : 277 - 287
  • [8] YAGO2: A spatially and temporally enhanced knowledge base from Wikipedia
    Hoffart, Johannes
    Suchanek, Fabian M.
    Berberich, Klaus
    Weikum, Gerhard
    ARTIFICIAL INTELLIGENCE, 2013, 194 : 28 - 61
  • [9] DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia
    Lehmann, Jens
    Isele, Robert
    Jakob, Max
    Jentzsch, Anja
    Kontokostas, Dimitris
    Mendes, Pablo N.
    Hellmann, Sebastian
    Morsey, Mohamed
    van Kleef, Patrick
    Auer, Soeren
    Bizer, Christian
    SEMANTIC WEB, 2015, 6 (02) : 167 - 195
  • [10] Constructing Semantic Knowledge Base based on Wikipedia automation
    Niu, Wanpeng
    Chen, Junting
    Chen, Meilin
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON MATERIALS ENGINEERING AND INFORMATION TECHNOLOGY APPLICATIONS (MEITA 2016), 2017, 107 : 202 - 209