Cross-Lingual Knowledge Validation Based Taxonomy Derivation from Heterogeneous Online Wikis

被引:0
|
作者
Wang, Zhigang [1 ]
Li, Juanzi [1 ]
Li, Shuangjie [1 ]
Li, Mingyang [1 ]
Tang, Jie [1 ]
Zhang, Kuo [2 ]
Zhang, Kun [2 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
[2] Sogou Incorp, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Creating knowledge bases based on the crowd-sourced wikis, like Wikipedia, has attracted significant research interest in the field of intelligent Web. However, the derived taxonomies usually contain many mistakenly imported taxonomic relations due to the difference between the user-generated subsumption relations and the semantic taxonomic relations. Current approaches to solving the problem still suffer the following issues: (i) the heuristic-based methods strongly rely on specific language dependent rules. (ii) the corpus-based methods depend on a large-scale high-quality corpus, which is often unavailable. In this paper, we formulate the cross-lingual taxonomy derivation problem as the problem of cross-lingual taxonomic relation prediction. We investigate different linguistic heuristics and language independent features, and propose a cross-lingual knowledge validation based dynamic adaptive boosting model to iteratively reinforce the performance of taxonomic relation prediction. The proposed approach successfully overcome the above issues, and experiments show that our approach significantly outperforms the designed state-of-the-art comparison methods.
引用
收藏
页码:180 / 186
页数:7
相关论文
共 50 条
  • [1] Cross-Lingual Entity Matching for Heterogeneous Online Wikis
    Lu, Weiming
    Wang, Peng
    Wang, Huan
    Liu, Jiahui
    Dai, Hao
    Wei, Baogang
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2017, 2018, 10619 : 887 - 899
  • [2] Building a Large-Scale Cross-Lingual Knowledge Base from Heterogeneous Online Wikis
    Li, Mingyang
    Shi, Yao
    Wang, Zhigang
    Liu, Yongbin
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2015, 2015, 9362 : 413 - 420
  • [3] Cross-Lingual Taxonomy Alignment with Bilingual Knowledge Graph Embeddings
    Wu, Tianxing
    Zhang, Du
    Zhang, Lei
    Qi, Guilin
    [J]. SEMANTIC TECHNOLOGY, JIST 2017, 2017, 10675 : 251 - 258
  • [4] Enrich cross-lingual entity links for online wikis via multi-modal semantic matching
    Lu, Weiming
    Wang, Peng
    Ma, Xinyin
    Xu, Wei
    Chen, Chen
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (05)
  • [5] Cross-Lingual Validation of Multilingual Wordnets
    Tufis, Dan
    Ion, Radu
    Barbu, Eduard
    Barbu, Verginica
    [J]. GWC 2004: SECOND INTERNATIONAL WORDNET CONFERENCE, PROCEEDINGS, 2003, : 332 - 340
  • [6] Conversations Powered by Cross-Lingual Knowledge
    Sun, Weiwei
    Meng, Chuan
    Meng, Qi
    Ren, Zhaochun
    Ren, Pengjie
    Chen, Zhumin
    de Rijke, Maarten
    [J]. SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1442 - 1451
  • [7] A cross-lingual framework for web news taxonomy integration
    Yang, Cheng-Zen
    Chen, Che-Min
    Chen, Ing-Xiang
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2006, 4182 : 270 - +
  • [8] Cross-lingual thesaurus for multilingual knowledge management
    Yang, Christopher C.
    Wei, Chih-Ping
    Li, K. W.
    [J]. DECISION SUPPORT SYSTEMS, 2008, 45 (03) : 596 - 605
  • [9] Cross-Lingual Knowledge Transfer for Clinical Phenotyping
    Papaioannou, Jens-Michalis
    Grundmann, Paul
    van Aken, Betty
    Samaras, Athanasios
    Kyparissidis, Ilias
    Giannakoulas, George
    Gers, Felix
    Loeser, Alexander
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 900 - 909
  • [10] Cross-Lingual Taxonomy Alignment with Bilingual Biterm Topic Model
    Wu, Tianxing
    Qi, Guilin
    Wang, Haofen
    Xu, Kang
    Cui, Xuan
    [J]. THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 287 - 293