Metaknowledge Extraction Based on Multi-Modal Documents

被引:3
|
作者
Liu, Shu-Kan [1 ,2 ]
Xu, Rui-Lin [2 ]
Geng, Bo-Ying [2 ]
Sun, Qiao [2 ,3 ]
Duan, Li [2 ]
Liu, Yi-Ming [2 ]
机构
[1] Southeast Univ, Sch Comp Sci & Engn, Nanjing 211189, Peoples R China
[2] PLA Naval Univ Engn, Sch Elect Engn, Wuhan 430033, Peoples R China
[3] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310058, Peoples R China
关键词
Task analysis; Optical character recognition software; Layout; Object detection; Semantics; Knowledge based systems; Computational modeling; Metaknowledge; multi-modal; document layout analysis; knowledge graph;
D O I
10.1109/ACCESS.2021.3068728
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The triplet-based knowledge in large-scale knowledge bases is most likely lacking in structural logic and problematic of conducting knowledge hierarchy. In this paper, we introduce the concept of metaknowledge to knowledge engineering research for the purpose of structural knowledge construction. Therefore, the Metaknowledge Extraction Framework and Document Structure Tree model are presented to extract and organize metaknowledge elements (titles, authors, abstracts, sections, paragraphs, etc.), so that it is feasible to extract the structural knowledge from multi-modal documents. Experiment results have proved the effectiveness of metaknowledge elements extraction by our framework. Meanwhile, detailed examples are given to demonstrate what exactly metaknowledge is and how to generate it. At the end of this paper, we propose and analyze the task flow of metaknowledge applications and the associations between knowledge and metaknowledge.
引用
收藏
页码:50050 / 50060
页数:11
相关论文
共 50 条
  • [41] Multi-modal mapping
    Yates, Darran
    [J]. NATURE REVIEWS NEUROSCIENCE, 2016, 17 (09) : 536 - 536
  • [42] Multi-modal perception
    Hollier, MP
    Rimell, AN
    Hands, DS
    Voelcker, RM
    [J]. BT TECHNOLOGY JOURNAL, 1999, 17 (01) : 35 - 46
  • [43] Multi-modal mapping
    Darran Yates
    [J]. Nature Reviews Neuroscience, 2016, 17 : 536 - 536
  • [44] Hadamard matrix-guided multi-modal hashing for multi-modal retrieval
    Yu, Jun
    Huang, Wei
    Li, Zuhe
    Shu, Zhenqiu
    Zhu, Liang
    [J]. DIGITAL SIGNAL PROCESSING, 2022, 130
  • [45] Conversational multi-modal browser: An integrated multi-modal browser and dialog manager
    Tiwari, A
    Hosn, RA
    Maes, SH
    [J]. 2003 SYMPOSIUM ON APPLICATIONS AND THE INTERNET, PROCEEDINGS, 2003, : 348 - 351
  • [46] Collaboration based multi-modal multi-label learning
    Yi Zhang
    Yinlong Zhu
    Zhecheng Zhang
    Chongjung Wang
    [J]. Applied Intelligence, 2022, 52 : 14204 - 14217
  • [47] Hierarchical Multi-Modal Prompting Transformer for Multi-Modal Long Document Classification
    Liu, Tengfei
    Hu, Yongli
    Gao, Junbin
    Sun, Yanfeng
    Yin, Baocai
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 6376 - 6390
  • [48] Robust indoor localization based on multi-modal information fusion and multi-scale sequential feature extraction
    Wang, Qinghu
    Jia, Jie
    Chen, Jian
    Deng, Yansha
    Wang, Xingwei
    Aghvami, Abdol Hamid
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 155 : 164 - 178
  • [49] Collaboration based multi-modal multi-label learning
    Zhang, Yi
    Zhu, Yinlong
    Zhang, Zhecheng
    Wang, Chongjung
    [J]. APPLIED INTELLIGENCE, 2022, 52 (12) : 14204 - 14217
  • [50] Infrared thermal image ROI extraction algorithm based on fusion of multi-modal feature maps
    Zhu Li
    Zhang Jing
    Fu Ying-Kai
    Shen Hui
    Zhang Shou-Feng
    Hong Xiang-Gong
    [J]. JOURNAL OF INFRARED AND MILLIMETER WAVES, 2019, 38 (01) : 125 - 132