Metaknowledge Extraction Based on Multi-Modal Documents

被引：3

作者：

Liu, Shu-Kan ^{[1
,2
]}

Xu, Rui-Lin ^{[2
]}

Geng, Bo-Ying ^{[2
]}

Sun, Qiao ^{[2
,3
]}

Duan, Li ^{[2
]}

Liu, Yi-Ming ^{[2
]}

机构：

[1] Southeast Univ, Sch Comp Sci & Engn, Nanjing 211189, Peoples R China

[2] PLA Naval Univ Engn, Sch Elect Engn, Wuhan 430033, Peoples R China

[3] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310058, Peoples R China

来源：

IEEE ACCESS | 2021年 / 9卷

关键词：

Task analysis; Optical character recognition software; Layout; Object detection; Semantics; Knowledge based systems; Computational modeling; Metaknowledge; multi-modal; document layout analysis; knowledge graph;

D O I：

10.1109/ACCESS.2021.3068728

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The triplet-based knowledge in large-scale knowledge bases is most likely lacking in structural logic and problematic of conducting knowledge hierarchy. In this paper, we introduce the concept of metaknowledge to knowledge engineering research for the purpose of structural knowledge construction. Therefore, the Metaknowledge Extraction Framework and Document Structure Tree model are presented to extract and organize metaknowledge elements (titles, authors, abstracts, sections, paragraphs, etc.), so that it is feasible to extract the structural knowledge from multi-modal documents. Experiment results have proved the effectiveness of metaknowledge elements extraction by our framework. Meanwhile, detailed examples are given to demonstrate what exactly metaknowledge is and how to generate it. At the end of this paper, we propose and analyze the task flow of metaknowledge applications and the associations between knowledge and metaknowledge.

引用

页码：50050 / 50060

页数：11

共 50 条

[41] Multi-modal mapping
Yates, Darran
[J]. NATURE REVIEWS NEUROSCIENCE, 2016, 17 (09) : 536 - 536
[42] Multi-modal perception
Hollier, MP
Rimell, AN
Hands, DS
Voelcker, RM
[J]. BT TECHNOLOGY JOURNAL, 1999, 17 (01) : 35 - 46
[43] Multi-modal mapping
Darran Yates
[J]. Nature Reviews Neuroscience, 2016, 17 : 536 - 536
[44] Hadamard matrix-guided multi-modal hashing for multi-modal retrieval
Yu, Jun
Huang, Wei
Li, Zuhe
Shu, Zhenqiu
Zhu, Liang
[J]. DIGITAL SIGNAL PROCESSING, 2022, 130
[45] Conversational multi-modal browser: An integrated multi-modal browser and dialog manager
Tiwari, A
Hosn, RA
Maes, SH
[J]. 2003 SYMPOSIUM ON APPLICATIONS AND THE INTERNET, PROCEEDINGS, 2003, : 348 - 351
[46] Collaboration based multi-modal multi-label learning
Yi Zhang
Yinlong Zhu
Zhecheng Zhang
Chongjung Wang
[J]. Applied Intelligence, 2022, 52 : 14204 - 14217
[47] Hierarchical Multi-Modal Prompting Transformer for Multi-Modal Long Document Classification
Liu, Tengfei
Hu, Yongli
Gao, Junbin
Sun, Yanfeng
Yin, Baocai
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 6376 - 6390
[48] Robust indoor localization based on multi-modal information fusion and multi-scale sequential feature extraction
Wang, Qinghu
Jia, Jie
Chen, Jian
Deng, Yansha
Wang, Xingwei
Aghvami, Abdol Hamid
[J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 155 : 164 - 178
[49] Collaboration based multi-modal multi-label learning
Zhang, Yi
Zhu, Yinlong
Zhang, Zhecheng
Wang, Chongjung
[J]. APPLIED INTELLIGENCE, 2022, 52 (12) : 14204 - 14217
[50] Infrared thermal image ROI extraction algorithm based on fusion of multi-modal feature maps
Zhu Li
Zhang Jing
Fu Ying-Kai
Shen Hui
Zhang Shou-Feng
Hong Xiang-Gong
[J]. JOURNAL OF INFRARED AND MILLIMETER WAVES, 2019, 38 (01) : 125 - 132

← 1 2 3 4 5 →