A multi-source heterogeneous spatial big data fusion method based on multiple similarity and voting decision

被引:2
|
作者
Chen, Zeqiu [1 ]
Zhou, Jianghui [2 ]
Sun, Ruizhi [1 ,3 ]
机构
[1] China Agr Univ, Coll Informat & Elect Engn, Beijing 100083, Peoples R China
[2] JD Technol, Beijing 100176, Peoples R China
[3] Minist Agr, Sci Res Base Integrated Technol Precis Agr Anim Hu, Beijing 100083, Peoples R China
关键词
Data fusion; Spatial big data; Multi-source heterogeneity; Multiple similarity; Voting decision; INFORMATION FUSION; ONTOLOGY;
D O I
10.1007/s00500-022-07734-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data fusion is an efficient way to achieve an improved accuracy and more specific inferences by fusing and aggregating data from different sensors. However, due to the increasing complexity of spatial data with massive and multi-source heterogeneous characteristics, the existing methods cannot satisfy quite well the requirement for the integrity of data and the accuracy of fusion results in some specific situations. By considering the geographical properties of spatial data, a multi-source heterogeneous spatial big data fusion method based on multiple similarity and voting decision (SDFSV) is proposed in this paper, which develops a three-step record linking algorithm to improve the quality of entity recognition for the incremental fusion of massive data. Then, a one-time voting algorithm is introduced into the proposed method, so that the data conflicts can be significantly reduced and thus the accuracy of the data fusion can be improved. And a relation deduction method based on rule and entity recognition is presented to enhance the data integrity. In addition, in order to promote traceability and interpretability of fusion results, it is necessary to construct a data traceability mechanism. Experimental results show that SDFSV has an improved performance by using the data of Beijing Medical Institutions collected from 10 data sources.
引用
收藏
页码:2479 / 2492
页数:14
相关论文
共 50 条
  • [41] The application research of multi-source heterogeneous energy big data analysis
    Han, Xuemin
    Zheng, Gaofeng
    Liu, Pengxi
    Li, Zhou
    Ma, Junjie
    Chen, Xi
    2020 13TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2020), 2020, : 168 - 171
  • [42] The Safety State Control of Hazardous Chemicals Based on Multi-source Heterogeneous Data Fusion
    Yu, Jie
    Ma, Zhehan
    Wu, Dan
    Wang, Rui
    Li, Ying
    Sun, Ru
    PROCEEDINGS OF 2019 IEEE 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2019), 2019, : 156 - 159
  • [43] Temporal and spatial heterogeneity research of urban anthropogenic heat emissions based on multi-source spatial big data fusion for Xi'an, China
    Xu, Duo
    Zhou, Dian
    Wang, Yupeng
    Meng, Xiangzhao
    Gu, Zhaolin
    Yang, Yujun
    ENERGY AND BUILDINGS, 2021, 240
  • [44] Research on multi-source heterogeneous data fusion method of substation based on cloud edge collaboration and AI technology
    Pei Sun
    Bo Zhao
    Xiang Li
    Discover Applied Sciences, 7 (4)
  • [45] Traffic Accident Risk Prediction of Tunnel Based on Multi-Source Heterogeneous Data Fusion
    Wang, Yong
    Liu, Tongbin
    Lu, Yong
    Wan, Huawen
    Huang, Peng
    Deng, Fangming
    IEEE ACCESS, 2024, 12 : 18694 - 18702
  • [46] Multi-source heterogeneous data fusion of a distribution network based on a joint Kalman filter
    Xia W.
    Cai W.
    Liu Y.
    Li H.
    Dianli Xitong Baohu yu Kongzhi/Power System Protection and Control, 2022, 50 (10): : 180 - 187
  • [47] A graph neural network-based stock forecasting method utilizing multi-source heterogeneous data fusion
    Li, Xiaohan
    Wang, Jun
    Tan, Jinghua
    Ji, Shiyu
    Jia, Huading
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (30) : 43753 - 43775
  • [48] A graph neural network-based stock forecasting method utilizing multi-source heterogeneous data fusion
    Xiaohan Li
    Jun Wang
    Jinghua Tan
    Shiyu Ji
    Huading Jia
    Multimedia Tools and Applications, 2022, 81 : 43753 - 43775
  • [49] Scraper conveyor gearbox fault diagnosis based on multi-source heterogeneous data fusion
    Feng, Long
    Ding, Zeyu
    Yin, Yibing
    Wang, Yang
    Zhang, Qiang
    Liu, Xinye
    Yuan, Zhi
    Li, Haoyu
    MEASUREMENT, 2025, 247
  • [50] Multi-source information fusion based heterogeneous network embedding
    Li, Bentian
    Pi, Dechang
    Lin, Yunxia
    Khan, Izhar Ahmed
    Cui, Lin
    INFORMATION SCIENCES, 2020, 534 : 53 - 71