Unsupervised Semantic Mapping for Healthcare Data Storage Schema

被引:3
|
作者
Satti, Fahad Ahmed [1 ,2 ]
Hussain, Musarrat [1 ]
Hussain, Jamil [3 ]
Ali, Syed Imran [1 ]
Ali, Taqdir [1 ]
Bilal, Hafiz Syed Muhammad [1 ,2 ]
Chung, Taechoong [1 ]
Lee, Sungyoung [1 ]
机构
[1] Kyung Hee Univ, Dept Comp Sci & Engn, Yongin 17104, South Korea
[2] Natl Univ Sci & Technol NUST, Sch Elect Engn & Comp Sci SEECS, Islamabad 44000, Pakistan
[3] Sejong Univ, Dept Data Sci, Seoul 05006, South Korea
基金
新加坡国家研究基金会;
关键词
Medical services; Interoperability; Semantics; Medical diagnostic imaging; Natural language processing; Ontologies; Machine learning; Context awareness; decision support systems; expert systems; health information management; medical information systems; ontology engineering; text processing; unsupervised learning;
D O I
10.1109/ACCESS.2021.3100686
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data, information, and knowledge processing systems, in the domain of healthcare, are currently plagued by heterogeneity at various levels. Current solutions have focused on developing a standard-based, manual intervention mechanism, which requires a large number of human resources and necessitates the realignment of existing systems. State-of-the-art methodologies in the field of natural language processing and machine learning can help to partially automate this process, reducing the resource requirements and providing a relatively good multi-class-based classification algorithm. We present a novel methodology for bridging the gap between various healthcare data management solutions by leveraging the strength of transformer-based machine learning models, to create mappings between the data elements. Additionally, the annotated data, collected against five medical schemas and labeled by four annotators is made available for helping future researchers. Our results indicate, that for biased, dependent multi-class text classification, transformer-based models provide better results than linguistic and other classical models. In particular, the Robustly Optimized BERT Pretraining Approach (RoBERTa) provides the best schema matching performance by achieving a Cohen's kappa score of 0.47 and Matthews Correlation Coefficient (MCC) score of 0.48, with human-annotated data.
引用
收藏
页码:107267 / 107278
页数:12
相关论文
共 50 条
  • [1] Schema-Based Mapping Approach for Data Transformation to Enrich Semantic Web
    Natarajan, Senthilselvan
    Vairavasundaram, Subramaniyaswamy
    Teekaraman, Yuvaraja
    Kuppusamy, Ramya
    Radhakrishnan, Arun
    [J]. Wireless Communications and Mobile Computing, 2021, 2021
  • [2] A semantic approach to discovering schema mapping expressions
    An, Yuan
    Borgida, Alex
    Miller, Renre J.
    Mylopoulos, John
    [J]. 2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2007, : 181 - +
  • [3] Mapping from XML DTD to semantic schema
    Rishe, N
    Yang, L
    Chekmasov, M
    Chekmasova, M
    Graham, S
    Roque, A
    [J]. 6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL VII, PROCEEDINGS: INFORMATION SYSTEMS DEVELOPMENT II, 2002, : 450 - 455
  • [4] A RDF-based Semantic Schema Mapping Transformation System for Localized Data Integration
    Cheong, Chi Po
    Chatwin, Chris
    Young, Rupert
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION IN COMMUNICATION, 2009, : 144 - 147
  • [5] Schema-based natural language semantic mapping
    Stratica, N
    Desai, BC
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2004, 3136 : 103 - 113
  • [6] Schema induction from incomplete semantic data
    Gao, Huan
    Qi, Guilin
    Ji, Qiu
    [J]. INTELLIGENT DATA ANALYSIS, 2018, 22 (06) : 1337 - 1353
  • [7] Semantic Web Information Retrieval in XML By Mapping To RDF Schema
    Phyue, Soe Lai
    Thein, Myint Myint
    Win, Thinn Thinn
    Thwin, Mie Mie Su
    [J]. 2010 INTERNATIONAL CONFERENCE ON NETWORKING AND INFORMATION TECHNOLOGY (ICNIT 2010), 2010, : 500 - 503
  • [8] A Case-Based Approach for Easing Schema Semantic Mapping
    Malherbe, Emmanuel
    Iwaszko, Thomas
    Aufaure, Marie-Aude
    [J]. CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2015, 2015, 9343 : 228 - 243
  • [9] Evaluation of unsupervised semantic mapping of natural language with Leximancer concept mapping
    Smith, Andrew E.
    Humphreys, Michael S.
    [J]. BEHAVIOR RESEARCH METHODS, 2006, 38 (02) : 262 - 279
  • [10] Evaluation of unsupervised semantic mapping of natural language with Leximancer concept mapping
    Andrew E. Smith
    Michael S. Humphreys
    [J]. Behavior Research Methods, 2006, 38 : 262 - 279