An Automatic Semantic Extraction Method for Web Data Interchange

被引:0
|
作者
Yao, Yuangang [1 ]
Liu, Hui [1 ]
Yi, Jin [1 ]
Chen, Haiqiang [1 ]
Zhao, Xianghui [1 ]
Ma, Xiaoyu [2 ]
机构
[1] China Informat Technol Secur Evaluat Ctr, Beijing, Peoples R China
[2] Patent Examinat Cooperat Ctr Patent Off, Beijing, Peoples R China
关键词
semantic extraction; web data; !text type='JSON']JSON[!/text; semantic web; ontology construction;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data interchange in Internet is mainly for specific web applications. The data formats are simple but lack of rich semantic descriptions, which can't meet current requirements of deep web data analyzing with semantic technology. Aiming at connecting web data with semantics, we propose an automatic semantic extraction method to handle web data sets with semantics and generate semantic data for applications, which includes data parsing, semantic mapping, semantic enrichment, and ontology merging processes. This method converts web data into semantic web descriptions and improves data semantics according to semantic computation. Meanwhile, it builds semantic models for data instances, which can be applied to further semantic reasoning applications. We use this method to extract schemaless JSON data automatically, including concepts, properties, constrains and values, and build semantic ontology to describe the metadata and instances. The experimental results show that this method can process web data resources and create semantic data effectively.
引用
收藏
页码:148 / 152
页数:5
相关论文
共 50 条
  • [41] Automatic composition of semantic web services
    Zhang, RY
    Arpinar, IB
    Aleman-Meza, B
    ICWS'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON WEB SERVICES, 2003, : 38 - 41
  • [42] A Data Transmission Method for Feature Extraction and Semantic Enhancement of Scarce Data
    Xie, Wenwu
    Xiong, Ming
    Xu, Hongbo
    Wang, Ji
    Yang, Liang
    Zou, Jian
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2025, 14 (02) : 484 - 488
  • [43] Generational analysis of tension and entropy in data structures: impact on automatic data integration and on the semantic web
    Rohn, Eli
    KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 28 (01) : 175 - 196
  • [44] Generational analysis of tension and entropy in data structures: impact on automatic data integration and on the semantic web
    Eli Rohn
    Knowledge and Information Systems, 2011, 28 : 175 - 196
  • [45] Automatic generation of agents for collecting hidden Web pages for data extraction
    Lage, JP
    da Silva, AS
    Golgher, PB
    Laender, AHF
    DATA & KNOWLEDGE ENGINEERING, 2004, 49 (02) : 177 - 196
  • [46] Wrapper generation for automatic data extraction from large web sites
    Jindal, N
    DATABASES IN NETWORKED INFORMATION SYSTEMS, PROCEEDINGS, 2005, 3433 : 34 - 53
  • [47] Using clustering and edit distance techniques for automatic web data extraction
    Alvarez, Manuel
    Pan, Alberto
    Raposo, Juan
    Bellas, Fernando
    Cacheda, Fidel
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2007, PROCEEDINGS, 2007, 4831 : 212 - 224
  • [48] PFIME: Parallel automatic deep web data extraction based on hadoop
    Feng, Yong
    Jia, Dongfeng
    Wang, Huijuan
    Journal of Computational Information Systems, 2014, 10 (09): : 3863 - 3870
  • [49] Automatic Data Extraction from Lists in Web Pages Based on XML
    Xin, Zhou
    Hao, Wang
    ADVANCED TECHNOLOGY IN TEACHING - PROCEEDINGS OF THE 2009 3RD INTERNATIONAL CONFERENCE ON TEACHING AND COMPUTATIONAL SCIENCE (WTCS 2009), VOL 2: EDUCATION, PSYCHOLOGY AND COMPUTER SCIENCE, 2012, 117 : 915 - 921
  • [50] An Approach of Automatic Web Data Record Extraction Using Clustering Techniques
    Dong, YongQuan
    Li, QingZhong
    2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2009, : 441 - 444