Extraction of relational schema from deep web sources: a form driven approach

被引:0
|
作者
Saissi, Yasser [1 ]
Zellou, Ahmed [1 ]
Idri, Ali [1 ]
机构
[1] Mohammed V Univ, ENSIAS, Rabat, Morocco
关键词
Deep web source; Web source integration; Structured data; !text type='HTML']HTML[!/text] form; DATABASES;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The deep web is the biggest unexplored part of the web and we need to access directly to its entire data web sources without using any crawling or surfacing method. For this, we choose to use a virtual web integration system. However, the deep web virtual integration methods existing today, focuses only on the integration of the query interfaces giving access to the deep web. These query interfaces are integrated to build a global query interface able to query all the deep web sources. The objective of our work is to propose another vision of a deep web virtual integration system that uses a mediated schema built with a relational schema describing each deep web source. This paper proposes our approach to extract a relational schema describing a deep web source. The key idea underlying our approach is to analyze two structured information: the HTML Form and the HTML Table extracted from the deep web source to discover its data structure and to allow us to build a relational schema describing it. We use also a knowledge table to take profit of our learning experience on extracting relational schema from deep web source.
引用
收藏
页码:178 / 182
页数:5
相关论文
共 50 条
  • [1] Towards XML Schema Extraction from Deep Web
    Saissi, Yasser
    Zellou, Ahmed
    Idri, Ali
    [J]. 2016 4TH IEEE INTERNATIONAL COLLOQUIUM ON INFORMATION SCIENCE AND TECHNOLOGY (CIST), 2016, : 94 - 99
  • [2] An Effective Schema Extraction Algorithm on the Deep Web
    Qiang, Bao-hua
    Xi, Jian-qing
    Qiang, Bao-hua
    Zhang, Long
    [J]. 2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 10976 - +
  • [3] Schema Extraction of Deep Web Query Interface
    Wang, Ying
    Peng, Tao
    Zuo, Wanli
    Zhu, Huifeng
    [J]. WISM: 2009 INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND MINING, PROCEEDINGS, 2009, : 391 - 395
  • [4] Discovering the Deep Web through XML Schema Extraction
    Saissi, Yasser
    Zellou, Ahmed
    Idri, Ali
    [J]. KDIR: PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL. 1, 2016, : 141 - 149
  • [5] Effective Schema Extraction of Query Interfaces on the Deep Web
    Qiang, Bao-hua
    Xi, Jian-qing
    Qiang, Bao-Hua
    Chen, Ling
    [J]. FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2008, : 291 - +
  • [6] Semantic Deep Web: Automatic Attribute Extraction from the Deep Web Data Sources
    An, Yoo Jung
    Geller, James
    Wu, Yi-Ta
    Chun, Soon Ae
    [J]. APPLIED COMPUTING 2007, VOL 1 AND 2, 2007, : 1667 - 1672
  • [7] Extraction of object-oriented schemas from existing relational databases: A form-driven approach
    Malki, M
    Flory, A
    Rahmouni, MK
    [J]. INFORMATICA, 2002, 13 (01) : 47 - 72
  • [8] DWSpyder: A new schema extraction method for a deep web integration system
    Saissi, Yasser
    Zellou, Ahmed
    Adri, Ali
    [J]. International Journal of Web Engineering and Technology, 2019, 14 (02): : 122 - 150
  • [9] Schema Extraction for Deep Web Query Interfaces Using Heuristics Rules
    Chichang Jou
    [J]. Information Systems Frontiers, 2019, 21 : 163 - 174
  • [10] Schema Extraction for Deep Web Query Interfaces Using Heuristics Rules
    Jou, Chichang
    [J]. INFORMATION SYSTEMS FRONTIERS, 2019, 21 (01) : 163 - 174