Towards XML Schema Extraction from Deep Web

被引:0
|
作者
Saissi, Yasser [1 ]
Zellou, Ahmed [1 ]
Idri, Ali [1 ]
机构
[1] Mohammed V Univ Rabat, ENSIAS, Rabat, Morocco
关键词
Deep web; XML schema; Web integration;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Today, not all the web is fully accessible by the web search engines. There is a hidden and inaccessible part of the web called the deep web. Many methods exist in the literature to access and to integrate the huge structured data contained in the deep web. In this paper, we propose our approach to extract the XML schema describing a selected deep web source. Our approach is based on the static and the dynamic analysis of the HTML forms giving access to the selected deep web source. Our approach uses two knowledge database during its process: our proprietary identification tables and Wordnet. The XML schema extracted will be used to integrate the associated deep web source into a mediation system without extracting all its information.
引用
收藏
页码:94 / 99
页数:6
相关论文
共 50 条
  • [21] Correction to: Schema Extraction for Deep Web Query Interfaces Using Heuristics Rules
    Chichang Jou
    [J]. Information Systems Frontiers, 2020, 22 : 273 - 273
  • [22] An approach for deep web interface schema extraction based on hierarchical semantic annotation
    Zhang, Liang
    Lu, Yuliang
    Liu, Jinhong
    Zhang, Tongtong
    [J]. Journal of Information and Computational Science, 2010, 7 (02): : 325 - 332
  • [23] Schema Inference and Data Extraction from Templatized Web Pages
    Krishna, Shinde Santaji
    Dattatraya, Joshi Shashank
    [J]. 2015 INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING (ICPC), 2015,
  • [24] Schema Extraction and Integration of Heterogeneous XML Document Collections
    Janga, Prudhvi
    Davis, Karen C.
    [J]. MODEL AND DATA ENGINEERING, MEDI 2013, 2013, 8216 : 176 - 187
  • [25] Hidden schema extraction in web documents
    Carchiolo, V
    Longheu, A
    Malgeri, M
    [J]. DATABASES IN NETWORKED INFORMATION SYSTEMS, PROCEEDINGS, 2003, 2822 : 42 - 52
  • [26] Hidden schema extraction in web documents
    [J]. 1600, International Affairs Committee; University of Aizu, (Springer Verlag):
  • [27] Schema Extraction for Tabular Data on the Web
    Adelfio, Marco D.
    Samet, Hanan
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (06): : 421 - 432
  • [28] Towards the XML Schema Measurement Based on Mapping Between XML and OO Domain
    Rakic, Gordana
    Budimac, Zoran
    Hericko, Marjan
    Pusnik, Maja
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2016 (ICNAAM-2016), 2017, 1863
  • [29] Hybrid Schema Matching for Deep Web
    Chen, Kerui
    Zuo, Wanli
    He, Fengling
    Chen, Yongheng
    [J]. INTELLIGENT COMPUTING AND INFORMATION SCIENCE, PT II, 2011, 135 : 165 - +
  • [30] Semantic Web Information Retrieval in XML By Mapping To RDF Schema
    Phyue, Soe Lai
    Thein, Myint Myint
    Win, Thinn Thinn
    Thwin, Mie Mie Su
    [J]. 2010 INTERNATIONAL CONFERENCE ON NETWORKING AND INFORMATION TECHNOLOGY (ICNIT 2010), 2010, : 500 - 503