Multi-objective optimization integration of query interfaces for the Deep Web based on attribute constraints

被引:8
|
作者
Li, Yanni [1 ,2 ]
Wang, Yuping [1 ]
Jiang, Peng [3 ]
Zhang, Zhensong [4 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, 2 South Taibai Rd, Xian 710071, Shaanxi, Peoples R China
[2] Xidian Univ, Sch Software, Xian 710071, Shaanxi, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
[4] Chinese Acad Sci, Inst Software, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Domain-Specific Deep Web Data Integration Systems (DWDIS); Algorithm; Integration of query interface; Attribute constraint matrix; SEARCH INTERFACES; SCHEMA; DATABASES;
D O I
10.1016/j.datak.2013.01.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to query and retrieve the rich and useful information hidden in the Deep Web efficiently, extensive research on domain-specific Deep Web Data Integration Systems (DWDIS) has been carried out in recent years. In DWDIS, large-scale automatic integration of query interfaces of domain-specific Web Databases (WDBs) remains a serious challenge due to the scale of the problem and the great diversity of the WDBs' query interfaces. To address this challenge, in this paper, we first give a definition of the constraint matrix which can accurately describe three types of constraints (hierarchical constraints, group constraints and precedence constraints) and the strengths of attributes of a query interface, and then prove that the schema tree of the query interface corresponds to only one constraint matrix, and vice versa. Furthermore, we transform the problem of integrating domain-specific query interfaces into a problem of integrating the constraint matrices and set up a multi-objective optimization problem model. To effectively solve the optimization model, some strategies to extend and merge the constraint matrices are designed. A method for automatically detecting and filtering abnormal data (noises) in the query interfaces is also proposed. More importantly, a novel and efficient algorithm applicable to large-scale automatic integration of domain-specific query interfaces is developed. Finally, the proposed algorithm is evaluated by experiments on the real query interface data set. Our theoretical analysis and experimental results show that the proposed algorithm outperforms existing state-of-the-art integration algorithms of domain-specific query interfaces. (c) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:38 / 60
页数:23
相关论文
共 50 条
  • [1] Automatic Integration of Deep Web Query Interfaces based on Ontology
    Wang, Ying
    Peng, Tao
    Zuo, Wanli
    He, Fengling
    [J]. ICCIT: 2009 FOURTH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND CONVERGENCE INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2009, : 1654 - 1659
  • [2] Research on the Integration of Deep Web Query Interfaces
    Liu, Yongjun
    Xie, Conghua
    Chang, Jinyi
    [J]. 2014 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2014), 2014, : 332 - 335
  • [3] Integration of Query Interfaces for Deep Web Databases
    Wang, Ying
    Zuo, Wanli
    Peng, Tao
    He, Fengling
    [J]. ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 4, PROCEEDINGS, 2008, : 157 - 161
  • [4] Multi-objective parametric query optimization
    Trummer, Immanuel
    Koch, Christoph
    [J]. VLDB JOURNAL, 2017, 26 (01): : 107 - 124
  • [5] Multi-Objective Parametric Query Optimization
    Trummer, Immanuel
    Koch, Christoph
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (10) : 81 - 89
  • [6] Multi-Objective Parametric Query Optimization
    Trummer, Immanuel
    Koch, Christoph
    [J]. SIGMOD RECORD, 2016, 45 (01) : 24 - 31
  • [7] Multi-objective parametric query optimization
    Immanuel Trummer
    Christoph Koch
    [J]. The VLDB Journal, 2017, 26 : 107 - 124
  • [8] Multi-Objective Parametric Query Optimization
    Trummer, Immanuel
    Koch, Christoph
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 8 (03): : 221 - 232
  • [9] Improving web service interfaces modularity using multi-objective optimization
    Boukharata, Sabrine
    Ouni, Ali
    Kessentini, Marouane
    Bouktif, Salah
    Wang, Hanzhang
    [J]. AUTOMATED SOFTWARE ENGINEERING, 2019, 26 (02) : 275 - 312
  • [10] Improving web service interfaces modularity using multi-objective optimization
    Sabrine Boukharata
    Ali Ouni
    Marouane Kessentini
    Salah Bouktif
    Hanzhang Wang
    [J]. Automated Software Engineering, 2019, 26 : 275 - 312