Towards Deeper Understanding of the Search Interfaces of the Deep Web

被引:0
|
作者
Hai He
Weiyi Meng
Yiyao Lu
Clement Yu
Zonghuan Wu
机构
[1] SUNY at Binghamton,Department of Computer Science
[2] University of Illinois at Chicago,Department of Computer Science
[3] University of Louisiana at Lafayette,Center for Advanced Computer Studies
来源
World Wide Web | 2007年 / 10卷
关键词
Web databases; search interfaces extraction; interface schema;
D O I
暂无
中图分类号
学科分类号
摘要
Many databases have become Web-accessible through form-based search interfaces (i.e., HTML forms) that allow users to specify complex and precise queries to access the underlying databases. In general, such a Web search interface can be considered as containing an interface schema with multiple attributes and rich semantic/meta-information; however, the schema is not formally defined in HTML. Many Web applications, such as Web database integration and deep Web crawling, require the construction of the schemas. In this paper, we first propose a schema model for representing complex search interfaces, and then present a layout-expression based approach to automatically extract the logical attributes from search interfaces. We also rephrase the identification of different types of semantic information as a classification problem, and design several Bayesian classifiers to help derive semantic information from extracted attributes. A system, WISE-iExtractor, has been implemented to automatically construct the schema from any Web search interfaces. Our experimental results on real search interfaces indicate that this system is highly effective.
引用
收藏
页码:133 / 155
页数:22
相关论文
共 50 条
  • [21] Deeper: A Data Enrichment System Powered by Deep Web
    Wang, Pei
    He, Yongjun
    Shea, Ryan
    Wang, Jiannan
    Wu, Eugene
    SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, : 1801 - 1804
  • [22] Towards deeper understanding of the latent semantic analysis performance
    Nakov, P
    Valchanova, E
    Angelova, G
    Recent Advances in Natural Language Processing III, 2004, 260 : 297 - 306
  • [23] Towards a deeper geometric, analytic and algorithmic understanding of margins
    Ramdas, Aaditya
    Pena, Javier
    OPTIMIZATION METHODS & SOFTWARE, 2016, 31 (02): : 377 - 391
  • [24] Introduction: towards a deeper understanding of the development of early football
    Curry, Graham
    SOCCER & SOCIETY, 2018, 19 (01) : 1 - 4
  • [25] Towards deeper understanding of multifaceted chemistry of magnesium alkylperoxides
    Pietrzak, Tomasz
    Justyniak, Iwona
    Zelga, Karolina
    Nowak, Krzysztof
    Ochal, Zbigniew
    Lewinski, Janusz
    COMMUNICATIONS CHEMISTRY, 2021, 4 (01)
  • [26] Towards a deeper understanding of parenting on farms: A qualitative study
    Elliot, Valerie
    Cammer, Allison
    Pickett, William
    Marlenga, Barbara
    Lawson, Joshua
    Dosman, James
    Hegel, Louise
    Koehncke, Niels
    Trask, Catherine
    PLOS ONE, 2018, 13 (08):
  • [27] Towards deeper understanding of multifaceted chemistry of magnesium alkylperoxides
    Tomasz Pietrzak
    Iwona Justyniak
    Karolina Zelga
    Krzysztof Nowak
    Zbigniew Ochal
    Janusz Lewiński
    Communications Chemistry, 4
  • [28] Towards deeper co-understanding of software quality
    Tervonen, I
    Kerola, P
    INFORMATION AND SOFTWARE TECHNOLOGY, 1998, 39 (14-15) : 995 - 1003
  • [29] Towards a Deeper Understanding of the Einstein–Podolsky–Rosen Problem
    Thomas Krüger
    Foundations of Physics, 2000, 30 : 1869 - 1890