Content Extraction from Deep Web Interfaces

被引:0
|
作者
Bhakare, Unnati N. [1 ]
Chatur, Prashant N. [1 ]
机构
[1] Govt Coll Engn, Dept Comp Sci & Engn, Amravati, MH, India
关键词
Ranking; deep web; data extraction; in-site search;
D O I
暂无
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Locating deep websites and information fetching from large amount of hidden links from deep web sites to get useful information has become recent trend now a days. The efficiency of the information retrieval process and the coverage of targeted data has become a challenging issue due to dynamic and voluminous nature of web resources. Thus finding the information related to specific topic or a keyword from largely available web resources enables promising opportunities for information discovery. In this paper we propose a system that finds the links related to specific keyword and then it performs the in-site searching to get the deeply hidden links related to the topic under consideration and further extracts the data underlying these hidden web-links. Ranking is performed on the links derived at the intermediate level to get the most relevant data to keyword. The proposed system aims to efficiently locate and extract the data underlying deep web interfaces.
引用
收藏
页码:349 / 353
页数:5
相关论文
共 50 条
  • [1] A Review on Extracting Underlying Content from Deep Web Interfaces
    Bhakare, Unnati N.
    Chatur, Prashant N.
    [J]. 2017 INTERNATIONAL CONFERENCE ON INNOVATIVE MECHANISMS FOR INDUSTRY APPLICATIONS (ICIMIA), 2017, : 234 - 237
  • [2] Effective Schema Extraction of Query Interfaces on the Deep Web
    Qiang, Bao-hua
    Xi, Jian-qing
    Qiang, Bao-Hua
    Chen, Ling
    [J]. FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2008, : 291 - +
  • [3] The Discovery and Extraction of Query Interfaces Based on Deep Web
    Yang Daowen
    Liu Quan
    Cui Zhiming
    Fu Yuchen
    [J]. 2009 WRI WORLD CONGRESS ON SOFTWARE ENGINEERING, VOL 1, PROCEEDINGS, 2009, : 507 - 511
  • [4] Vision-based Deep Web query interfaces automatic extraction
    Institute of Intelligent Information Processing and Application, Suzhou University, Suzhou 215006, China
    [J]. J. Comput. Inf. Syst., 2007, 4 (1433-1440):
  • [5] Schema Extraction for Deep Web Query Interfaces Using Heuristics Rules
    Chichang Jou
    [J]. Information Systems Frontiers, 2019, 21 : 163 - 174
  • [7] Heuristics-Based Schema Extraction for Deep Web Query Interfaces
    Jou, Chichang
    Cheng, Yucheng
    [J]. 2017 IEEE 18TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IEEE IRI 2017), 2017, : 389 - 396
  • [8] Correction to: Schema Extraction for Deep Web Query Interfaces Using Heuristics Rules
    Chichang Jou
    [J]. Information Systems Frontiers, 2020, 22 : 273 - 273
  • [9] Data extraction from Deep Web pages
    Yang, Jufeng
    Shi, Guangshun
    Zheng, Yan
    Wang, Qingren
    [J]. CIS: 2007 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PROCEEDINGS, 2007, : 237 - 241
  • [10] Semantic Deep Web: Automatic Attribute Extraction from the Deep Web Data Sources
    An, Yoo Jung
    Geller, James
    Wu, Yi-Ta
    Chun, Soon Ae
    [J]. APPLIED COMPUTING 2007, VOL 1 AND 2, 2007, : 1667 - 1672