Ontology-based automatically hidden web portal index

被引:0
|
作者
Song, H [1 ]
Pan, L
Ma, FY
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200030, Peoples R China
[2] Donghua Univ, Coll Informat Sci & Technol, Shanghai 200051, Peoples R China
关键词
hidden Web; query interfaces; information extraction; interface transformation; ontology;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Many valuable databases on the Web have non-crawlable contents that are "hidden" behind the search forms. Information is available only by filling out HTML forms manually to query the underlying databases. For accessing data behind forms by automated agents, the critical task is having the corresponding query interfaces of the hidden databases that can be understood by machine. This paper presents an automatic approach of hidden Web portal index for various domains. It discovers and scrapes the query forms from Web pages based the tag-tree presentation, and then interpret them into the uniform mediate interfaces with the aid of domain ontology definition. To achieve high transformation accuracy, the domain ontology is also used to filter out the interfaces that are not related to the specific domain. The query interfaces gained finally represented with common concepts can automatically be indexed and retrieved by program. The experiments indicate that the algorithms used are efficient and the system is materially useful for information system or personalized Web search system to retrieval contents from hidden Web.
引用
收藏
页码:609 / 611
页数:3
相关论文
共 50 条
  • [1] Integrating Web services into ontology-based Web portal
    Zhou, J
    Yu, Y
    Zhang, L
    Lin, CX
    Yang, Y
    [J]. WEB TECHNOLOGIES RESEARCH AND DEVELOPMENT - APWEB 2005, 2005, 3399 : 585 - 596
  • [2] Ontology-based Knowledge Extraction from Hidden Web
    宋晖
    马范援
    刘晓强
    [J]. Journal of Donghua University(English Edition), 2004, (05) : 73 - 78
  • [3] Ontology-based web crawler
    Ganesh, S
    Jayaraj, M
    Kalyan, V
    Murthy, S
    Aghila, G
    [J]. ITCC 2004: INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: CODING AND COMPUTING, VOL 2, PROCEEDINGS, 2004, : 337 - 341
  • [4] An ontology-based Web engine
    Lee, MR
    Mizoguchi, R
    [J]. WEB TECHNOLOGIES AND APPLICATIONS, 1998, : 359 - 360
  • [5] Automatically generating assembly sequences with an ontology-based approach
    Zhong, Yanru
    Jiang, Chaohao
    Qin, Yuchu
    Yang, Guoyu
    Huang, Meifa
    Luo, Xiaonan
    [J]. ASSEMBLY AUTOMATION, 2020, 40 (02) : 319 - 334
  • [6] Ontology-based Web navigation assistant
    Jung, H
    Yang, JY
    Choi, J
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING IDEAL 2004, PROCEEDINGS, 2004, 3177 : 443 - 448
  • [7] Ontology-based web knowledge management
    Wang, YM
    Yang, ZH
    Kong, PHH
    Gay, RKL
    [J]. ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 1859 - 1863
  • [8] An Ontology-Based Crawler for the Semantic Web
    Van de Maele, Felix
    Spyns, Peter
    Meersman, Robert
    [J]. ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2008 WORKSHOPS, 2008, 5333 : 1056 - +
  • [9] Ontology-Based Web Information Extraction
    Mo, Qian
    Chen, Yi-hong
    [J]. COMMUNICATIONS AND INFORMATION PROCESSING, PT 1, 2012, 288 : 118 - 126
  • [10] Ontology-Based Administration of Web Directories
    Horvat, Marko
    Gledec, Gordan
    Bogunovic, Nikola
    [J]. TRANSACTIONS ON COMPUTATIONAL COLLECTIVE INTELLIGENCE I, 2010, 6220 : 101 - 120