Enterprise information integration

被引:0
|
作者
Hernandez, Inma [1 ]
机构
[1] Univ Seville, Seville, Spain
关键词
Web page classification; navigation; crawling;
D O I
10.3233/AIC-150670
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Integrating a web application into an automated business process requires to design wrappers that get user queries as input and map them onto the search forms that the application provides. Such wrappers build on automatic navigators which are responsible for navigating to the pages that provide the information required to answer the original user queries. A navigator relies on a web page classifier that discerns which pages provide the information and which do not. In the literature, there are many proposals to classify web pages, but none of them fulfills the requirements for a web page classifier in a navigator context. We address the problem of designing an unsupervised web page classifier that builds solely on the information provided by the URLs and does not require extensive crawling of the site being analysed. Our contribution is CALA, a new automated proposal to generate URL-based web page classifiers. Its salient features are that it does not need to previously crawl the complete web site, it is unsupervised, it does not require to download a page before classifying it, and it is computationally tractable. It has been validated by a number of experiments using real-world, top-visited web sites.
引用
收藏
页码:397 / 399
页数:3
相关论文
共 50 条
  • [1] Enterprise information integration
    Ghoualmi-Zine, Nacira
    [J]. RESEARCH AND PRACTICAL ISSUES OF ENTERPRISE INFORMATION SYSTEMS, 2006, : 213 - 218
  • [2] Information integration in the enterprise
    Bernstein, Philip A.
    Haas, Laura M.
    [J]. COMMUNICATIONS OF THE ACM, 2008, 51 (09) : 72 - 79
  • [3] Integration of information in enterprise groups
    Carone, A
    [J]. MICRO- AND MACRODATA OF FIRMS: STATISTICAL ANALYSIS AND INTERNATIONAL COMPARISON, 1999, : 69 - 80
  • [4] Enterprise Information Assurance Integration
    Johnson, George A.
    Lopez, Martin H.
    [J]. PROCEEDINGS OF THE 23RD INTERNATIONAL TECHNICAL MEETING OF THE SATELLITE DIVISION OF THE INSTITUTE OF NAVIGATION (ION GNSS 2010), 2010, : 1345 - 1349
  • [5] Information integration of CIMS and enterprise information standardization
    Gao, Y
    Yang, YL
    [J]. PROCEEDINGS OF THE 2001 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING, VOLS I AND II, 2001, : 733 - 736
  • [6] A Semantic Approach to Enterprise Information Integration
    Katasonov, Artem
    Lattunen, Ali
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2014, : 219 - 226
  • [7] A generic information infrastructure for enterprise integration
    Jonker, J
    Ehlers, EM
    [J]. INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 1996, 9 (04) : 255 - 259
  • [8] Enterprise information integration - A semantic approach
    Kamps, T
    Stenzel, R
    Chen, LB
    Rostek, L
    [J]. FROM INTEGRATED PUBLICATION AND INFORMATION SYSTEMS TO VIRTUAL INFORMATION AND KNOWLEDGE ENVIRONMENTS: ESSAYS DEDICATED TO ERICH J NEUHOLD ON THE OCCASION OF HIS 65TH BIRTHDAY, 2005, 3379 : 271 - 279
  • [9] Enterprise information integration - XML to the rescue!
    Carey, MJ
    [J]. CONCEPTUAL MODELING - ER 2003, PROCEEDINGS, 2003, 2813 : 14 - 14
  • [10] A framework to review the information integration of the enterprise
    Giachetti, RE
    [J]. INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2004, 42 (06) : 1147 - 1166