Scalable information extraction for web queries

被引:0
|
作者
Hsu, Meichun [1 ]
Xiong, Yuhong [2 ]
机构
[1] Hewlett Packard Labs, 1501 Page Mill Rd, Palo Alto, CA 94022 USA
[2] Innovat Works, Beijing 100084, Peoples R China
关键词
web mining; parallel computing; classification; information extraction; focused crawling;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The dominant way to find information on the web nowadays is through search. General search engines are very effective, but search phrases and results are unstructured and that limits a user's ability to further automate the processing of the search results. In recent years, we have seen efforts to build systems that support more precise query on the web for certain content verticals. We describe the general problems for building an extensible web query system and present one of our projects in this area - a vertical search portal for online courses.
引用
收藏
页码:176 / 184
页数:9
相关论文
共 50 条
  • [1] Consistency queries in information extraction
    Grieser, G
    Jantke, KP
    Lange, S
    [J]. ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 2002, 2533 : 173 - 187
  • [2] Juicer: Scalable Extraction for Thread Meta-information of Web Forum
    Guo, Yan
    Wang, Yu
    Ding, Guodong
    Cao, Donglin
    Zhang, Gang
    Lv, Yi
    [J]. INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2009, 5477 : 143 - +
  • [3] Scalable, efficient range queries for grid information services
    Andrzejak, A
    Xu, ZC
    [J]. SECOND INTERNATIONAL CONFERENCE ON PEER-TO-PEER COMPUTING, PROCEEDINGS, 2002, : 33 - 40
  • [4] Scalable Execution of Continuous Aggregation Queries over Web Data
    Gupta, Rajeev
    Ramamritham, Krithi
    [J]. IEEE INTERNET COMPUTING, 2012, 16 (01) : 43 - 51
  • [5] Information Monitoring on the Web: A Scalable Solution
    Liu L.
    Tang W.
    Buttler D.
    Pu C.
    [J]. World Wide Web, 2002, 5 (4) : 263 - 304
  • [6] GenerIE: Information Extraction Using Database Queries
    Tari, Luis
    Phan Huy Tu
    Hakenberg, Joerg
    Chen, Yi
    Tran Cao Son
    Gonzalez, Graciela
    Baral, Chitta
    [J]. 26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING ICDE 2010, 2010, : 1121 - 1124
  • [7] Scalable Knowledge Extraction and Visualization for Web Intelligence
    Scharl, Arno
    Weichselbraun, Albert
    Goebel, Max
    Rafelsberger, Walter
    Kamolov, Ruslan
    [J]. PROCEEDINGS OF THE 49TH ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS 2016), 2016, : 3749 - 3757
  • [8] Geosemantic Web Queries on ChefMoz for Personalized Information Retrieval
    Ponce-Medellin, Rafael
    Gonzalez Serna, Gabriel
    Vargas, Rocio
    Ruiz-Vanoye, J. A.
    Mexicano, A.
    Cervantes, S.
    [J]. 2009 EIGHTH MEXICAN INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, : 185 - 190
  • [9] A method for web information extraction
    Lam, Man I.
    Gong, Zhiguo
    Muyeba, Maybin
    [J]. PROGRESS IN WWW RESEARCH AND DEVELOPMENT, PROCEEDINGS, 2008, 4976 : 383 - +
  • [10] Web Services for information extraction from the Web
    Habegger, B
    Quafafou, M
    [J]. IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, PROCEEDINGS, 2004, : 279 - 286