Web Information Extraction for content augmentation

被引:0
|
作者
Janevski, A [1 ]
Dimitrova, N [1 ]
机构
[1] Philips Res USA, Briarcliff Manor, NY 10510 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Today users have to cope with an overwhelming amount of TV channels and Web content sources. We introduce automatic content augmentation, as a novel approach to contextual information extraction on behalf of the user where the context is provided by the primary content source (i.e. TV channel) and tailored by user's preferences. A key aspect of this approach is Web Information Extraction (WebIE) which automatically derives structured information from unstructured Web documents. Our system executes WebIE tasks, each an instantiation of WebIE rules - our generic document processors. We present two WebIE approaches: Diffusion WebIE that crawls a wide set of Web pages and extracts information from a subset of the pertinent pages; and Laser WebIE that accesses a select set of Web pages and extracts narrowly defined information. We describe the architecture and the implementation details of the system and provide detailed Laser WebIE examples.
引用
收藏
页码:A389 / A392
页数:4
相关论文
共 50 条
  • [41] Web Information Extraction and Conversion for Mashup
    Zhang, Rui
    Lan, Xiang
    Liu, Yao
    Liu, Qingyang
    [J]. MECHATRONICS ENGINEERING, COMPUTING AND INFORMATION TECHNOLOGY, 2014, 556-562 : 5471 - 5476
  • [42] Extraction and Comparison of Tourism Information on the Web
    Wu, Xiaobin
    Hirokawa, Sachio
    Yin, Chengjiu
    Nakatoh, Tetsuya
    Tabata, Yoshiyuki
    [J]. PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 16TH '11), 2011, : 170 - 173
  • [43] Open Information Extraction from the Web
    Banko, Michele
    Cafarella, Michael J.
    Soderland, Stephen
    Broadhead, Matt
    Etzioni, Oren
    [J]. 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 2670 - 2676
  • [44] On validating web information extraction proposals
    Jimenez, Patricia
    Corchuelo, Rafael
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 199
  • [45] Web Information Extraction Based on IEBIDTech
    Ren, Xiaoyan
    Fu, Yunxia
    [J]. 2012 WORLD AUTOMATION CONGRESS (WAC), 2012,
  • [46] Shallow Information Extraction for the Knowledge Web
    Barbosa, Denilson
    Wang, Haixun
    Yu, Cong
    [J]. 2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2013, : 1264 - 1267
  • [47] Open Information Extraction from the Web
    Etzioni, Oren
    Banko, Michele
    Soderland, Stephen
    Weld, Daniel S.
    [J]. COMMUNICATIONS OF THE ACM, 2008, 51 (12) : 68 - 74
  • [48] Metabrain: Web Information Extraction and Visualization
    Teixeira, Joao
    Barata, Gabriel
    Goncalves, Daniel
    [J]. PROCEEDINGS OF THE INTERNATIONAL WORKING CONFERENCE ON ADVANCED VISUAL INTERFACES, 2012, : 534 - 537
  • [49] Extraction of structural information from the web
    Murata, T
    [J]. FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 2, PROCEEDINGS, 2005, 3614 : 1204 - 1207
  • [50] Content Extraction from Deep Web Interfaces
    Bhakare, Unnati N.
    Chatur, Prashant N.
    [J]. 2017 INTERNATIONAL CONFERENCE OF ELECTRONICS, COMMUNICATION AND AEROSPACE TECHNOLOGY (ICECA), VOL 1, 2017, : 349 - 353