Advertising Keywords Extraction from Web Pages

被引:0
|
作者
Liu, Jianyi [1 ]
Wang, Cong [1 ]
Liu, Zhengyang [1 ]
Yao, Wenbin [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Comp Sci, Beijing 100876, Peoples R China
来源
关键词
Keyword extraction; information extraction; advertising; PageRank;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A large and growing number of web pages display contextual advertising based on keywords automatically extracted from the text of the page, and it has been become a rapidly growing business in recent years. We describe a system that learns how to extract keywords from web pages for advertisement targeting. Firstly a text network for a single webpage is build, then Page Rank is applied in the network to decide on the importance of a word, finally top-ranked words are selected as keywords of the webpage. The algorithm is tested on the corpus of blog pages, and the experiment result proves practical and effective.
引用
下载
收藏
页码:336 / 343
页数:8
相关论文
共 50 条
  • [21] TEXT: Automatic Template Extraction from Heterogeneous Web Pages
    Kim, Chulyun
    Shim, Kyuseok
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (04) : 612 - 626
  • [22] Schema Inference and Data Extraction from Templatized Web Pages
    Krishna, Shinde Santaji
    Dattatraya, Joshi Shashank
    2015 INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING (ICPC), 2015,
  • [23] Automatic data extraction from template generated web pages
    Ma, L
    Goharian, N
    Chowdhury, A
    PDPTA'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-4, 2003, : 642 - 648
  • [24] Unsupervised Keyphrase Extraction for Web Pages
    Haarman, Tim
    Zijlema, Bastiaan
    Wiering, Marco
    MULTIMODAL TECHNOLOGIES AND INTERACTION, 2019, 3 (03)
  • [25] Automatic Web Pages Author Extraction
    Changuel, Sahar
    Labroche, Nicolas
    Bouchon-Meunier, Bernadette
    FLEXIBLE QUERY ANSWERING SYSTEMS: 8TH INTERNATIONAL CONFERENCE, FQAS 2009, 2009, 5822 : 300 - 311
  • [26] Web advertising's birth and early childhood as viewed in the pages of advertising age
    Thorson, E
    Wells, WD
    Rogers, S
    ADVERTISING AND THE WORLD WIDE WEB, 1999, : 5 - 25
  • [27] Authoring of Personalized Web Page from Heterogeneous Web Pages by Content Extraction and Integration
    Li, Wei-gang
    Sun, Ke
    Wang, Shuo-chen
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER NETWORKS AND COMMUNICATION TECHNOLOGY (CNCT 2016), 2016, 54 : 734 - 740
  • [28] WEB PAGES CLUSTER BASED ON THE RELATIONS OF MAPPING KEYWORDS TO ONTOLOGY CONCEPT HIERARCHY
    Chen, Rung-Ching
    Bau, Cho-Tsan
    Tsai, Ming-Yung
    Huang, Chung-Yi
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2010, 6 (06): : 2749 - 2760
  • [29] Knowledge Extraction from Web Pages with an Auto-Adaptive System
    Havas, Camille
    Larue, Othalia
    Camus, Mickael
    COMPUTATIONAL ENGINEERING IN SYSTEMS APPLICATIONS, 2008, : 126 - 131
  • [30] Automatic Data Extraction from Lists in Web Pages Based on XML
    Xin, Zhou
    Hao, Wang
    ADVANCED TECHNOLOGY IN TEACHING - PROCEEDINGS OF THE 2009 3RD INTERNATIONAL CONFERENCE ON TEACHING AND COMPUTATIONAL SCIENCE (WTCS 2009), VOL 2: EDUCATION, PSYCHOLOGY AND COMPUTER SCIENCE, 2012, 117 : 915 - 921