An efficient topic-specific web text filtering framework

被引:0
|
作者
Li, Q [1 ]
Li, JH [1 ]
机构
[1] Shanghai Jiao Tong Univ, Modern Commun Inst, Shanghai 200030, Peoples R China
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an efficient topic-specific Web text filtering framework is proposed. This framework focuses on blocking some topic-specific Web text content. In this framework, a hybrid feature selection method is proposed, and a high efficient filtering engine is designed. In training, we select features based on CHI statistic and rough set theory, then to construct filter with Vector Space Model. We train our frame with huge datasets, and the result suggests our framework is more effective for the topic-specific text filtering. This framework runs at server such as gateway, and it is more efficient than a client-based system.
引用
收藏
页码:157 / 163
页数:7
相关论文
共 50 条
  • [1] Topic-specific text filtering based on multiple reducts
    Li, Q
    Li, JH
    [J]. AUTONOMOUS INTELLIGENT SYSTEMS: AGENTS AND DATA MINING, PROCEEDINGS, 2005, 3505 : 175 - 183
  • [2] Real-Text Dictionary for Topic-Specific Web Searching
    Pirkola, Ari
    [J]. WEB INFORMATION SYSTEMS AND TECHNOLOGIES, WEBIST 2012, 2013, 140 : 105 - 119
  • [3] A topic-specific data filtering framework based on rough set theory
    Guo, H
    Cao, Y
    Guo, S
    [J]. CCECE 2003: CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-3, PROCEEDINGS: TOWARD A CARING AND HUMANE TECHNOLOGY, 2003, : 1095 - 1098
  • [4] Learnable topic-specific web crawler
    Rungsawang, A
    Angkawattanawit, N
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2005, 28 (02) : 97 - 114
  • [5] Applying Semantic Similarity Measures to Enhance Topic-Specific Web Crawling Topic-Specific Web Crawlering through Disambiguating Topic Sense
    Pesaranghader, Ali
    Mustapha, Norwati
    Pesaranghader, Ahmad
    [J]. 2013 13TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2013, : 205 - 212
  • [6] Topic-specific intelligent web crawler system
    Qian, Rong
    Xu, Xinhua
    Zheng, Ying
    Yang, Bingru
    [J]. Jisuanji Gongcheng/Computer Engineering, 2006, 32 (03): : 57 - 59
  • [7] System for analyzing topic-specific Web pages
    Song, Ju-Ping
    Wang, Yong-Cheng
    Yin, Zhong-Hang
    Teng, Wei
    [J]. Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2003, 37 (03): : 401 - 403
  • [8] Improvement of HITS for topic-specific web crawler
    Zong, XJ
    Shen, Y
    Liao, XX
    [J]. ADVANCES IN INTELLIGENT COMPUTING, PT 1, PROCEEDINGS, 2005, 3644 : 524 - 532
  • [9] A rough set-based hybrid feature selection method for topic-specific text filtering
    Li, Q
    Li, JH
    Liu, GS
    Li, SH
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1464 - 1468
  • [10] Augmented Topic-Specific Summarization for Domain Dialogue Text
    Rao, Zhiqiang
    Wei, Daimeng
    Li, Zongyao
    Shang, Hengchao
    Yang, Jinlong
    Yu, Zhengzhe
    Li, Shaojun
    Wu, Zhanglin
    Lei, Lizhi
    Yang, Hao
    Qin, Ying
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT II, 2022, 13552 : 274 - 283