The feature extraction of text mining based on Web

被引:0
|
作者
Liu, LZ [1 ]
Chen, JJ [1 ]
Song, HT [1 ]
机构
[1] Beijing Inst Technol, Dept Comp, Beijing 100081, Peoples R China
关键词
Web; text mining; extract;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A mass of information on WWW usually hides within Web electronic documents. So mining contents of Web pages is a kind of important application. Text mining depends on finding out feature set of documents. The text features are extracted by terms frequency based on Vector Space Model, and feature subset is selected according to evaluating function to reduce high vector dimensions. The selection of feature decides the efficiency of text mining.
引用
收藏
页码:547 / 550
页数:4
相关论文
共 50 条
  • [1] Feature Selection and Feature Weight Estimate in Web Text Mining
    Pei, Zhili
    Qi, Jianhong
    Zhang, Xinhong
    Zhou, Yuxin
    Bai, Mingyu
    Wang, Qinghu
    Liu, Lisha
    Fan, Xiaojing
    Jiang, Mingyang
    [J]. 2ND INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY FOR EDUCATION (ICTE 2015), 2015, : 316 - 320
  • [2] Text Classification using Graph Mining-based Feature Extraction
    Jiang, Chuntao
    Coenen, Frans
    Sanderson, Robert
    Zito, Michele
    [J]. RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXVI: INCORPORATING APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XVII, 2010, : 21 - 34
  • [3] Text classification using graph mining-based feature extraction
    Jiang, Chuntao
    Coenen, Frans
    Sanderson, Robert
    Zito, Michele
    [J]. KNOWLEDGE-BASED SYSTEMS, 2010, 23 (04) : 302 - 308
  • [4] Research and realization of extraction algorithm on web text mining
    Yin, Shiqun
    Qu, Yuhui
    Ge, Jike
    Lan, Xiaohong
    [J]. IITA 2007: WORKSHOP ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, PROCEEDINGS, 2007, : 278 - +
  • [5] News item extraction for text mining in web newspapers
    Norvåg, K
    Oyri, R
    [J]. International Workshop on Challenges in Web Information Retrieval and Integration, Proceedings, 2005, : 195 - 204
  • [6] Deep Web Data Source Classification Based on Text Feature Extension and Extraction
    Li, Yuancheng
    Wu, Guixian
    Wang, Xiaohan
    [J]. INFOCOMMUNICATIONS JOURNAL, 2019, 11 (03): : 42 - 49
  • [7] Web Text Feature Extraction with Particle Swarm Optimization
    Song Liangtu
    Zhang Xiaoming
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2007, 7 (06): : 132 - 136
  • [8] Bilevel Feature Extraction-Based Text Mining for Fault Diagnosis of Railway Systems
    Wang, Feng
    Xu, Tianhua
    Tang, Tao
    Zhou, MengChu
    Wang, Haifeng
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2017, 18 (01) : 49 - 58
  • [9] Non-negative matrix factorization based text mining: Feature extraction and classification
    Barman, P. C.
    Iqbal, Nadeem
    Lee, Soo-Young
    [J]. NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2006, 4233 : 703 - 712
  • [10] Research on Feature Extraction from Chinese Text for Opinion Mining
    Zhu, Shanzong
    Liu, Yuanchao
    Liu, Ming
    Tian, Peiliang
    [J]. 2009 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2009, : 7 - 10