Similarity Webpage Denoising Data Clustering Algorithm Based on Time Series

被引:0
|
作者
Hang Chun-mei [1 ]
Wu Yang-yang [1 ]
机构
[1] HuaQiao Univ, Dept Comp Sci & Technol, Xiamen 361021, Peoples R China
关键词
Similarity matching; intrinsic mode function; weighted processing;
D O I
10.1109/ICMTMA.2015.240
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the processing of large data of unsteady Webpage data or non first sequence Webpage data, we often choose the empirical mode decomposition (EMD), typically exhibiting very high noise ratio. Using EMD to the sequence data for processing, and finally get the intrinsic mode function (IMF) and residual series, among them, there existing the local characteristic data of different time range in the intrinsic mode function, showing the property of removing impurities. The use of the characteristic of different IMF covers, obtained the initial Webpage information by using the decomposition of the EMD to extract the relevant information from the Webpage, for the different features of the IMF selecting different Webpage information weight, then using the Euclidean distance to analysis in the similar level. The finally situation shows that using the intrinsic mode function compared with the previous way of matching directly, the former emphasizing on time series decomposition, to eliminate the influence of the noise, and then being matched by using a weighted processing idea, which makes the matching accuracy have a great promotion, this method is effective.
引用
收藏
页码:984 / 986
页数:3
相关论文
共 50 条
  • [1] Clustering Algorithm Based on Time Series Similarity to Web Data Clustering
    Yang Yan
    Yao Hua-Xiong
    Li Rong
    [J]. PROCEEDINGS OF THE 2015 4TH NATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING ( NCEECE 2015), 2016, 47 : 1373 - 1377
  • [2] An efficient similarity searching algorithm based on clustering for time series
    Feng, Yucai
    Jiang, Tao
    Zhou, Yingbiao
    Li, Junkui
    [J]. ADVANCES IN DATA MINING, PROCEEDINGS: MEDICAL APPLICATIONS, E-COMMERCE, MARKETING, AND THEORETICAL ASPECTS, 2008, 5077 : 360 - 373
  • [3] An algorithm for time series data mining based on clustering
    Wu, Shaozhi
    Wu, Yue
    Wang, Ying
    Ye, Yalan
    [J]. 2006 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1-4: VOL 1: SIGNAL PROCESSING, 2006, : 2155 - +
  • [4] A clustering algorithm for time series data
    Yin, Jian
    Zhou, Duanning
    Xie, Qiong-Qiong
    [J]. SEVENTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2006, : 119 - +
  • [5] An Algorithm Based on Time Series Similarity Measurement for Missing Data Filling
    Li Hui-min
    Wang Pu
    Fang Li-ying
    Liu Jing-wei
    [J]. PROCEEDINGS OF THE 2012 24TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2012, : 3933 - 3935
  • [6] Incremental Clustering for Time Series Data based on an Improved Leader Algorithm
    Huynh Thi Thu Thuy
    Duong Tuan Anh
    Vo Thi Ngoc Chau
    [J]. 2019 IEEE - RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF), 2019, : 13 - 18
  • [7] An Application on Time Series Clustering Based on Wavelet Decomposition and Denoising
    Guo, Hongwei
    Liu, Yanchi
    Liang, Helan
    Gao, Xuedong
    [J]. ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 5, PROCEEDINGS, 2008, : 419 - 422
  • [8] A Similarity-Based Clustering Algorithm for Fuzzy Data
    Hung, Wen-Liang
    Yang, Miin-Shen
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2010), 2010,
  • [9] A New HHT-Based Denoising Algorithm for Financial Time Series Data Mining
    Li, Yi
    Han, Huijing
    Li, Yaqin
    [J]. PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019), 2019, : 397 - 401
  • [10] Modified Hierarchical Clustering Algorithm for Time Series Data
    Rani, Sangeeta
    [J]. PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 4036 - 4040