Mass Log Data Processing and Mining Based on Hadoop and Cloud Computing

被引:0
|
作者
Yu, Hongyong [1 ]
Wang, Deshuai [1 ]
机构
[1] Neusoft Corp, State Key Lab Software Architecture, Neusoft Pk,2 Xinxiu St, Shenyang 110179, Peoples R China
关键词
mass data processing; data mining; real time statistics; business intelligence; Hadoop;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With the rapid development of the Internet, SaaS applications delivered as services through internet become an important alternative of traditional software. While using the services, users need real time usage information, and they also need to dig out useful knowledge. As a result, data processing and data mining techniques are designed to cope with such problems, and using log data is an effective method to record the SaaS usage information in a standard format. However, as the size of data grows, traditional distributed log data processing systems are not able to processing massive log data from SaaS applications with millions of users. This paper proposes a mass log data processing and data mining methods based on Hadoop to achieve scalability and performance. The model, process, architecture, and implementation of the data processing and mining methods are proposed, and the experimental results is shown and analyzed to prove the effectiveness of the methods.
引用
收藏
页码:197 / 202
页数:6
相关论文
共 50 条
  • [1] Research on data mining of electric power system based on Hadoop cloud computing platform
    Zhu J.
    [J]. International Journal of Computers and Applications, 2019, 41 (04) : 289 - 295
  • [2] A Survey of Mass Data Mining Based on Cloud-computing
    Hu, Tingting
    Chen, Haishan
    Huang, Lu
    Zhu, Xiaodan
    [J]. 2012 INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY AND IDENTIFICATION (ASID), 2012,
  • [3] Design of big data processing system architecture based on Hadoop Under the cloud computing
    Duan, Chunmei
    [J]. MECHATRONICS ENGINEERING, COMPUTING AND INFORMATION TECHNOLOGY, 2014, 556-562 : 6302 - 6306
  • [4] Research on database massive data processing and mining method based on hadoop cloud platform
    Xiaoyong, Zhao
    Chunrong, Yang
    [J]. Open Automation and Control Systems Journal, 2014, 6 (01): : 1463 - 1467
  • [5] LOG ANALYSIS IN CLOUD COMPUTING ENVIRONMENT WITH HADOOP AND SPARK
    Lin, Xiuqin
    Wang, Peng
    Wu, Bin
    [J]. 2013 5TH IEEE INTERNATIONAL CONFERENCE ON BROADBAND NETWORK & MULTIMEDIA TECHNOLOGY (IC-BNMT), 2013, : 273 - 276
  • [6] Research on Database Massive Data Processing and Mining Method based on Hadoop Cloud Platform
    Wu, Dan
    Li, Zhuorong
    Bie, Rongfang
    Zhou, Mingquan
    [J]. 2014 International Conference on Identification, Information and Knowledge in the Internet of Things (IIKI 2014), 2014, : 107 - 110
  • [7] The Research on Hadoop and Cloud computing-based mass data storage model of computation
    Peng Fang
    Huang Qingyun
    Qian Zhaopeng
    [J]. MECHANICAL AND ELECTRONICS ENGINEERING III, PTS 1-5, 2012, 130-134 : 2899 - 2902
  • [8] Exploration of data mining algorithms of an online learning behaviour log based on cloud computing
    Wang, Rongguo
    [J]. INTERNATIONAL JOURNAL OF CONTINUING ENGINEERING EDUCATION AND LIFE-LONG LEARNING, 2021, 31 (03) : 371 - 380
  • [9] Research on the Data Mining Based on Cloud Computing
    Luo, Laixi
    Zhu, Yu
    [J]. PROCEEDINGS OF 2020 CHINA MARKETING INTERNATIONAL CONFERENCE (WEB CONFERENCING): MARKETING AND MANAGEMENT IN THE DIGITAL AGE, 2020, : 494 - 505
  • [10] DATA MINING ALGORITHM BASED ON CLOUD COMPUTING
    Hao, Y. J.
    [J]. LATIN AMERICAN APPLIED RESEARCH, 2018, 48 (04) : 281 - 285