Maximum entropy model for mobile text classification in cloud computing using improved information gain algorithm

被引:9
|
作者
Yin, Chunyong [1 ]
Xi, Jinwen [1 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Jiangsu Engn Ctr Network Monitoring, Sch Comp & Software, Nanjing 210044, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Text classification; Cloud computing; Information gain; Maximum entropy models; Pretreatment; Features selection;
D O I
10.1007/s11042-016-3545-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid popularization of the Internet and the multimedia that be deemed to a new information transmission mode, people can not only get the information you want easily, but also post the information that you have in the world. At the same time, with the introduction of a variety of tablet PCs, smart phones and other network terminals, and the emergence of a variety of social networks, greatly accelerated the pace of information on the internet. People can update a variety of text, pictures, video and other data in a variety of applications every day. There is data show that the Internet has an exponential level of information data and news or media company will typically see hundreds and thousands of submissions every day, people have been in a very expansive information time. In the face of such huge information resources, how to manage it effectively, make people get the target information more convenient and fast, has become a hot research topic. And text classification technology in text information mining is effective to solve this problem. We mainly study the mobile text classification technology based on the maximum entropy model and implement the automatic classification system of texts in cloud computing, and through technical improvements, for a large number of documents in the network, given technical solutions in mobile environment. This paper introduces the text classification methods and features of the maximum entropy model with improved information gain selection method and the pretreatment method and the MapReduce programming method, the experimental results have a good accuracy and recall, the classification of large amounts of text, meeting the requirements of practical application.
引用
收藏
页码:16875 / 16891
页数:17
相关论文
共 50 条
  • [1] Maximum entropy model for mobile text classification in cloud computing using improved information gain algorithm
    Chunyong Yin
    Jinwen Xi
    [J]. Multimedia Tools and Applications, 2017, 76 : 16875 - 16891
  • [2] The Research of Text Classification Technology Based on Improved Maximum Entropy Model
    Yin, Chunyong
    Xi, Jinwen
    Wang, Jin
    [J]. 2015 FIRST INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE THEORY, SYSTEMS AND APPLICATIONS (CCITSA 2015), 2015, : 142 - 145
  • [3] Retraction Note: Multimedia text classification algorithm using potential Dirichlet distribution in mobile cloud computing environment
    Xiaohong Zhang
    Yan Gao
    [J]. Multimedia Tools and Applications, 2022, 81 : 39827 - 39827
  • [4] RETRACTED ARTICLE: Multimedia text classification algorithm using potential Dirichlet distribution in mobile cloud computing environment
    Xiaohong Zhang
    Yan Gao
    [J]. Multimedia Tools and Applications, 2020, 79 : 9615 - 9627
  • [5] Accurate Text Classification via Maximum Entropy Model
    Zou, Baoping
    [J]. COLLABORATE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING, COLLABORATECOM 2016, 2017, 201 : 569 - 576
  • [6] Intensive Maximum Entropy Model for Sentiment Classification of Short Text
    Rao, Yanghui
    Li, Jun
    Xiang, Xiyun
    Xie, Haoran
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2015, 2015, 9052 : 42 - 51
  • [7] Using maximum entropy model for Chinese text categorization
    Li, RL
    Tao, XP
    Tang, L
    Hu, YF
    [J]. ADVANCED WEB TECHNOLOGIES AND APPLICATIONS, 2004, 3007 : 578 - 587
  • [8] Resource Scheduling Based on Improved FCM Algorithm for Mobile Cloud Computing
    Wu Hong-Qiang
    Li Xiao-Yong
    Fang Bin-Xing
    Wang Yi-Ping
    [J]. 2016 IEEE 22ND INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2016, : 128 - 132
  • [9] An Improved Information Gain Feature Selection Algorithm for SVM Text Classifier
    Xu, Jiamin
    Jiang, Hong
    [J]. 2015 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY, 2015, : 273 - 276
  • [10] Improved Design of Classification Algorithm in Cloud Computing and Big Data Environment
    Jiang, Yihuo
    [J]. ADVANCED HYBRID INFORMATION PROCESSING, ADHIP 2019, PT I, 2019, 301 : 161 - 170