A Method of SMS Spam Filtering Based on AdaBoost Algorithm

被引:0
|
作者
Zhang, Xipeng [1 ]
Xiong, Gang [2 ]
Hu, Yuexiang [3 ]
Zhu, Fenghua [4 ]
Dong, Xisong [5 ]
Nyberg, Timo R. [6 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, CO, Peoples R China
[2] Chinese Acad Sci, Cloud Comp Ctr, Dongguan 523000, CO, Peoples R China
[3] Hainan Zhongke Flower Ocean Cloud Commerce Techno, Haikom 570311, Hainan, Peoples R China
[4] Chinese Acad Sci, Inst Automat, Beijing Engn Res Ctr Intelligent Syst & Teclmol, Beijing 100190, CO, Peoples R China
[5] Qingdao Acad Intelligent Ind, Qingdao 266061, CO, Peoples R China
[6] Aalto Univ, Dept Ind Engn & Management, FI-00076 Aalto, CO, Finland
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Short message is one of the most common communication media for mobile subscribers, so major mobile operators are devoted to improve their Short Message Service (SMS). However, the annoying and undesired messages, also named message spam or simply spam, not only worsen the users' experience, but also cause their complaints on SMS. In this paper, we present a novel Chinese SMS spam filtering framework based on AdaBoost algorithm to provide accurate and effective short messages classification. Three content-based weak filters are introduced to boost the performance of final classification decision. Results from Receiver Operating Characteristics (ROC) analysis prove the proposed method has such advantages as higher efficiency and fewer parameters over those established SMS spam filtering methods. The application of the proposed method is expected to block the most spam for mobile subscribers and improve the service quality of SMS. With simple data processing and few training parameters, the proposed method can be applied into the practice of short text classification.
引用
收藏
页码:2328 / 2332
页数:5
相关论文
共 50 条
  • [1] SMS Spam Filtering Based on "Cloud Security"
    Wu, Hongli
    Jiang, Yonghui
    [J]. INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 2015 - 2019
  • [2] Content-based SMS Spam Filtering based on the Scaled Conjugate Gradient Backpropagation Algorithm
    Waheeb, Waddah
    Ghazali, Rozaida
    Deris, Mustafa Mat
    [J]. 2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 675 - 680
  • [3] Word Embedding Method of SMS Messages for Spam Message Filtering
    Lee, Hyun-Young
    Kang, Seung-Shik
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2019, : 652 - 655
  • [4] SMS spam filtering: Methods and data
    Delany, Sarah Jane
    Buckley, Mark
    Greene, Derek
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (10) : 9899 - 9908
  • [5] SMS Spam Filtering based on Text Classification and Expert System
    Bozan, Yavuz Selim
    Coban, Onder
    Ozyer, Gulsah Tumuklu
    Ozyer, Baris
    [J]. 2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 2345 - 2348
  • [6] Content-based Approach for Vietnamese Spam SMS Filtering
    Pham, Thai-Hoang
    Le-Hong, Phuong
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 41 - 44
  • [7] A Review on Mobile SMS Spam Filtering Techniques
    Abdulhamid, Shafi'I Muhammad
    Abd Latiff, Muhammad Shafie
    Chiroma, Haruna
    Osho, Oluwafemi
    Abdul-Salaam, Gaddafi
    Abubakar, Adamu I.
    Herawan, Tutut
    [J]. IEEE ACCESS, 2017, 5 : 15650 - 15666
  • [8] The Evaluation of Ordered Features for SMS Spam Filtering
    Bande Serrano, Jose M.
    Hernandez Palancar, Jose
    Cumplido, Rene
    [J]. PROGRESS IN PATTERN RECOGNITION IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2014, 2014, 8827 : 383 - 390
  • [9] Thai-English Spam SMS Filtering
    Khemapatapan, Chaiyaporn
    [J]. 2010 16TH ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS (APCC 2010), 2010, : 226 - 230
  • [10] Index-based Online Text Classification for SMS Spam Filtering
    Liu, Wuying
    Wang, Ting
    [J]. JOURNAL OF COMPUTERS, 2010, 5 (06) : 844 - 851