A Message Topic Model for Multi-Grain SMS Spam Filtering

被引:7
|
作者
Ma, Jialin [1 ,2 ]
Zhang, Yongjun [1 ,2 ]
Wang, Zhijian [2 ]
Yu, Kun [1 ]
机构
[1] Huaiyin Inst Technol, Huaian, Peoples R China
[2] Hohai Univ, Coll Comp & Informat, Nanjing, Jiangsu, Peoples R China
关键词
LDA; MTM; SMS Spam; SVM; Topic Model;
D O I
10.4018/IJTHI.2016040107
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
At present, content-based methods are regard as the more effective in the task of Short Message Service (SMS) spam filtering. However, they usually use traditional text classification technologies, which are more suitable to deal with normal long texts; therefore, it often faces some serious challenges, such as the sparse data problem and noise data in the SMS message. In addition, the existing SMS spam filtering methods usually consider the SMS spam task as a binary-class problem, which could not provide for different categories for multi-grain SMS spam filtering. In this paper, the authors propose a message topic model (MTM) for multi-grain SMS spam filtering. The MTM derives from the famous probability topic model, and is improved in this paper to make it more suitable for SMS spam filtering. Finally, the authors compare the MTM with the SVM and the standard LDA on the public SMS spam corpus. The experimental results show that the MTM is more effective for the task of SMS spam filtering.
引用
收藏
页码:83 / 95
页数:13
相关论文
共 50 条
  • [31] Contributions to the Study of SMS Spam Filtering: New Collection and Results
    Almeida, Tiago A.
    Maria Gomez, Jose
    Yamakami, Akebo
    [J]. DOCENG 2011: PROCEEDINGS OF THE 2011 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, 2011, : 259 - 262
  • [32] SMS Spam Filtering using Supervised Machine Learning Algorithms
    Navaney, Pavas
    Dubey, Gaurav
    Rana, Ajay
    [J]. PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE CONFLUENCE 2018 ON CLOUD COMPUTING, DATA SCIENCE AND ENGINEERING, 2018, : 43 - 48
  • [33] Multi-grain finite element model for studying the wire drawing process
    Ocenasek, J.
    Ripoll, M. Rodriguez
    Weygand, S. M.
    Riedel, H.
    [J]. COMPUTATIONAL MATERIALS SCIENCE, 2007, 39 (01) : 23 - 28
  • [34] MICROSEGREGATION IN CAST MULTI-GRAIN SILICON
    HELMREICH, D
    AST, G
    [J]. JOURNAL OF THE ELECTROCHEMICAL SOCIETY, 1980, 127 (03) : C111 - C111
  • [35] Spam Filtering of Mobile SMS Using CNN-LSTM Based Deep Learning Model
    Hossain, Syed Md Minhaz
    Sumon, Jayed Akbar
    Sen, Anik
    Alam, Md Iftaker
    Kamal, Khaleque Md Aashiq
    Alqahtani, Hamed
    Sarker, Iqbal H.
    [J]. HYBRID INTELLIGENT SYSTEMS, HIS 2021, 2022, 420 : 106 - 116
  • [36] A uniform stress, multi-grain model for migration recrystallization in polar ice
    Staroszczyk, Ryszard
    [J]. ACTA GEOPHYSICA, 2011, 59 (05) : 833 - 857
  • [37] A Spam Filtering Model of Immune Based on Multi-Agent
    Jiang, Yaping
    Guo, Hao
    Guo, Peigen
    [J]. PROCEEDINGS OF THE 2017 2ND INTERNATIONAL SYMPOSIUM ON ADVANCES IN ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING (ISAEECE 2017), 2017, 124 : 275 - 279
  • [38] A uniform stress, multi-grain model for migration recrystallization in polar ice
    Ryszard Staroszczyk
    [J]. Acta Geophysica, 2011, 59 : 833 - 857
  • [39] (Un/Semi-)supervised SMS text message SPAM detection
    Giannella, Chris R.
    Winder, Ransom
    Wilson, Brandon
    [J]. NATURAL LANGUAGE ENGINEERING, 2015, 21 (04) : 553 - 567
  • [40] A multi-grain multi-layer astrochemical model with variable desorption energy for surface species
    Kalvans, Juris
    Kalnina, Aija
    Veitners, Kristaps
    [J]. ASTRONOMY & ASTROPHYSICS, 2024, 687