Email Classification Research Trends: Review and Open Issues

被引:0
|
作者
Mujtaba, Ghulam [1 ,2 ]
Shuib, Liyana [1 ]
Raj, Ram Gopal [3 ]
Majeed, Nahdia [2 ]
Al-Garadi, Mohammed Ali [1 ]
机构
[1] Univ Malaya, Fac Comp Sci & Informat Technol, Dept Informat Syst, Kuala Lumpur 50603, Malaysia
[2] Sukkur Inst Business Adm, Dept Comp Sci, Sukkur 65200, Pakistan
[3] Univ Malaya, Fac Comp Sci & Informat Technol, Dept Artificial Intelligence, Kuala Lumpur 50603, Malaysia
来源
IEEE ACCESS | 2017年 / 5卷
关键词
Email classification; spam detection; phishing detection; multi-folder categorization; machine learning techniques; E-MAIL CLASSIFICATION; FEATURE-SELECTION; SPAM; ANALYZER; FEATURES; MODEL;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Personal and business users prefer to use e-mail as one of the crucial sources of communication. The usage and importance of e-mails continuously grow despite the prevalence of alternative means, such as electronic messages, mobile applications, and social networks. As the volume of business-critical e-mails continues to grow, the need to automate the management of e-mails increases for several reasons, such as spam e-mail classification, phishing e-mail classification, and multi-folder categorization, among others. This paper comprehensively reviews articles on e-mail classification published in 2006-2016 by exploiting the methodological decision analysis in five aspects, namely, e-mail classification application areas, data sets used in each application area, feature space utilized in each application area, e-mail classification techniques, and the use of performance measures. A total of 98 articles (56 articles from Web of Science core collection databases and 42 articles from Scopus database) are selected. To achieve the objective of the study, a comprehensive review and analysis is conducted to explore the various areas where e-mail classification was applied. Moreover, various public data sets, features sets, classification techniques, and performance measures are examined and used in each identified application area. This review identifies five application areas of e-mail classification. The most widely used data sets, features sets, classification techniques, and performance measures are found in the identified application areas. The extensive use of these popular data sets, features sets, classification techniques, and performance measures is discussed and justified. The research directions, research challenges, and open issues in the field of e-mail classification are also presented for future researchers.
引用
收藏
页码:9044 / 9064
页数:21
相关论文
共 50 条
  • [1] Clinical text classification research trends: Systematic literature review and open issues
    Mujtaba, Ghulam
    Shuib, Liyana
    Idris, Norisma
    Hoo, Wai Lam
    Raj, Ram Gopal
    Khowaja, Kamran
    Shaikh, Khairunisa
    Nweke, Henry Friday
    [J]. Expert Systems with Applications, 2019, 116 : 494 - 520
  • [2] Clinical text classification research trends: Systematic literature review and open issues
    Mujtaba, Ghulam
    Shuib, Liyana
    Idris, Norisma
    Hoo, Wai Lam
    Raj, Ram Gopal
    Khowaja, Kamran
    Shaikh, Khairunisa
    Nweke, Henry Friday
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 116 : 494 - 520
  • [3] Recommendation research trends: Review, approaches and open issues
    Taneja A.
    Arora A.
    [J]. Taneja, Anu (anutaneja16@gmail.com), 2018, Inderscience Publishers, 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland (13) : 123 - 186
  • [4] Phishing Image Spam Classification Research Trends: Survey and Open Issues
    Abari, Ovye John
    Sani, Nor Fazlida Mohd
    Khalid, Fatimah
    Bin Sharum, Mohd Yunus
    Ariffin, Noor Afiza Mohd
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (11) : 794 - 805
  • [5] Satellite Communications: Research Trends and Open Issues
    Vanelli-Coralli, A.
    Corazza, G. E.
    Karagiannidis, G. K.
    Mathiopoulos, P. T.
    Michalopoulos, D. S.
    Mosquera, C.
    Papaharalabos, S.
    Scalise, S.
    [J]. 2007 INTERNATIONAL WORKSHOP ON SATELLITE AND SPACE COMMUNICATIONS, IWSSC '07, CONFERENCE PROCEEDINGS, 2007, : 71 - +
  • [6] A Comprehensive Review of Computer Vision in Sports: Open Issues, Future Trends and Research Directions
    Naik, Banoth Thulasya
    Hashmi, Mohammad Farukh
    Bokde, Neeraj Dhanraj
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (09):
  • [7] Trends and Open Research Issues in Intelligent Internet of Vehicles
    Kamble, Shridevi Jeevan
    Kounte, Manjunath R.
    [J]. TRANSPORT AND TELECOMMUNICATION JOURNAL, 2023, 24 (02) : 143 - 157
  • [8] Emerging Trends, Techniques and Open Issues of Containerization: A Review
    Watada, Junzo
    Roy, Arunava
    Kadikar, Ruturaj
    Pham, Hoang
    Xu, Bing
    [J]. IEEE ACCESS, 2019, 7 : 152443 - 152472
  • [9] Machine learning for email spam filtering: review, approaches and open research problems
    Dada, Emmanuel Gbenga
    Bassi, Joseph Stephen
    Chiroma, Haruna
    Abdulhamid, Shafi'i Muhammad
    Adetunmbi, Adebayo Olusola
    Ajibuwa, Opeyemi Emmanuel
    [J]. HELIYON, 2019, 5 (06)
  • [10] Issues and Trends in Causal Ambiguity Research: A Review and Assessment
    Konlechner, Stefan
    Ambrosini, Veronique
    [J]. JOURNAL OF MANAGEMENT, 2019, 45 (06) : 2352 - 2386