Data Analytics in Text Messages: A Mobile Network Operator Case Study

被引:0
|
作者
Babaee, Mohammadreza [1 ]
Sarabadani, Hamidreza [2 ]
机构
[1] Univ Stirling, Management Sch, Stirling, Scotland
[2] Univ Nottingham, Sch Comp Sci, Nottingham, England
关键词
Classification; Machine Learning Algorithms; Spam; Text messages; MNO; Big Data;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper explores the application of different data mining and machine learning algorithms to propose an effective technique to filter out spam SMSs. Due to high competitive nature of MNO business; filtering spam SMSs will have a great impact on the protection of business and profit making. This is mostly because subscribers refuse to use the services of MNOs that are not vigilant about spam SMSs. Based on the CRISP-DM method which is an open standard process model for data analytics projects, machine learning algorithms and data preparation methods have been conducted on a MNO unstructured dataset to transform characters, delete stop words, extract word stems, roots, N-Grams, and classification. Next, numerical Vector Space Models were created utilizing all four types of word vector creation methods. After producing test and train models with machine learning algorithms; accuracy and error rate, recall, precision and the area under curve for each classification algorithm has been measured. Finally, the Bagging algorithm by implementing Binary Term Occurrence vector space creation method showed the highest efficiency rate which can have the highest application in the big data ecosystem of the industry for spam filtering.
引用
收藏
页码:330 / 336
页数:7
相关论文
共 50 条
  • [1] Using Text Analytics on Operator Logbooks for Performance Benchmarking: A Case Study
    Dutta, Saptak
    Gunay, Burak
    Bucking, Scott
    [J]. ASHRAE TRANSACTIONS 2019, VOL 125, PT 2, 2019, 125 : 408 - 416
  • [2] Metamorphic Relations for Data Validation: A Case Study of Translated Text Messages
    Yan, Boyang
    Yecies, Brian
    Zhou, Zhi Quan
    [J]. 2019 IEEE/ACM 4TH INTERNATIONAL WORKSHOP ON METAMORPHIC TESTING (MET 2019), 2019, : 70 - 75
  • [3] Gender differences in social network development via mobile phone text messages: A longitudinal study
    Igarashi, T
    Takai, J
    Yoshida, T
    [J]. JOURNAL OF SOCIAL AND PERSONAL RELATIONSHIPS, 2005, 22 (05) : 691 - 713
  • [4] Text Big Data Analytics Case Study "Third Wave": Internet of Words
    Kolesnichenko, Olga
    Smorodin, Gennady
    Yakovleva, Dariya
    Mazelis, Lev
    Balandin, Sergey
    Kolesnichenko, Yuriy
    [J]. PROCEEDINGS OF THE 19TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2016, : 88 - 98
  • [5] An Empirical Study on Text Analytics in Big Data
    Packiam, R. Merlin
    Prakash, V. Sinthu Janita
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2015, : 456 - 459
  • [6] Analyzing network availability of a mobile data network: A case study
    Malone, Bemard L., III
    Asthana, Abhaya
    [J]. BELL LABS TECHNICAL JOURNAL, 2006, 11 (03) : 47 - 56
  • [7] A Mobile Network Planning Tool Based on Data Analytics
    Moysen, Jessica
    Giupponi, Lorenza
    Mangues-Bafalluy, Josep
    [J]. MOBILE INFORMATION SYSTEMS, 2017, 2017
  • [8] Text analytics and data access as services - A case study in transforming a legacy client-server text analytics workbench and framework to SOA
    Maximilien, E. Michael
    Chen, Ying
    Lelescu, Ana
    Rhodes, James
    Kreulen, Jeffrey
    Spangler, Scott
    [J]. ICEIS 2007: PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS: DATABASES AND INFORMATION SYSTEMS INTEGRATION, 2007, : 581 - 588
  • [9] Data Analytics: Understanding Human Behavior based on Mobile Network Data
    Franceschina, Luciano
    [J]. CCSW'16: PROCEEDINGS OF THE 2016 ACM CLOUD COMPUTING SECURITY WORKSHOP, 2016, : 1 - 1
  • [10] Agraphia in Mobile Text Messages in a Case of Amyotrophic Lateral Sclerosis with Frontotemporal Dementia
    Maeda, Kengo
    Shiraishi, Tomoyuki
    Idehara, Ryo
    [J]. INTERNAL MEDICINE, 2015, 54 (23) : 3065 - 3068