Multi-class twitter data categorization and geocoding with a novel computing framework

被引:10
|
作者
Khan, Sakib Mahmud [1 ]
Chowdhury, Mashrur [1 ]
Ngo, Linh B. [2 ]
Apon, Amy [3 ]
机构
[1] Clemson Univ, Glenn Dept Civil Engn, Clemson, SC 29634 USA
[2] West Chester Univ, Comp Sci Dept, W Chester, PA 19383 USA
[3] Clemson Univ, Sch Comp, Clemson, SC 29634 USA
关键词
Social media; New York; Traffic operation; Short-term planning; Machine learning; Traffic management policy;
D O I
10.1016/j.cities.2019.102410
中图分类号
TU98 [区域规划、城乡规划];
学科分类号
0814 ; 082803 ; 0833 ;
摘要
This study details the progress in transportation data analysis with a novel computing framework in keeping with the continuous evolution of the computing technology. The computing framework combines the Labeled Latent Dirichlet Allocation (L-LDA)-incorporated Support Vector Machine (SVM) classifier with the supporting computing strategy on publicly available Twitter data in determining transportation-related events to provide reliable information to travelers. The analytical approach includes analyzing tweets using text classification and geocoding locations based on string similarity. A case study conducted for the New York City and its surrounding areas demonstrates the feasibility of the analytical approach. Approximately 700,010 tweets are analyzed to extract relevant transportation-related information for one week. The SVM classifier achieves > 85% accuracy in identifying transportation-related tweets from structured data. To further categorize the transportation-related tweets into sub-classes: incident, congestion, construction, special events, and other events, three supervised classifiers are used: L-LDA, SVM, and L-LDA incorporated SVM. Findings from this study demonstrate that the analytical framework, which uses the L-LDA incorporated SVM, can classify roadway transportation-related data from Twitter with over 98.3% accuracy, which is significantly higher than the accuracies achieved by standalone L-LDA and SVM.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] On Taxonomies for Multi-class Image Categorization
    Binder, Alexander
    Mueller, Klaus-Robert
    Kawanabe, Motoaki
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 99 (03) : 281 - 301
  • [2] On Taxonomies for Multi-class Image Categorization
    Alexander Binder
    Klaus-Robert Müller
    Motoaki Kawanabe
    [J]. International Journal of Computer Vision, 2012, 99 : 281 - 301
  • [3] A Dynamic Sampling Framework for Multi-Class Imbalanced Data
    Debowski, B.
    Areibi, S.
    Grewal, G.
    Tempelman, J.
    [J]. 2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 2, 2012, : 113 - 118
  • [4] Multi-class Twitter sentiment classification with emojis
    Li, Mengdi
    Ch'ng, Eugene
    Chong, Alain Yee Loong
    See, Simon
    [J]. INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2018, 118 (09) : 1804 - 1820
  • [5] Sentiment Classification from Multi-class Imbalanced Twitter Data Using Binarization
    Krawczyk, Bartosz
    McInnes, Bridget T.
    Cano, Alberto
    [J]. HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, HAIS 2017, 2017, 10334 : 26 - 37
  • [6] A New Multi-Class Rebalancing Framework for Imbalance Medical Data
    Edward, Jafhate
    Rosli, Marshima Mohd
    Seman, Ali
    [J]. IEEE ACCESS, 2023, 11 : 92857 - 92874
  • [7] A Partial Labeling Framework for Multi-Class Imbalanced Streaming Data
    Arabmakki, Elaheh
    Kantardzic, Mehmed
    Sethi, Tegjyot Singh
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1018 - 1025
  • [8] A Unified Multi-Class Feature Selection Framework for Microarray Data
    Ding, Xiaojian
    Yang, Fan
    Ma, Fumin
    Chen, Shilin
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (06) : 3725 - 3736
  • [9] Experiments in multi-class e-mail categorization
    Berger, Helmut
    Merkl, Dieter
    Dittenbach, Michael
    [J]. DATABASES AND INFORMATION SYSTEMS: COMMUNICATIONS, MATERIALS OF DOCTORAL CONSORTIUM, 2006, : 79 - 90
  • [10] BOOSTING KERNEL COMBINATION FOR MULTI-CLASS IMAGE CATEGORIZATION
    Lechervy, Alexis
    Gosselin, Philippe-Henri
    Precioso, Frederic
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1893 - 1896