Visual topic models for healthcare data clustering

被引:19
|
作者
Prasad, K. Rajendra [1 ]
Mohammed, Moulana [2 ]
Noorullah, R. M. [1 ,2 ]
机构
[1] Inst Aeronaut Engn, Dept Comp Sci & Engn, Hyderabad 500043, Telangana, India
[2] Koneru Lakshmaiah Educ Fdn, Dept Comp Sci & Engn, Guntur 522502, Andhra Pradesh, India
关键词
Visual topic model; Social data; Visual clustering; Cosine based metric; Health tendency;
D O I
10.1007/s12065-019-00300-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social media is a great source to search health-related topics for envisages solutions towards healthcare. Topic models originated from Natural Language Processing that is receiving much attention in healthcare areas because of interpretability and its decision making, which motivated us to develop visual topic models. Topic models are used for the extraction of health topics for analyzing discriminative and coherent latent features of tweet documents in healthcare applications. Discovering the number of topics in topic models is an important issue. Sometimes, users enable an incorrect number of topics in traditional topic models, which leads to poor results in health data clustering. In such cases, proper visualizations are essential to extract information for identifying cluster trends. To aid in the visualization of topic clouds and health tendencies in the document collection, we present hybrid topic modeling techniques by integrating traditional topic models with visualization procedures. We believe proposed visual topic models viz., Visual Non-Negative Matrix Factorization (VNMF), Visual Latent Dirichlet Allocation (VLDA), Visual intJNon-negative Matrix Factorization (VintJNMF), and Visual Probabilistic Latent Schematic Indexing (VPLSI) are promising methods for extracting tendency of health topics from various sources in healthcare data clustering. Standard and benchmark social health datasets are used in an experimental study to demonstrate the efficiency of proposed models concerning clustering accuracy (CA), Normalized Mutual Information (NMI), precision (P), recall (R), F-Score (F) measures and computational complexities. VNMF visual model performs significantly at an increased rate of 32.4% under cosine based metric in the display of visual clusters and an increased rate of 35-40% in performance measures compared to other visual methods on different number of health topics.
引用
收藏
页码:545 / 562
页数:18
相关论文
共 50 条
  • [31] Graph-based topic models for trajectory clustering in crowd videos
    Al Ghamdi, Manal
    Gotoh, Yoshihiko
    MACHINE VISION AND APPLICATIONS, 2020, 31 (05)
  • [32] Graph-based topic models for trajectory clustering in crowd videos
    Manal Al Ghamdi
    Yoshihiko Gotoh
    Machine Vision and Applications, 2020, 31
  • [33] ONLINE TIME-DEPENDENT CLUSTERING USING PROBABILISTIC TOPIC MODELS
    Renard, Benjamin
    Kharratzadeh, Milad
    Coates, Mark
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2036 - 2040
  • [34] Decomposition, discovery and detection of visual categories using topic models
    Fritz, Mario
    Schiele, Bernt
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 3583 - 3590
  • [35] Visual Assessment of Clustering Tendency for Incomplete Data
    Park, Laurence A. F.
    Bezdek, James C.
    Leckie, Christopher
    Kotagiri, Ramamohanarao
    Bailey, James
    Palaniswami, Marimuthu
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (12) : 3409 - 3422
  • [36] Simultaneous Modelling and Clustering of Visual Field Data
    Bin Jilani, Mohd Zairul Mazwan
    Tucker, Allan
    Swift, Stephen
    2016 IEEE 29TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2016, : 213 - 218
  • [37] Clustering Models for Data Stream Mining
    Mythily, R.
    Banu, Aisha
    Raghunathan, Shriram
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES, ICICT 2014, 2015, 46 : 619 - 626
  • [38] Analysis of Clustering Algorithms in Machine Learning for Healthcare Data
    Zhang J.
    Zhong H.
    Journal of Commercial Biotechnology, 2022, 27 (05) : 82 - 91
  • [39] Optimised Clustering Based Approach for Healthcare Data Analytics
    Bhopale, Amol P.
    Zanwar, Sanskar
    Balpande, Aarya
    Kazi, Jaweria
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2023, 14 (01): : 298 - 305
  • [40] Stochastic topic models for large scale and nonstationary data
    Ihou, Koffi Eddy
    Bouguila, Nizar
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 88 (88)