Topic Discovery and Topic-Driven Clustering for Audit Method Datasets

被引:0
|
作者
Zhao, Ying [1 ]
Fu, Wanyu [1 ]
Huang, Shaobin [2 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Harbin Engn Univ, Coll Comp Sci Technol, Harbin 150001, Peoples R China
来源
ADVANCED DATA MINING AND APPLICATIONS, PT II | 2011年 / 7121卷
基金
美国国家科学基金会;
关键词
topic-driven clustering; audit methods; topic discovery;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the promotion of China's Golden Auditing Project and the fast growth of on-line auditing, there are thousands of new computer audit methods emerged every year to fulfill various needs of audit practices. How to organize these existing computer audit methods and use them intelligently have become a fundamental and challenging problem. In this paper, we propose to use topic-driven clustering methods to organize computer audit methods according to the system of computer audit methods that is issued by the National Audit Office of China. We also apply Latent Dirichlet allocation (LDA) analysis to audit method datasets at different levels of granularity. Our experimental results on social insurance computer audit methods show that the topic-driven clustering scheme with topics created by domain experts is the overall best scheme. It achieved an average purity of 0.862 across the datasets. Topics discovered by LDA were consistent with classes defined in the taxonomy for four out of five datasets, and they were effective when used in the topic-driven clustering scheme.
引用
收藏
页码:346 / +
页数:3
相关论文
共 50 条
  • [41] Clustering Web logs on the Basis of a Topic Detection Method
    Perez-Tellez, Fernando
    Pinto, David
    Cardiff, John
    Rosso, Paolo
    ADVANCES IN PATTERN RECOGNITION, 2010, 6256 : 342 - +
  • [42] Leveraging Formal Concept Analysis with Topic Correlation for Service Clustering and Discovery
    Aznag, Mustapha
    Quafafou, Mohamed
    Jarir, Zahi
    2014 IEEE 21ST INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS 2014), 2014, : 153 - 160
  • [43] An ensemble clustering approach for topic discovery using implicit text segmentation
    Memon, Muhammad Qasim
    Lu, Yu
    Chen, Penghe
    Memon, Aasma
    Pathan, Muhammad Salman
    Zardari, Zulfiqar Ali
    JOURNAL OF INFORMATION SCIENCE, 2021, 47 (04) : 431 - 457
  • [44] Topic discovery by spectral decomposition and clustering with coordinated global and local contexts
    Jian Wang
    Kejing He
    Min Yang
    International Journal of Machine Learning and Cybernetics, 2020, 11 : 2475 - 2487
  • [45] A Short Text Topic Discovery Method for Social Network
    Liu Jia
    Wang Qinglin
    Liu Yu
    Li Yuan
    2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 512 - 516
  • [46] Topic discovery by spectral decomposition and clustering with coordinated global and local contexts
    Wang, Jian
    He, Kejing
    Yang, Min
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (11) : 2475 - 2487
  • [47] Image paragraph captioning with topic clustering and topic shift prediction
    Tang, Ting
    Chen, Jiansheng
    Huang, Yiqing
    Ma, Huimin
    Zhang, Yudong
    Yu, Hongwei
    KNOWLEDGE-BASED SYSTEMS, 2024, 286
  • [48] A Sparse Topic Model for Bursty Topic Discovery in Social Networks
    Shi, Lei
    Du, Junping
    Kou, Feifei
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2020, 17 (05) : 816 - 824
  • [49] Topic-Driven SocialRank: Personalized search result ranking by identifying similar, credible users in a social network
    Kim, Young An
    Park, Gun Woo
    KNOWLEDGE-BASED SYSTEMS, 2013, 54 : 230 - 242
  • [50] Language Model-Driven Topic Clustering and Summarization for News Articles
    Yang, Peng
    Li, Wenhan
    Zhao, Guangzhen
    IEEE ACCESS, 2019, 7 : 185506 - 185519