Topic Discovery and Topic-Driven Clustering for Audit Method Datasets

被引:0
|
作者
Zhao, Ying [1 ]
Fu, Wanyu [1 ]
Huang, Shaobin [2 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Harbin Engn Univ, Coll Comp Sci Technol, Harbin 150001, Peoples R China
基金
美国国家科学基金会;
关键词
topic-driven clustering; audit methods; topic discovery;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the promotion of China's Golden Auditing Project and the fast growth of on-line auditing, there are thousands of new computer audit methods emerged every year to fulfill various needs of audit practices. How to organize these existing computer audit methods and use them intelligently have become a fundamental and challenging problem. In this paper, we propose to use topic-driven clustering methods to organize computer audit methods according to the system of computer audit methods that is issued by the National Audit Office of China. We also apply Latent Dirichlet allocation (LDA) analysis to audit method datasets at different levels of granularity. Our experimental results on social insurance computer audit methods show that the topic-driven clustering scheme with topics created by domain experts is the overall best scheme. It achieved an average purity of 0.862 across the datasets. Topics discovered by LDA were consistent with classes defined in the taxonomy for four out of five datasets, and they were effective when used in the topic-driven clustering scheme.
引用
收藏
页码:346 / +
页数:3
相关论文
共 50 条
  • [21] Trending Topic Discovery of Twitter Tweets Using Clustering and Topic Modeling Algorithms
    Sapul, Ma. Shiela C.
    Aung, Than Htike
    Jiamthapthaksin, Rachsuda
    PROCEEDINGS OF 2017 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2017,
  • [22] Self-Assesment tool with topic-driven navigation for algorithms learning
    Lopez-Ostenero, Fernando
    Plaza, Laura
    Araujo, Lourdes
    Martinez-Romo, Juan
    PROCEEDINGS OF THE 2022 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON 2022), 2022, : 356 - 363
  • [23] Domain-Oriented Topic Discovery Based on Features Extraction and Topic Clustering
    Lu, Xiaofeng
    Zhou, Xiao
    Wang, Wenting
    Lio, Pietro
    Hui, Pan
    IEEE ACCESS, 2020, 8 (08): : 93648 - 93662
  • [24] Topic-Constrained Hierarchical Clustering for Document Datasets
    Zhao, Ying
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2010, PT I, 2010, 6440 : 181 - 192
  • [25] TopicKS: Topic-driven Knowledge Selection for Knowledge-grounded Dialogue Generation
    Wang, Shiquan
    Si, Yuke
    Wei, Xiao
    Wang, Longbiao
    Zhuang, Zhiqiang
    Zhang, Xiaowang
    Dang, Jianwu
    INTERSPEECH 2022, 2022, : 1121 - 1125
  • [26] Topic-Driven and Knowledge-Aware Transformer for Dialogue Emotion Detection
    Zhu, Lixing
    Pergola, Gabriele
    Gui, Lin
    Zhou, Deyu
    He, Yulan
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 1571 - 1582
  • [27] Near real-time topic-driven rumor detection in source microblogs
    Xu, Fan
    Sheng, Victor S.
    Wang, Mingwen
    KNOWLEDGE-BASED SYSTEMS, 2020, 207 (207)
  • [28] Enhancing Online Discussion Forums with Topic-Driven Content Search and Assisted Posting
    Distante, Damiano
    Fernandez, Alejandro
    Cerulo, Luigi
    Visaggio, Aaron
    KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT, IC3K 2014, 2015, 553 : 161 - 180
  • [29] The Ambient Tag Cloud: A New Concept for Topic-Driven Mobile Urban Exploration
    Baldauf, Matthias
    Frohlich, Peter
    Reichl, Peter
    AMBIENT INTELLIGENCE, PROCEEDINGS, 2009, 5859 : 44 - 48
  • [30] Location-driven Geographical Topic Discovery
    Zhang, Li
    Sun, Xiaoping
    Zhuge, Hai
    2013 NINTH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2013, : 210 - 213