TTED-PU:A Transferable Tax Evasion Detection Method based on Positive and Unlabeled Learning

被引:6
|
作者
Zhang, Fa [1 ]
Shi, Bin [1 ]
Dong, Bo [2 ]
Zheng, Qinghua [1 ]
Ji, Xiangting [3 ]
机构
[1] Xi An Jiao Tong Univ, SPKLSTN Lab, Sch Comp Sci & Technol, Xian, Peoples R China
[2] Xi An Jiao Tong Univ, Natl Engn Lab Big Data Analyt, Sch Distance Educ, Xian, Peoples R China
[3] Baidu Inc, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
tax evasion; transfer learning; positive and unlabeled learning;
D O I
10.1109/COMPSAC48688.2020.00036
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Tax evasion usually refers to taxpayers making false declarations in order to reduce their tax obligations. One of the most common types of tax evasion is to lower the declared taxable amount. This kind of behavior will lead to the loss of tax revenues and damage the fairness of taxation. One of the main roles of the tax authorities is to conduct tax evasion testing through efficient auditing methods. At present, by using machine learning technology along with large amounts of labeled data, tax evasion detection models have achieved good results in specific areas. However, it is a long and costly process for tax experts to label large amounts of data. Since, the data distribution characteristics vary from region to region, models cannot be used across regions. In this paper, we propose a new method called a transferable tax evasion detection method based on positive and unlabeled learning (TTED-PU), which uses only semi-supervised techniques to detect tax evasion in the source domain. In addition, we use the idea of transfer to adapt to the domain to predict tax evasion behavior on the target domain where labeled tax data are unavailable. We evaluate our method on real-world tax data set. The experimental results show that our model can detect tax evasion in both the source and target domains.
引用
收藏
页码:207 / 216
页数:10
相关论文
共 50 条
  • [31] SVM based adaptive learning method for text classification from positive and unlabeled documents
    Tao Peng
    Wanli Zuo
    Fengling He
    [J]. Knowledge and Information Systems, 2008, 16 : 281 - 301
  • [32] SVM based adaptive learning method for text classification from positive and unlabeled documents
    Peng, Tao
    Zuo, Wanli
    He, Fengling
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 16 (03) : 281 - 301
  • [33] A Two-Step Classification Method Based on Collaborative Representation for Positive and Unlabeled Learning
    Wang, Yijin
    Peng, Yali
    He, Kai
    Liu, Shigang
    Li, Jun
    [J]. NEURAL PROCESSING LETTERS, 2021, 53 (06) : 4239 - 4255
  • [34] A Two-Step Classification Method Based on Collaborative Representation for Positive and Unlabeled Learning
    Yijin Wang
    Yali Peng
    Kai He
    Shigang Liu
    Jun Li
    [J]. Neural Processing Letters, 2021, 53 : 4239 - 4255
  • [35] A graph-based approach for positive and unlabeled learning
    Carnevali, Julio César
    Geraldeli Rossi, Rafael
    Milios, Evangelos
    de Andrade Lopes, Alneu
    [J]. Information Sciences, 2021, 580 : 655 - 672
  • [36] PUED: A Social Spammer Detection Method Based on PU Learning and Ensemble Learning
    Song, Yuqi
    Gao, Min
    Yu, Junliang
    Li, Wentao
    Yu, Lulan
    Xiao, Xinyu
    [J]. COLLABORATIVE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING, COLLABORATECOM 2017, 2018, 252 : 143 - 152
  • [37] Distance-based positive and unlabeled learning for ranking
    Helm, Hayden S.
    Basu, Amitabh
    Athreya, Avanti
    Park, Youngser
    Vogelstein, Joshua T.
    Priebe, Carey E.
    Winding, Michael
    Zlatic, Marta
    Cardona, Albert
    Bourke, Patrick
    Larson, Jonathan
    Abdin, Marah
    Choudhury, Piali
    Yang, Weiwei
    White, Christopher W.
    [J]. PATTERN RECOGNITION, 2023, 134
  • [38] A graph-based approach for positive and unlabeled learning
    Carnevali, Julio Cesar
    Rossi, Rafael Geraldeli
    Milios, Evangelos
    Lopes, Alneu de Andrade
    [J]. INFORMATION SCIENCES, 2021, 580 : 655 - 672
  • [39] Split-PU: Hardness-aware Training Strategy for Positive-Unlabeled Learning
    Xu, Chengming
    Liu, Chen
    Yang, Siqian
    Wang, Yabiao
    Zhang, Shijie
    Jia, Lijie
    Fu, Yanwei
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2719 - 2729
  • [40] Positive-Unlabeled Learning-Based Hybrid Deep Network for Intelligent Fault Detection
    Qian, Min
    Yan-Fu Li
    Han, Te
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (07) : 4510 - 4519