TTED-PU:A Transferable Tax Evasion Detection Method based on Positive and Unlabeled Learning

被引:6
|
作者
Zhang, Fa [1 ]
Shi, Bin [1 ]
Dong, Bo [2 ]
Zheng, Qinghua [1 ]
Ji, Xiangting [3 ]
机构
[1] Xi An Jiao Tong Univ, SPKLSTN Lab, Sch Comp Sci & Technol, Xian, Peoples R China
[2] Xi An Jiao Tong Univ, Natl Engn Lab Big Data Analyt, Sch Distance Educ, Xian, Peoples R China
[3] Baidu Inc, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
tax evasion; transfer learning; positive and unlabeled learning;
D O I
10.1109/COMPSAC48688.2020.00036
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Tax evasion usually refers to taxpayers making false declarations in order to reduce their tax obligations. One of the most common types of tax evasion is to lower the declared taxable amount. This kind of behavior will lead to the loss of tax revenues and damage the fairness of taxation. One of the main roles of the tax authorities is to conduct tax evasion testing through efficient auditing methods. At present, by using machine learning technology along with large amounts of labeled data, tax evasion detection models have achieved good results in specific areas. However, it is a long and costly process for tax experts to label large amounts of data. Since, the data distribution characteristics vary from region to region, models cannot be used across regions. In this paper, we propose a new method called a transferable tax evasion detection method based on positive and unlabeled learning (TTED-PU), which uses only semi-supervised techniques to detect tax evasion in the source domain. In addition, we use the idea of transfer to adapt to the domain to predict tax evasion behavior on the target domain where labeled tax data are unavailable. We evaluate our method on real-world tax data set. The experimental results show that our model can detect tax evasion in both the source and target domains.
引用
收藏
页码:207 / 216
页数:10
相关论文
共 50 条
  • [41] When Less is Enough: Positive and Unlabeled Learning Model for Vulnerability Detection
    Wen, Xin-Cheng
    Wang, Xinchen
    Gao, Cuiyun
    Wang, Shaohua
    Liu, Yang
    Gu, Zhaoquan
    [J]. 2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE, 2023, : 345 - 357
  • [42] Boosting Positive and Unlabeled Learning for Anomaly Detection With Multi-Features
    Zhang, Jiaqi
    Wang, Zhenzhen
    Meng, Jingjing
    Tan, Yap-Peng
    Yuan, Junsong
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (05) : 1332 - 1344
  • [43] PU-Shapelets: Towards Pattern-Based Positive Unlabeled Classification of Time Series
    Liang, Shen
    Zhang, Yanchun
    Ma, Jiangang
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2019), PT I, 2019, 11446 : 87 - 103
  • [44] Unsupervised Body Hair Detection by Positive-Unlabeled Learning in Photoacoustic Image
    Kikkawa, Ryo
    Kajita, Hiroki
    Imanishi, Nobuaki
    Aiso, Sadakazu
    Bise, Ryoma
    [J]. 2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 3349 - 3352
  • [45] A Novel Classification Method: Neighborhood-Based Positive Unlabeled Learning Using Decision Tree (NPULUD)
    Ghasemkhani, Bita
    Balbal, Kadriye Filiz
    Birant, Kokten Ulas
    Birant, Derya
    [J]. ENTROPY, 2024, 26 (05)
  • [46] Positive-Unlabeled Learning for Cell Detection in Histopathology Images with Incomplete Annotations
    Zhao, Zipei
    Pang, Fengqian
    Liu, Zhiwen
    Ye, Chuyang
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VIII, 2021, 12908 : 509 - 518
  • [47] Learning from Positive and Unlabeled Multi-Instance Bags in Anomaly Detection
    Perini, Lorenzo
    Vercruyssen, Vincent
    Davis, Jesse
    [J]. PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 1897 - 1906
  • [48] LEARNING A TRANSFERABLE CHANGE DETECTION METHOD BY RECURRENT NEURAL NETWORK
    Lyu, Haobo
    Lu, Hui
    [J]. 2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 5157 - 5160
  • [49] Tax evasion risk management using a Hybrid Unsupervised Outlier Detection method
    Savic, Milos
    Atanasijevic, Jasna
    Jakovetic, Dusan
    Krejic, Natasa
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 193
  • [50] PLUS: Predicting cancer metastasis potential based on positive and unlabeled learning
    Zhou, Junyi
    Lu, Xiaoyu
    Chang, Wennan
    Wan, Changlin
    Lu, Xiongbin
    Zhang, Chi
    Cao, Sha
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (03)