Using deep Q-learning to understand the tax evasion behavior of risk-averse firms

被引:14
|
作者
Goumagias, Nikolaos D. [1 ]
Hristu-Varsakelis, Dimitrios [2 ]
Assael, Yannis M. [3 ]
机构
[1] Northumbria Univ, Newcastle Business Sch, Cent Campus East 1, Newcastle Upon Tyne NE1 8ST, Tyne & Wear, England
[2] Univ Macedonia, Dept Appl Informat, Egnatia 156, Thessaloniki 54006, Greece
[3] Univ Oxford, Dept Comp Sci, Wolfson Bldg,Parks Rd, Oxford OX1 3QD, England
关键词
Markov decision processes; Tax evasion; Q-learning; Deep learning; NEURAL-NETWORKS; AMNESTIES;
D O I
10.1016/j.eswa.2018.01.039
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Designing tax policies that are effective in curbing tax evasion and maximize state revenues requires a rigorous understanding of taxpayer behavior. This work explores the problem of determining the strategy a self-interested, risk-averse tax entity is expected to follow, as it "navigates" - in the context of a Markov Decision Process - a government-controlled tax environment that includes random audits, penalties and occasional tax amnesties. Although simplified versions of this problem have been previously explored, the mere assumption of risk-aversion (as opposed to risk-neutrality) raises the complexity of finding the optimal policy well beyond the reach of analytical techniques. Here, we obtain approximate solutions via a combination of Q-learning and recent advances in Deep Reinforcement Learning. By doing so, we (i) determine the tax evasion behavior expected of the taxpayer entity, (ii) calculate the degree of risk aversion of the "average" entity given empirical estimates of tax evasion, and (iii) evaluate sample tax policies, in terms of expected revenues. Our model can be useful as a testbed for "in-vitro" testing of tax policies, while our results lead to various policy recommendations. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:258 / 270
页数:13
相关论文
共 50 条
  • [1] Tax evasion by risk-averse firms in Greece: a discrete Markov-based optimization model
    Goumagias, Nikolaos
    Hristu-Varsakelis, Dimitris
    [J]. OPTIMIZATION, 2013, 62 (08) : 1153 - 1167
  • [2] Automating Vehicles by Risk-Averse Preview-based Q-Learning Algorithm
    Mazouchi, Majid
    Nageshrao, Subramanya
    Modares, Hamidreza
    [J]. IFAC PAPERSONLINE, 2022, 55 (15): : 105 - 110
  • [3] A One-shot Convex Optimization Approach to Risk-Averse Q-Learning
    Han, Yuzhen
    Mazouchi, Majid
    Nageshrao, Subramanya
    Modares, Hamidreza
    [J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 102 - 107
  • [4] Transmission of Risk-Averse Behavior in Small Firms
    Dennis Patrick Leyden
    Albert N. Link
    [J]. Small Business Economics, 2004, 23 : 255 - 259
  • [5] Transmission of risk-averse behavior in small firms
    Leyden, DP
    Link, AN
    [J]. SMALL BUSINESS ECONOMICS, 2004, 23 (03) : 255 - 259
  • [6] Comparison of the production behavior of regret-averse and purely risk-averse firms
    Guo, Xu
    Wong, Wing-Keung
    [J]. ESTUDIOS DE ECONOMIA, 2019, 46 (02): : 157 - 171
  • [7] Towards Risk-Averse Edge Computing With Deep Reinforcement Learning
    Xu, Dianlei
    Su, Xiang
    Wang, Huandong
    Tarkoma, Sasu
    Hui, Pan
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (06) : 7030 - 7047
  • [8] Dynamic history-dependent tax and environmental compliance monitoring of risk-averse firms
    Noam Goldberg
    Isaac Meilijson
    Yael Perlman
    [J]. Annals of Operations Research, 2024, 334 : 469 - 495
  • [9] Dynamic history-dependent tax and environmental compliance monitoring of risk-averse firms
    Goldberg, Noam
    Meilijson, Isaac
    Perlman, Yael
    [J]. ANNALS OF OPERATIONS RESEARCH, 2024, 334 (1-3) : 469 - 495
  • [10] NONCOMPLIANCE BEHAVIOR OF RISK-AVERSE FIRMS UNDER THE MINIMUM-WAGE LAW
    CHANG, YM
    [J]. PUBLIC FINANCE QUARTERLY, 1992, 20 (03): : 390 - 401