Towards a Foundation Purchasing Model: Pretrained Generative Autoregression on Transaction Sequences

被引:0
|
作者
Skalski, Piotr [1 ]
Sutton, David [1 ]
Burrell, Stuart [1 ]
Perez, Iker [1 ]
Wong, Jason [1 ]
机构
[1] Featurespace, Innovat Lab, Cambridge, England
关键词
transaction embeddings; self-supervised learning; generative modelling; multivariate time series; fraud detection;
D O I
10.1145/3604237.3626850
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
Machine learning models underpin many modern financial systems for use cases such as fraud detection and churn prediction. Most are based on supervised learning with hand-engineered features, which relies heavily on the availability of labelled data. Large self-supervised generative models have shown tremendous success in natural language processing and computer vision, yet so far they haven't been adapted to multivariate time series of financial transactions. In this paper, we present a generative pretraining method that can be used to obtain contextualised embeddings of financial transactions. Benchmarks on public datasets demonstrate that it outperforms state-of-the-art self-supervised methods on a range of downstream tasks. We additionally perform large-scale pretraining of an embedding model using a corpus of data from 180 issuing banks containing 5.1 billion transactions and apply it to the card fraud detection problem on hold-out datasets. The embedding model significantly improves value detection rate at high precision thresholds and transfers well to out-of-domain distributions.
引用
收藏
页码:141 / 149
页数:9
相关论文
共 50 条
  • [31] Accelerating drug target inhibitor discovery with a deep generative foundation model
    Chenthamarakshan, Vijil
    Hoffman, Samuel C.
    Owen, C. David
    Lukacik, Petra
    Strain-Damerell, Claire
    Fearon, Daren
    Malla, Tika R.
    Tumber, Anthony
    Schofield, Christopher J.
    Duyvesteyn, Helen M. E.
    Dejnirattisai, Wanwisa
    Carrique, Loic
    Walter, Thomas S.
    Screaton, Gavin R.
    Matviiuk, Tetiana
    Mojsilovic, Aleksandra
    Crain, Jason
    Walsh, Martin A.
    Stuart, David I.
    Das, Payel
    SCIENCE ADVANCES, 2023, 9 (25):
  • [32] Towards a Neural-Symbolic Generative Policy Model
    Cunnington, Daniel
    Law, Mark
    Russo, Alessandra
    Bertino, Elisa
    Calo, Seraphin
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 4008 - 4016
  • [33] Operationalizing and Implementing Pretrained, Large Artificial Intelligence Linguistic Models in the US Health Care System: Outlook of Generative Pretrained Transformer 3 (GPT-3) as a Service Model
    Sezgin, Emre
    Sirrianni, Joseph
    Linwood, Simon L.
    JMIR MEDICAL INFORMATICS, 2022, 10 (02)
  • [34] Towards Deriving Test Sequences by Model Checking
    Bonifacio, Adilson Luiz
    Moura, Arnaldo Vieira
    Simao, Adenilso da Silva
    Maldonado, Jose Carlos
    ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2008, 195 (21-40) : 21 - 40
  • [35] Towards Context-Aware Ubiquitous Transaction Processing: a Model and Algorithm
    Tang, Feilong
    Guo, Song
    Guo, Minyi
    Li, Minglu
    Wang, Cho-li
    2011 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2011,
  • [36] Joint Foundation Model Caching and Inference of Generative AI Services for Edge Intelligence
    Xu, Minrui
    Niyato, Dusit
    Zhang, Hongliang
    Kang, Jiawen
    Xiong, Zehui
    Mao, Shiwen
    Han, Zhu
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 3548 - 3553
  • [37] Towards an astronomical foundation model for stars with a transformer-based model
    Leung, Henry W.
    Bovy, Jo
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2024, 527 (01) : 1494 - 1520
  • [38] Towards a Cognitive Agent-Based Model for Air Conditioners Purchasing Prediction
    Mogles, Nataliya
    Ramallo-Gonzalez, Alfonso P.
    Gabe-Thomas, Elizabeth
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE, 2015, 51 : 463 - 472
  • [39] Factors Influencing Consumers' Behaviour towards Purchasing Organic Foods: A Theoretical Model
    Yilmaz, Birsen
    SUSTAINABILITY, 2023, 15 (20)
  • [40] Discovering protein-binding RNA motifs with a generative model of RNA sequences
    Park, Byungkyu
    Han, Kyungsook
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2020, 84