Towards a Foundation Purchasing Model: Pretrained Generative Autoregression on Transaction Sequences

被引:0
|
作者
Skalski, Piotr [1 ]
Sutton, David [1 ]
Burrell, Stuart [1 ]
Perez, Iker [1 ]
Wong, Jason [1 ]
机构
[1] Featurespace, Innovat Lab, Cambridge, England
关键词
transaction embeddings; self-supervised learning; generative modelling; multivariate time series; fraud detection;
D O I
10.1145/3604237.3626850
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
Machine learning models underpin many modern financial systems for use cases such as fraud detection and churn prediction. Most are based on supervised learning with hand-engineered features, which relies heavily on the availability of labelled data. Large self-supervised generative models have shown tremendous success in natural language processing and computer vision, yet so far they haven't been adapted to multivariate time series of financial transactions. In this paper, we present a generative pretraining method that can be used to obtain contextualised embeddings of financial transactions. Benchmarks on public datasets demonstrate that it outperforms state-of-the-art self-supervised methods on a range of downstream tasks. We additionally perform large-scale pretraining of an embedding model using a corpus of data from 180 issuing banks containing 5.1 billion transactions and apply it to the card fraud detection problem on hold-out datasets. The embedding model significantly improves value detection rate at high precision thresholds and transfers well to out-of-domain distributions.
引用
收藏
页码:141 / 149
页数:9
相关论文
共 50 条
  • [1] Banksformer: A Deep Generative Model for Synthetic Transaction Sequences
    Nickerson, Kyle
    Tricco, Terrence
    Kolokolova, Antonina
    Shoeleh, Farzaneh
    Robertson, Charles
    Hawkin, John
    Hu, Ting
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT VI, 2023, 13718 : 121 - 136
  • [2] Learning spatiotemporal dynamics with a pretrained generative model
    Li, Zeyu
    Han, Wang
    Zhang, Yue
    Fu, Qingfei
    Li, Jingxuan
    Qin, Lizi
    Dong, Ruoyu
    Sun, Hao
    Deng, Yue
    Yang, Lijun
    NATURE MACHINE INTELLIGENCE, 2024, 6 (12) : 1566 - 1579
  • [3] Semantic-Electromagnetic Inversion With Pretrained Multimodal Generative Model
    Chen, Yanjin
    Zhang, Hongrui
    Ma, Jie
    Cui, Tie Jun
    del Hougne, Philipp
    Li, Lianlin
    ADVANCED SCIENCE, 2024, 11 (42)
  • [4] A high throughput generative vector autoregression model for stochastic synapses
    Hennen, Tyler
    Elias, Alexander
    Nodin, Jean-Francois
    Molas, Gabriel
    Waser, Rainer
    Wouters, Dirk J. J.
    Bedau, Daniel
    FRONTIERS IN NEUROSCIENCE, 2022, 16
  • [5] Towards Responsible Generative AI: A Reference Architecture for Designing Foundation Model based Agents
    Lu, Qinghua
    Zhu, Liming
    Xu, Xiwei
    Xing, Zhenchang
    Harrer, Stefan
    Whittle, Jon
    IEEE 21ST INTERNATIONAL CONFERENCE ON SOFTWARE ARCHITECTURE COMPANION, ICSA-C 2024, 2024, : 119 - 126
  • [6] Urban Multimodal Transportation Generative Pretrained Transformer Foundation Model: Hierarchical Techniques and Application Scenarios of Spot-corridor-network Decomposition
    Zhou Z.
    Gu Z.-Y.
    Qu X.-B.
    Liu P.
    Liu Z.-Y.
    Zhongguo Gonglu Xuebao/China Journal of Highway and Transport, 2024, 37 (02): : 253 - 274
  • [7] TOWARDS A GENERATIVE MODEL OF NOMADISM
    SPOONER, B
    ANTHROPOLOGICAL QUARTERLY, 1971, 44 (03) : 198 - 210
  • [8] PoET: A generative model of protein families as sequences-of-sequences
    Truong, Timothy F., Jr.
    Bepler, Tristan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [9] Deep Curvilinear Editing: Commutative and Nonlinear Image Manipulation for Pretrained Deep Generative Model
    Aoshima, Takehiro
    Matsubara, Takashi
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 5957 - 5967
  • [10] Towards a transaction model for services in grid environment
    Jiang, Jinlei
    Yang, Guangwen
    Shi, Meilin
    2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, (WI 2006 MAIN CONFERENCE PROCEEDINGS), 2006, : 625 - +