SeqCondenser: Inductive Representation Learning of Sequences by Sampling Characteristic Functions

被引:0
|
作者
Chenebaux, Maixent [1 ]
Cazenave, Tristan [2 ]
机构
[1] Vectors Grp, Paris, France
[2] Univ Paris Dauphine PSL, CNRS, LAMSADE, Paris, France
来源
关键词
D O I
10.1007/978-3-031-70563-2_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we introduce SeqCondenser, a neural network layer that compresses a variable-length input sequence into a fixed-size vector representation. The SeqCondenser layer samples the empirical characteristic function and its derivatives for each input dimension, and uses an attention mechanism to determine the associated probability distribution. We argue that the features extracted through this process effectively represent the entire sequence and that the SeqCondenser layer is particularly well-suited for inductive sequence classification tasks, such as text and time series classification. Our experiments show that SCoMo, a SeqCondenser-based architecture, outperforms the state-of-the-art inductive methods on nearly all examined text classification datasets and also outperforms the current best transductive method on one dataset.
引用
收藏
页码:3 / 16
页数:14
相关论文
共 50 条
  • [22] Learning inductive invariants by sampling from frequency distributions
    Fedyukovich, Grigory
    Kaufman, Samuel J.
    Bodik, Rastislav
    FORMAL METHODS IN SYSTEM DESIGN, 2020, 56 (1-3) : 154 - 177
  • [23] Unsupervised Representation Learning by Sorting Sequences
    Lee, Hsin-Ying
    Huang, Jia-Bin
    Singh, Maneesh
    Yang, Ming-Hsuan
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 667 - 676
  • [24] Learning inductive invariants by sampling from frequency distributions
    Grigory Fedyukovich
    Samuel J. Kaufman
    Rastislav Bodík
    Formal Methods in System Design, 2020, 56 : 154 - 177
  • [25] Inductive Representation Learning of Multiple ICD Codes for Healthcare
    Lui, Sheng Jie
    Cheng, Xiang
    Krishnaswamy, Shonali
    2022 IEEE 17TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION, ICCA, 2022, : 498 - 503
  • [26] A Study of Inductive Biases for Unsupervised Speech Representation Learning
    Boulianne, Gilles
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2781 - 2795
  • [27] Inductive Document Representation Learning for Short Text Clustering
    Chen, Junyang
    Gong, Zhiguo
    Wang, Wei
    Dong, Xiao
    Liu, Weiwen
    Wang, Cong
    Chen, Xian
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2020, PT III, 2021, 12459 : 600 - 616
  • [28] Characteristic maximum-length sequences for the interleaved sampling method
    Xiang, N
    Genuit, K
    ACUSTICA, 1996, 82 (06): : 905 - 907
  • [29] Bounded characteristic functions and models for noncontractive sequences of operators
    Gheondea, A
    Popescu, G
    INTEGRAL EQUATIONS AND OPERATOR THEORY, 2003, 45 (01) : 15 - 38
  • [30] Bounded characteristic functions and models for noncontractive sequences of operators
    Aurelian Gheondea
    Gelu Popescu
    Integral Equations and Operator Theory, 2003, 45 : 15 - 38