SymFormer: End-to-End Symbolic Regression Using Transformer-Based Architecture

被引:4
|
作者
Vastl, Martin [1 ,2 ]
Kulhanek, Jonas [1 ,3 ]
Kubalik, Jiri [1 ]
Derner, Erik [1 ]
Babuska, Robert [1 ,4 ]
机构
[1] Czech Tech Univ, Czech Inst Informat Robot & Cybernet, Prague 16000, Czech Republic
[2] Charles Univ Prague, Fac Math & Phys, Prague 12116, Czech Republic
[3] Czech Tech Univ, Fac Elect Engn, Prague 16000, Czech Republic
[4] Delft Univ Technol, Dept Cognit Robot, NL-2628 CD Delft, Netherlands
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Transformers; Mathematical models; Vectors; Symbols; Decoding; Optimization; Predictive models; Neural networks; Genetic programming; Computational complexity; Benchmark testing; Regression analysis; Symbolic regression; neural networks; transformers;
D O I
10.1109/ACCESS.2024.3374649
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many real-world systems can be naturally described by mathematical formulas. The task of automatically constructing formulas to fit observed data is called symbolic regression. Evolutionary methods such as genetic programming have been commonly used to solve symbolic regression tasks, but they have significant drawbacks, such as high computational complexity. Recently, neural networks have been applied to symbolic regression, among which the transformer-based methods seem to be most promising. After training a transformer on a large number of formulas, the actual inference, i.e., finding a formula for new, unseen data, is very fast (in the order of seconds). This is considerably faster than state-of-the-art evolutionary methods. The main drawback of transformers is that they generate formulas without numerical constants, which have to be optimized separately, yielding suboptimal results. We propose a transformer-based approach called SymFormer, which predicts the formula by outputting the symbols and the constants simultaneously. This helps to generate formulas that fit the data more accurately. In addition, the constants provided by SymFormer serve as a good starting point for subsequent tuning via gradient descent to further improve the model accuracy. We show on several benchmarks that SymFormer outperforms state-of-the-art methods while having faster inference.
引用
收藏
页码:37840 / 37849
页数:10
相关论文
共 50 条
  • [21] A study of transformer-based end-to-end speech recognition system for Kazakh language
    Mamyrbayev, Orken
    Oralbekova, Dina
    Alimhan, Keylan
    Turdalykyzy, Tolganay
    Othman, Mohamed
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [22] Transformer-Based End-to-End Classification of Variable-Length Volumetric Data
    Oghbaie, Marzieh
    Araujo, Teresa
    Emre, Taha
    Schmidt-Erfurth, Ursula
    Bogunovic, Hrvoje
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VI, 2023, 14225 : 358 - 367
  • [23] TransOrga: End-To-End Multi-modal Transformer-Based Organoid Segmentation
    Qin, Yiming
    Li, Jiajia
    Chen, Yulong
    Wang, Zikai
    Huang, Yu-An
    You, Zhuhong
    Hu, Lun
    Hu, Pengwei
    Tan, Feng
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT III, 2023, 14088 : 460 - 472
  • [24] TOD-Net: An end-to-end transformer-based object detection network
    Sirisha, Museboyina
    Sudha, S. V.
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 108
  • [25] Transformer-based Planning for Symbolic Regression
    Shojaee, Parshin
    Meidani, Kazem
    Farimani, Amir Barati
    Reddy, Chandan K.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [26] End-to-End Asbestos Roof Detection on Orthophotos Using Transformer-Based YOLO Deep Neural Network
    Pace, Cesare Davide
    Bria, Alessandro
    Focareta, Mariano
    Lozupone, Gabriele
    Marrocco, Claudio
    Meoli, Giuseppe
    Molinara, Mario
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT I, 2023, 14233 : 232 - 244
  • [27] OrientedFormer: An End-to-End Transformer-Based Oriented Object Detector in Remote Sensing Images
    Zhao, Jiaqi
    Ding, Zeyu
    Zhou, Yong
    Zhu, Hancheng
    Du, Wen-Liang
    Yao, Rui
    El Saddik, Abdulmotaleb
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [28] HyperSFormer: A Transformer-Based End-to-End Hyperspectral Image Classification Method for Crop Classification
    Xie, Jiaxing
    Hua, Jiajun
    Chen, Shaonan
    Wu, Peiwen
    Gao, Peng
    Sun, Daozong
    Lyu, Zhendong
    Lyu, Shilei
    Xue, Xiuyun
    Lu, Jianqiang
    REMOTE SENSING, 2023, 15 (14)
  • [29] An Empirical Study on Transformer-Based End-to-End Speech Recognition with Novel Decoder Masking
    Weng, Shi-Yan
    Chiu, Hsuan-Sheng
    Chen, Berlin
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 518 - 522
  • [30] Intra-hour solar irradiance forecasting: An end-to-end Transformer-based network
    Song, Kang
    Wang, Kai
    Wang, Shibo
    Wang, Nan
    Zhang, Jingxin
    Zhang, Kanjian
    Wei, Haikun
    39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 526 - 531