A Study on Hierarchical Text Classification as a Seq2seq Task

被引:0
|
作者
Torba, Fatos [1 ,2 ]
Gravier, Christophe [2 ]
Laclau, Charlotte [3 ]
Kammoun, Abderrhammen [1 ]
Subercaze, Julien [1 ]
机构
[1] AItenders, St Etienne, France
[2] CNRS, Lab Hubert Curien, UMR 5516, St Etienne, France
[3] Inst Polytech Paris, Telecom Paris, Paris, France
关键词
Hierarchical text classification; generative model; reproducibility;
D O I
10.1007/978-3-031-56063-7_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the progress of generative neural models, Hierarchical Text Classification (HTC) can be cast as a generative task. In this case, given an input text, the model generates the sequence of predicted class labels taken from a label tree of arbitrary width and depth. Treating HTC as a generative task introduces multiple modeling choices. These choices vary from choosing the order for visiting the class tree and therefore defining the order of generating tokens, choosing either to constrain the decoding to labels that respect the previous level predictions, up to choosing the pre-trained Language Model itself. Each HTC model therefore differs from the others from an architectural standpoint, but also from the modeling choices that were made. Prior contributions lack transparent modeling choices and open implementations, hindering the assessment of whether model performance stems from architectural or modeling decisions. For these reasons, we propose with this paper an analysis of the impact of different modeling choices along with common model errors and successes for this task. This analysis is based on an open framework coming along this paper that can facilitate the development of future contributions in the field by providing datasets, metrics, error analysis toolkit and the capability to readily test various modeling choices for one given model.
引用
收藏
页码:287 / 296
页数:10
相关论文
共 50 条
  • [41] A Transformer Seq2Seq Model with Fast Fourier Transform Layers for Rephrasing and Simplifying Complex Arabic Text
    Alshanqiti, Abdullah
    Alkhodre, Ahmad
    Namoun, Abdallah
    Albouq, Sami
    Nabil, Emad
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (02) : 888 - 898
  • [42] Tackling Structured Knowledge Extraction from Polymer Nanocomposite Literature as an NER/RE Task with seq2seq
    Hu, Bingyin
    Lin, Anqi
    Brinson, L. Catherine
    INTEGRATING MATERIALS AND MANUFACTURING INNOVATION, 2024, 13 (03) : 656 - 668
  • [43] Falls Prediction Based on Body Keypoints and Seq2Seq Architecture
    Hua, Minjie
    Nan, Yibing
    Lian, Shiguo
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1251 - 1259
  • [44] Seq2Seq Surrogates of Epidemic Models to Facilitate Bayesian Inference
    Charles, Giovanni
    Wolock, Timothy M.
    Winskill, Peter
    Ghani, Azra
    Bhatt, Samir
    Flaxman, Seth
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12, 2023, : 14170 - 14177
  • [45] MTrajRec: Map-Constrained Trajectory Recovery via Seq2Seq Multi-task Learning
    Ren, Huimin
    Ruan, Sijie
    Li, Yanhua
    Bao, Jie
    Meng, Chuishi
    Li, Ruiyuan
    Zheng, Yu
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 1410 - 1419
  • [46] MINGUS: MELODIC IMPROVISATION NEURAL GENERATOR USING SEQ2SEQ
    2021, International Society for Music Information Retrieval
  • [47] Improving Seq2Seq TTS Frontends With Transcribed Speech Audio
    Sun, Siqi
    Richmond, Korin
    Tang, Hao
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1940 - 1952
  • [48] Laughter Synthesis: Combining Seq2seq modeling with Transfer Learning
    Tits, Noe
    El Haddad, Kevin
    Dutoit, Thierry
    INTERSPEECH 2020, 2020, : 3401 - 3405
  • [49] Knowledge-based Questions Generation with Seq2Seq Learning
    Tang, Xiangru
    Gao, Hanning
    Gao, Junjie
    PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), 2018, : 180 - 184
  • [50] WiFi Based Fingerprinting Positioning Based on Seq2seq Model
    Sun, Haotai
    Zhu, Xiaodong
    Liu, Yuanning
    Liu, Wentao
    SENSORS, 2020, 20 (13) : 1 - 19