Incorporating Scenario Knowledge into A Unified Fine-tuning Architecture for Event Representation

被引:21
|
作者
Zheng, Jianming [1 ]
Cai, Fei [1 ]
Chen, Honghui [1 ]
机构
[1] Natl Univ Def Technol, Sci & Technol Informat Syst Engn Lab, Changsha, Peoples R China
基金
中国国家自然科学基金;
关键词
event representation; pre-training; fine-tuning; scenario knowledge;
D O I
10.1145/3397271.3401173
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Given an occurred event, human can easily predict the next event or reason the preceding event, yet which is difficult for machine to perform such event reasoning. Event representation bridges the connection and targets to model the process of event reasoning as a machine-readable format, which then can support a wide range of applications in information retrieval, e.g., question answering and information extraction. Existing work mainly resorts to a joint training to integrate all levels of training loss in event chains by a simple loss summation, which is easily trapped into a local optimum. In addition, the scenario knowledge in event chains is not well investigated for event representation. In this paper, we propose a unified fine-tuning architecture, incorporated with scenario knowledge for event representation, i.e., UniFA-S, which mainly consists of a unified fine-timing architecture (UniFA) and a scenario-level variational auto-encoder (S-VAE). In detail, UniFA employs a multi-step fine-tuning to integrate all levels of training and S-VAE applies a stochastic variable to implicitly represent the scenario-level knowledge. We evaluate our proposal from two aspects, i.e., the representation and inference abilities. For the representation ability, our ensemble model UniFA-S can beat state-of-the-art base-lines for two similarity tasks. For the inference ability, UniFA-S can outperform the best baseline, achieving 4.1% 8.2% improvements in terms of accuracy for various inference tasks.
引用
收藏
页码:249 / 258
页数:10
相关论文
共 50 条
  • [1] Fine-tuning challenges for the matter bounce scenario
    Levy, Aaron M.
    PHYSICAL REVIEW D, 2017, 95 (02)
  • [2] Knowledge Graph Fusion for Language Model Fine-Tuning
    Bhana, Nimesh
    van Zyl, Terence L.
    2022 9TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE, ISCMI, 2022, : 167 - 172
  • [3] Fine-Tuning a Federation of Models - The Quest for a Discrete Event Module
    Morris, John
    Connolly, Kelly
    Hershey, William
    PROCEEDINGS OF THE 13TH WSEAS INTERNATIONAL CONFERENCE ON SYSTEMS: RECENT ADVANCES IN SYSTEMS, 2009, : 159 - +
  • [4] Event Recognition on Images by Fine-Tuning of Deep Neural Networks
    Yudin, Dmitry
    Zeno, Bassel
    PROCEEDINGS OF THE SECOND INTERNATIONAL SCIENTIFIC CONFERENCE INTELLIGENT INFORMATION TECHNOLOGIES FOR INDUSTRY (IITI'17), VOL 1, 2018, 679 : 479 - 487
  • [5] Advancing Parameter Efficiency in Fine-tuning via Representation Editing
    Wu, Muling
    Liu, Wenhao
    Wang, Xiaohua
    Li, Tianlong
    Lv, Changze
    Ling, Zixuan
    Zhu, Jianhao
    Zhang, Cenyuan
    Zheng, Xiaoqing
    Huang, Xuanjing
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 13445 - 13464
  • [6] LLAMAFACTORY: Unified Efficient Fine-Tuning of 100+Language Models
    Zheng, Yaowei
    Zhang, Richong
    Zhang, Junhao
    Ye, Yanhan
    Luo, Zheyan
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 3: SYSTEM DEMONSTRATIONS, 2024, : 400 - 410
  • [7] Knowledge-guided pre-training and fine-tuning: Video representation learning for action recognition
    Wang, Guanhong
    Zhou, Yang
    He, Zhanhao
    Lu, Keyu
    Feng, Yang
    Liu, Zuozhu
    Wang, Gaoang
    NEUROCOMPUTING, 2024, 571
  • [8] Low fine-tuning with heavy higgsinos in Yukawa unified SUSY GUTs
    Un, Cem Salih
    TURKISH JOURNAL OF PHYSICS, 2024, 48 (01): : 1 - 27
  • [9] Fine-tuning of neuronal architecture requires two profilin isoforms
    Michaelsen, Kristin
    Murk, Kai
    Zagrebelsky, Marta
    Dreznjak, Anita
    Jockusch, Brigitte M.
    Rothkegel, Martin
    Korte, Martin
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (36) : 15780 - 15785
  • [10] InstaTune: Instantaneous Neural Architecture Search During Fine-Tuning
    Sridhar, Sharath Nittur
    Kundu, Souvik
    Sundaresan, Sairam
    Szankin, Maciej
    Sarah, Anthony
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 1515 - 1519