Unifying Structure Reasoning and Language Pre-Training for Complex Reasoning Tasks

被引：2

作者：

Wang, Siyuan ^{[1
]}

Wei, Zhongyu ^{[1
,2
]}

Xu, Jiarong ^{[3
]}

Li, Taishan ^{[4
]}

Fan, Zhihao ^{[1
]}

机构：

[1] Fudan Univ, Sch Data Sci, Shanghai 200433, Peoples R China

[2] Fudan Univ, Res Inst Intelligent & Complex Syst, Shanghai 200433, Peoples R China

[3] Fudan Univ, Sch Management, Shanghai 200433, Peoples R China

[4] ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai 201210, Peoples R China

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2024年 / 32卷

基金：

中国国家自然科学基金;

关键词：

Cognition; Task analysis; Semantics; Films; Speech processing; Context modeling; Data models; Structure reasoning skill; language model pre-training; complex reasoning;

D O I：

10.1109/TASLP.2023.3325973

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Recent pre-trained language models (PLMs) equipped with foundation reasoning skills have shown remarkable performance on downstream complex tasks. However, the significant structure reasoning skill has been rarely studied, which involves modeling implicit structure information within the text and performing explicit logical reasoning over them to deduce the conclusion. This paper proposes a unified learning framework that combines explicit structure reasoning and language pre-training to endow PLMs with the structure reasoning skill. It first identifies several elementary structures within contexts to construct structured queries and performs step-by-step reasoning along the queries to identify the answer entity. The fusion of textual semantics and structure reasoning is achieved by using contextual representations learned by PLMs to initialize the representation space of structures, and performing stepwise reasoning on this semantic representation space. Experimental results on four datasets demonstrate that the proposed model achieves significant improvements in complex reasoning tasks involving diverse structures, and shows transferability to downstream tasks with limited training data and effectiveness for complex reasoning of KGs modality.

引用

页码：1586 / 1595

页数：10

共 50 条

[31] Structure-inducing pre-training
McDermott, Matthew B. A.
Yap, Brendan
Szolovits, Peter
Zitnik, Marinka
NATURE MACHINE INTELLIGENCE, 2023, 5 (06) : 612 - +
[32] Structure-inducing pre-training
Matthew B. A. McDermott
Brendan Yap
Peter Szolovits
Marinka Zitnik
Nature Machine Intelligence, 2023, 5 : 612 - 621
[33] Graph Structure Enhanced Pre-Training Language Model for Knowledge Graph Completion
Zhu, Huashi
Xu, Dexuan
Huang, Yu
Jin, Zhi
Ding, Weiping
Tong, Jiahui
Chong, Guoshuang
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (04): : 2697 - 2708
[34] UIT: Unifying Pre-training Objectives for Image-Text Understanding
Xu, Guoqiang
Yan, Shenggang
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT V, 2023, 14258 : 572 - 585
[35] Unifying Structured Data as Graph for Data-to-Text Pre-Training
Li, Shujie
Li, Liang
Geng, Ruiying
Yang, Min
Li, Binhua
Yuan, Guanghu
He, Wanwei
Yuan, Shao
Ma, Can
Huang, Fei
Li, Yongbin
TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2024, 12 : 210 - 228
[36] Reasoning with Large Language Models on Graph Tasks: The Influence of Temperature
Wang, Yiming
Zhang, Ziyang
Chen, Hanwei
Shen, Huayi
2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024, 2024, : 630 - 634
[37] Anatomical Structure-Guided Medical Vision-Language Pre-training
Li, Qingqiu
Yan, Xiaohan
Xu, Jilan
Yuan, Runtian
Zhang, Yuejie
Feng, Rui
Shen, Quanli
Zhang, Xiaobo
Wang, Shujun
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT XI, 2024, 15011 : 80 - 90
[38] Unifying Event Detection and Captioning as Sequence Generation via Pre-training
Zhang, Qi
Song, Yuqing
Jin, Qin
COMPUTER VISION, ECCV 2022, PT XXXVI, 2022, 13696 : 363 - 379
[39] Improved matrix reasoning is limited to training on tasks with a visuospatial component
Stephenson, Clayton L.
Halpern, Diane F.
INTELLIGENCE, 2013, 41 (05) : 341 - 357
[40] Bridging the Gap between Recognition-level Pre-training and Commonsensical Vision-language Tasks
Wan, Yue
Ma, Yueen
You, Haoxuan
Wang, Zhecan
Chang, Shih-Fu
PROCEEDINGS OF THE FIRST WORKSHOP ON COMMONSENSE REPRESENTATION AND REASONING (CSRR 2022), 2022, : 23 - 35

← 1 2 3 4 5 →