Unifying Structure Reasoning and Language Pre-Training for Complex Reasoning Tasks

被引：2

作者：

Wang, Siyuan ^{[1
]}

Wei, Zhongyu ^{[1
,2
]}

Xu, Jiarong ^{[3
]}

Li, Taishan ^{[4
]}

Fan, Zhihao ^{[1
]}

机构：

[1] Fudan Univ, Sch Data Sci, Shanghai 200433, Peoples R China

[2] Fudan Univ, Res Inst Intelligent & Complex Syst, Shanghai 200433, Peoples R China

[3] Fudan Univ, Sch Management, Shanghai 200433, Peoples R China

[4] ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai 201210, Peoples R China

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2024年 / 32卷

基金：

中国国家自然科学基金;

关键词：

Cognition; Task analysis; Semantics; Films; Speech processing; Context modeling; Data models; Structure reasoning skill; language model pre-training; complex reasoning;

D O I：

10.1109/TASLP.2023.3325973

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Recent pre-trained language models (PLMs) equipped with foundation reasoning skills have shown remarkable performance on downstream complex tasks. However, the significant structure reasoning skill has been rarely studied, which involves modeling implicit structure information within the text and performing explicit logical reasoning over them to deduce the conclusion. This paper proposes a unified learning framework that combines explicit structure reasoning and language pre-training to endow PLMs with the structure reasoning skill. It first identifies several elementary structures within contexts to construct structured queries and performs step-by-step reasoning along the queries to identify the answer entity. The fusion of textual semantics and structure reasoning is achieved by using contextual representations learned by PLMs to initialize the representation space of structures, and performing stepwise reasoning on this semantic representation space. Experimental results on four datasets demonstrate that the proposed model achieves significant improvements in complex reasoning tasks involving diverse structures, and shows transferability to downstream tasks with limited training data and effectiveness for complex reasoning of KGs modality.

引用

页码：1586 / 1595

页数：10

共 50 条

[21] ASR-Generated Text for Language Model Pre-training Applied to Speech Tasks
Pelloin, Valentin
Dary, Franck
Herve, Nicolas
Favre, Benoit
Camelin, Nathalie
Laurent, Antoine
Besacier, Laurent
INTERSPEECH 2022, 2022, : 3453 - 3457
[22] Evaluating synthetic pre-Training for handwriting processing tasks
Pippi, Vittorio
Cascianelli, Silvia
Baraldi, Lorenzo
Cucchiara, Rita
PATTERN RECOGNITION LETTERS, 2023, 172 : 44 - 50
[23] Insights into Pre-training via Simpler Synthetic Tasks
Wu, Yuhuai
Li, Felix
Liang, Percy
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[24] Synthetic Pre-Training Tasks for Neural Machine Translation
He, Zexue
Blackwood, Graeme
Panda, Rameswar
McAuley, Julian
Feris, Rogerio
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 8080 - 8098
[25] Unsupervised Pre-training for Temporal Action Localization Tasks
Zhang, Can
Yang, Tianyu
Weng, Junwu
Cao, Meng
Wang, Jue
Zou, Yuexian
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 14011 - 14021
[26] Survey on Vision-language Pre-training
Yin J.
Zhang Z.-D.
Gao Y.-H.
Yang Z.-W.
Li L.
Xiao M.
Sun Y.-Q.
Yan C.-G.
Ruan Jian Xue Bao/Journal of Software, 2023, 34 (05): : 2000 - 2023
[27] Sigmoid Loss for Language Image Pre-Training
Zhai, Xiaohua
Mustafa, Basil
Kolesnikov, Alexander
Beyer, Lucas
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 11941 - 11952
[28] Grounded Language-Image Pre-training
Li, Liunian Harold
Zhang, Pengchuan
Zhang, Haotian
Yang, Jianwei
Li, Chunyuan
Zhong, Yiwu
Wang, Lijuan
Yuan, Lu
Zhang, Lei
Hwang, Jenq-Neng
Chang, Kai-Wei
Gao, Jianfeng
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10955 - 10965
[29] VILA: On Pre-training for Visual Language Models
Lin, Ji
Yin, Hongxu
Ping, Wei
Molchanov, Pavlo
Shoeybi, Mohammad
Han, Song
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 26679 - 26689
[30] RELATION ENHANCED VISION LANGUAGE PRE-TRAINING
Lee, Ju-Hee
Kang, Je-Won
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2286 - 2290

← 1 2 3 4 5 →