Semantic Parsing of Interpage Relations

被引:0
|
作者
Demirtas, Mehmet Arif [1 ,2 ]
Oral, Berke [2 ]
Akpinar, Mehmet Yasin [2 ]
Deniz, Onur [2 ]
机构
[1] Istanbul Tech Univ, Dept Comp Engn, Istanbul, Turkey
[2] Yapi Kredi Teknol, Istanbul, Turkey
关键词
semantic parsing; page stream segmentation; dependency parsing; multimodal page representation; document understanding;
D O I
10.1109/ICPR56361.2022.9956546
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Page-level analysis of documents has been a topic of interest in digitization efforts and multimodal approaches have been applied to both classification and page stream segmentation. In this work, we focus on capturing finer semantic relations between pages of a multi-page document. To this end, we formalize the task as semantic parsing of interpage relations and we propose an end-to-end approach for interpage dependency extraction, inspired by the dependency parsing literature. We further design a multi-task training approach to jointly optimize for page embeddings to be used in segmentation, classification, and parsing of the page dependencies using textual and visual features extracted from the pages. Moreover, we also combine the features from two modalities to obtain multimodal page embeddings. To the best of our knowledge, this is the first study to extract rich semantic interpage relations from multi-page documents. Our experimental results show that the proposed method increased LAS by 41 percentage points for semantic parsing, increased accuracy by 33 percentage points for page stream segmentation, and 45 percentage points for page classification over a naive baseline.
引用
收藏
页码:1579 / 1585
页数:7
相关论文
共 50 条
  • [1] Analysis of Sanskrit Text: Parsing and Semantic Relations
    Goyal, Pawan
    Arora, Vipul
    Behera, Laxmidhar
    [J]. SANSKRIT COMPUTATIONAL LINGUISTICS, INVITED PAPERS, 2009, 5402 : 200 - 218
  • [2] From Paraphrasing to Semantic Parsing: Unsupervised Semantic Parsing via Synchronous Semantic Decoding
    Wu, Shan
    Chen, Bo
    Xin, Chunlei
    Han, Xianpei
    Sun, Le
    Zhang, Weipeng
    Chen, Jiansong
    Yang, Fan
    Cai, Xunliang
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 5110 - 5121
  • [3] Conversational Semantic Parsing
    Aghajanyan, Armen
    Maillard, Jean
    Shrivastava, Akshat
    Diedrick, Keith
    Haeger, Mike
    Li, Haoran
    Mehdad, Yashar
    Stoyanov, Ves
    Kumar, Anuj
    Lewis, Mike
    Gupta, Sonal
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 5026 - 5035
  • [4] Learning for semantic parsing
    Mooney, Raymond J.
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2007, 4394 : 311 - 324
  • [5] Parsing, Semantic Networks, and Political Authority Using Syntactic Analysis to Extract Semantic Relations from Dutch Newspaper Articles
    van Atteveldt, Wouter
    Kleinnijenhuis, Jan
    Ruigrok, Nel
    [J]. POLITICAL ANALYSIS, 2008, 16 (04) : 428 - 446
  • [6] Shallow semantic parsing of Chinese
    Sun, H
    Jurafsky, D
    [J]. HLT-NAACL 2004: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2004, : 249 - 256
  • [7] Semantic Parsing with Dual Learning
    Cao, Ruisheng
    Zhu, Su
    Liu, Chen
    Li, Jieyu
    Yu, Kai
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 51 - 64
  • [8] Semantic Parsing via Paraphrasing
    Berant, Jonathan
    Liang, Percy
    [J]. PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 1415 - 1425
  • [9] Domain Adaptation for Semantic Parsing
    Li, Zechang
    Lai, Yuxuan
    Feng, Yansong
    Zhao, Dongyan
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3723 - 3729
  • [10] Semantic Parsing of Disfluent Speech
    Sen, Priyanka
    Groves, Isabel
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1748 - 1753