Graph Convolutional Neural Networks for Learning Attribute Representations for Word Spotting

被引:1
|
作者
Wolf, Fabian [1 ]
Fischer, Andreas [2 ,3 ]
Fink, Gernot A. [1 ]
机构
[1] TU Dortmund Univ, Dept Comp Sci, D-44227 Dortmund, Germany
[2] Univ Fribourg, Dept Informat, DIVA Grp, Fribourg, Switzerland
[3] Univ Appl Sci & Arts Western Switzerland, Inst Complex Syst, Fribourg, Switzerland
关键词
Graph neural networks; Geometric deep learning; Word spotting;
D O I
10.1007/978-3-030-86549-8_4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Graphs are an intuitive and natural way of representing handwriting. Due to their high representational power, they have shown high performances in different learning-free document analysis tasks. While machine learning is rather unexplored for graph representations, geometric deep learning offers a novel framework that allows for convolutional neural networks similar to the image domain. In this work, we show that the concept of attribute prediction can be adapted to the graph domain. We propose a graph neural network to map handwritten word graphs to a symbolic attribute space. This mapping allows to perform query-by-example word spotting as it was also tackled by other learning-free approaches in the graph domain. Furthermore, our model is capable of query-by-string, which is out of scope for other graph-based methods in the literature. We investigate two variants of graph convolutional layers and show that learning improves performances considerably on two popular graph-based word spotting benchmarks.
引用
收藏
页码:50 / 64
页数:15
相关论文
共 50 条
  • [1] Learning Deep Graph Representations via Convolutional Neural Networks
    Ye, Wei
    Askarisichani, Omid
    Jones, Alex
    Singh, Ambuj
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (05) : 2268 - 2279
  • [2] Learning multimodal word representation with graph convolutional networks
    Zhu, Wenhao
    Liu, Shuang
    Liu, Chaoming
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (06)
  • [3] Deep Neural Networks for Learning Graph Representations
    Cao, Shaosheng
    Lu, Wei
    Xu, Qiongkai
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1145 - 1152
  • [4] Graph Transformer: Learning Better Representations for Graph Neural Networks
    Wang, Boyuan
    Cui, Lixin
    Bai, Lu
    Hancock, Edwin R.
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2020, 2021, 12644 : 139 - 149
  • [5] Learning Word Representations with Deep Neural Networks for Turkish
    Dundar, Enes Burak
    Alpaydin, Ethem
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [6] SpottingNet: Learning the Similarity of Word Images with Convolutional Neural Network for Word Spotting in Handwritten Historical Documents
    Zhong, Zhuoyao
    Pan, Weishen
    Jin, Lianwen
    Mouchere, Harold
    Viard-Gaudin, Christian
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 295 - 300
  • [7] Learning graph in graph convolutional neural networks for robust seizure prediction
    Lian, Qi
    Qi, Yu
    Pan, Gang
    Wang, Yueming
    JOURNAL OF NEURAL ENGINEERING, 2020, 17 (03)
  • [8] Efficient Relative Attribute Learning Using Graph Neural Networks
    Meng, Zihang
    Adluru, Nagesh
    Kim, Hyunwoo J.
    Fung, Glenn
    Singh, Vikas
    COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 575 - 590
  • [9] Multi-Task Learning for Metaphor Detection with Graph Convolutional Neural Networks and Word Sense Disambiguation
    Duong Minh Le
    My Thai
    Thien Huu Nguyen
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 8139 - 8146
  • [10] Learning Shared Representations for Recommendation with Dynamic Heterogeneous Graph Convolutional Networks
    Jing, Mengyuan
    Zhu, Yanmin
    Xu, Yanan
    Liu, Haobing
    Zang, Tianzi
    Wang, Chunyang
    Yu, Jiadi
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2023, 17 (04)