Multi-modal Information Integration for Document Retrieval

被引：3

作者：

Hassan, Ehtesham ^{[1
]}

Chaudhury, Santanu ^{[1
]}

Gopal, M. ^{[2
]}

机构：

[1] Indian Inst Technol Delhi, Dept Elect Engn, Delhi, India

[2] SNU, Sch Engn, Gautam Buddha Nagar, India

来源：

2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR) | 2013年

关键词：

Document Indexing; Multi-modal Retrieval; Multiple Kernel Learning; TEXT; SPACE;

D O I：

10.1109/ICDAR.2013.243

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The paper proposes a novel multi-modal document image retrieval framework by exploiting the information of text and graphics regions. The framework applies multiple kernel learning based hashing formulation for generation of composite document indexes using different modalities. The existing multimedia management methods for imaged text documents have not addressed the requirement of old and degraded documents. In the subsequent contribution, we propose novel multi-modal document indexing framework for retrieval of old and degraded text documents by combining OCRed text and image based representation using learning. The evaluation of proposed concepts is demonstrated on sampled magazine cover pages, and documents of Devanagari script.

引用

页码：1200 / 1204

页数：5

共 50 条

[41] A Multi-Modal Multilingual Benchmark for Document Image Classification
Fujinuma, Yoshinari
Varia, Siddharth
Sankaran, Nishant
Min, Bonan
Appalaraju, Srikar
Vyas, Yogarshi
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 14361 - 14376
[42] Building Multi-Modal Relational Graphs for Multimedia Retrieval
Shieh, Jyh-Ren
Lin, Ching-Yung
Wang, Shun-Xuan
Wu, Ja-Ling
INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT, 2011, 2 (02): : 19 - 41
[43] Multi-modal Language Models for Lecture Video Retrieval
Chen, Huizhong
Cooper, Matthew
Joshi, Dhiraj
Girod, Bernd
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 1081 - 1084
[44] Multi-modal Solution for Unconstrained News Story Retrieval
Younessian, Ehsan
Rajan, Deepu
ADVANCES IN MULTIMEDIA MODELING, 2012, 7131 : 186 - 195
[45] Multi-Modal Hashing for Efficient Multimedia Retrieval: A Survey
Zhu, Lei
Zheng, Chaoqun
Guan, Weili
Li, Jingjing
Yang, Yang
Shen, Heng Tao
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (01) : 239 - 260
[46] MULTI-MODAL JOINT EMBEDDING FOR FASHION PRODUCT RETRIEVAL
Rubio, A.
Yu, LongLong
Simo-Serra, E.
Moreno-Noguer, F.
2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 400 - 404
[47] Multi-modal fusion for associated news story retrieval
Younessian, Ehsan
Rajan, Deepu
MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (08) : 2563 - 2585
[48] Multi-Modal Knowledge Hypergraph for Diverse Image Retrieval
Zeng, Yawen
Jin, Qin
Bao, Tengfei
Li, Wenfeng
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3376 - 3383
[49] Flexible Multi-modal Hashing for Scalable Multimedia Retrieval
Zhu, Lei
Lu, Xu
Cheng, Zhiyong
Li, Jingjing
Zhang, Huaxiang
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2020, 11 (02)
[50] A multi-modal system for the retrieval of semantic video events
Amir, A
Basu, S
Iyengar, G
Lin, CY
Naphade, M
Smith, JR
Srinivasan, S
Tseng, B
COMPUTER VISION AND IMAGE UNDERSTANDING, 2004, 96 (02) : 216 - 236

← 1 2 3 4 5 →