Multi-modal Information Integration for Document Retrieval

被引:3
|
作者
Hassan, Ehtesham [1 ]
Chaudhury, Santanu [1 ]
Gopal, M. [2 ]
机构
[1] Indian Inst Technol Delhi, Dept Elect Engn, Delhi, India
[2] SNU, Sch Engn, Gautam Buddha Nagar, India
关键词
Document Indexing; Multi-modal Retrieval; Multiple Kernel Learning; TEXT; SPACE;
D O I
10.1109/ICDAR.2013.243
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper proposes a novel multi-modal document image retrieval framework by exploiting the information of text and graphics regions. The framework applies multiple kernel learning based hashing formulation for generation of composite document indexes using different modalities. The existing multimedia management methods for imaged text documents have not addressed the requirement of old and degraded documents. In the subsequent contribution, we propose novel multi-modal document indexing framework for retrieval of old and degraded text documents by combining OCRed text and image based representation using learning. The evaluation of proposed concepts is demonstrated on sampled magazine cover pages, and documents of Devanagari script.
引用
收藏
页码:1200 / 1204
页数:5
相关论文
共 50 条
  • [1] Multi-modal information retrieval using FINT
    van Zaanen, M
    de Croon, G
    MULTILINGUAL INFORMATION ACCESS FOR TEXT, SPEECH AND IMAGES, 2005, 3491 : 728 - +
  • [2] A Novel Multi-modal Integration and Propagation Model for Cross-Media Information Retrieval
    Lin, Wanxia
    Lu, Tong
    Su, Feng
    ADVANCES IN MULTIMEDIA MODELING, 2012, 7131 : 740 - 749
  • [3] Multi-modal information retrieval with a semantic view mechanism
    Li, Q
    Yang, J
    Zhuang, YT
    19TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 1, PROCEEDINGS: AINA 2005, 2005, : 133 - 138
  • [4] SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval
    Wu, Siwei
    Li, Yizhi
    Zhu, Kang
    Zhang, Ge
    Liang, Yiming
    Ma, Kaijing
    Xiao, Chenghao
    Zhang, Haoran
    Yang, Bohao
    Chen, Wenhu
    Huang, Wenhao
    Al Moubayed, Noura
    Fu, Jie
    Lin, Chenghua
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 12560 - 12574
  • [5] Using information gain to improve multi-modal information retrieval systems
    Martin-Valdivia, M. T.
    Diaz-Galiano, M. C.
    Montejo-Raez, A.
    Urena-Lopez, L. A.
    INFORMATION PROCESSING & MANAGEMENT, 2008, 44 (03) : 1146 - 1158
  • [6] The integration of information in a digital, multi-modal learning environment
    Schueler, Anne
    LEARNING AND INSTRUCTION, 2019, 59 : 76 - 87
  • [7] A multi-modal dialogue system for information navigation and retrieval across spoken document archives with topic hierarchies
    Pan, YC
    Wang, CC
    Hsieh, YC
    Lee, TH
    Lee, YS
    Fu, YS
    Huang, YT
    Lee, LS
    2005 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2005, : 375 - 380
  • [8] Fuzzy Ontology Based Multi-Modal Semantic Information Retrieval
    Nagarajan, G.
    Minu, R. I.
    INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND CONVERGENCE (ICCC 2015), 2015, 48 : 101 - 106
  • [9] Flexible Dual Multi-Modal Hashing for Incomplete Multi-Modal Retrieval
    Wei, Yuhong
    An, Junfeng
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2024,
  • [10] Multi-modal Information Integration for Interactive Multi-agent Systems
    Toru Yamaguchi
    Makoto Sato
    Tomohiro Takagi
    Journal of Intelligent and Robotic Systems, 1998, 23 : 183 - 199