Multi-modal Information Integration for Document Retrieval

被引:3
|
作者
Hassan, Ehtesham [1 ]
Chaudhury, Santanu [1 ]
Gopal, M. [2 ]
机构
[1] Indian Inst Technol Delhi, Dept Elect Engn, Delhi, India
[2] SNU, Sch Engn, Gautam Buddha Nagar, India
关键词
Document Indexing; Multi-modal Retrieval; Multiple Kernel Learning; TEXT; SPACE;
D O I
10.1109/ICDAR.2013.243
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper proposes a novel multi-modal document image retrieval framework by exploiting the information of text and graphics regions. The framework applies multiple kernel learning based hashing formulation for generation of composite document indexes using different modalities. The existing multimedia management methods for imaged text documents have not addressed the requirement of old and degraded documents. In the subsequent contribution, we propose novel multi-modal document indexing framework for retrieval of old and degraded text documents by combining OCRed text and image based representation using learning. The evaluation of proposed concepts is demonstrated on sampled magazine cover pages, and documents of Devanagari script.
引用
收藏
页码:1200 / 1204
页数:5
相关论文
共 50 条
  • [31] Privacy Protection in Deep Multi-modal Retrieval
    Zhang, Peng-Fei
    Li, Yang
    Huang, Zi
    Yin, Hongzhi
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 634 - 643
  • [32] Multi-modal Correlation Modeling and Ranking for Retrieval
    Zhang, Hong
    Meng, Fanlian
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2009, 2009, 5879 : 637 - 646
  • [33] MULTI-MODAL TRAVEL INFORMATION ON THE WEB
    Pun-Cheng, Lilian S. C.
    Shea, Geoffrey Y. K.
    Mok, Esmond C. M.
    TRANSPORTATION AND LOGISTICS, 2003, : 285 - 290
  • [34] A Multi-Modal Incompleteness Ontology model (MMIO) to enhance information fusion for image retrieval
    Poslad, Stefan
    Kesorn, Kraisak
    INFORMATION FUSION, 2014, 20 : 225 - 241
  • [35] Integration of transgender health: A multi-modal approach
    Paradiso, Catherine
    Arca-Contreras, Karen
    Brillhart, Susan J.
    Macchiarola, Jennifer
    Curcio, Danna L.
    TEACHING AND LEARNING IN NURSING, 2022, 17 (04) : 425 - 432
  • [36] Special Issue on Multi-modal Integration and Development
    Rao, A. Ravishankar
    Choe, Yoonsuck
    Chakravarthy, Srinivasa
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2016, 8 (04) : 312 - 312
  • [37] A control architecture for multi-modal sensory integration
    Goncalves, LMG
    Grupen, RA
    Oliveira, AAF
    SIBGRAPI '98 - INTERNATIONAL SYMPOSIUM ON COMPUTER GRAPHICS, IMAGE PROCESSING, AND VISION, PROCEEDINGS, 1998, : 418 - 425
  • [38] DavarOCR: A Toolbox for OCR and Multi-Modal Document Understanding
    Qiao, Liang
    Jiang, Hui
    Chen, Ying
    Li, Can
    Li, Pengfei
    Li, Zaisheng
    Zou, Baorui
    Guo, Dashan
    Xu, Yingda
    Xu, Yunlu
    Cheng, Zhanzhan
    Niu, Yi
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 7355 - 7358
  • [39] Panel Labels Extraction from Multi-panel Figures for Facilitating Multi-modal Information Retrieval
    Ali, Mushtaq
    Dong, Le
    Liang, Yan
    He, Ling
    Feng, Ning
    SEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2015), 2015, 9631
  • [40] Multi-modal Feature Integration for Secure Authentication
    Kang, Hang-Bong
    Ju, Myung-Ho
    INTELLIGENT COMPUTING, PART I: INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, ICIC 2006, PART I, 2006, 4113 : 1191 - 1200