Fabric image retrieval based on multi-modal feature fusion

被引:0
|
作者
Ning Zhang
Yixin Liu
Zhongjian Li
Jun Xiang
Ruru Pan
机构
[1] Jiangnan University,Key Laboratory of Eco
[2] Shaoxing University,Textiles, Ministry of Education
来源
关键词
Separable feature extraction; Multi-modal feature fusion; Visual-semantic joint embedding; Fabric retrieval;
D O I
暂无
中图分类号
学科分类号
摘要
With the increasing of multi-source heterogeneous data, flexible retrieval across different modalities is an urgent demand in industrial applications. To allow users to control the retrieval results, a novel fabric image retrieval method is proposed in this paper based on multi-modal feature fusion. First, the image feature is extracted using the modified pre-trained convolutional neural network to separate macroscopic and fine-grained features, which are then selected and aggregated by the multi-layer perception. The feature of the modification text is extracted by long short-term memory networks. Subsequently, the two features are fused in a visual-semantic joint embedding space by gated and residual structures to control the selective expression of separable image features. To validate the proposed scheme, a fabric image database for multi-modal retrieval is created as the benchmark. Qualitative and quantitative experiments indicate that the proposed method is practicable and effective, which can be extended to other similar industrial fields, like wood and wallpaper.
引用
收藏
页码:2207 / 2217
页数:10
相关论文
共 50 条
  • [41] Test method of laser paint removal based on multi-modal feature fusion
    Huang Hai-peng
    Hao Ben-tian
    Ye De-jun
    Gao Hao
    Li Liang
    [J]. JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2022, 29 (10) : 3385 - 3398
  • [42] Everything at Once - Multi-modal Fusion Transformer for Video Retrieval
    Shvetsova, Nina
    Chen, Brian
    Rouditchenko, Andrew
    Thomas, Samuel
    Kingsbury, Brian
    Feris, Rogerio
    Harwath, David
    Glass, James
    Kuehne, Hilde
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19988 - 19997
  • [43] Multi-Object Tracking Based on a Novel Feature Image With Multi-Modal Information
    An, Yi
    Wu, Jialin
    Cui, Yunhao
    Hu, Huosheng
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (08) : 9909 - 9921
  • [44] Multi-modal Query Expansion Based on Local Analysis for Medical Image Retrieval
    Rahman, Md. Mahmudur
    Antani, Sameer K.
    Long, Rodney L.
    Demner-Fushman, Dina
    Thoma, George R.
    [J]. MEDICAL CONTENT-BASED RETRIEVAL FOR CLINICAL DECISION SUPPORT, 2010, 5853 : 110 - 119
  • [45] A probabilistic semantic model for image annotation and multi-modal image retrieval
    Zhang, Ruofei
    Zhang, Zhongfei
    Li, Mingjing
    Ma, Wei-Ying
    Zhang, Hong-Jiang
    [J]. MULTIMEDIA SYSTEMS, 2006, 12 (01) : 27 - 33
  • [46] A probabilistic semantic model for image annotation and multi-modal image retrieval
    Zhang, RF
    Zhang, ZF
    Li, MJ
    Ma, WY
    Zhang, HJ
    [J]. TENTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 846 - 851
  • [47] Multi-modal Fusion
    Liu, Huaping
    Hussain, Amir
    Wang, Shuliang
    [J]. INFORMATION SCIENCES, 2018, 432 : 462 - 462
  • [48] A probabilistic semantic model for image annotation and multi-modal image retrieval
    Ruofei Zhang
    Zhongfei (Mark) Zhang
    Mingjing Li
    Wei-Ying Ma
    Hong-Jiang Zhang
    [J]. Multimedia Systems, 2006, 12 : 27 - 33
  • [49] Multi-modal multi-task feature fusion for RGBT tracking
    Cai, Yujue
    Sui, Xiubao
    Gu, Guohua
    [J]. INFORMATION FUSION, 2023, 97
  • [50] A novel multi-modal medical image fusion algorithm
    Li, Xinhua
    Zhao, Jing
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (02) : 1995 - 2002