Audiovisual cross-modal material surface retrieval

被引:4
|
作者
Liu, Zhuokun [1 ]
Liu, Huaping [2 ]
Huang, Wenmei [1 ]
Wang, Bowen [1 ]
Sun, Fuchun [2 ]
机构
[1] State Key Lab Reliabil & Intelligence Elect Equip, Tianjin 300130, Peoples R China
[2] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
来源
NEURAL COMPUTING & APPLICATIONS | 2020年 / 32卷 / 18期
关键词
Cross-modal retrieval; Local receptive fields-based extreme learning machine; Canonical correlation analysis; Material analysis; EXTREME LEARNING-MACHINE; LOCAL RECEPTIVE-FIELDS; PERCEPTION;
D O I
10.1007/s00521-019-04476-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-modal retrieval is developed rapidly because it can process the data among different modalities. Aiming at solving the problem that the text and image sometimes cannot perform the true and accurate analysis of the material, a system of audiovisual cross-modal retrieval on material surface is proposed. First, we use local receptive fields-based extreme learning machine to extract sound and image features, and then the sound and image features are mapped to the subspace using canonical correlation analysis and retrieved by Euclidean distance. Finally, the process of audiovisual cross-modal retrieval is realized by the system. The experimental results show that the proposed system has a good application effect on wood. The designed system provides a new idea for research in the field of material identification.
引用
收藏
页码:14301 / 14309
页数:9
相关论文
共 50 条
  • [31] Effect of inhibition of return on audiovisual cross-modal correspondence
    Zu Guangyao
    Li Shuqi
    Zhang Tianyang
    Wang Aijun
    Zhang Ming
    [J]. ACTA PSYCHOLOGICA SINICA, 2023, 55 (08) : 1220 - 1233
  • [32] Cross-Modal Retrieval Using Deep Learning
    Malik, Shaily
    Bhardwaj, Nikhil
    Bhardwaj, Rahul
    Kumar, Saurabh
    [J]. PROCEEDINGS OF THIRD DOCTORAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE, DOSCI 2022, 2023, 479 : 725 - 734
  • [33] Multi-Label Cross-modal Retrieval
    Ranjan, Viresh
    Rasiwasia, Nikhil
    Jawahar, C. V.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4094 - 4102
  • [34] Multi-modal and cross-modal for lecture videos retrieval
    Nhu Van Nguyen
    Coustaty, Mickal
    Ogier, Jean-Marc
    [J]. 2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 2667 - 2672
  • [35] Deep Semantic Mapping for Cross-Modal Retrieval
    Wang, Cheng
    Yang, Haojin
    Meinel, Christoph
    [J]. 2015 IEEE 27TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2015), 2015, : 234 - 241
  • [36] Cross-modal retrieval based on shared proxies
    Wei, Yuxin
    Zheng, Ligang
    Qiu, Guoping
    Cai, Guocan
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2024, 13 (01)
  • [37] Deep Relation Embedding for Cross-Modal Retrieval
    Zhang, Yifan
    Zhou, Wengang
    Wang, Min
    Tian, Qi
    Li, Houqiang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 617 - 627
  • [38] Cross-Modal Topic Correlations for Multimedia Retrieval
    Yu, Jing
    Cong, Yonghui
    Qin, Zengchang
    Wan, Tao
    [J]. 2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 246 - 249
  • [39] Learning Cross-Modal Retrieval with Noisy Labels
    Hu, Peng
    Peng, Xi
    Zhu, Hongyuan
    Zhen, Liangli
    Lin, Jie
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5399 - 5409
  • [40] Cross-Modal Retrieval with Correlation Feature Propagation
    Zhang, Lu
    Cao, Feng
    Liang, Xinyan
    Qian, Yuhua
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (09): : 1993 - 2002