共 50 条
- [32] Temporal Cross-Modal Attention for Audio-Visual Event Localization [J]. Seimitsu Kogaku Kaishi/Journal of the Japan Society for Precision Engineering, 2022, 88 (03): : 263 - 268
- [33] The Neural Correlates of Visual and Auditory Cross-Modal Selective Attention in Aging [J]. FRONTIERS IN AGING NEUROSCIENCE, 2020, 12
- [34] Visual attention guided image fusion with sparse representation [J]. OPTIK, 2014, 125 (17): : 4881 - 4888
- [35] Cross-Modal Attention-Guided Convolutional Network for Multi-modal Cardiac Segmentation [J]. MACHINE LEARNING IN MEDICAL IMAGING (MLMI 2019), 2019, 11861 : 601 - 610
- [37] Cross-Modal Self-Attention Network for Referring Image Segmentation [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10494 - 10503
- [39] Stacked cross-modal feature consolidation attention networks for image captioning [J]. Multimedia Tools and Applications, 2024, 83 : 12209 - 12233