共 24 条
- [1] Yang Y, Zhan DC, Jiang Y, Xiong H., Reliable multi-modal learning: A survey, Ruan Jian Xue Bao/Journal of Software, 32, 4, pp. 1067-1081, (2021)
- [2] Ge XL., Influence of audiovisual congruency on the auditory intensity change judgment, (2011)
- [3] Lv ZL., Study on generation of spatial audio using audio-visual cues, (2021)
- [4] Cheng HN, Li SJ, Liu SG., Deep cross-modal synthesis of environmental sound, Journal of Computer-aided Design & Computer Graphics, 31, 12, pp. 2047-2055, (2019)
- [5] Wang RQ, Cheng HN, Ye L, Qi QT., Reproduction transformation rule-based sound generation for film soundtrack, Journal of Computer-aided Design & Computer Graphics, 34, 10, pp. 1524-1532, (2022)
- [6] Huang HM, Lin LF, Tong RF, Hu HJ, Zhang QW, Iwamoto Y, Han XH, Chen YW, Wu J., UNet 3+: A full-scale connected UNet for medical image segmentation, Proc. of the 2020 IEEE Int’l Conf. on Acoustics, Speech and Signal Processing, pp. 1055-1059, (2020)
- [7] Gao RH, Grauman K., 2.5D visual sound, Proc. of the 2019 IEEE/CVF Conf. on Computer Vision and Pattern Recognition, pp. 324-333, (2019)
- [8] Zhou H, Xu XD, Lin DH, Wang XG, Liu ZW., Sep-stereo: Visually guided stereophonic audio generation by associating source separation, Proc. of the 16th European Conf. on Computer Vision, pp. 52-69, (2020)
- [9] Li SJ, Liu SG, Manocha D., Binaural audio generation via multi-task learning, ACM Trans. on Graphics, 40, 6, (2021)
- [10] Parida KK, Srivastava S, Sharma G., Beyond mono to binaural: Generating binaural audio from mono audio with depth and cross modal attention, Proc. of the 2022 IEEE/CVF Winter Conf. on Applications of Computer Vision, pp. 2151-2160, (2022)