共 42 条
- [1] Li X P, Zhang B, Sun F C, Et al., Indoor scene understanding by fusing multi-view RGB-D image frames, Journal of Computer Research and Development, 57, 6, pp. 1218-1226, (2020)
- [2] Liu Y G, Yu J Z, Han Y H, Et al., Understanding the effective receptive field in semantic image segmentation, Multimedia Tools and Applications, 77, 17, pp. 22159-22171, (2018)
- [3] Yatskar M, Zettlemoyer L, Farhadi A, Et al., Situation recognition: Visual semantic role labeling for image understanding, Computer Vision and Pattern Recognition, pp. 5534-5542, (2016)
- [4] Zitnick C L, Parikh D, Vanderwende L, Et al., Learning the visual interpretation of sentences, International Conference on Computer Vision, pp. 1681-1688, (2013)
- [5] Desai C, Ramanan D, Fowlkes C C, Et al., Discriminative models for static human-object interactions, Computer Vision and Pattern Recognition, pp. 9-16, (2010)
- [6] Yao B D, Li F F., Modeling mutual context of object and human pose in human-object interaction activities, Computer Vision and Pattern Recognition, pp. 17-24, (2010)
- [7] Sadeghi M A, Farhadi A., Recognition using visual phrases, Computer Vision and Pattern Recognition, pp. 1745-1752, (2012)
- [8] Li Y K, Ouyang W L, Zhou B L, Et al., Scene graph generation from objects, phrases and region captions, International Conference on Computer Vision, pp. 1270-1279, (2017)
- [9] Shin D, Kim I., Deep image understanding using multilayered contexts, Mathematical Problems in Engineering, 2018, pp. 1-11, (2018)
- [10] Krishna R, Zhu Y K, Groth O, Et al., Visual genome: Connecting language and vision using crowdsourced dense image annotations, International Journal of Computer Vision, 123, 1, pp. 32-73, (2017)