共 25 条
- [1] HU R H, ROHRBACH M, DARRELL T., Segmentation from natural language expressions, Proceeding of the European Conference on Computer Vision, pp. 108-124, (2016)
- [2] LIU C X, LIN Z, SHEN X H, Et al., Recurrent multimodal interaction for referring image segmentation, Proceedings of the 2017 IEEE International Conference on Computer Vision, pp. 1280-1289, (2017)
- [3] MARGFFOY-TUAY E, PEREZ J C, BOTERO E, Et al., Dynamic multimodal instance segmentation guided by natural language queries[C], Proceedings of the European Conference on Computer Vision, pp. 656-672, (2018)
- [4] LEI T, ZHANG Y., Training RNNs as fast as CNNs
- [5] LI R Y, LI K C, KUO Y C, Et al., Referring image segmentation via recurrent refinement networks, Proceedings of the 2018 IEEE/ CVF Conference on Computer Vision and Pattern Recognition, pp. 5745-5753, (2018)
- [6] YE L W, ROCHAN M, LIU Z, Et al., Cross-modal self-attention network for referring image segmentation, Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10494-10503, (2019)
- [7] CHEN D J, JIA S H, LO Y C, Et al., See-through-text grouping for referring image segmentation, Proceedings of the 2019 IEEE/ CVF International Conference on Computer Vision, pp. 7453-7462, (2019)
- [8] HUANG S F, HUI T R, LIU S, Et al., Referring image segmentation via cross-modal progressive comprehension, Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10485-10494, (2020)
- [9] HUI T R, LIU S, HUANG S F, Et al., Linguistic structure guided context modeling for referring image segmentation, Proceeding of the European Conference on Computer Vision, pp. 59-75, (2020)
- [10] BELLVER M, VENTURA C, SILBERER C, Et al., A closer look at referring expressions for video object segmentation, Multimedia Tools and Applications, 82, 3, pp. 4419-4438, (2023)