Dynamic Interaction Dilation for Interactive Human Parsing

被引:1
|
作者
Gao, Yutong [1 ]
Lang, Congyan [1 ]
Liu, Fayao [2 ]
Cao, Yuanzhouhan [3 ]
Sun, Lijuan [4 ]
Wei, Yunchao [5 ]
机构
[1] Beijing Jiaotong Univ, Minist Educ, Key Lab Big Data & Artificial Intelligence Transpo, Beijing 100044, Peoples R China
[2] ASTAR, Inst Infocomm Res, Singapore 138632, Singapore
[3] Beijing Jiaotong Univ, Sch Comp Sci & Informat Technol, Beijing 100044, Peoples R China
[4] Beijing Univ Posts & Telecommun, Sch Econ & Management, Minist Educ, Key Lab Trustworthy Distributed Comp & Serv, Beijing 100876, Peoples R China
[5] Beijing Jiaotong Univ, Inst Informat Sci, Beijing 10044, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Image edge detection; Annotations; Location awareness; Feature extraction; Transforms; Task analysis; Human parsing; interactive image segmentation; semantic image segmentation; IMAGE SEGMENTATION;
D O I
10.1109/TMM.2023.3262973
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Interactive segmentation pursues generating high-quality pixel-level predictions with a few user-provided clicks, which is gaining attention for its convenience in segmentation data annotation. Users are allowed to iteratively refine the prediction by adding clicks until the result is satisfactory. Existing interactive methods usually transform the clicks into a set of localization maps by Euclidian distance computation or RGB texture extraction to guide the segmentation, which makes the click transformation a core module in interactive segmentation networks. However, when adopted in human images where large poses, occlusions, and bad illuminations are prevailing, prior transformation methods tend to cause uncorrectable overlapping across localization maps which are difficult to form a good match among human parts. Furthermore, the inappropriately transformed information is hard to be refined with the static transformation manner which is out of tune with the dynamically refined interaction process. Hence, we design a dynamic transformation scheme for interactive human parsing (IHP) named Dynamic Interaction Dilation Net (DID-Net), which serves as an initial attempt to break the limitations of static transformation while capturing long-range dependencies of clicks within each human part. Specifically, we construct a Dynamic Dilation Module (DD-Module) to dilate clicks radially in several directions assisted by human body edge detection to refine the dilation quality in each interaction iteration. Furthermore, we propose an Adaptive Interaction Excitation Block (AIE-Block) to exploit potential semantic clues buried in the dilated clicks. Our DID-Net achieves state-of-the-art performance on 3 public human parsing benchmarks.
引用
收藏
页码:178 / 189
页数:12
相关论文
共 50 条
  • [21] iCGPN: Interaction-centric graph parsing network for human-object interaction detection
    Yang, Wenhao
    Chen, Guanyu
    Zhao, Zhicheng
    Su, Fei
    Meng, Hongying
    NEUROCOMPUTING, 2022, 502 : 98 - 109
  • [22] Incremental parsing for interactive natural language interface
    Mori, D
    Matsubara, S
    Inagaki, Y
    2001 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: E-SYSTEMS AND E-MAN FOR CYBERNETICS IN CYBERSPACE, 2002, : 2880 - 2885
  • [23] On Collocations and Their Interaction with Parsing and Translation
    Seretan, Violeta
    INFORMATICS-BASEL, 2014, 1 (01): : 11 - 31
  • [24] Gesture and musical interaction: Interactive engagement through dynamic morphology
    Paine, Garth (ga.paine@uws.edu.au), (International Conference on New Interfaces for Musical Expression):
  • [25] DYNAMIC DISPLAY SYSTEM FOR BETTER INTERACTION ABILITY IN INTERACTIVE INSTALLATION
    Kim, Kirak
    Wong, Chee-Onn
    Jung, Keechul
    LEONARDO, 2009, 42 (03) : 286 - 287
  • [26] Parsing with Dynamic Rule Selection
    宗成庆
    陈肇雄
    黄河燕
    "Journal of Computer Science and Technology J", 1997, (01) : 90 - 96
  • [27] Parsing with dynamic rule selection
    Chengqing Zong
    Zhaoxiong Chen
    Heyan Huang
    Journal of Computer Science and Technology, 1997, 12 (1) : 90 - 96
  • [28] Dynamic LL(k) parsing
    Russmann, A
    ACTA INFORMATICA, 1997, 34 (04) : 267 - 289
  • [29] Interactive Phrases: Semantic Descriptions for Human Interaction Recognition
    Kong, Yu
    Jia, Yunde
    Fu, Yun
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (09) : 1775 - 1788
  • [30] Leveraging Human Inputs in Interactive Machine Learning for Human Robot Interaction
    Senft, Emmanuel
    Lemaignan, Severin
    Baxter, Paul E.
    Belpaeme, Tony
    COMPANION OF THE 2017 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION (HRI'17), 2017, : 281 - 282