Dynamic Interaction Dilation for Interactive Human Parsing

被引:1
|
作者
Gao, Yutong [1 ]
Lang, Congyan [1 ]
Liu, Fayao [2 ]
Cao, Yuanzhouhan [3 ]
Sun, Lijuan [4 ]
Wei, Yunchao [5 ]
机构
[1] Beijing Jiaotong Univ, Minist Educ, Key Lab Big Data & Artificial Intelligence Transpo, Beijing 100044, Peoples R China
[2] ASTAR, Inst Infocomm Res, Singapore 138632, Singapore
[3] Beijing Jiaotong Univ, Sch Comp Sci & Informat Technol, Beijing 100044, Peoples R China
[4] Beijing Univ Posts & Telecommun, Sch Econ & Management, Minist Educ, Key Lab Trustworthy Distributed Comp & Serv, Beijing 100876, Peoples R China
[5] Beijing Jiaotong Univ, Inst Informat Sci, Beijing 10044, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Image edge detection; Annotations; Location awareness; Feature extraction; Transforms; Task analysis; Human parsing; interactive image segmentation; semantic image segmentation; IMAGE SEGMENTATION;
D O I
10.1109/TMM.2023.3262973
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Interactive segmentation pursues generating high-quality pixel-level predictions with a few user-provided clicks, which is gaining attention for its convenience in segmentation data annotation. Users are allowed to iteratively refine the prediction by adding clicks until the result is satisfactory. Existing interactive methods usually transform the clicks into a set of localization maps by Euclidian distance computation or RGB texture extraction to guide the segmentation, which makes the click transformation a core module in interactive segmentation networks. However, when adopted in human images where large poses, occlusions, and bad illuminations are prevailing, prior transformation methods tend to cause uncorrectable overlapping across localization maps which are difficult to form a good match among human parts. Furthermore, the inappropriately transformed information is hard to be refined with the static transformation manner which is out of tune with the dynamically refined interaction process. Hence, we design a dynamic transformation scheme for interactive human parsing (IHP) named Dynamic Interaction Dilation Net (DID-Net), which serves as an initial attempt to break the limitations of static transformation while capturing long-range dependencies of clicks within each human part. Specifically, we construct a Dynamic Dilation Module (DD-Module) to dilate clicks radially in several directions assisted by human body edge detection to refine the dilation quality in each interaction iteration. Furthermore, we propose an Adaptive Interaction Excitation Block (AIE-Block) to exploit potential semantic clues buried in the dilated clicks. Our DID-Net achieves state-of-the-art performance on 3 public human parsing benchmarks.
引用
收藏
页码:178 / 189
页数:12
相关论文
共 50 条
  • [31] Error-Aware Interactive Semantic Parsing of OpenStreetMap
    Staniek, Michael
    Riezler, Stefan
    SPLU-ROBONLP 2021: THE 2ND INTERNATIONAL COMBINED WORKSHOP ON SPATIAL LANGUAGE UNDERSTANDING AND GROUNDED COMMUNICATION FOR ROBOTICS, 2021, : 53 - 59
  • [32] CIGALE - A TOOL FOR INTERACTIVE GRAMMAR CONSTRUCTION AND EXPRESSION PARSING
    VOISIN, F
    SCIENCE OF COMPUTER PROGRAMMING, 1986, 7 (01) : 61 - 86
  • [33] VAL: Interactive Task Learning with GPT Dialog Parsing
    Lawley, Lane
    MacLellan, Christopher J.
    PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS, CHI 2024, 2024,
  • [34] MOtion Human Parsing: A New Benchmark for 3D Human Parsing
    Tang, Bingyu
    Jin, Chao
    Zhang, Dongliang
    Zheng, Quanshi
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 3203 - 3208
  • [35] PARSING, MENTAL REPRESENTATION, AND DISCOURSE INTERACTION
    KOMLOSI, LI
    COMMUNICATION AND COGNITION, 1985, 18 (1-2): : 95 - 106
  • [36] Dynamic interactive weighted feature selection using fuzzy interaction information
    Ma, Xi-Ao
    Xu, Hao
    Liu, Yi
    APPLIED INTELLIGENCE, 2025, 55 (02)
  • [37] Dynamic interaction network to model the interactive patterns of international stock markets
    Lukmanto, Laura
    Widiputra, Harya
    Lukas
    World Academy of Science, Engineering and Technology, 2009, 35 : 257 - 261
  • [38] Dynamic knowledge interaction in human cognition
    Nobuhiko, Fujihara
    International Conference on Knowledge-Based Intelligent Electronic Systems, Proceedings, KES, 2000, 1 : 345 - 348
  • [39] Dynamic knowledge interaction in human cognition
    Nobuhiko, F
    KES'2000: FOURTH INTERNATIONAL CONFERENCE ON KNOWLEDGE-BASED INTELLIGENT ENGINEERING SYSTEMS & ALLIED TECHNOLOGIES, VOLS 1 AND 2, PROCEEDINGS, 2000, : 345 - 348
  • [40] Robust parsing using dynamic programming
    Vilares, M
    Darriba, VM
    Vilares, J
    Rodríguez, L
    IMPLEMENTATION AND APPLICATION OF AUTOMATA, PROCEEDINGS, 2003, 2759 : 258 - 268