Bidirectional Optimization Coupled Lightweight Networks for Efficient and Robust Multi-Person 2D Pose Estimation

被引：0

作者：

Shuai Li

Zheng Fang

Wen-Feng Song

Ai-Min Hao

Hong Qin

机构：

[1] Beihang University,State Key Laboratory of Virtual Reality Technology and Systems

[2] Beihang University Qingdao Research Institute,Department of Computer Science

[3] Stony Brook University,undefined

来源：

Journal of Computer Science and Technology | 2019年 / 34卷

关键词：

bidirectional optimization; computer vision; deep learning; probability limb heat map; 2D multi-person pose; estimation;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

For multi-person 2D pose estimation, current deep learning based methods have exhibited impressive performance, but the trade-offs among efficiency, robustness, and accuracy in the existing approaches remain unavoidable. In principle, bottom-up methods are superior to top-down methods in efficiency, but they perform worse in accuracy. To make full use of their respective advantages, in this paper we design a novel bidirectional optimization coupled lightweight network (BOCLN) architecture for efficient, robust, and general-purpose multi-person 2D (2-dimensional) pose estimation from natural images. With the BOCLN framework, the bottom-up network focuses on global features, while the top-down network places emphasis on detailed features. The entire framework shares global features along the bottom-up data stream, while the top-down data stream aims to accelerate the accurate pose estimation. In particular, to exploit the priors of human joints’ relationship, we propose a probability limb heat map to represent the spatial context of the joints and guide the overall pose skeleton prediction, so that each person’s pose estimation in cluttered scenes (involving crowd) could be as accurate and robust as possible. Therefore, benefiting from the novel BOCLN architecture, the time-consuming refinement procedure could be much simplified to an efficient lightweight network. Extensive experiments and evaluations on public benchmarks have confirmed that our new method is more efficient and robust, yet still attain competitive accuracy performance compared with the state-of-the-art methods. Our BOCLN shows even greater promise in online applications.

引用

页码：522 / 536

页数：14

共 50 条

[31] E2Pose: Fully Convolutional Networks for End-to-End Multi-Person Pose Estimation
Tobeta, Masakazu
Sawada, Yoshihide
Zheng, Ze
Takamuku, Sawa
Natori, Naotake
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 532 - 537
[32] DGCN: Dynamic Graph Convolutional Network for Efficient Multi-Person Pose Estimation
Qiu, Zhongwei
Qiu, Kai
Fu, Jianlong
Fu, Dongmei
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11924 - 11931
[33] Dual Networks Based 3D Multi-Person Pose Estimation From Monocular Video
Cheng, Yu
Wang, Bo
Tan, Robby T. T.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) : 1636 - 1651
[34] Graph and Temporal Convolutional Networks for 3D Multi-person Pose Estimation in Monocular Videos
Cheng, Yu
Wang, Bo
Yang, Bo
Tan, Robby T.
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1157 - 1165
[35] Skeleton Cluster Tracking for robust multi-view multi-person 3D human pose estimation
Niu, Zehai
Lu, Ke
Xue, Jian
Wang, Jinbao
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 246
[36] Direct Multi-view Multi-person 3D Pose Estimation
Wang, Tao
Zhang, Jianfeng
Cai, Yujun
Yan, Shuicheng
Feng, Jiashi
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[37] Multi-Person Hierarchical 3D Pose Estimation in Natural Videos
Gu, Renshu
Wang, Gaoang
Jiang, Zhongyu
Hwang, Jenq-Neng
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (11) : 4245 - 4257
[38] Multi-person 3D pose estimation from unlabelled data
Daniel Rodriguez-Criado
Pilar Bachiller-Burgos
George Vogiatzis
Luis J. Manso
Machine Vision and Applications, 2024, 35
[39] Dynamic Graph Reasoning for Multi-person 3D Pose Estimation
Qiu, Zhongwei
Yang, Qiansheng
Wang, Jian
Fu, Dongmei
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3521 - 3529
[40] Multi-person 3D pose estimation from unlabelled data
Rodriguez-Criado, Daniel
Bachiller-Burgos, Pilar
Vogiatzis, George
Manso, Luis J.
MACHINE VISION AND APPLICATIONS, 2024, 35 (03)

← 1 2 3 4 5 →