PANDA: A Gigapixel-level Human-centric Video Dataset

被引:44
|
作者
Wang, Xueyang [1 ]
Zhang, Xiya [1 ]
Zhu, Yinheng [1 ]
Guo, Yuchen [1 ]
Yuan, Xiaoyun [1 ]
Xiang, Liuyu [1 ]
Wang, Zerun [1 ]
Ding, Guiguang [1 ]
Brady, David [2 ]
Dai, Qionghai [1 ]
Fang, Lu [1 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Duke Univ, Durham, NC 27706 USA
关键词
DATA SET; ATTENTION; OBJECT; MODEL;
D O I
10.1109/CVPR42600.2020.00333
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present PANDA, the first gigaPixel-level humAN-centric viDeo dAtaset, for large-scale, long-term, and multi-object visual analysis. The videos in PANDA were captured by a gigapixel camera and cover real-world scenes with both wide field-of-view (similar to 1 km(2) area) and high-resolution details (similar to gigapixel-level/frame). The scenes may contain 4k head counts with over 100x scale variation. PANDA provides enriched and hierarchical ground-truth annotations, including 15,974.6k bounding boxes, 111.8k fine-grained attribute labels, 12.7k trajectories, 2.2k groups and 2.9k interactions. We benchmark the human detection and tracking tasks. Due to the vast variance of pedestrian pose, scale, occlusion and trajectory, existing approaches are challenged by both accuracy and efficiency. Given the uniqueness of PANDA with both wide FoV and high resolution, a new task of interaction-aware group detection is introduced. We design a 'global-to-local zoom-in' framework, where global trajectories and local interactions are simultaneously encoded, yielding promising results. We believe PANDA will contribute to the community of artificial intelligence and praxeology by understanding human behaviors and interactions in large-scale real-world scenes. PANDA Website: http://www.panda-dataset.com.
引用
收藏
页码:3265 / 3275
页数:11
相关论文
共 50 条
  • [21] Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes
    Ju, Xuan
    Zeng, Ailing
    Wang, Jianan
    Xu, Qiang
    Zhang, Lei
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 618 - 629
  • [22] HAA500: Human-Centric Atomic Action Dataset with Curated Videos
    Chung, Jihoon
    Wuu, Cheng-Hsin
    Yang, Hsuan-Ru
    Tai, Yu-Wing
    Tang, Chi-Keung
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13445 - 13454
  • [23] Fuzzy Multimodal Graph Reasoning for Human-Centric Instructional Video Grounding
    Li, Yujie
    Jiang, Xun
    Xu, Xing
    Lu, Huimin
    Tao Shen, Heng
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (09) : 5046 - 5059
  • [24] Human-Centric Spatio-Temporal Video Grounding With Visual Transformers
    Tang, Zongheng
    Liao, Yue
    Liu, Si
    Li, Guanbin
    Jin, Xiaojie
    Jiang, Hongxu
    Yu, Qian
    Xu, Dong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (12) : 8238 - 8249
  • [25] Real-time human-centric segmentation for complex video scenes
    Yu, Ran
    Tian, Chenyu
    Xia, Weihao
    Zhao, Xinyuan
    Wang, Liejun
    Yang, Yujiu
    IMAGE AND VISION COMPUTING, 2022, 126
  • [26] Human-Centric Scene Understanding from Single View 360 Video
    Fowler, Sam
    Kim, Hansung
    Hilton, Adrian
    2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 334 - 342
  • [27] Representing and Retrieving Video Shots in Human-Centric Brain Imaging Space
    Han, Junwei
    Ji, Xiang
    Hu, Xintao
    Zhu, Dajiang
    Li, Kaiming
    Jiang, Xi
    Cui, Guangbin
    Guo, Lei
    Liu, Tianming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (07) : 2723 - 2736
  • [28] 'Skimming-Perusal' Detection: A Simple Object Detection Baseline in GigaPixel-level Images
    Zhang, Zhibin
    Xue, Wanli
    Zhang, Kaihua
    Chen, Shengyong
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2471 - 2476
  • [29] Human-centric smart manufacturing
    Wang, Baicun
    Peng, Tao
    Wang, Xi Vincent
    Wuest, Thorsten
    Romero, David
    Wang, Lihui
    JOURNAL OF MANUFACTURING SYSTEMS, 2023, 69 : 18 - 19
  • [30] Human-Centric Partitioning of the Environment
    Karaoguz, Hakan
    Bore, Nils
    Folkesson, John
    Jensfelt, Patric
    2017 26TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2017, : 844 - 850