Improved STNNet, A benchmark for detection, tracking, and counting crowds using Drones

被引:1
|
作者
Nazeer, Mohd [1 ]
Sharma, Kanhaiya [2 ]
Sathappan, S. [1 ]
Srilatha, Pulipati [3 ]
Mohammed, Arshad Ahmad Khan [1 ,4 ]
机构
[1] Vidya Jyothi Inst Technol, Hyderabad 500075, India
[2] Symbiosis Int Univ, Symbiosis Inst Technol Pune, Pune 411021, India
[3] CBIT, Dept Artificial Intelligence & Data Sci, Hyderabad, India
[4] GITAM Univ, Hyderabad, India
关键词
Surveillance; Crowd counting; Tracking and localization; Neural network; Density estimation;
D O I
10.1016/j.mex.2024.102820
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In computer vision, navigating multi-object tracking in crowded scenes poses a fundamental challenge with broad applications ranging from surveillance systems to autonomous vehicles. Traditional tracking methods encounter difficulties associating noisy object detections and maintaining consistent labels across frames, particularly in scenarios like video surveillance for crowd control and public safety. This paper introduces 'Improved Space-Time Neighbor-Aware Network (STNNet),' an advanced framework for online Multi-Object Tracking (MOT) designed to address these challenges. Expanding upon the foundational STNNet architecture, our enhanced model incorporates deep reinforcement learning techniques to refine decision-making. By framing the online MOT problem as a Markov Decision Process (MDP), Improved STNNet learns a sophisticated policy for data association, adeptly handling complexities such as object birth/death and appearance/disappearance as state transitions within the MDP. Through extensive experimentation on benchmark datasets, including the MOT Challenge, our proposed Improved STNNet demonstrates superior performance, surpassing existing methods in demanding, crowded scenarios. This study showcases the effectiveness of our approach and lays the groundwork for advancing real-time video analysis applications, particularly in dynamic, crowded environments. Additionally, we utilize the dataset provided by STNNET for density map estimation, forming the basis for our research. center dot Develop an advanced framework for online Multi-Object Tracking (MOT) to address crowded scene challenges, particularly improving object association and label consistency across frames. center dot Explore integrating Deep Reinforcement learning techniques into the MOT framework, framing the problem as an MDP to refine decision-making and handle complexities such as object birth or death and appearance or disappearance transitions.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Automated detection, tracking, and counting of gray whales
    Sullivan, Kevin
    Fennell, Mark
    Perryman, Wayne
    Weller, David
    THERMOSENSE: THERMAL INFRARED APPLICATIONS XLII, 2020, 11409
  • [32] The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking
    Du, Dawei
    Qi, Yuankai
    Yu, Hongyang
    Yang, Yifan
    Duan, Kaiwen
    Li, Guorong
    Zhang, Weigang
    Huang, Qingming
    Tian, Qi
    COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 375 - 391
  • [33] Detection of Gender in Crowds Using ResNet Model
    Singh, Priyanka
    Vishwakarma, Rajeev G.
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (02) : 389 - 408
  • [34] Scale and density invariant head detection deep model for crowd counting in pedestrian crowds
    Khan, Sultan Daud
    Basalamah, Saleh
    VISUAL COMPUTER, 2021, 37 (08): : 2127 - 2137
  • [35] A Self-Training Approach for Point-Supervised Object Detection and Counting in Crowds
    Wang, Yi
    Hou, Junhui
    Hou, Xinyu
    Chau, Lap-Pui
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2876 - 2887
  • [36] Using the Improved SSD Algorithm to Motion Target Detection and Tracking
    Yan, Yongjiang
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [37] Tracking in dense crowds using prominence and neighborhood motion concurrence
    Idrees, Haroon
    Warner, Nolan
    Shah, Mubarak
    IMAGE AND VISION COMPUTING, 2014, 32 (01) : 14 - 26
  • [38] Tracking of Multiple People in Crowds Using Laser Range Scanners
    Adiaviakoye, Ladji
    Patrick, Plainchault
    Marc, Bourcerie
    Auberlet, Jean-Michel
    2014 IEEE NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT SENSORS, SENSOR NETWORKS AND INFORMATION PROCESSING (IEEE ISSNIP 2014), 2014,
  • [39] DecideNet: Counting Varying Density Crowds Through Attention Guided Detection and Density Estimation
    Liu, Jiang
    Gao, Chenqiang
    Meng, Deyu
    Hauptmann, Alexander G.
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5197 - 5206
  • [40] Scale and density invariant head detection deep model for crowd counting in pedestrian crowds
    Sultan Daud Khan
    Saleh Basalamah
    The Visual Computer, 2021, 37 : 2127 - 2137