Estimation of Pedestrian Pose Orientation Using Soft Target Training Based on Teacher-Student Framework

被引:11
|
作者
Heo, DuYeong [1 ]
Nam, Jae Yeal [1 ]
Ko, Byoung Chul [1 ]
机构
[1] Keimyung Univ, Dept Comp Engn, Daegu 42601, South Korea
基金
新加坡国家研究基金会;
关键词
soft-target training; teacher-student algorithm; model compression; pedestrian orientation; wavelet transform; BODY ORIENTATION; CLASSIFICATION;
D O I
10.3390/s19051147
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Semi-supervised learning is known to achieve better generalisation than a model learned solely from labelled data. Therefore, we propose a new method for estimating a pedestrian pose orientation using a soft-target method, which is a type of semi-supervised learning method. Because a convolutional neural network (CNN) based pose orientation estimation requires large numbers of parameters and operations, we apply the teacher-student algorithm to generate a compressed student model with high accuracy and compactness resembling that of the teacher model by combining a deep network with a random forest. After the teacher model is generated using hard target data, the softened outputs (soft-target data) of the teacher model are used for training the student model. Moreover, the orientation of the pedestrian has specific shape patterns, and a wavelet transform is applied to the input image as a pre-processing step owing to its good spatial frequency localisation property and the ability to preserve both the spatial information and gradient information of an image. For a benchmark dataset considering real driving situations based on a single camera, we used the TUD and KITTI datasets. We applied the proposed algorithm to various driving images in the datasets, and the results indicate that its classification performance with regard to the pose orientation is better than that of other state-of-the-art methods based on a CNN. In addition, the computational speed of the proposed student model is faster than that of other deep CNNs owing to the shorter model structure with a smaller number of parameters.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Teacher-Student Model Using Grounding DINO and You Only Look Once for Multi-Sensor-Based Object Detection
    Son, Jinhwan
    Jung, Heechul
    APPLIED SCIENCES-BASEL, 2024, 14 (06):
  • [42] There is no mouse: using a virtual mouse to generate training data for video-based pose estimation
    Guido T. Meijer
    Jaime Arlandis
    Anne E. Urai
    Lab Animal, 2021, 50 : 172 - 173
  • [43] There is no mouse: using a virtual mouse to generate training data for video-based pose estimation
    Meijer, Guido T.
    Arlandis, Jaime
    Urai, Anne E.
    LAB ANIMAL, 2021, 50 (07) : 172 - 173
  • [44] Object Pose Estimation Using Soft Tactile Sensor Based on Manifold Particle Filter with Continuous Observation
    Yagi, Hiraku
    Kobayashi, Yuichi
    Kato, Daisuke
    Miyazawa, Noritsugu
    Hara, Kosuke
    Usui, Dotaro
    2023 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION, SII, 2023,
  • [45] Speech Enhancement Based on Teacher-Student Deep Learning Using Improved Speech Presence Probability for Noise-Robust Speech Recognition
    Tu, Yan-Hui
    Du, Jun
    Lee, Chin-Hui
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (12) : 2080 - 2091
  • [46] Student becomes teacher: training faster deep learning lightweight networks for automated identification of optical coherence tomography B-scans of interest using a student-teacher framework
    Owen, Julia P.
    Blazes, Marian
    Manivannan, Niranchana
    Lee, Gary C.
    Yu, Sophia
    Durbin, Mary K.
    Nair, Aditya
    Singh, Rishi P.
    Talcott, Katherine E.
    Melo, Alline G.
    Greenlee, Tyler
    Chen, Eric R.
    Conti, Thais F.
    Lee, Cecilia S.
    Lee, Aaron Y.
    BIOMEDICAL OPTICS EXPRESS, 2021, 12 (09) : 5387 - 5399
  • [47] Multimedia based student-teacher smart interaction framework using multi-agents in eLearning
    Muhammad Munwar Iqbal
    Yasir Saleem
    Kashif Naseer
    Mucheol Kim
    Multimedia Tools and Applications, 2018, 77 : 5003 - 5026
  • [48] Multimedia based student-teacher smart interaction framework using multi-agents in eLearning
    Iqbal, Muhammad Munwar
    Saleem, Yasir
    Naseer, Kashif
    Kim, Mucheol
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (04) : 5003 - 5026
  • [49] Deep Learning-Based Target Pose Estimation Using LiDAR Measurements in Active Debris Removal Operations
    Aldao Pensado, Enrique
    Gonzalez de Santos, Luis Miguel
    Gonzalez Jorge, Higinio
    Sanjurjo-Rivo, Manuel
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (05) : 5658 - 5670
  • [50] Model-Based Disturbance Estimation for a Fiber-Reinforced Soft Manipulator using Orientation Sensing
    Cangan, Barnabas Gavin
    Navarro, Stefan Escaida
    Yang, Bai
    Zhang, Yu
    Duriez, Christian
    Katzschmann, Robert K.
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 9424 - 9430