Sharing visual features for multiclass and multiview object detection

被引:336
|
作者
Torralba, Antonio
Murphy, Kevin P.
Freeman, William T.
机构
[1] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA 02139 USA
[2] Univ British Columbia, Dept Comp Sci, Vancouver, BC V6T 1Z4, Canada
[3] Univ British Columbia, Dept Stat, Vancouver, BC V6T 1Z4, Canada
基金
美国国家科学基金会;
关键词
object detection; interclass transfer; sharing features; boosting; multiclass;
D O I
10.1109/TPAMI.2007.1055
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of detecting a large number of different classes of objects in cluttered scenes. Traditional approaches require applying a battery of different classifiers to the image, at multiple locations and scales. This can be slow and can require a lot of training data since each classifier requires the computation of many different image features. In particular, for independently trained detectors, the ( runtime) computational complexity and the ( training-time) sample complexity scale linearly with the number of classes to be detected. We present a multitask learning procedure, based on boosted decision stumps, that reduces the computational and sample complexity by finding common features that can be shared across the classes ( and/or views). The detectors for each class are trained jointly, rather than independently. For a given performance level, the total number of features required and, therefore, the runtime cost of the classifier, is observed to scale approximately logarithmically with the number of classes. The features selected by joint training are generic edge-like features, whereas the features chosen by training each class separately tend to be more object-specific. The generic features generalize better and considerably reduce the computational cost of multiclass object detection.
引用
收藏
页码:854 / 869
页数:16
相关论文
共 50 条
  • [41] Robust Visual Object Tracking With Multiple Features and Reliable Re-Detection Scheme
    Wang, Haijun
    Ma, Wenlai
    Zhang, Shengyan
    Chen, Guo
    Ge, Hongjuan
    Du, Yujie
    IEEE ACCESS, 2020, 8 : 98810 - 98826
  • [42] PERIODICITY TRANSFORMS FOR MULTICHANNEL AND MULTICLASS DETECTION OF VISUAL EVOKED POTENTIALS
    Saidi, Pouria
    Atia, George
    Vosoughi, Azadeh
    2018 CONFERENCE RECORD OF 52ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2018, : 1918 - 1922
  • [43] CDD-Net: A Context-Driven Detection Network for Multiclass Object Detection
    Wu, Yulin
    Zhang, Ke
    Wang, Jingyu
    Wang, Yezi
    Wang, Qi
    Li, Qiang
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [44] Edge based Approach in Object Boundary Detection on Multiclass Fruit Images
    Rachmawati, Ema
    Khodra, Masayu Leylia
    Supriana, Iping
    2016 4TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT), 2016,
  • [45] Stacked Generalization Models for Improving Multiclass Object Detection in Drone Images
    Paxton, Allison Nicole
    Verma, Abhishek
    2024 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING AND MACHINE INTELLIGENCE, ICMI 2024, 2024,
  • [46] Multiclass Object Detection in UAV Images Based on Rotation Region Network
    Xiao J.
    Zhang S.
    Dai Y.
    Jiang Z.
    Yi B.
    Xu C.
    IEEE Journal on Miniaturization for Air and Space Systems, 2020, 1 (03): : 188 - 196
  • [47] Epilepsy detection using multiclass classifier based on spectral features
    Oliva, Jefferson Tales
    Garcia Rosa, Joao Luís
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [48] Deep GoogLeNet Features for Visual Object Tracking
    Aswathy, P.
    Siddhartha
    Mishra, Deepak
    2018 IEEE 13TH INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (IEEE ICIIS), 2018, : 73 - 79
  • [49] Object recognition with features inspired by visual cortex
    Serre, T
    Wolf, L
    Poggio, T
    2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, : 994 - 1000
  • [50] Use of multiple visual features for object tracking
    Pasqual, AA
    Aizawa, K
    Hatori, M
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING '99, PARTS 1-2, 1998, 3653 : 946 - 955