Sharing visual features for multiclass and multiview object detection

被引:336
|
作者
Torralba, Antonio
Murphy, Kevin P.
Freeman, William T.
机构
[1] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA 02139 USA
[2] Univ British Columbia, Dept Comp Sci, Vancouver, BC V6T 1Z4, Canada
[3] Univ British Columbia, Dept Stat, Vancouver, BC V6T 1Z4, Canada
基金
美国国家科学基金会;
关键词
object detection; interclass transfer; sharing features; boosting; multiclass;
D O I
10.1109/TPAMI.2007.1055
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of detecting a large number of different classes of objects in cluttered scenes. Traditional approaches require applying a battery of different classifiers to the image, at multiple locations and scales. This can be slow and can require a lot of training data since each classifier requires the computation of many different image features. In particular, for independently trained detectors, the ( runtime) computational complexity and the ( training-time) sample complexity scale linearly with the number of classes to be detected. We present a multitask learning procedure, based on boosted decision stumps, that reduces the computational and sample complexity by finding common features that can be shared across the classes ( and/or views). The detectors for each class are trained jointly, rather than independently. For a given performance level, the total number of features required and, therefore, the runtime cost of the classifier, is observed to scale approximately logarithmically with the number of classes. The features selected by joint training are generic edge-like features, whereas the features chosen by training each class separately tend to be more object-specific. The generic features generalize better and considerably reduce the computational cost of multiclass object detection.
引用
收藏
页码:854 / 869
页数:16
相关论文
共 50 条
  • [31] The influence of location and visual features on visual object memory
    Sun, Hsin-Mei
    Gordon, Robert D.
    MEMORY & COGNITION, 2010, 38 (08) : 1049 - 1057
  • [32] Joint Multiclass Object Detection and Semantic Segmentation for Autonomous Driving
    Abdigapporov, Shakhboz
    Miraliev, Shokhrukh
    Kakani, Vijay
    Kim, Hakil
    IEEE ACCESS, 2023, 11 : 37637 - 37649
  • [33] Object detection using hausdorff distance and multiclass discriminative field
    Zhang, Xiaofeng
    Ding, Hong
    Cheng, Rengui
    International Journal of Signal Processing, Image Processing and Pattern Recognition, 2013, 6 (01) : 109 - 122
  • [34] The influence of location and visual features on visual object memory
    Hsin-Mei Sun
    Robert D. Gordon
    Memory & Cognition, 2010, 38 : 1049 - 1057
  • [35] HIERARCHY OF VISUAL FEATURES FOR OBJECT RECOGNITION
    Gupta, Nitin
    Das, Sukhendu
    Chakraborti, Sutanu
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 5901 - 5905
  • [36] VISUAL OBJECT RECOGNITION - AXES OR FEATURES
    WARRINGTON, EK
    JOURNAL OF CLINICAL AND EXPERIMENTAL NEUROPSYCHOLOGY, 1990, 12 (01) : 32 - 32
  • [37] Visual Object Detection: A Review
    Wang, Zuyi
    Jiao, Bowen
    Xu, Li
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 7106 - 7112
  • [38] A Multiclass Model Observer for Multislice-Multiview Images
    Gifford, H. C.
    Lehovich, A.
    King, M. A.
    2006 IEEE NUCLEAR SCIENCE SYMPOSIUM CONFERENCE RECORD, VOL 1-6, 2006, : 1687 - 1691
  • [39] Visualizing Object Detection Features
    Vondrick, Carl
    Khosla, Aditya
    Pirsiavash, Hamed
    Malisiewicz, Tomasz
    Torralba, Antonio
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 119 (02) : 145 - 158
  • [40] Visualizing Object Detection Features
    Carl Vondrick
    Aditya Khosla
    Hamed Pirsiavash
    Tomasz Malisiewicz
    Antonio Torralba
    International Journal of Computer Vision, 2016, 119 : 145 - 158