Do We Need More Training Data?

被引:0
|
作者
Xiangxin Zhu
Carl Vondrick
Charless C. Fowlkes
Deva Ramanan
机构
[1] UC Irvine,Department of Computer Science
[2] MIT,CSAIL
来源
关键词
Object detection; Mixture models; Part models;
D O I
暂无
中图分类号
学科分类号
摘要
Datasets for training object recognition systems are steadily increasing in size. This paper investigates the question of whether existing detectors will continue to improve as data grows, or saturate in performance due to limited model complexity and the Bayes risk associated with the feature spaces in which they operate. We focus on the popular paradigm of discriminatively trained templates defined on oriented gradient features. We investigate the performance of mixtures of templates as the number of mixture components and the amount of training data grows. Surprisingly, even with proper treatment of regularization and “outliers”, the performance of classic mixture models appears to saturate quickly (∼10\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\sim }10$$\end{document} templates and ∼100\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\sim }100$$\end{document} positive training examples per template). This is not a limitation of the feature space as compositional mixtures that share template parameters via parts and that can synthesize new templates not encountered during training yield significantly better performance. Based on our analysis, we conjecture that the greatest gains in detection performance will continue to derive from improved representations and learning algorithms that can make efficient use of large datasets.
引用
收藏
页码:76 / 92
页数:16
相关论文
共 50 条
  • [21] We Need More Continuity Training
    Doolittle, Benjamin R.
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2022, 14 (04)
  • [22] DO WE NEED MORE DOCTORS OR NOT
    PARKHOUSE, J
    PROCEEDINGS OF THE ROYAL SOCIETY OF MEDICINE-LONDON, 1976, 69 (11): : 815 - 821
  • [23] DO WE NEED MORE HOSPICES
    REGNARD, CFB
    BRITISH MEDICAL JOURNAL, 1993, 306 (6894): : 1754 - 1754
  • [24] Heinz Center report says we do need more data
    Froelich, A
    BIOSCIENCE, 2002, 52 (11) : 978 - 978
  • [25] Advanced endovascular training for vascular residents: What more do we need?
    Johnson, Colleen M.
    Hodgson, Kim J.
    SEMINARS IN VASCULAR SURGERY, 2006, 19 (04) : 194 - 199
  • [26] DO WE NEED TRAINING IN MANAGEMENT
    MAXIE, G
    CANADIAN VETERINARY JOURNAL-REVUE VETERINAIRE CANADIENNE, 1989, 30 (03): : 205 - 205
  • [27] DO WE NEED SURVIVAL TRAINING
    VICKERY, KN
    COLLEGE AND UNIVERSITY, 1972, 48 (01): : 5 - 9
  • [28] What data do we need for training an AV motion planner?
    Chen, Long
    Platinsky, Lukas
    Speichert, Stefanie
    Osinski, Blazej
    Scheel, Oliver
    Ye, Yawei
    Grimmett, Hugo
    Del Pero, Luca
    Ondruska, Peter
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 1066 - 1072
  • [29] We Don't Need More Data, We Need the Right Data
    Shah, Rashmee U.
    CIRCULATION, 2020, 142 (03) : 197 - 198
  • [30] Safe sedation practices among gastroenterology registrars: do we need more training?
    Mohanaruban, Aruchuna
    Bryce, Kathleen
    Radhakrishnan, Archchana
    Gallaher, Joseph
    Johnson, Gavin
    FRONTLINE GASTROENTEROLOGY, 2015, 6 (03) : 223 - 228