A survey of few shot learning based on intelligent perception

被引:0
|
作者
Song C. [1 ]
Zhao J. [1 ]
Wang K. [2 ]
Liang X. [1 ]
机构
[1] Science and Technology on Complex System Control and Intelligent Agent Cooperation Laboratory, Beijing
[2] School of Computer and Technique, Fudan University, Shanghai
关键词
Few-shot learning; Image classification; Machine learning; Object detection; Zero-shot learning;
D O I
10.7527/S1000-6893.2019.23756
中图分类号
学科分类号
摘要
Few-shot learning refers to using only a small amount of supervision information of the target class to train the machine learning model. Due to its practical values, recent advances in few-shot learning by academia and industry have made significant contributions. However, there were few reviews on this issue in China. This paper systematically summarizes and explores the few-shot learning algorithms and the object detection algorithms based on few-shot learning. Firstly, the problem definition of few-shot learning is given, and its connections with other classic machine learning problems are also enumerated. Meanwhile, the theoretical challenges of the problem of few-shot learning are explained. Then, we summarize the image classification based on few-shot learning, and analyze its representative works. Based on this, we focus on the problem of few-shot object detection, especially the problem of zero-shot object detection, and analyze the existing research works in detail. Finally, we look forward to the future development of few-shot learning in terms of problem setting, theoretical research, implementation technology, and application scenarios based on the advantages and disadvantages of the existing methods. It is expected to provide inspirations for the subsequent research works in this field. © 2020, Beihang University Aerospace Knowledge Press. All right reserved.
引用
收藏
相关论文
共 99 条
  • [21] KINGMA D P, MOHAMED S, REZENDE D J, Et al., Semi-supervised learning with deep generative models, Advances in Neural Information Processing Systems, pp. 3581-3589, (2014)
  • [22] BUCCINO G, VOGT S, RITZL A, Et al., Neural circuits underlying imitation learning of hand actions: An event-related fMRI study, Neuron, 42, 2, pp. 323-334, (2004)
  • [23] SMEULDERS A W M, WORRING M, SANTINI S, Et al., Content-based image retrieval at the end of the early years, IEEE Transactions on Pattern Analysis & Machine Intelligence, 12, pp. 1349-1380, (2000)
  • [24] BLACKMAN S., Multiple-target tracking with radar applications, (1986)
  • [25] FREEMAN W T, ROTH M., Orientation histograms for hand gesture recognition, International Workshop on Automatic Face and Gesture Recognition, pp. 296-301, (1995)
  • [26] XU K, BA J, KIROS R, Et al., Show, attend and tell: Neural image caption generation with visual attention, International Conference on Machine Learning, pp. 2048-2057, (2015)
  • [27] ANTOL S, AGRAWAL A, LU J, Et al., Vqa: Visual question answering, Proceedings of the IEEE International Conference on Computer Vision, pp. 2425-2433, (2015)
  • [28] MEDIONI G, COHEN I, BREMOND F, Et al., Event detection and analysis from video streams, IEEE Transactions on Pattern Analysis and Machine Intelligence, 23, 8, pp. 873-889, (2001)
  • [29] BENGIO Y, DUCHARME R, VINCENT P, Et al., A neural probabilistic language model, Journal of Machine Learning Research, 3, 2, pp. 1137-1155, (2003)
  • [30] ZOPH B, LE Q V., Neural architecture search with reinforcement learning