Tutor-based learning of visual categories using different levels of supervision

被引：2

作者：

Fritz, Mario ^{[1
,2
]}

Kruijff, Geert-Jan M. ^{[3
]}

Schiele, Bernt ^{[4
,5
]}

机构：

[1] Univ Calif Berkeley, Dept EECS, Berkeley, CA 94720 USA

[2] ICSI, Berkeley, CA USA

[3] DFKI GmbH, Language Technol Lab, Saarbrucken, Germany

[4] Tech Univ Darmstadt, CS Dept, Saarbrucken, Germany

[5] MPI Informat, Saarbrucken, Germany

来源：

COMPUTER VISION AND IMAGE UNDERSTANDING | 2010年 / 114卷 / 05期

关键词：

Object categorization; Cross-modal learning; Tutor-based learning; Incremental learning; Interactive learning; Unsupervised learning; Semi-supervised learning;

D O I：

10.1016/j.cviu.2009.12.008

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years we have seen lots of strong work in visual recognition, dialogue interpretation and multi-modal learning that is targeted at provide the building blocks to enable intelligent robots to interact with humans in a meaningful way and even continuously evolve during this process. Building systems that unify those components under a common architecture has turned out to be challenging, as each approach comes with it's own set of assumptions, restrictions, and implications. For example, the impact of recent progress on visual category recognition has been limited from a perspective of interactive systems. Reasons for this are diverse. We identify and address two major challenges in order to integrate modern techniques for visual categorization in an interactive learning system: reducing the number of required labelled training examples and dealing with potentially erroneous input. Today's object categorization methods use either supervised or unsupervised training methods. While supervised methods tend to produce more accurate results, unsupervised methods are highly attractive due to their potential to use far more and unlabeled training data. We proposes a novel method that uses unsupervised training to obtain visual groupings of objects and a cross-modal learning scheme to overcome inherent limitations of purely unsupervised training. The method uses a unified and scale-invariant object representation that allows to handle labeled as well as unlabeled information in a coherent way. First experiments demonstrate the ability of the system to learn object category models from many unlabeled observations and a few dialogue interactions that can be ambiguous or even erroneous. (c) 2010 Elsevier Inc. All rights reserved.

引用

页码：564 / 573

页数：10

共 50 条

[1] Supporting System Development by Novice Software Engineers Using a Tutor-Based Software Visualization (TubVis) Approach
Sulaiman, Shahida
Rashid, NurAini Abdul
Abdullah, Rosni
Sulaiman, Sarina
INTERNATIONAL SYMPOSIUM OF INFORMATION TECHNOLOGY 2008, VOLS 1-4, PROCEEDINGS: COGNITIVE INFORMATICS: BRIDGING NATURAL AND ARTIFICIAL KNOWLEDGE, 2008, : 2818 - +
[2] Reciprocal learning in service-learning? Measuring bidirectional outcomes of college students and service recipients in tutor-based services in Hong Kong
Khiatani, Paul Vinod
Liu, Jacky Ka Kei
INNOVATIONS IN EDUCATION AND TEACHING INTERNATIONAL, 2020, 57 (03) : 364 - 373
[3] Learning categories at different hierarchical levels: A comparison of category learning models
Palmeri, TJ
PSYCHONOMIC BULLETIN & REVIEW, 1999, 6 (03) : 495 - 503
[4] Learning categories at different hierarchical levels: A comparison of category learning models
Thomas J. Palmeri
Psychonomic Bulletin & Review, 1999, 6 : 495 - 503
[5] End-to-end novel visual categories learning via auxiliary self-supervision
Qing, Yuanyuan
Zeng, Yijie
Cao, Qi
Huang, Guang-Bin
NEURAL NETWORKS, 2021, 139 : 24 - 32
[6] Fragment-based learning of visual object categories
Hegde, Jay
Bart, Evgeniy
Kersten, Daniel
CURRENT BIOLOGY, 2008, 18 (08) : 597 - 601
[7] Towards integration of different paradigms in modeling, representation, and learning of visual categories
Darmstadt University of Technology, Germany
Object Categorization: Computerl. and Hum. Vis. Perspect., (324-347):
[8] Problem-based learning tutor expertise: the need for different questions
Gilkison, A
MEDICAL EDUCATION, 2004, 38 (09) : 925 - 926
[9] TEAMWORK BASED LEARNING: USING AN AUTOMATIC TUTOR IN PROGRAMMING COURSES
Gonzalez-Guerra, Luis H.
Leal-Flores, Armandina J.
13TH INTERNATIONAL TECHNOLOGY, EDUCATION AND DEVELOPMENT CONFERENCE (INTED2019), 2019, : 72 - 78
[10] Learning to recognize generic visual categories using a hybrid structural approach
Burger, W
Burge, M
Mayr, W
INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, PROCEEDINGS - VOL II, 1996, : 321 - 324

← 1 2 3 4 5 →