Tutor-based learning of visual categories using different levels of supervision

被引：2

作者：

Fritz, Mario ^{[1
,2
]}

Kruijff, Geert-Jan M. ^{[3
]}

Schiele, Bernt ^{[4
,5
]}

机构：

[1] Univ Calif Berkeley, Dept EECS, Berkeley, CA 94720 USA

[2] ICSI, Berkeley, CA USA

[3] DFKI GmbH, Language Technol Lab, Saarbrucken, Germany

[4] Tech Univ Darmstadt, CS Dept, Saarbrucken, Germany

[5] MPI Informat, Saarbrucken, Germany

来源：

COMPUTER VISION AND IMAGE UNDERSTANDING | 2010年 / 114卷 / 05期

关键词：

Object categorization; Cross-modal learning; Tutor-based learning; Incremental learning; Interactive learning; Unsupervised learning; Semi-supervised learning;

D O I：

10.1016/j.cviu.2009.12.008

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years we have seen lots of strong work in visual recognition, dialogue interpretation and multi-modal learning that is targeted at provide the building blocks to enable intelligent robots to interact with humans in a meaningful way and even continuously evolve during this process. Building systems that unify those components under a common architecture has turned out to be challenging, as each approach comes with it's own set of assumptions, restrictions, and implications. For example, the impact of recent progress on visual category recognition has been limited from a perspective of interactive systems. Reasons for this are diverse. We identify and address two major challenges in order to integrate modern techniques for visual categorization in an interactive learning system: reducing the number of required labelled training examples and dealing with potentially erroneous input. Today's object categorization methods use either supervised or unsupervised training methods. While supervised methods tend to produce more accurate results, unsupervised methods are highly attractive due to their potential to use far more and unlabeled training data. We proposes a novel method that uses unsupervised training to obtain visual groupings of objects and a cross-modal learning scheme to overcome inherent limitations of purely unsupervised training. The method uses a unified and scale-invariant object representation that allows to handle labeled as well as unlabeled information in a coherent way. First experiments demonstrate the ability of the system to learn object category models from many unlabeled observations and a few dialogue interactions that can be ambiguous or even erroneous. (c) 2010 Elsevier Inc. All rights reserved.

引用

页码：564 / 573

页数：10

共 50 条

[21] Deep Learning-Based Recognition of Different Thyroid Cancer Categories Using Whole Frozen-Slide Images
Zhu, Xinyi
Chen, Cancan
Guo, Qiang
Ma, Jianhui
Sun, Fenglong
Lu, Haizhen
FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 10
[22] A Constraint-Based Tutor for Learning Object-Oriented Analysis and Design using UML
Baghaei, Nilufar
Mitrovic, Antonija
Irwin, Warwick
TOWARDS SUSTAINABLE AND SCALABLE EDUCATIONAL INNOVATIONS INFORMED BY LEARNING SCIENCES, 2005, 133 : 11 - 18
[23] Tutor's actions in a rural South-African University using problem based learning
Gari Calzada, Mayra A.
Rivera Michelena, Natacha M.
REDU-REVISTA DE DOCENCIA UNIVERSITARIA, 2013, 11 (02): : 153 - 171
[24] The Effect of Collaborative Supervision Approaches and Collegial Supervision Techniques on Teacher Intensity Using Performance-Based Learning
Wiyono, Bambang Budi
Rasyad, Ach
Maisyaroh
SAGE OPEN, 2021, 11 (02):
[25] I can't believe there's no images! Learning Visual Tasks Using Only Language Supervision
Gu, Sophia
Clark, Christopher
Kembhavi, Aniruddha
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2672 - 2683
[26] EFFECT OF CORRELATED VISUAL AND TACTUAL FEEDBACK ON AUDITORY PATTERN LEARNING AT DIFFERENT AGE LEVELS
WOHLWILL, JF
JOURNAL OF EXPERIMENTAL CHILD PSYCHOLOGY, 1971, 11 (02) : 213 - &
[27] Forecasting movements of health-care stock prices based on different categories of news articles using multiple kernel learning
Shynkevich, Yauheniya
McGinnity, T. M.
Coleman, Sonya A.
Belatreche, Ammar
DECISION SUPPORT SYSTEMS, 2016, 85 : 74 - 83
[28] Using Visual Learning Analytics to Support Competence-based Learning
Villamane, Mikel
Alvarez, Ainhoa
Larranaga, Mikel
Caballero, Jessica
Hernandez-Rivas, Oscar
SIXTH INTERNATIONAL CONFERENCE ON TECHNOLOGICAL ECOSYSTEMS FOR ENHANCING MULTICULTURALITY (TEEM'18), 2018, : 333 - 338
[29] LITE: Intent-based Task Representation Learning Using Weak Supervision
Otani, Naoki
Gamon, Michael
Jauhar, Sujay Kumar
Yang, Mei
Malireddi, Sri Raghu
Riva, Oriana
NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 2410 - 2424
[30] LEARNING VISUAL CATEGORIES THROUGH A SPARSE REPRESENTATION CLASSIFIER BASED CROSS-CATEGORY KNOWLEDGE TRANSFER
Lu, Ying
Chen, Liming
Saidi, Alexandre
Zhang, Zhaoxiang
Wang, Yunhong
2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 165 - 169

← 1 2 3 4 5 →