Tutor-based learning of visual categories using different levels of supervision

被引:2
|
作者
Fritz, Mario [1 ,2 ]
Kruijff, Geert-Jan M. [3 ]
Schiele, Bernt [4 ,5 ]
机构
[1] Univ Calif Berkeley, Dept EECS, Berkeley, CA 94720 USA
[2] ICSI, Berkeley, CA USA
[3] DFKI GmbH, Language Technol Lab, Saarbrucken, Germany
[4] Tech Univ Darmstadt, CS Dept, Saarbrucken, Germany
[5] MPI Informat, Saarbrucken, Germany
关键词
Object categorization; Cross-modal learning; Tutor-based learning; Incremental learning; Interactive learning; Unsupervised learning; Semi-supervised learning;
D O I
10.1016/j.cviu.2009.12.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years we have seen lots of strong work in visual recognition, dialogue interpretation and multi-modal learning that is targeted at provide the building blocks to enable intelligent robots to interact with humans in a meaningful way and even continuously evolve during this process. Building systems that unify those components under a common architecture has turned out to be challenging, as each approach comes with it's own set of assumptions, restrictions, and implications. For example, the impact of recent progress on visual category recognition has been limited from a perspective of interactive systems. Reasons for this are diverse. We identify and address two major challenges in order to integrate modern techniques for visual categorization in an interactive learning system: reducing the number of required labelled training examples and dealing with potentially erroneous input. Today's object categorization methods use either supervised or unsupervised training methods. While supervised methods tend to produce more accurate results, unsupervised methods are highly attractive due to their potential to use far more and unlabeled training data. We proposes a novel method that uses unsupervised training to obtain visual groupings of objects and a cross-modal learning scheme to overcome inherent limitations of purely unsupervised training. The method uses a unified and scale-invariant object representation that allows to handle labeled as well as unlabeled information in a coherent way. First experiments demonstrate the ability of the system to learn object category models from many unlabeled observations and a few dialogue interactions that can be ambiguous or even erroneous. (c) 2010 Elsevier Inc. All rights reserved.
引用
收藏
页码:564 / 573
页数:10
相关论文
共 50 条
  • [21] Deep Learning-Based Recognition of Different Thyroid Cancer Categories Using Whole Frozen-Slide Images
    Zhu, Xinyi
    Chen, Cancan
    Guo, Qiang
    Ma, Jianhui
    Sun, Fenglong
    Lu, Haizhen
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 10
  • [22] A Constraint-Based Tutor for Learning Object-Oriented Analysis and Design using UML
    Baghaei, Nilufar
    Mitrovic, Antonija
    Irwin, Warwick
    TOWARDS SUSTAINABLE AND SCALABLE EDUCATIONAL INNOVATIONS INFORMED BY LEARNING SCIENCES, 2005, 133 : 11 - 18
  • [23] Tutor's actions in a rural South-African University using problem based learning
    Gari Calzada, Mayra A.
    Rivera Michelena, Natacha M.
    REDU-REVISTA DE DOCENCIA UNIVERSITARIA, 2013, 11 (02): : 153 - 171
  • [24] The Effect of Collaborative Supervision Approaches and Collegial Supervision Techniques on Teacher Intensity Using Performance-Based Learning
    Wiyono, Bambang Budi
    Rasyad, Ach
    Maisyaroh
    SAGE OPEN, 2021, 11 (02):
  • [25] I can't believe there's no images! Learning Visual Tasks Using Only Language Supervision
    Gu, Sophia
    Clark, Christopher
    Kembhavi, Aniruddha
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2672 - 2683
  • [27] Forecasting movements of health-care stock prices based on different categories of news articles using multiple kernel learning
    Shynkevich, Yauheniya
    McGinnity, T. M.
    Coleman, Sonya A.
    Belatreche, Ammar
    DECISION SUPPORT SYSTEMS, 2016, 85 : 74 - 83
  • [28] Using Visual Learning Analytics to Support Competence-based Learning
    Villamane, Mikel
    Alvarez, Ainhoa
    Larranaga, Mikel
    Caballero, Jessica
    Hernandez-Rivas, Oscar
    SIXTH INTERNATIONAL CONFERENCE ON TECHNOLOGICAL ECOSYSTEMS FOR ENHANCING MULTICULTURALITY (TEEM'18), 2018, : 333 - 338
  • [29] LITE: Intent-based Task Representation Learning Using Weak Supervision
    Otani, Naoki
    Gamon, Michael
    Jauhar, Sujay Kumar
    Yang, Mei
    Malireddi, Sri Raghu
    Riva, Oriana
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 2410 - 2424
  • [30] LEARNING VISUAL CATEGORIES THROUGH A SPARSE REPRESENTATION CLASSIFIER BASED CROSS-CATEGORY KNOWLEDGE TRANSFER
    Lu, Ying
    Chen, Liming
    Saidi, Alexandre
    Zhang, Zhaoxiang
    Wang, Yunhong
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 165 - 169