Can Visual Recognition Benefit from Auxiliary Information in Training?

被引:20
|
作者
Zhang, Qilin [1 ]
Hua, Gang [1 ]
Liu, Wei [2 ]
Liu, Zicheng [3 ]
Zhang, Zhengyou [3 ]
机构
[1] Stevens Inst Technol, Hoboken, NJ 07030 USA
[2] IBM Thomas J Watson Res Ctr, Yorktown Hts, NY USA
[3] Microsoft Res, Redmond, WA USA
来源
关键词
D O I
10.1007/978-3-319-16865-4_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We examine an under-explored visual recognition problem, where we have a main view along with an auxiliary view of visual information present in the training data, but merely the main view is available in the test data. To effectively leverage the auxiliary view to train a stronger classifier, we propose a collaborative auxiliary learning framework based on a new discriminative canonical correlation analysis. This framework reveals a common semantic space shared across both views through enforcing a series of nonlinear projections. Such projections automatically embed the discriminative cues hidden in both views into the common space, and better visual recognition is thus achieved on the test data that stems from only the main view. The efficacy of our proposed auxiliary learning approach is demonstrated through three challenging visual recognition tasks with different kinds of auxiliary information.
引用
收藏
页码:65 / 80
页数:16
相关论文
共 50 条
  • [31] Exploiting visual information for NAM recognition
    Heracleous, Panikos
    Beautemps, Denis
    Tran, Viet-Anh
    Loevenbruck, Helene
    Bailly, Gerard
    [J]. IEICE ELECTRONICS EXPRESS, 2009, 6 (02): : 77 - 83
  • [32] Visual speech information for face recognition
    Rosenblum, LD
    Yakel, DA
    Baseer, N
    Panchal, A
    Nodarse, BC
    Niehus, RP
    [J]. PERCEPTION & PSYCHOPHYSICS, 2002, 64 (02): : 220 - 229
  • [33] BENEFIT FROM VISUAL CUES IN AUDITORY-VISUAL SPEECH RECOGNITION BY MIDDLE-AGED AND ELDERLY PERSONS
    WALDEN, BE
    BUSACCO, DA
    MONTGOMERY, AA
    [J]. JOURNAL OF SPEECH AND HEARING RESEARCH, 1993, 36 (02): : 431 - 436
  • [34] JVPR: Bilateral Information Interaction by Joint Training for Sequence-Based Visual Place Recognition
    Li, Jincheng
    Shen, Yanqing
    Xin, Jingmin
    Zheng, Nanning
    [J]. 2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 3505 - 3511
  • [35] Dynamic visual information facilitates object recognition from novel viewpoints
    Teramoto, Wataru
    Riecke, Bernhard E.
    [J]. JOURNAL OF VISION, 2010, 10 (13): : 1 - 13
  • [36] Show me the features! Understanding recognition from the use of visual information
    Schyns, PG
    Bonnar, L
    Gosselin, F
    [J]. PSYCHOLOGICAL SCIENCE, 2002, 13 (05) : 402 - 409
  • [37] A RECOGNITION ALGORITHM WITH EXTENDED TRAINING INFORMATION
    ASHUROV, AR
    [J]. USSR COMPUTATIONAL MATHEMATICS AND MATHEMATICAL PHYSICS, 1983, 23 (04): : 155 - 157
  • [38] Teacher Training of Colleges and Universities from the Visual Angle of Information Education
    Cheng, Minghua
    Jia, Zelu
    Yuan, Dongfang
    [J]. 2010 INTERNATIONAL CONFERENCE ON THE DEVELOPMENT OF EDUCATIONAL SCIENCE AND COMPUTER TECHNOLOGY, 2010, : 215 - 218
  • [39] Broader PhD training can benefit science and society
    Lewis, R
    [J]. SCIENTIST, 1999, 13 (03): : 1 - +
  • [40] EXPECTATIONS INCREASE THE BENEFIT DERIVED FROM PARAFOVEAL VISUAL INFORMATION IN READING WORDS ALOUD
    MCCLELLAND, JL
    OREGAN, JK
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 1981, 7 (03) : 634 - 644