Can Visual Recognition Benefit from Auxiliary Information in Training?

被引：20

作者：

Zhang, Qilin ^{[1
]}

Hua, Gang ^{[1
]}

Liu, Wei ^{[2
]}

Liu, Zicheng ^{[3
]}

Zhang, Zhengyou ^{[3
]}

机构：

[1] Stevens Inst Technol, Hoboken, NJ 07030 USA

[2] IBM Thomas J Watson Res Ctr, Yorktown Hts, NY USA

[3] Microsoft Res, Redmond, WA USA

来源：

COMPUTER VISION - ACCV 2014, PT I | 2015年 / 9003卷

关键词：

D O I：

10.1007/978-3-319-16865-4_5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We examine an under-explored visual recognition problem, where we have a main view along with an auxiliary view of visual information present in the training data, but merely the main view is available in the test data. To effectively leverage the auxiliary view to train a stronger classifier, we propose a collaborative auxiliary learning framework based on a new discriminative canonical correlation analysis. This framework reveals a common semantic space shared across both views through enforcing a series of nonlinear projections. Such projections automatically embed the discriminative cues hidden in both views into the common space, and better visual recognition is thus achieved on the test data that stems from only the main view. The efficacy of our proposed auxiliary learning approach is demonstrated through three challenging visual recognition tasks with different kinds of auxiliary information.

引用

页码：65 / 80

页数：16

共 50 条

[31] Exploiting visual information for NAM recognition
Heracleous, Panikos
Beautemps, Denis
Tran, Viet-Anh
Loevenbruck, Helene
Bailly, Gerard
[J]. IEICE ELECTRONICS EXPRESS, 2009, 6 (02): : 77 - 83
[32] Visual speech information for face recognition
Rosenblum, LD
Yakel, DA
Baseer, N
Panchal, A
Nodarse, BC
Niehus, RP
[J]. PERCEPTION & PSYCHOPHYSICS, 2002, 64 (02): : 220 - 229
[33] BENEFIT FROM VISUAL CUES IN AUDITORY-VISUAL SPEECH RECOGNITION BY MIDDLE-AGED AND ELDERLY PERSONS
WALDEN, BE
BUSACCO, DA
MONTGOMERY, AA
[J]. JOURNAL OF SPEECH AND HEARING RESEARCH, 1993, 36 (02): : 431 - 436
[34] JVPR: Bilateral Information Interaction by Joint Training for Sequence-Based Visual Place Recognition
Li, Jincheng
Shen, Yanqing
Xin, Jingmin
Zheng, Nanning
[J]. 2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 3505 - 3511
[35] Dynamic visual information facilitates object recognition from novel viewpoints
Teramoto, Wataru
Riecke, Bernhard E.
[J]. JOURNAL OF VISION, 2010, 10 (13): : 1 - 13
[36] Show me the features! Understanding recognition from the use of visual information
Schyns, PG
Bonnar, L
Gosselin, F
[J]. PSYCHOLOGICAL SCIENCE, 2002, 13 (05) : 402 - 409
[37] A RECOGNITION ALGORITHM WITH EXTENDED TRAINING INFORMATION
ASHUROV, AR
[J]. USSR COMPUTATIONAL MATHEMATICS AND MATHEMATICAL PHYSICS, 1983, 23 (04): : 155 - 157
[38] Teacher Training of Colleges and Universities from the Visual Angle of Information Education
Cheng, Minghua
Jia, Zelu
Yuan, Dongfang
[J]. 2010 INTERNATIONAL CONFERENCE ON THE DEVELOPMENT OF EDUCATIONAL SCIENCE AND COMPUTER TECHNOLOGY, 2010, : 215 - 218
[39] Broader PhD training can benefit science and society
Lewis, R
[J]. SCIENTIST, 1999, 13 (03): : 1 - +
[40] EXPECTATIONS INCREASE THE BENEFIT DERIVED FROM PARAFOVEAL VISUAL INFORMATION IN READING WORDS ALOUD
MCCLELLAND, JL
OREGAN, JK
[J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 1981, 7 (03) : 634 - 644

← 1 2 3 4 5 →