Listen to Look: Action Recognition by Previewing Audio

被引：132

作者：

Gao, Ruohan ^{[1
,2
]}

Oh, Tae-Hyun ^{[2
,3
]}

Grauman, Kristen ^{[1
,2
]}

Torresani, Lorenzo ^{[2
]}

机构：

[1] Univ Texas Austin, Austin, TX 78712 USA

[2] Facebook AI Res, Austin, TX 78701 USA

[3] POSTECH, Dept EE, Pohang, South Korea

来源：

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2020年

关键词：

D O I：

10.1109/CVPR42600.2020.01047

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the face of the video data deluge, today's expensive clip-level classifiers are increasingly impractical. We propose a framework for efficient action recognition in untrimmed video that uses audio as a preview mechanism to eliminate both short-term and long-term visual redun-dancies. First, we devise an IMGAUD2VID framework that hallucinates clip-level features by distilling from lighter modalities-a single frame and its accompanying audio-reducing short-term temporal redundancy for efficient clip-level recognition. Second, building on IMGAUD2VID, we further propose IMGAUD-SKIMMING, an attention-based long short-term memory network that iteratively selects useful moments in untrimmed videos, reducing long-term temporal redundancy for efficient video-level recognition. Extensive experiments on four action recognition datasets demonstrate that our method achieves the state-of-the-art in terms of both recognition accuracy and speed.

引用

页码：10454 / 10464

页数：11

共 50 条

[21] Look, Listen, Move
Hogan, Brian J.
MANUFACTURING ENGINEERING, 2010, 144 (01): : 6 - 6
[22] Learning To Look and Listen
Atwater, Reginald M.
AMERICAN JOURNAL OF PUBLIC HEALTH AND THE NATIONS HEALTH, 1951, 41 (09): : 1140 - 1140
[23] SHOP, LOOK AND LISTEN
不详
MANAGEMENT OF WORLD WASTES, 1985, 28 (07): : 4 - 4
[24] To listen, look and think
Gjersvik, Petter
TIDSSKRIFT FOR DEN NORSKE LAEGEFORENING, 2015, 135 (14) : 1217 - 1217
[25] Stop, look and listen
Dickie, RA
JOURNAL OF COATINGS TECHNOLOGY, 2000, 72 (908): : 7 - 7
[26] LOOK - LISTEN - READ
YABROFF, L
LIBRARY JOURNAL, 1960, 85 (10) : 1866 - 1868
[27] Learning to Look and Listen
不详
VOLTA REVIEW, 1951, 53 (09) : 432 - 432
[28] Stop, look, listen!
Hogan, BJ
MANUFACTURING ENGINEERING, 2005, 135 (03): : 12 - 12
[29] STOP - LOOK - LISTEN
HERSHENSON, BR
AMERICAN JOURNAL OF PHARMACEUTICAL EDUCATION, 1990, 54 (02) : 216 - 217
[30] Look, Listen and Learn
Arandjelovic, Relja
Zisserman, Andrew
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 609 - 617

← 1 2 3 4 5 →