Coarse-to-Fine Localization of Temporal Action Proposals

被引:24
|
作者
Long, Fuchen [1 ]
Yao, Ting [2 ]
Qiu, Zhaofan [1 ]
Tian, Xinmei [1 ]
Mei, Tao [2 ]
Luo, Jiebo [3 ]
机构
[1] Univ Sci & Technol China, Elect Engn & Informat Sci, Hefei 230027, Peoples R China
[2] JD AI Res, Vis & Multimedia Lab, Beijing 100105, Peoples R China
[3] Univ Rochester, Dept Comp Sci, Rochester, NY 14604 USA
关键词
Proposals; Videos; Painting; Brushes; Microsoft Windows; Task analysis; Feature extraction; Action Proposals; Action Recognition; Action Detection; Video Captioning;
D O I
10.1109/TMM.2019.2943204
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Localizing temporal action proposals from long videos is a fundamental challenge in video analysis (e.g., action detection and recognition or dense video captioning). Most existing approaches often overlook the hierarchical granularities of actions and thus fail to discriminate fine-grained action proposals (e.g., hand washing laundry or changing a tire in vehicle repair). In this paper, we propose a novel coarse-to-fine temporal proposal (CFTP) approach to localize temporal action proposals by exploring different action granularities. Our proposed CFTP consists of three stages: a coarse proposal network (CPN) to generate long action proposals, a temporal convolutional anchor network (CAN) to localize finer proposals, and a proposal reranking network (PRN) to further identify proposals from previous stages. Specifically, CPN explores three complementary actionness curves (namely pointwise, pairwise, and recurrent curves) that represent actions at different levels for generating coarse proposals, while CAN refines these proposals by a multiscale cascaded 1D-convolutional anchor network. In contrast to existing works, our coarse-to-fine approach can progressively localize fine-grained action proposals. We conduct extensive experiments on two action benchmarks (THUMOS14 and ActivityNet v1.3) and demonstrate the superior performance of our approach when compared to the state-of-the-art techniques on various video understanding tasks.
引用
收藏
页码:1577 / 1590
页数:14
相关论文
共 50 条
  • [41] A coarse-to-fine approach for pericardial effusion localization and segmentation in chest CT scans
    Liu, Jiamin
    Chellamuthu, Karthik
    Lu, Le
    Bagheri, Mohammadhadi
    Summers, Ronald M.
    [J]. MEDICAL IMAGING 2018: COMPUTER-AIDED DIAGNOSIS, 2018, 10575
  • [42] Tunnel crack detection using coarse-to-fine region localization and edge detection
    Li, Ce
    Xu, Pinjie
    Niu, Lijinliang
    Chen, Yuan
    Sheng, Longshuai
    Liu, Mingcun
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 9 (05)
  • [43] Coarse-to-fine pedestrian localization and silhouette extraction for the gait challenge data sets
    Lu, Haiping
    Plataniotis, K. N.
    Venetsanopoulos, A. N.
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1009 - +
  • [44] Coarse-to-fine 3D facial landmark localization based on keypoints
    基于关键点的由粗到精三维人脸特征点定位
    [J]. Da, Feipeng (dafp@seu.edu.cn), 2018, Science Press (39):
  • [45] CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding
    Hou, Zhijian
    Zhong, Wanjun
    Ji, Lei
    Gao, Difei
    Yan, Kun
    Chan, Wing-Kwong
    Ngo, Chong-Wah
    Shou, Mike Zheng
    Duan, Nan
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 8013 - 8028
  • [46] Coarse-to-fine multiple testing strategies
    Lahouel, Kamel
    Geman, Donald
    Younes, Laurent
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2019, 13 (01): : 1292 - 1328
  • [47] Coarse-to-Fine Contrastive Learning on Graphs
    Zhao, Peiyao
    Pan, Yuangang
    Li, Xin
    Chen, Xu
    Tsang, Ivor W.
    Liao, Lejian
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 4622 - 4634
  • [48] A coarse-to-fine method for shape recognition
    Tang H.-X.
    Wei H.
    [J]. Journal of Computer Science and Technology, 2007, 22 (02) : 330 - 334
  • [49] Coarse-to-Fine Deep Kernel Networks
    Sahbi, Hichem
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 1131 - 1139
  • [50] A Coarse-to-Fine Network for Craniopharyngioma Segmentation
    Yu, Yijie
    Zhang, Lei
    Shu, Xin
    Wang, Zizhou
    Chen, Chaoyue
    Xu, Jianguo
    [J]. MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2022, 2022, 13583 : 91 - 100