Towards Real-World HDR Video Reconstruction: A Large-Scale Benchmark Dataset and A Two-Stage Alignment Network

被引:0
|
作者
Shu, Yong [1 ]
Shen, Liquan [1 ]
Hu, Xiangyu [1 ]
Li, Mengyao [1 ]
Zhou, Zihao [1 ]
机构
[1] Shanghai Univ, Shanghai, Peoples R China
关键词
D O I
10.1109/CVPR52733.2024.00278
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As an important and practical way to obtain high dynamic range (HDR) video, HDR video reconstruction from sequences with alternating exposures is still less explored, mainly due to the lack of large-scale real-world datasets. Existing methods are mostly trained on synthetic datasets, which perform poorly in real scenes. In this work, to facilitate the development of real-world HDR video reconstruction, we present Real-HDRV, a large-scale real-world benchmark dataset for HDR video reconstruction, featuring various scenes, diverse motion patterns, and high-quality labels. Specifically, our dataset contains 500 LDRs-HDRs video pairs, comprising about 28,000 LDR frames and 4,000 HDR labels, covering daytime, nighttime, indoor, and outdoor scenes. To our best knowledge, our dataset is the largest real-world HDR video reconstruction dataset. Correspondingly, we propose an end-to-end network for HDR video reconstruction, where a novel two-stage strategy is designed to perform alignment sequentially. Specifically, the first stage performs global alignment with the adaptively estimated global offsets, reducing the difficulty of subsequent alignment. The second stage implicitly performs local alignment in a coarse-to-fine manner at the feature level using the adaptive separable convolution. Extensive experiments demonstrate that: (1) models trained on our dataset can achieve better performance on real scenes than those trained on synthetic datasets; (2) our method outperforms previous state-of-the-art methods. Our dataset is available at https://github.com/yungsyu99/Real-HDRV.
引用
收藏
页码:2879 / 2888
页数:10
相关论文
共 50 条
  • [1] HDR Video Reconstruction: A Coarse-to-fine Network and A Real-world Benchmark Dataset
    Chen, Guanying
    Chen, Chaofeng
    Guo, Shi
    Liang, Zhetong
    Wong, Kwan-Yee K.
    Zhang, Lei
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2482 - 2491
  • [2] UVEB: A Large-scale Benchmark and Baseline Towards Real-World Underwater Video Enhancement
    Xie, Yaofeng
    Kong, Lingwei
    Chen, Kai
    Zheng, Ziqiang
    Yu, Xiao
    Yu, Zhibin
    Zheng, Bing
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 22358 - 22367
  • [3] RVDNET: A TWO-STAGE NETWORK FOR REAL-WORLD VIDEO DESNOWING WITH DOMAIN ADAPTATION
    Xue, Tianhao
    Zhou, Gang
    He, Runlin
    Wang, Zhong
    Chen, Juan
    Jia, Zhenhong
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3305 - 3309
  • [4] Real-World Video Deblurring: A Benchmark Dataset and an Efficient Recurrent Neural Network
    Zhihang Zhong
    Ye Gao
    Yinqiang Zheng
    Bo Zheng
    Imari Sato
    International Journal of Computer Vision, 2023, 131 : 284 - 301
  • [5] Real-World Video Deblurring: A Benchmark Dataset and an Efficient Recurrent Neural Network
    Zhong, Zhihang
    Gao, Ye
    Zheng, Yinqiang
    Zheng, Bo
    Sato, Imari
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (01) : 284 - 301
  • [6] Towards Real-World Prohibited Item Detection: A Large-Scale X-ray Benchmark
    Wang, Boying
    Zhang, Libo
    Wen, Longyin
    Liu, Xianglong
    Wu, Yanjun
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5392 - 5401
  • [7] Tracking Large-Scale Video Remix in Real-World Events
    Xie, Lexing
    Natsev, Apostol
    He, Xuming
    Kender, John R.
    Hill, Matthew
    Smith, John R.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2013, 15 (06) : 1244 - 1254
  • [8] Measurement of Malware Family Classification on a Large-Scale Real-World Dataset
    Wang, Qinqin
    Yan, Hanbing
    Zhao, Chang
    Mei, Rui
    Han, Zhihui
    Zhou, Yu
    2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, : 1390 - 1397
  • [9] RCooper: A Real-world Large-scale Dataset for Roadside Cooperative Perception
    Hao, Ruiyang
    Fan, Siqi
    Dai, Yingru
    Zhang, Zhenlin
    Li, Chenxi
    Wang, Yuntian
    Yu, Haibao
    Yang, Wenxian
    Yuan, Jirui
    Nie, Zaiqing
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 22347 - 22357
  • [10] A Large-scale Benchmark Dataset for Event Recognition in Surveillance Video
    Oh, Sangmin
    Hoogs, Anthony
    Perera, Amitha
    Cuntoor, Naresh
    Chen, Chia-Chih
    Lee, Jong Taek
    Mukherjee, Saurajit
    Aggarwal, J. K.
    Lee, Hyungtae
    Davis, Larry
    Swears, Eran
    Wang, Xioyang
    Ji, Qiang
    Reddy, Kishore
    Shah, Mubarak
    Vondrick, Carl
    Pirsiavash, Hamed
    Ramanan, Deva
    Yuen, Jenny
    Torralba, Antonio
    Song, Bi
    Fong, Anesco
    Roy-Chowdhury, Amit
    Desai, Mita
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011,