Spatio-temporal feature learning for enhancing video quality based on screen content characteristics

被引:0
|
作者
Huang, Ziyin [1 ]
Chan, Yui-Lam [1 ]
Tsang, Sik-Ho [1 ]
Kwong, Ngai-Wing [1 ]
Lam, Kin-Man [1 ]
Ling, Wing-Kuen [2 ]
机构
[1] Hong Kong Polytech Univ, Dept Elect & Elect Engn, Hong Kong, Peoples R China
[2] Guangdong Univ Technol, Sch Informat Engn, Guangzhou, Guangdong, Peoples R China
关键词
Screen content video; Quality enhancement; Deep learning; HEVC;
D O I
10.1016/j.jvcir.2024.104270
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rising demands for remote desktops and online meetings, screen content videos have drawn significant attention. Different from natural videos, screen content videos often exhibit scene switches where the content abruptly changes from one frame to the next. These scene switches result in obvious distortions in compressed videos. Besides, frame freezing, where the content remains unchanged for a certain duration, is also very common in screen content videos. Existing alignment-based models struggle to effectively enhance scene switch frames and lack efficiency when dealing with frame freezing situations. Therefore, we propose a novel alignment-free method that effectively handles both scene switches and frame freezing. In our approach, we develop a spatial and temporal feature extraction module that compresses and extracts spatio-temporal information from three groups of frame inputs. This enables efficient handling of scene switches. In addition, an edge aware block is proposed for extracting edge information, which guides the model to focus on restoring the high-frequency components in frame freezing situations. The fusion module is then designed to adaptively fuse the features from three groups, considering different positions of video frames, to enhance frames during scene switch and frame freezing scenarios. Experimental results demonstrate the significant advancements achieved by the proposed edge aware with spatio-temporal information fusion network (EAST) in enhancing the quality of compressed videos, surpassing the current state-of-the-art methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Unsupervised Video Hashing by Exploiting Spatio-Temporal Feature
    Ma, Chao
    Gu, Yun
    Liu, Wei
    Yang, Jie
    He, Xiangjian
    NEURAL INFORMATION PROCESSING, ICONIP 2016, PT III, 2016, 9949 : 511 - 518
  • [22] Adversarial Spatio-Temporal Learning for Video Deblurring
    Zhang, Kaihao
    Luo, Wenhan
    Zhong, Yiran
    Ma, Lin
    Liu, Wei
    Li, Hongdong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (01) : 291 - 301
  • [23] Video-based driver emotion recognition using hybrid deep spatio-temporal feature learning
    Varma, Harshit
    Ganapathy, Nagarajan
    Deserno, Thomas M.
    MEDICAL IMAGING 2022: IMAGING INFORMATICS FOR HEALTHCARE, RESEARCH, AND APPLICATIONS, 2022, 12037
  • [24] Coastal water quality prediction based on machine learning with feature interpretation and spatio-temporal analysis
    Grbcic, Luka
    Druzeta, Sinisa
    Mausa, Goran
    Lipic, Tomislav
    Lusic, Darija Vukic
    Alvir, Marta
    Lucin, Ivana
    Sikirica, Ante
    Davidovic, Davor
    Travas, Vanja
    Kalafatovic, Daniela
    Pikelj, Kristina
    Fajkovic, Hana
    Holjevic, Toni
    Kranjcevic, Lado
    ENVIRONMENTAL MODELLING & SOFTWARE, 2022, 155
  • [25] Spatio-temporal attention model for video content analysis
    Guironnet, M
    Guyader, N
    Pellerin, D
    Ladret, P
    2005 International Conference on Image Processing (ICIP), Vols 1-5, 2005, : 2989 - 2992
  • [26] Enhancing Human Action Recognition through Spatio-temporal Feature Learning and Semantic Rules
    Ramirez-Amaro, Karinne
    Kim, Eun-Sol
    Kim, Jiseob
    Zhang, Byoung-Tak
    Beetz, Michael
    Cheng, Gordon
    2013 13TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2013, : 456 - 461
  • [27] Video anomaly detection based on attention and efficient spatio-temporal feature extraction
    Rahimpour, Seyed Mohammad
    Kazemi, Mohammad
    Moallem, Payman
    Safayani, Mehran
    VISUAL COMPUTER, 2024, 40 (10): : 6825 - 6841
  • [28] Video Quality Assessment Metric Based on Spatio-Temporal Motion Information
    Kang, Kai
    Liu, Xingang
    Sun, Chao
    2013 IEEE 11TH INTERNATIONAL CONFERENCE ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING (DASC), 2013, : 47 - 51
  • [29] Novel Spatio-Temporal Structural Information Based Video Quality Metric
    Wang, Yue
    Jiang, Tingting
    Ma, Siwei
    Gao, Wen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2012, 22 (07) : 989 - 998
  • [30] A Spatio-Temporal Feature Trajectory Clustering Algorithm Based on Deep Learning
    He, Xintai
    Li, Qing
    Wang, Runze
    Chen, Kun
    ELECTRONICS, 2022, 11 (15)