HIDDEN MARKOV MODEL FOR EYE GAZE PREDICTION IN NETWORKED VIDEO STREAMING

被引:0
|
作者
Feng, Yunlong [1 ]
Cheung, Gene [2 ]
Tan, Wai-tian [3 ]
Ji, Yusheng [2 ]
机构
[1] Grad Univ Adv Studies, Hayama, Kanagawa, Japan
[2] Natl Inst Informat, Tokyo, Japan
[3] Hewlett Packard Labs, Palo Alto, CA USA
关键词
Eye-gaze prediction; network streaming;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the advent of eye gaze tracking technology, eye gaze is increasingly being used as a media interaction trigger in a variety of applications, such as eye typing, video content customization, and network video streaming based on region-of-interest (ROI). The reaction time of a gaze-based networked system, however, is in practice lower-bounded by the round trip time (RTT) of today's networks, which can be large. To improve the efficacy of gaze-based networked systems, in the paper we propose a Hidden Markov Model (HMM)-based gaze prediction strategy to predict future gaze locations to lower end-to-end reaction delay. We first design an HMM with three states corresponding to human's three major types of intrinsic eye movements. HMM parameters are obtained offiine on a per-video basis during training phase. During testing phase, a window of noisy gaze observations are collected in real-time as input to a forward algorithm, which computes the most likely HMM state. Given the deduced HMM state, linear prediction is used to predict gaze location RTT seconds into the future. We demonstrate the applicability of our gaze prediction strategy by focusing on ROI-based bit allocation for network video streaming. To reduce transmission rate of a video stream without degrading viewer's perceived visual quality, we allocate more bits to encode the viewer's current spatial ROI, while devoting fewer bits in other spatial regions. The challenge lies in overcoming the delay between the time a viewer's ROI is detected by gaze tracking, to the time the effected video is encoded, delivered and displayed at the viewer's terminal. To this end, we use our proposed gaze-prediction strategy to predict future eye gaze locations, so that optimized bit allocation can be performed for future frames. Our experiments show that bit rate can be reduced by 21% without noticeable visual quality degradation when end-to-end network delay is as high as 200ms.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Low-Cost Eye Gaze Prediction System for Interactive Networked Video Streaming
    Feng, Yunlong
    Cheung, Gene
    Tan, Wai-tian
    Le Callet, Patrick
    Ji, Yusheng
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2013, 15 (08) : 1865 - 1879
  • [2] Design and Implementation of a Video Streaming Adaptive Algorithm Based on Markov Prediction Model
    Liu, Jiangtao
    Li, Zeping
    Lin, Chuan
    Yang, Bingzhao
    Ge, Mengyuan
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2018), 2018,
  • [3] A Failure Prediction Approach Based on Cloud Theory and Hidden Markov Model in Networked Computing Systems
    Zheng, Weiwei
    Wang, Zhili
    Huang, Haoqiu
    Meng, Luoming
    Qiu, Xuesong
    [J]. 2015 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATION (ISCC), 2015, : 520 - 525
  • [4] A hidden Markov model for earthquake prediction
    Cheuk Fung Yip
    Wai Leong Ng
    Chun Yip Yau
    [J]. Stochastic Environmental Research and Risk Assessment, 2018, 32 : 1415 - 1434
  • [5] A hidden Markov model for earthquake prediction
    Yip, Cheuk Fung
    Ng, Wai Leong
    Yau, Chun Yip
    [J]. STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2018, 32 (05) : 1415 - 1434
  • [6] Video summarization using Hidden Markov Model
    Huang, CL
    Chang, CY
    [J]. INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: CODING AND COMPUTING, PROCEEDINGS, 2001, : 473 - 477
  • [7] HIDDEN MARKOV MODEL FOR DISTRIBUTED VIDEO CODING
    Toto-Zarasoa, V.
    Roumy, A.
    Guillemot, C.
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 3709 - 3712
  • [8] Hidden Markov model parsing of video programs
    Wolf, W
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 2609 - 2611
  • [9] Classifying Eye Gaze Patterns and Inferring Individual Preferences Using Hidden Markov Models
    Chan, Antoni B.
    Coutrot, Antoine
    [J]. I-PERCEPTION, 2017, 8 : 20 - 21
  • [10] A Hidden Markov Model for Route and Destination Prediction
    Lassoued, Yassine
    Monteil, Julien
    Gu, Yingqi
    Russo, Giovanni
    Shorten, Robert
    Mevissen, Martin
    [J]. 2017 IEEE 20TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2017,