HIDDEN MARKOV MODEL FOR EYE GAZE PREDICTION IN NETWORKED VIDEO STREAMING

被引：0

作者：

Feng, Yunlong ^{[1
]}

Cheung, Gene ^{[2
]}

Tan, Wai-tian ^{[3
]}

Ji, Yusheng ^{[2
]}

机构：

[1] Grad Univ Adv Studies, Hayama, Kanagawa, Japan

[2] Natl Inst Informat, Tokyo, Japan

[3] Hewlett Packard Labs, Palo Alto, CA USA

来源：

2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME) | 2011年

关键词：

Eye-gaze prediction; network streaming;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the advent of eye gaze tracking technology, eye gaze is increasingly being used as a media interaction trigger in a variety of applications, such as eye typing, video content customization, and network video streaming based on region-of-interest (ROI). The reaction time of a gaze-based networked system, however, is in practice lower-bounded by the round trip time (RTT) of today's networks, which can be large. To improve the efficacy of gaze-based networked systems, in the paper we propose a Hidden Markov Model (HMM)-based gaze prediction strategy to predict future gaze locations to lower end-to-end reaction delay. We first design an HMM with three states corresponding to human's three major types of intrinsic eye movements. HMM parameters are obtained offiine on a per-video basis during training phase. During testing phase, a window of noisy gaze observations are collected in real-time as input to a forward algorithm, which computes the most likely HMM state. Given the deduced HMM state, linear prediction is used to predict gaze location RTT seconds into the future. We demonstrate the applicability of our gaze prediction strategy by focusing on ROI-based bit allocation for network video streaming. To reduce transmission rate of a video stream without degrading viewer's perceived visual quality, we allocate more bits to encode the viewer's current spatial ROI, while devoting fewer bits in other spatial regions. The challenge lies in overcoming the delay between the time a viewer's ROI is detected by gaze tracking, to the time the effected video is encoded, delivered and displayed at the viewer's terminal. To this end, we use our proposed gaze-prediction strategy to predict future eye gaze locations, so that optimized bit allocation can be performed for future frames. Our experiments show that bit rate can be reduced by 21% without noticeable visual quality degradation when end-to-end network delay is as high as 200ms.

引用

页数：6

共 50 条

[1] Low-Cost Eye Gaze Prediction System for Interactive Networked Video Streaming
Feng, Yunlong
Cheung, Gene
Tan, Wai-tian
Le Callet, Patrick
Ji, Yusheng
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2013, 15 (08) : 1865 - 1879
[2] Design and Implementation of a Video Streaming Adaptive Algorithm Based on Markov Prediction Model
Liu, Jiangtao
Li, Zeping
Lin, Chuan
Yang, Bingzhao
Ge, Mengyuan
[J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2018), 2018,
[3] A Failure Prediction Approach Based on Cloud Theory and Hidden Markov Model in Networked Computing Systems
Zheng, Weiwei
Wang, Zhili
Huang, Haoqiu
Meng, Luoming
Qiu, Xuesong
[J]. 2015 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATION (ISCC), 2015, : 520 - 525
[4] A hidden Markov model for earthquake prediction
Cheuk Fung Yip
Wai Leong Ng
Chun Yip Yau
[J]. Stochastic Environmental Research and Risk Assessment, 2018, 32 : 1415 - 1434
[5] A hidden Markov model for earthquake prediction
Yip, Cheuk Fung
Ng, Wai Leong
Yau, Chun Yip
[J]. STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2018, 32 (05) : 1415 - 1434
[6] Video summarization using Hidden Markov Model
Huang, CL
Chang, CY
[J]. INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: CODING AND COMPUTING, PROCEEDINGS, 2001, : 473 - 477
[7] HIDDEN MARKOV MODEL FOR DISTRIBUTED VIDEO CODING
Toto-Zarasoa, V.
Roumy, A.
Guillemot, C.
[J]. 2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 3709 - 3712
[8] Hidden Markov model parsing of video programs
Wolf, W
[J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 2609 - 2611
[9] Classifying Eye Gaze Patterns and Inferring Individual Preferences Using Hidden Markov Models
Chan, Antoni B.
Coutrot, Antoine
[J]. I-PERCEPTION, 2017, 8 : 20 - 21
[10] A Hidden Markov Model for Route and Destination Prediction
Lassoued, Yassine
Monteil, Julien
Gu, Yingqi
Russo, Giovanni
Shorten, Robert
Mevissen, Martin
[J]. 2017 IEEE 20TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2017,

← 1 2 3 4 5 →