Camera Motion-Based Analysis of User Generated Video

被引:42
|
作者
Abdollahian, Golnaz [1 ]
Taskiran, Cuneyt M. [2 ]
Pizlo, Zygmunt [3 ]
Delp, Edward J. [1 ]
机构
[1] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
[2] Motorola Inc, Applicat Res & Technol Ctr, Schaumburg, IL 60196 USA
[3] Purdue Univ, Dept Psychol Sci, W Lafayette, IN 47907 USA
关键词
Content-based video analysis; eye tracking; home video; motion-based analysis; regions of interest; saliency maps; user generated video; video summarization; COMPRESSED VIDEO; VISUAL-ATTENTION; MODEL;
D O I
10.1109/TMM.2009.2036286
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we propose a system for the analysis of user generated video (UGV). UGV often has a rich camera motion structure that is generated at the time the video is recorded by the person taking the video, i.e., the "camera person." We exploit this structure by defining a new concept known as camera view for temporal segmentation of UGV. The segmentation provides a video summary with unique properties that is useful in applications such as video annotation. Camera motion is also a powerful feature for identification of keyframes and regions of interest (ROIs) since it is an indicator of the camera person's interests in the scene and can also attract the viewers' attention. We propose a new location-based saliency map which is generated based on camera motion parameters. This map is combined with other saliency maps generated using features such as color contrast, object motion and face detection to determine the ROIs. In order to evaluate our methods we conducted several user studies. A subjective evaluation indicated that our system produces results that is consistent with viewers' preferences. We also examined the effect of camera motion on human visual attention through an eye tracking experiment. The results showed a high dependency between the distribution of fixation points of the viewers and the direction of camera movement which is consistent with our location-based saliency map.
引用
收藏
页码:28 / 41
页数:14
相关论文
共 50 条
  • [21] Motion-based camera localization system in colonoscopy videos
    Yao, Heming
    Stidham, Ryan W.
    Gao, Zijun
    Gryak, Jonathan
    Najarian, Kayvan
    MEDICAL IMAGE ANALYSIS, 2021, 73
  • [22] Motion-based interaction on the mobile device for user interface
    Baek, Jonghun
    Jang, Ik-Jin
    Kim, Hyun Deok
    Cho, Jae-Soo
    Yun, Byoung-Ju
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2006, 4251 : 1011 - 1020
  • [23] Novel motion-based active video indexing method
    Wei, Jie
    Li, Ze-Nian
    Gertner, Izidor
    International Conference on Multimedia Computing and Systems -Proceedings, 1999, 2 : 460 - 465
  • [24] Motion-Based Multiple Object Detection and Tracking in Video
    Sadura, Piotr
    2021 SIGNAL PROCESSING SYMPOSIUM (SPSYMPO), 2021, : 248 - 251
  • [25] Bayesian approaches to motion-based image and video segmentation
    Cremers, Daniel
    COMPLEX MOTION, 2007, 3417 : 104 - 123
  • [26] Motion-based video object tracking in the compressed domain
    Ritch, Mark
    Canagarajah, Nishan
    2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 3097 - 3100
  • [27] Two novel motion-based algorithms for surveillance video analysis on embedded platforms
    Vijverberg, Julien A.
    Loomans, Marijn J. H.
    Koeleman, Cornelis J.
    de With, Peter H. N.
    REAL-TIME IMAGE AND VIDEO PROCESSING 2010, 2010, 7724
  • [28] Motion-based video representation for scene change detection
    Ngo, CW
    Pong, TC
    Zhang, HJ
    Chin, RT
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 827 - 830
  • [29] A novel motion-based active video indexing method
    Wei, J
    Li, ZN
    Gertner, I
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 2, 1999, : 460 - 465
  • [30] Motion-based video representation for scene change detection
    Ngo, CW
    Pong, TC
    Zhang, HJ
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2002, 50 (02) : 127 - 142