共 50 条
Learning the Depths of Moving People by Watching Frozen People
被引:101
|作者:
Li, Zhengqi
[1
]
Dekel, Tali
[1
]
Cole, Forrester
[1
]
Tucker, Richard
[1
]
Snavely, Noah
[1
]
Liu, Ce
[1
]
Freeman, William T.
[1
]
机构:
[1] Google Res, Mountain View, CA 94043 USA
关键词:
D O I:
10.1109/CVPR.2019.00465
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
We present a method for predicting dense depth in scenarios where both a monocular camera and people in the scene are freely moving. Existing methods for recovering depth for dynamic, non-rigid objects from monocular video impose strong assumptions on the objects' motion and may only recover sparse depth. In this paper, we take a data-driven approach and learn human depth priors from a new source of data: thousands of Internet videos of people imitating mannequins, i.e., freezing in diverse, natural poses, while a hand-held camera tours the scene. Because people are stationary, training data can be generated using multi-view stereo reconstruction. At inference time, our method uses motion parallax cues from the static areas of the scenes to guide the depth prediction. We demonstrate our method on real-world sequences of complex human actions captured by a moving hand-held camera, show improvement over state-of-the-art monocular depth prediction methods, and show various 3D effects produced using our predicted depth.
引用
收藏
页码:4516 / 4525
页数:10
相关论文