Supervision-by-Registration: An Unsupervised Approach to Improve the Precision of Facial Landmark Detectors

被引：123

作者：

Dong, Xuanyi ^{[1
]}

Yu, Shoou-, I ^{[2
]}

Weng, Xinshuo ^{[2
]}

Wei, Shih-En ^{[2
]}

Yang, Yi ^{[1
]}

Sheikh, Yaser ^{[2
]}

机构：

[1] Univ Technol Sydney, CAI, Sydney, NSW, Australia

[2] Facebook Real Labs, Pittsburgh, PA USA

来源：

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年

关键词：

D O I：

10.1109/CVPR.2018.00045

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present supervision-by-registration, an unsupervised approach to improve the precision of facial landmark detectors on both images and video. Our key observation is that the detections of the same landmark in adjacent frames should be coherent with registration, i.e., optical flow. Interestingly, coherency of optical flow is a source of supervision that does not require manual labeling, and can be leveraged during detector training. For example, we can enforce in the training loss function that a detected landmark at frame(t-1) followed by optical flow tracking from frame(t-1) to frame) should coincide with the location of the detection at frame). Essentially, supervision by -registration augments the training loss function with a registration loss, thus training the detector to have output that is not only close to the annotations in labeled images, but also consistent with registration on large amounts of unlabeled videos. End-to-end training with the registration loss is made possible by a differentiable Lucas-Kanade operation, which computes optical flow registration in the forward pass, and back-propagates gradients that encourage temporal coherency in the detector. The output of our method is a more precise image-based facial landmark detector, which can be applied to single images or video. With supervision-by-registration, we demonstrate (1) improvements in facial landmark detection on both images (300W ALFW) and video (300VW, Youtube-Celebrities), and (2) significant reduction of jittering in video detections.

引用

页码：360 / 368

页数：9

共 2 条

[1] Numerical Approach to Facial Palsy Using a Novel Registration Method with 3D Facial Landmark
Kim, Junsik
Jeong, Hyungwha
Cho, Jeongmok
Pak, Changsik
Oh, Tae Suk
Hong, Joon Pio
Kwon, Soonchul
Yoo, Jisang
[J]. SENSORS, 2022, 22 (17)
[2] Two dimensional assisted liquid chromatography - a chemometric approach to improve accuracy and precision of quantitation in liquid chromatography using 2D separation, dual detectors, and multivariate curve resolution
Cook, Daniel W.
Rutan, Sarah C.
Stoll, Dwight R.
Carr, Peter W.
[J]. ANALYTICA CHIMICA ACTA, 2015, 859 : 87 - 95

← 1 →