
Open access
Date
2019-06-16Type
- Conference Paper
ETH Bibliography
yes
Altmetrics
Abstract
We present a self-supervision method for 3D hand pose estimation from depth maps. We begin with a neural network initialized with synthesized data and fine-tune it on real but unlabelled depth maps by minimizing a set of datafitting terms. By approximating the hand surface with a set of spheres, we design a differentiable hand renderer to align estimates by comparing the rendered and input depth maps. In addition, we place a set of priors including a data-driven term to further regulate the estimate’s kinematic feasibility. Our method makes highly accurate estimates comparable to current supervised methods which require large amounts of labelled training samples, thereby advancing state-of-the-art in unsupervised learning for hand pose estimation. Show more
Permanent link
https://doi.org/10.3929/ethz-b-000391652Publication status
publishedPages / Article No.
Publisher
Computer Vision Foundation (CVF)Event
Organisational unit
03514 - Van Gool, Luc / Van Gool, Luc
Related publications and datasets
Is original form of: http://hdl.handle.net/20.500.11850/397699
More
Show all metadata
ETH Bibliography
yes
Altmetrics