Open access
Date
2022Type
- Conference Paper
ETH Bibliography
yes
Altmetrics
Abstract
We introduce TempCLR, a new time-coherent contrastive learning approach for the structured regression task of 3D hand reconstruction. Unlike previous time-contrastive methods for hand pose estimation, our framework considers temporal consistency in its augmentation scheme, and accounts for the differences of hand poses along the temporal direction. Our data-driven method leverages unlabelled videos and a standard CNN, without relying on synthetic data, pseudo-labels, or specialized architectures. Our approach improves the performance of fully-supervised hand reconstruction methods by 15.9% and 7.6% in PA-V2V on the HO-3D and FreiHAND datasets respectively, thus establishing new state-of-the-art performance. Finally, we demonstrate that our approach produces smoother hand reconstructions through time, and is more robust to heavy occlusions compared to the previous state-of-the-art which we show quantitatively and qualitatively. Show more
Permanent link
https://doi.org/10.3929/ethz-b-000587549Publication status
publishedExternal links
Book title
2022 International Conference on 3D Vision (3DV)Pages / Article No.
Publisher
IEEEEvent
Subject
Three-dimensional displays; Shape; Pose estimation; Reconstruction algorithms; Probabilistic logic; Stability analysis; Task analysisOrganisational unit
03979 - Hilliges, Otmar / Hilliges, Otmar
Notes
Conference lecture held on September 15, 2022 at the poster session.More
Show all metadata
ETH Bibliography
yes
Altmetrics