Bayesian Self-training for Semi-supervised 3D Segmentation
METADATA ONLY
Loading...
Author / Producer
Date
2025
Publication Type
Conference Paper
ETH Bibliography
yes
Citations
Altmetric
METADATA ONLY
Data
Rights / License
Abstract
3D segmentation is a core problem in computer vision and, similarly to many other dense prediction tasks, it requires large amounts of annotated data for adequate training. However, densely labeling 3D point clouds to employ fully-supervised training remains too labor intensive and expensive. Semi-supervised training provides a more practical alternative, where only a small set of labeled data is given, accompanied by a larger unlabeled set. This area thus studies the effective use of unlabeled data to reduce the performance gap that arises due to the lack of annotations. In this work, inspired by Bayesian deep learning, we first propose a Bayesian self-training framework for semi-supervised 3D semantic segmentation. Employing stochastic inference, we generate an initial set of pseudo-labels and then filter these based on estimated point-wise uncertainty. By constructing a heuristic n-partite matching algorithm, we extend the method to semi-supervised 3D instance segmentation, and finally, with the same building blocks, to dense 3D visual grounding. We demonstrate state-of-the-art results for our semi-supervised method on SemanticKITTI and ScribbleKITTI for 3D semantic segmentation and on ScanNet and S3DIS for 3D instance segmentation. We further achieve substantial improvements in dense 3D visual grounding over supervised-only baselines on ScanRefer. Our project page is available at ouenal.github.io/bst/.
Permanent link
Publication status
published
External links
Book title
Computer Vision – ECCV 2024
Journal / series
Volume
15114
Pages / Article No.
89 - 107
Publisher
Springer
Event
18th European Conference on Computer Vision (ECCV 2024)
Edition / version
Methods
Software
Geographic location
Date collected
Date created
Subject
Semi-supervised; Self-training; 3D semantic segmentation; 3D instance segmentation; 3D visual grounding
Organisational unit
03886 - Schindler, Konrad / Schindler, Konrad
03514 - Van Gool, Luc (emeritus) / Van Gool, Luc (emeritus)