
Open access
Date
2018Type
- Conference Paper
Citations
Cited 31 times in
Web of Science
Cited 42 times in
Scopus
ETH Bibliography
yes
Altmetrics
Abstract
Research on learning suitable feature descriptors for Computer Vision has recently shifted to deep learning where the biggest challenge lies with the formulation of appropriate loss functions, especially since the descriptors to be learned are not known at training time. While approaches such as Siamese and triplet losses have been applied with success, it is still not well understood what makes a good loss function. In this spirit, this work demonstrates that many commonly used losses suffer from a range of problems. Based on this analysis, we introduce mixed-context losses and scale-aware sampling, two methods that when combined enable networks to learn consistently scaled descriptors for the first time. Show more
Permanent link
https://doi.org/10.3929/ethz-b-000254341Publication status
publishedExternal links
Book title
2018 IEEE/CVF Conference on Computer Vision and Pattern RecognitionPages / Article No.
Publisher
IEEEEvent
Subject
Feature Encoding; Triplet NetworkOrganisational unit
09559 - Chli, Margarita (ehemalig) / Chli, Margarita (former)
Funding
644128 - Collaborative Aerial Robotic Workers (SBFI)
More
Show all metadata
Citations
Cited 31 times in
Web of Science
Cited 42 times in
Scopus
ETH Bibliography
yes
Altmetrics