Search

Unsupervised Deep Single‐Image Intrinsic Decomposition using Illumination‐Varying Image Sequences

Lettry, Louis; Vanhoey, Kenneth; Van Gool, Luc (2018)

Computer Graphics Forum

Machine learning based Single Image Intrinsic Decomposition (SIID) methods decompose a captured scene into its albedo and shading images by using the knowledge of a large set of known and realistic ground truth decompositions. Collecting and annotating such a dataset is an approach that cannot scale to sufficient variety and realism. We free ourselves from this limitation by training on unannotated images. Our method leverages the observation ...

Conference Paper

Iterative Deep Retinal Topology Extraction

Ventura, Carles; Pont-Tuset, Jordi; Caelles Prat, Sergi; et al. (2018)

Lecture Notes in Computer Science ~ Patch-Based Techniques in Medical Imaging

Conference Paper

Acquiring Common Sense Spatial Knowledge Through Implicit Spatial Templates

Collell, Guillem; Van Gool, Luc; Moens, Marie-Francine (2018)

Proceedings of 32nd AAAI Conference on Artificial Intelligence

Conference Paper

Blazingly Fast Video Object Segmentation with Pixel-Wise Metric Learning

Chen, Yuhua; Pont-Tuset, Jordi; Montes, Alberto; et al. (2018)

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition

This paper tackles the problem of video object segmentation, given some user annotation which indicates the object of interest. The problem is formulated as pixel-wise retrieval in a learned embedding space: we embed pixels of the same object instance into the vicinity of each other, using a fully convolutional network trained by a modified triplet loss as the embedding model. Then the annotated pixels are set as reference and the rest ...

Conference Paper

ROAD: Reality Oriented Adaptation for Semantic Segmentation of Urban Scenes

Chen, Yuhua; Li, Wen; Van Gool, Luc (2018)

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition

Exploiting synthetic data to learn deep models has attracted increasing attention in recent years. However, the intrinsic domain difference between synthetic and real images usually causes a significant performance drop when applying the learned model to real world scenarios. This is mainly due to two reasons: 1) the model overfits to synthetic images, making the convolutional filters incompetent to extract informative representation for ...

Conference Paper

Appearance-and-Relation Networks for Video Classification

Wang, Limin; Li, Wei; Li, Wen; et al. (2018)

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition

Conference Paper

WILDTRACK: A Multi-camera HD Dataset for Dense Unscripted Pedestrian Detection

Chavdarova, Tatjana; Baqué, Pierre; Bouquet, Stéphane; et al. (2018)

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition

People detection methods are highly sensitive to occlusions between pedestrians, which are extremely frequent in many situations where cameras have to be mounted at a limited height. The reduction of camera prices allows for the generalization of static multi-camera set-ups. Using joint visual information from multiple synchronized cameras gives the opportunity to improve detection performance. In this paper, we present a new large-scale ...

Conference Paper

Deep Extreme Cut: From Extreme Points to Object Segmentation

Maninis, Kevis-Kokitsi; Caelles, Sergi; Pont-Tuset, Jordi; et al. (2018)

2018 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper explores the use of extreme points in an object (left-most, right-most, top, bottom pixels) as input to obtain precise object segmentation for images and videos. We do so by adding an extra channel to the image in the input of a convolutional neural network (CNN), which contains a Gaussian centered in each of the extreme points. The CNN learns to transform this information into a segmentation of an object that matches those ...

Conference Paper

Classification-Driven Dynamic Image Enhancement

Sharma, Vivek; Diba, Ali; Neven, Davy; et al. (2018)

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition

Convolutional neural networks rely on image texture and structure to serve as discriminative features to classify the image content. Image enhancement techniques can be used as preprocessing steps to help improve the overall image quality and in turn improve the overall effectiveness of a CNN. Existing image enhancement methods, however, are designed to improve the perceptual quality of an image for a human observer. In this paper, we are ...

Conference Paper

Conditional Probability Models for Deep Image Compression

Mentzer, Fabian; Agustsson, Eirikur; Tschannen, Michael; et al. (2018)

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition

Deep Neural Networks trained as image auto-encoders have recently emerged as a promising direction for advancing the state-of-the-art in image compression. The key challenge in learning such networks is twofold: To deal with quantization, and to control the trade-off between reconstruction error (distortion) and entropy (rate) of the latent image representation. In this paper, we focus on the latter challenge and propose a new technique ...

Conference Paper

Results

Unsupervised Deep Single‐Image Intrinsic Decomposition using Illumination‐Varying Image Sequences

Iterative Deep Retinal Topology Extraction

Acquiring Common Sense Spatial Knowledge Through Implicit Spatial Templates

Blazingly Fast Video Object Segmentation with Pixel-Wise Metric Learning

ROAD: Reality Oriented Adaptation for Semantic Segmentation of Urban Scenes

Appearance-and-Relation Networks for Video Classification

WILDTRACK: A Multi-camera HD Dataset for Dense Unscripted Pedestrian Detection

Deep Extreme Cut: From Extreme Points to Object Segmentation

Classification-Driven Dynamic Image Enhancement

Conditional Probability Models for Deep Image Compression

Refine by

Research Collection

Search

Search

Results

Unsupervised Deep Single‐Image Intrinsic Decomposition using Illumination‐Varying Image Sequences ﻿

Iterative Deep Retinal Topology Extraction ﻿

Acquiring Common Sense Spatial Knowledge Through Implicit Spatial Templates ﻿

Blazingly Fast Video Object Segmentation with Pixel-Wise Metric Learning ﻿

ROAD: Reality Oriented Adaptation for Semantic Segmentation of Urban Scenes ﻿

Appearance-and-Relation Networks for Video Classification ﻿

WILDTRACK: A Multi-camera HD Dataset for Dense Unscripted Pedestrian Detection ﻿

Deep Extreme Cut: From Extreme Points to Object Segmentation ﻿

Classification-Driven Dynamic Image Enhancement ﻿

Conditional Probability Models for Deep Image Compression ﻿

Refine by

Unsupervised Deep Single‐Image Intrinsic Decomposition using Illumination‐Varying Image Sequences

Iterative Deep Retinal Topology Extraction

Acquiring Common Sense Spatial Knowledge Through Implicit Spatial Templates

Blazingly Fast Video Object Segmentation with Pixel-Wise Metric Learning

ROAD: Reality Oriented Adaptation for Semantic Segmentation of Urban Scenes

Appearance-and-Relation Networks for Video Classification

WILDTRACK: A Multi-camera HD Dataset for Dense Unscripted Pedestrian Detection

Deep Extreme Cut: From Extreme Points to Object Segmentation

Classification-Driven Dynamic Image Enhancement

Conditional Probability Models for Deep Image Compression