Augmenting visual place recognition with structural cues


METADATA ONLY
Loading...

Date

2020-10

Publication Type

Journal Article

ETH Bibliography

yes

Citations

Altmetric
METADATA ONLY

Data

Rights / License

Abstract

In this letter, we propose to augment image-based place recognition with structural cues. Specifically, these structural cues are obtained using structure-from-motion, such that no additional sensors are needed for place recognition. This is achieved by augmenting the 2D convolutional neural network (CNN) typically used for image-based place recognition with a 3D CNN that takes as input a voxel grid derived from the structure-from-motion point cloud. We evaluate different methods for fusing the 2D and 3D features and obtain best performance with global average pooling and simple concatenation. On the Oxford RobotCar dataset, the resulting descriptor exhibits superior recognition performance compared to descriptors extracted from only one of the input modalities, including state-of-The-Art image-based descriptors. Especially at low descriptor dimensionalities, we outperform state-of-The-Art descriptors by up to 90%. © 2016 IEEE.

Permanent link

Publication status

published

Editor

Book title

Volume

5 (4)

Pages / Article No.

5534 - 5541

Publisher

IEEE

Event

Edition / version

Methods

Software

Geographic location

Date collected

Date created

Subject

Recognition; localization

Organisational unit

Notes

Funding

Related publications and datasets