Search
Results
-
Vanishing Point Estimation in Uncalibrated Images with Prior Gravity Direction
(2023)2023 IEEE/CVF International Conference on Computer Vision (ICCV)We tackle the problem of estimating a Manhattan frame, i.e. three orthogonal vanishing points, and the unknown focal length of the camera, leveraging a prior vertical direction. The direction can come from an Inertial Measurement Unit that is a standard component of recent consumer devices, e.g., smartphones. We provide an exhaustive analysis of minimal line configurations and derive two new 2-line solvers, one of which does not suffer ...Conference Paper -
SGAligner: 3D Scene Alignment with Scene Graphs
(2023)2023 IEEE/CVF International Conference on Computer Vision (ICCV)Building 3D scene graphs has recently emerged as a topic in scene representation for several embodied AI applications to represent the world in a structured and rich manner. With their increased use in solving downstream tasks (e.g., navigation and room rearrangement), can we leverage and recycle them for creating 3D maps of environments, a pivotal step in agent operation? We focus on the fundamental problem of aligning pairs of 3D scene ...Conference Paper -
Privacy Preserving Localization via Coordinate Permutations
(2023)2023 IEEE/CVF International Conference on Computer Vision (ICCV)Recent methods on privacy-preserving image-based localization use a random line parameterization to protect the privacy of query images and database maps. The lifting of points to lines effectively drops one of the two geometric constraints traditionally used with point-to-point correspondences in structure-based localization. This leads to a significant loss of accuracy for the privacy-preserving methods. In this paper, we overcome this ...Conference Paper -
Guiding Local Feature Matching with Surface Curvature
(2023)2023 IEEE/CVF International Conference on Computer Vision (ICCV)We propose a new method, named curvature similarity extractor (CSE), for improving local feature matching across images. CSE calculates the curvature of the local 3D surface patch for each detected feature point in a viewpoint-invariant manner via fitting quadrics to predicted monocular depth maps. This curvature is then leveraged as an additional signal in feature matching with off-the-shelf matchers like SuperGlue and LoFTR. Additionally, ...Conference Paper -
The Drunkard’s Odometry: Estimating Camera Motion in Deforming Scenes
(2023)Advances in Neural Information Processing Systems 36Conference Paper -
HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World
(2023)2023 IEEE/CVF International Conference on Computer Vision (ICCV)Building an interactive AI assistant that can perceive, reason, and collaborate with humans in the real world has been a long-standing pursuit in the AI community. This work is part of a broader research effort to develop intelligent agents that can interactively guide humans through performing tasks in the physical world. As a first step in this direction, we introduce HoloAssist, a large-scale egocentric human interaction dataset, where ...Conference Paper -
Intrinsicnerf: Learning intrinsic neural radiance fields for editable novel view synthesis
(2023)2023 IEEE/CVF International Conference on Computer Vision (ICCV)Existing inverse rendering combined with neural rendering methods can only perform editable novel view synthesis on object-specific scenes, while we present intrinsic neural radiance fields, dubbed IntrinsicNeRF, which introduce intrinsic decomposition into the NeRF-based neural rendering method and can extend its application to room-scale scenes. Since intrinsic decomposition is a fundamentally under-constrained inverse problem, we propose ...Conference Paper -
Tracking by 3D Model Estimation of Unknown Objects in Videos
(2023)2023 IEEE/CVF International Conference on Computer Vision (ICCV)Most model-free visual object tracking methods formulate the tracking task as object location estimation given by a 2D segmentation or a bounding box in each video frame. We argue that this representation is limited and instead propose to guide and improve 2D tracking with an explicit object representation, namely the textured 3D shape and 6DoF pose in each video frame. Our representation tackles a complex long-term dense correspondence ...Conference Paper -
Gluestick: Robust image matching by sticking points and lines together
(2023)2023 IEEE/CVF International Conference on Computer Vision (ICCV)Line segments are powerful features complementary to points. They offer structural cues, robust to drastic viewpoint and illumination changes, and can be present even in texture-less areas. However, describing and matching them is more challenging compared to points due to partial occlusions, lack of texture, or repetitiveness. This paper introduces a new matching paradigm, where points, lines, and their descriptors are unified into a ...Conference Paper -
Human from Blur: Human Pose Tracking from Blurry Images
(2023)2023 IEEE/CVF International Conference on Computer Vision (ICCV)We propose a method to estimate 3D human poses from substantially blurred images. The key idea is to tackle the inverse problem of image deblurring by modeling the forward problem with a 3D human model, a texture map, and a sequence of poses to describe human motion. The blurring process is then modeled by a temporal image aggregation step. Using a differentiable renderer, we can solve the inverse problem by backpropagating the pixel-wise ...Conference Paper