Search

JavaScript is disabled for your browser. Some features of this site may not work without it.

Now showing items 1-10 of 338

Vanishing Point Estimation in Uncalibrated Images with Prior Gravity Direction

Pautrat, Rémi; Liu, Shaohui; Hruby, Petr; et al. (2024)

2023 IEEE/CVF International Conference on Computer Vision (ICCV)

We tackle the problem of estimating a Manhattan frame, i.e. three orthogonal vanishing points, and the unknown focal length of the camera, leveraging a prior vertical direction. The direction can come from an Inertial Measurement Unit that is a standard component of recent consumer devices, e.g., smartphones. We provide an exhaustive analysis of minimal line configurations and derive two new 2-line solvers, one of which does not suffer ...

Conference Paper

Human from Blur: Human Pose Tracking from Blurry Images

Zhao, Yiming; Rozumnyi, Denys; Song, Jie; et al. (2024)

2023 IEEE/CVF International Conference on Computer Vision (ICCV)

We propose a method to estimate 3D human poses from substantially blurred images. The key idea is to tackle the inverse problem of image deblurring by modeling the forward problem with a 3D human model, a texture map, and a sequence of poses to describe human motion. The blurring process is then modeled by a temporal image aggregation step. Using a differentiable renderer, we can solve the inverse problem by backpropagating the pixel-wise ...

Conference Paper

RegFormer: An Efficient Projection-Aware Transformer Network for Large-Scale Point Cloud Registration

Liu, Jiuming; Wang, Guangming; Liu, Zhe; et al. (2024)

2023 IEEE/CVF International Conference on Computer Vision (ICCV)

Although point cloud registration has achieved remarkable advances in object-level and indoor scenes, large-scale registration methods are rarely explored. Challenges mainly arise from the huge point number, complex distribution, and outliers of outdoor LiDAR scans. In addition, most existing registration works generally adopt a two-stage paradigm: They first find correspondences by extracting discriminative local features and then leverage ...

Conference Paper

RLSAC: Reinforcement Learning enhanced Sample Consensus for End-to-End Robust Estimation

Nie, Chang; Wang, Guangming; Liu, Zhe; et al. (2024)

2023 IEEE/CVF International Conference on Computer Vision (ICCV)

Robust estimation is a crucial and still challenging task, which involves estimating model parameters in noisy environments. Although conventional sampling consensus-based algorithms sample several times to achieve robustness, these algorithms cannot use data features and historical information effectively. In this paper, we propose RLSAC, a novel Reinforcement Learning enhanced SAmple Consensus framework for end-to-end robust estimation. ...

Conference Paper

Tracking by 3D Model Estimation of Unknown Objects in Videos

Rozumnyi, Denys; Matas, Jiří; Pollefeys, Marc; et al. (2024)

2023 IEEE/CVF International Conference on Computer Vision (ICCV)

Most model-free visual object tracking methods formulate the tracking task as object location estimation given by a 2D segmentation or a bounding box in each video frame. We argue that this representation is limited and instead propose to guide and improve 2D tracking with an explicit object representation, namely the textured 3D shape and 6DoF pose in each video frame. Our representation tackles a complex long-term dense correspondence ...

Conference Paper

GlueStick: Robust Image Matching by Sticking Points and Lines Together

Pautrat, Rémi; Suárez, Iago; Yu, Yifan; et al. (2024)

2023 IEEE/CVF International Conference on Computer Vision (ICCV)

Line segments are powerful features complementary to points. They offer structural cues, robust to drastic viewpoint and illumination changes, and can be present even in texture-less areas. However, describing and matching them is more challenging compared to points due to partial occlusions, lack of texture, or repetitiveness. This paper introduces a new matching paradigm, where points, lines, and their descriptors are unified into a ...

Conference Paper

R3D3: Dense 3D Reconstruction of Dynamic Scenes from Multiple Cameras

Schmied, Aron; Fischer, Tobias; Danelljan, Martin; et al. (2024)

2023 IEEE/CVF International Conference on Computer Vision (ICCV)

Dense 3D reconstruction and ego-motion estimation are key challenges in autonomous driving and robotics. Compared to the complex, multi-modal systems deployed today, multi-camera systems provide a simpler, low-cost alternative. However, camera-based 3D reconstruction of complex dynamic scenes has proven extremely difficult, as existing solutions often produce incomplete or incoherent results. We propose R3D3, a multi-camera system for ...

Conference Paper

IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis

Ye, Weicai; Chen, Shuo; Bao, Chong; et al. (2024)

2023 IEEE/CVF International Conference on Computer Vision (ICCV)

Existing inverse rendering combined with neural rendering methods can only perform editable novel view synthesis on object-specific scenes, while we present intrinsic neural radiance fields, dubbed IntrinsicNeRF, which introduce intrinsic decomposition into the NeRF-based neural rendering method and can extend its application to room-scale scenes. Since intrinsic decomposition is a fundamentally under-constrained inverse problem, we propose ...

Conference Paper

SGAligner: 3D Scene Alignment with Scene Graphs

Sarkar, Sayan Deb; Miksik, Ondrej; Pollefeys, Marc; et al. (2024)

2023 IEEE/CVF International Conference on Computer Vision (ICCV)

Building 3D scene graphs has recently emerged as a topic in scene representation for several embodied AI applications to represent the world in a structured and rich manner. With their increased use in solving downstream tasks (e.g., navigation and room rearrangement), can we leverage and recycle them for creating 3D maps of environments, a pivotal step in agent operation? We focus on the fundamental problem of aligning pairs of 3D scene ...

Conference Paper

HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World

Wang, Xin; Kwon, Taein; Rad, Mahdi; et al. (2024)

2023 IEEE/CVF International Conference on Computer Vision (ICCV)

Building an interactive AI assistant that can perceive, reason, and collaborate with humans in the real world has been a long-standing pursuit in the AI community. This work is part of a broader research effort to develop intelligent agents that can interactively guide humans through performing tasks in the physical world. As a first step in this direction, we introduce HoloAssist, a large-scale egocentric human interaction dataset, where ...

Conference Paper

Results

Vanishing Point Estimation in Uncalibrated Images with Prior Gravity Direction

Human from Blur: Human Pose Tracking from Blurry Images

RegFormer: An Efficient Projection-Aware Transformer Network for Large-Scale Point Cloud Registration

RLSAC: Reinforcement Learning enhanced Sample Consensus for End-to-End Robust Estimation

Tracking by 3D Model Estimation of Unknown Objects in Videos

GlueStick: Robust Image Matching by Sticking Points and Lines Together

R3D3: Dense 3D Reconstruction of Dynamic Scenes from Multiple Cameras

IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis

SGAligner: 3D Scene Alignment with Scene Graphs

HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World

Refine by

Research Collection

Search

Search

Results

Vanishing Point Estimation in Uncalibrated Images with Prior Gravity Direction ﻿

Human from Blur: Human Pose Tracking from Blurry Images ﻿

RegFormer: An Efficient Projection-Aware Transformer Network for Large-Scale Point Cloud Registration ﻿

RLSAC: Reinforcement Learning enhanced Sample Consensus for End-to-End Robust Estimation ﻿

Tracking by 3D Model Estimation of Unknown Objects in Videos ﻿

GlueStick: Robust Image Matching by Sticking Points and Lines Together ﻿

R3D3: Dense 3D Reconstruction of Dynamic Scenes from Multiple Cameras ﻿

IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis ﻿

SGAligner: 3D Scene Alignment with Scene Graphs ﻿

HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World ﻿

Refine by

Vanishing Point Estimation in Uncalibrated Images with Prior Gravity Direction

Human from Blur: Human Pose Tracking from Blurry Images

RegFormer: An Efficient Projection-Aware Transformer Network for Large-Scale Point Cloud Registration

RLSAC: Reinforcement Learning enhanced Sample Consensus for End-to-End Robust Estimation

Tracking by 3D Model Estimation of Unknown Objects in Videos

GlueStick: Robust Image Matching by Sticking Points and Lines Together

R3D3: Dense 3D Reconstruction of Dynamic Scenes from Multiple Cameras

IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis

SGAligner: 3D Scene Alignment with Scene Graphs

HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World