T-3DGS: Removing Transient Objects for 3D Scene Reconstruction

Project Page | Paper

Abstract

Transient objects in video sequences can significantly degrade the quality of 3D scene reconstructions. To address this challenge, we propose T-3DGS, a novel framework that robustly filters out transient distractors during 3D reconstruction using Gaussian Splatting. Our framework consists of two steps. First, we employ an unsupervised classification network that distinguishes transient objects from static scene elements by leveraging their distinct training dynamics within the reconstruction process. Second, we refine these initial detections by integrating an off-the-shelf segmentation method with a bidirectional tracking module, which together enhance boundary accuracy and temporal coherence. Evaluations on both sparsely and densely captured video datasets demonstrate that T-3DGS significantly outperforms state-of-the-art approaches, enabling high-fidelity 3D reconstructions in challenging, real-world scenarios.

Overview

This repository implements Reconstruction Uncertainty Predictor (RUP), a solution for handling transient objects in 3D scene reconstruction. For mask refinement functionality (TMR), please refer to our companion repository.

Key Features

Automatic Detection of Transient Objects: Integrate transient object removal seamlessly into the 3D reconstruction pipeline.
Two-Stage Pipeline: Combines RUP and TMR for enhanced mask prediction and refinement.
Docker Support: Simplifies deployment and setup across different environments.

Installation

The installation process aligns with the original Gaussian Splatting project, with additional dependencies specified in environment.yml. We also provide a Dockerfile for containerized setups.

Run Experiments

By default, the following features are enabled:

Reconstruction Uncertainty Predictor (RUP)
Mask Dilation
Depth Regularization

Training the Model

To start training with default settings:

python train.py -s [path to dataset]

Customizing Training Options

To disable specific features, use the following flags:

Disable Reconstruction Uncertainty Predictor (RUP):

python train.py -s [path to dataset] --disable_transient

Disable Mask Dilation:

python train.py -s [path to dataset] --disable_dilate

Disable Depth Regularization:

python train.py -s [path to dataset] --lambda_tv 0

Training With Precomputed Masks

python train.py -s [path to dataset] --masks [path to masks] --disable_transient

Masks should be in .png format.
Masks can have any naming format.
Images and masks are matched based on their positions in the nasorted lists of image filenames and mask filenames.
It is recommended to slightly dilate your masks to account for potential inaccuracies. Use the --mask_dilate flag (default is 5).

Bechmarking

Running TMP Benchmark

To run all experiments without TMR:

bash examples/tmp_benchmark.sh

This script will initiate the training and evaluation processes for the TMP without mask refinement.

Mask Refinement with TMR

To refine transient masks using TMR, follow these steps:

Prepare TMR input

Run the preparation script:

bash examples/prepare_tmr_input.sh

This script performs the following actions:

Reformats Images: Converts images to the format required by SAM2.
Extracts Transient Masks and Differences: Retrieves transient masks and difference images from your T-3DGS checkpoint (default iteration is 7000).

Run TMR

Follow Instructions: Visit the TMR Repository for detailed instructions.
Execute Refinement Script: Use the provided script in the TMR repository to perform mask refinement.

Final Training with Refined Masks After obtaining refined masks from TMR, run the following script to train the model with these masks:

bash examples/tmr_benchmark.sh

Citation

If you find this work useful in your research, please consider citing:

@misc{pryadilshchikov2024t3dgsremovingtransientobjects,
      title={T-3DGS: Removing Transient Objects for 3D Scene Reconstruction}, 
      author={Vadim Pryadilshchikov and Alexander Markin and Artem Komarichev and Ruslan Rakhimov and Peter Wonka and Evgeny Burnaev},
      year={2024},
      eprint={2412.00155},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2412.00155}, 
}

Acknowledgments

Our code is based on the official implementation of 3D Gaussian Splatting (3DGS).

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
SIBR_viewers @ 0103f7f		SIBR_viewers @ 0103f7f
arguments		arguments
examples		examples
gaussian_renderer		gaussian_renderer
lpipsPyTorch		lpipsPyTorch
scene		scene
submodules		submodules
utils		utils
.gitmodules		.gitmodules
Dockerfile		Dockerfile
README.md		README.md
convert.py		convert.py
environment.yml		environment.yml
extract_tmr_prompt.py		extract_tmr_prompt.py
full_eval.py		full_eval.py
metrics.py		metrics.py
prepare_images_for_tmr.py		prepare_images_for_tmr.py
render.py		render.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

T-3DGS: Removing Transient Objects for 3D Scene Reconstruction

Project Page | Paper

Abstract

Overview

Key Features

Installation

Run Experiments

Training the Model

Customizing Training Options

Training With Precomputed Masks

Bechmarking

Running TMP Benchmark

Mask Refinement with TMR

Citation

Acknowledgments

About

Releases

Packages

Languages

Vadim200116/T-3DGS

Folders and files

Latest commit

History

Repository files navigation

T-3DGS: Removing Transient Objects for 3D Scene Reconstruction

Project Page | Paper

Abstract

Overview

Key Features

Installation

Run Experiments

Training the Model

Customizing Training Options

Training With Precomputed Masks

Bechmarking

Running TMP Benchmark

Mask Refinement with TMR

Citation

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages