News

October 13, 2022: We migrated all the datasets to our new NAS server, links updated below. The simulated dataset with time-resolved raw light transport data (340 GB in total) has now a public link (see Downloads).
February 22, 2018: Dataset and network available (see Downloads).
November 14, 2017: Web launched.
November 8, 2017: Paper available (see Downloads).

Abstract

Time-of-flight (ToF) imaging has become a widespread technique for depth estimation, allowing affordable off-the-shelf cameras to provide depth maps in real time. However, multipath interference (MPI) resulting from indirect illumination significantly degrades the captured depth. Most previous works have tried to solve this problem by means of complex hardware modifications or costly computations. In this work we avoid these approaches, and propose a new technique that corrects errors in depth caused by MPI that requires no camera modifications, and corrects depth in just 10 milliseconds per frame. By observing that most MPI information can be expressed as a function of the captured depth, we pose MPI removal as a convolutional approach, and model it using a convolutional neural network. In particular, given that the input and output data present similar structure, we base our network in an autoencoder, which we train in two stages: first, we use the encoder (convolution filters) to learn a suitable basis to represent corrupted range images; then, we train the decoder (deconvolution filters) to correct depth from the learned basis from synthetically generated scenes. This approach allows us to tackle the lack of reference data, by using a large-scale captured training set with corrupted depth to train the encoder, and a smaller synthetic training set with ground truth depth to train the corrector stage of the network, which we generate by using a physically-based, time-resolved rendering. We demonstrate and validate our method on both synthetic and real complex scenarios, using an off-the-shelf ToF camera, and with only the captured incorrect depth as input.

Downloads

Paper [PDF, 22.7 MB]
Slides [PPTX, 34.9 MB]
Supplemental video [MP4, 110.7 MB] [Stream]
Architecture definition and trained network [ZIP, 1MB]
Zaragoza-DeepToF dataset (train and test) [ZIP, 8.5GB] [README] [Scenes table]
Zaragoza-DeepToF time-resolved raw dataset [HDF5 files, 340 GB]

Dataset Contents

The Zaragoza-DeepToF dataset is provided in HDF5 format, and contains labeled Time-of-Flight depth simulations for a set of 1050 viewpoints and albedo combinations (augmented to a total of 8400) in different diffuse scenes. In particular we provide the ToF depth images and amplitude (with MPI errors), and their respective reference depth images, all at a resolution of 256x256 pixels. The ToF depths and amplitudes were obtained by simulating the ToF imaging model with sinusoidal 20MHz modulation frequency. The time-resolved simulations were obtained with the physically-based transient framework by Jarabo and colleagues [2014].

Additionally, the Zaragoza-DeepToF time-resolved raw dataset contains the 1050 time-resolved physically-based raw simulations generated by the transient renderer. These simulations have a temporal resolution of 16.67 picoseconds, with a maximum time of flight of 68 nanoseconds (i.e. 20 meters in vacuum).

If you plan to use any of these datasets, please cite both works [Marco et al. 2017, Jarabo et al. 2014] using the provided bibtex snippets.

Supplemental Video

Bibtex

@article{MarcoSIGA2017DeepToF, author = {Marco, Julio and Hernandez, Quercus and Mu\~{n}oz, Adolfo and Dong, Yue and Jarabo, Adrian and Kim, Min and Tong, Xin and Gutierrez, Diego}, title = {DeepToF: Off-the-Shelf Real-Time Correction of Multipath Interference in Time-of-Flight Imaging}, journal = {ACM Transactions on Graphics (SIGGRAPH Asia 2017)}, volume = {36}, number = {6}, year = {2017} }

Related Bibtex

2014: A Framework for Transient Rendering

@article{JaraboSIGA14, author = {Jarabo, Adrian and Marco, Julio and Mu\~{n}oz, Adolfo and Buisan, Raul and Jarosz, Wojciech and Gutierrez, Diego}, title = {A Framework for Transient Rendering}, journal = {ACM Transactions on Graphics (SIGGRAPH Asia 2014)}, volume = {33}, number = {6}, articleno = {177}, year = {2014}, }

2017: Recent Advances in Transient Imaging: A Computer Graphics and Vision Perspective

@article{Jarabo2017transient, title={Recent Advances in Transient Imaging: A Computer Graphics and Vision Perspective}, author={Jarabo, Adrian and Masia, Belen and Marco, Julio and Gutierrez, Diego}, journal={Visual Informatics}, Volume= {1}, Number={1}, year={2017} }

Acknowledgements

We want to thank the anonymous reviewers for their insightful comments, Belen Masia for proofreading the manuscript, and the members of the Graphics & Imaging Lab for helpful discussions. We also thank David Jimenez for providing the code for our comparisons. This project has received funding from the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation programme (CHAMELEON project, grant agreement No 682080), DARPA (project REVEAL), and the Spanish Ministerio de Economía y Competitividad (projects TIN2016-78753-P and TIN2014-61696-EXP). Min H. Kim acknowledges Korea NRF grants (2016R1A2B2013031, 2013M3A6A6073718), Giga KOREA Project (GK17P0200) and KOCCA in MCST of Korea. Julio Marco was additionally funded by a grant from the Gobierno de Aragón.