ArchDepth
Depth estimation from 2D images is a fundamental research topic in photogrammetry and computer vision toward 3D reconstruction and scene understanding with a vast field of applications, including mapping, navigation, and augmented reality. A recent trend focuses on learning-based monocular depth estimation algorithms and despite the release of large-scale datasets for training, methods are limited in scenarios and still struggle to yield reliable results in the 3D space without additional scene cues. Moreover the transferability of deep learning depth estimation for real-world photogrammetric scenarios is still a challenging problem. We therefore investigated the potential of integrating learning-based monocular depth estimation in photogrammetric applications.
Our contribution includes:
(1) a novel rendered large-scale dataset (ArchDepth) of photorealistic outdoor scenes of historic buildings, including high-quality, complete, metric depth maps for every image;
(2) a straightforward training pipeline following an encoder-decoder network for metric monocular depth estimation to demonstrate the potential of this dataset;
(3) a 3D reconstruction module based on our predictions for single-view 3D scene recovery;
(4) a generalization performance of our trained model and investigate its applicability in real-world photogrammetric scenarios.
ArchDepth is a large-scale dataset of synthetic RGB images and corresponding high-quality and complete metric depth maps. It contains ca 24.000 images of photorealistic outdoor scenes of historic buildings. The dataset is divided into two main parts:
1. Churches - six photorealistic outdoor scenes of historical churches and
2. Piazza - four scenes of an imaginary plaza composed of several buildings facades, trees, walls and statues fused into a single fictional environment.
All the models were retrieved and are freely downloadable from the Sketchfab.com 3D models sharing platform.
Download ~18.23 GB
DOWNLOADS - Samples of the Churches subset (RGB+depthmaps and 3D models)
The Churches subset of the ArchDepth dataset is comprised of a sequence of RGB images and correspondent depth maps generated in Blender. The 3D models used in the data set were retrieved from Sketchfab, created by the 3D artist Lassi Kaukonen: Kuusisto, Liedon, Mietoinen, Nousiainen, Piikkio, Saint Jacobs
This Churches subset features 15.000 RGB/depth couples subdivided per church and per rendering camera.
Kuusisto
cam1 - 640x480 pinhole camera 36mm sensor size, 24mm focal length: Download ~430 MB
cam2 - 640x480 pinhole camera 36mm sensor size, 18mm focal length: Download ~385 MB
cam3 - 640x480 pinhole camera 36mm sensor size, 14mm focal length: Download ~385 MB
cam4 - 640x480 pinhole camera 36mm sensor size, 24mm focal length: Download ~388 MB
Liedon
cam1 - 640x480 pinhole camera, 36mm sensor size, 24mm focal length: Download ~469 MB
cam2 - 640x480 pinhole camera, 36mm sensor size, 18mm focal length: Download ~417 MB
cam3 - 640x480 pinhole camera, 36mm sensor size, 18mm focal length: Download ~458 MB
cam4 - 640x480 pinhole camera, 36mm sensor size, 20mm focal length: Download ~428 MB
cam5 - 640x480 pinhole camera, 36mm sensor size, 20mm focal length Download ~216 MB
cam5var - 640x480 pinhole camera, 36mm sensor size, 20mm focal length: Download ~215 MB
Mietoinen
cam1 - 640x480 pinhole camera, 36mm sensor size, 24mm focal length: Download ~440 MB
cam2 - 640x480 pinhole camera, 36mm sensor size, 18mm focal length: Download ~453 MB
cam3 - 640x480 pinhole camera, 36mm sensor size, 18mm focal length: Download ~432 MB
cam4 - 640x480 pinhole camera, 36mm sensor size, 20mm focal length: Download ~460 MB
Nousiainen
cam1 - 640x480 pinhole camera, 36mm sensor size, 24mm focal length: Download ~532 MB
cam2 - 640x480 pinhole camera, 36mm sensor size, 18mm focal length: Download ~453 MB
cam3 - 640x480 pinhole camera, 36mm sensor size, 18mm focal length: Download ~500 MB
cam4 - 640x480 pinhole camera, 36mm sensor size, 20mm focal length: Download ~462 MB
Piikkio
cam1 - 640x480 pinhole camera, 36mm sensor size, 24mm focal length: Download ~455 MB
cam2 - 640x480 pinhole camera, 36mm sensor size, 18mm focal length: Download ~421 MB
cam3 - 640x480 pinhole camera, 36mm sensor size, 14mm focal length: Download ~392 MB
cam4 - 640x480 pinhole camera, 36mm sensor size, 20mm focal length: Download ~432 MB
SaintJacobs
cam1 - 640x480 pinhole camera, 36mm sensor size, 24mm focal length: Download ~376 MB
cam2 - 640x480 pinhole camera, 36mm sensor size, 18mm focal length: Download ~413 MB
cam3 - 640x480 pinhole camera, 36mm sensor size, 14mm focal length: Download ~400 MB
cam4 - 640x480 pinhole camera, 36mm sensor size, 20mm focal length: Download ~426 MB
The Churches 3D models can be downloaded here: download ~11 GB or as separate zip files:
DOWNLOAD - Samples of the Piazza subset (RGB+depthmaps and 3D models)
This Piazza subset part of the ArchDepth dataset is comprised of a sequence of RGB images and correspendent depth maps generated in Blender. The 3D models used in the subdata were retrieved from Sketchfab, created by various 3D artists: Cathédrale Saint-Pierre, Poitiers, Entry to Church of Saints Cyril and Methodius, Eglise notre-dame-la-grande, Poitiers, Eglise Sainte Radegonde, Poitiers, Eglise du Puch, Sauveterre de Guyenne, Portada románica de la catedral de Valencia, Portail roman, Saint-André-de-Sorède, Eglise Saint Sauveur, Figeac, Eglise St-Médard, Thouars, Damaged Wall, Stone Wall Nr.2, Old Tree, Tree, oak trees, Löwe, Löwe.
The Piazza subset features 9.000 RGB/depth couples subdivided per cardinal directions and per rendering cam.
CityE
cam3 - 640x480 pinhole camera, 36mm sensor size, 18mm focal length: Download ~723 MB
cam4 - 640x480 pinhole camera, 36mm sensor size, 20mm focal length: Download ~947 MB
CityN
cam1 - 640x480 pinhole camera, 36mm sensor size, 18mm focal length: Download ~718 MB
cam2 - 640x480 pinhole camera, 36mm sensor size, 20mm focal length: Download ~924 MB
CityS
cam7 - 640x480 pinhole camera, 36mm sensor size, 18mm focal length: Download ~839 MB
cam8 - 640x480 pinhole camera, 36mm sensor size, 20mm focal length: Download ~1 GB
cam9 - 640x480 pinhole camera, 36mm sensor size, 18mm focal length: Download ~793 MB
cam10 - 640x480 pinhole camera, 36mm sensor size, 20mm focal length: Download ~960 MB
CityW
cam5 - 640x480 pinhole camera, 36mm sensor size, 18mm focal length: Download ~692 MB
cam6 - 640x480 pinhole camera, 36mm sensor size, 20mm focal length: Download ~907 MB
The Piazza 3D models can be downloaded here: download ~8.5 GB or as separate zip files:
Representative Publications:
Welponer, M., Stathopoulou, E.K., Remondino, F., 2022: MONOCULAR DEPTH PREDICTION IN PHOTOGRAMMETRIC APPLICATIONS. Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., XLIII-B2-2022, 469–476, https://doi.org/10.5194/isprs-archives-XLIII-B2-2022-469-2022 - ISPRS Congress 2022
Funding: Internal