Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Thomas Pöllabauer

One-to-many Reconstruction of 3D Geometry of cultural Artifacts using a synthetically trained Generative Model

Feb 13, 2024
Thomas Pöllabauer, Julius Kühn, Jiayi Li, Arjan Kuijper

Estimating the 3D shape of an object using a single image is a difficult problem. Modern approaches achieve good results for general objects, based on real photographs, but worse results on less expressive representations such as historic sketches. Our automated approach generates a variety of detailed 3D representation from a single sketch, depicting a medieval statue, and can be guided by multi-modal inputs, such as text prompts. It relies solely on synthetic data for training, making it adoptable even in cases of only small numbers of training examples. Our solution allows domain experts such as a curators to interactively reconstruct potential appearances of lost artifacts.

* 21st Eurographics Workshop on Graphics and Cultural Heritage (GCH 2023)

Via

Access Paper or Ask Questions

Extending 6D Object Pose Estimators for Stereo Vision

Feb 08, 2024
Thomas Pöllabauer, Jan Emrich, Volker Knauthe, Arjan Kuijper

Estimating the 6D pose of objects accurately, quickly, and robustly remains a difficult task. However, recent methods for directly regressing poses from RGB images using dense features have achieved state-of-the-art results. Stereo vision, which provides an additional perspective on the object, can help reduce pose ambiguity and occlusion. Moreover, stereo can directly infer the distance of an object, while mono-vision requires internalized knowledge of the object's size. To extend the state-of-the-art in 6D object pose estimation to stereo, we created a BOP compatible stereo version of the YCB-V dataset. Our method outperforms state-of-the-art 6D pose estimation algorithms by utilizing stereo vision and can easily be adopted for other dense feature-based algorithms.

Via

Access Paper or Ask Questions

A Concept for Reconstructing Stucco Statues from historic Sketches using synthetic Data only

Feb 08, 2024
Thomas Pöllabauer, Julius Kühn

In medieval times, stuccoworkers used a red color, called sinopia, to first create a sketch of the to-be-made statue on the wall. Today, many of these statues are destroyed, but using the original drawings, deriving from the red color also called sinopia, we can reconstruct how the final statue might have looked.We propose a fully-automated approach to reconstruct a point cloud and show preliminary results by generating a color-image, a depth-map, as well as surface normals requiring only a single sketch, and without requiring a collection of other, similar samples. Our proposed solution allows real-time reconstruction on-site, for instance, within an exhibition, or to generate a useful starting point for an expert, trying to manually reconstruct the statue, all while using only synthetic data for training.

* Eurographics Workshop on Graphics and Cultural Heritage 2022

Via

Access Paper or Ask Questions

Detection and Pose Estimation of flat, Texture-less Industry Objects on HoloLens using synthetic Training

Feb 07, 2024
Thomas Pöllabauer, Fabian Rücker, Andreas Franek, Felix Gorschlüter

Current state-of-the-art 6d pose estimation is too compute intensive to be deployed on edge devices, such as Microsoft HoloLens (2) or Apple iPad, both used for an increasing number of augmented reality applications. The quality of AR is greatly dependent on its capabilities to detect and overlay geometry within the scene. We propose a synthetically trained client-server-based augmented reality application, demonstrating state-of-the-art object pose estimation of metallic and texture-less industry objects on edge devices. Synthetic data enables training without real photographs, i.e. for yet-to-be-manufactured objects. Our qualitative evaluation on an AR-assisted sorting task, and quantitative evaluation on both renderings, as well as real-world data recorded on HoloLens 2, sheds light on its real-world applicability.

* In Scandinavian Conference on Image Analysis 2023 (pp. 569-585). Cham: Springer Nature Switzerland
* Scandinavian Conference on Image Analysis 2023

Via

Access Paper or Ask Questions

YCB-Ev: Event-vision dataset for 6DoF object pose estimation

Sep 15, 2023
Pavel Rojtberg, Thomas Pöllabauer

Our work introduces the YCB-Ev dataset, which contains synchronized RGB-D frames and event data that enables evaluating 6DoF object pose estimation algorithms using these modalities. This dataset provides ground truth 6DoF object poses for the same 21 YCB objects \cite{calli2017yale} that were used in the YCB-Video (YCB-V) dataset, enabling the evaluation of algorithm performance when transferred across datasets. The dataset consists of 21 synchronized event and RGB-D sequences, amounting to a total of 7:43 minutes of video. Notably, 12 of these sequences feature the same object arrangement as the YCB-V subset used in the BOP challenge. Our dataset is the first to provide ground truth 6DoF pose data for event streams. Furthermore, we evaluate the generalization capabilities of two state-of-the-art algorithms, which were pre-trained for the BOP challenge, using our novel YCB-V sequences. The proposed dataset is available at https://github.com/paroj/ycbev.

Via

Access Paper or Ask Questions

Style-transfer GANs for bridging the domain gap in synthetic pose estimator training

Apr 28, 2020
Pavel Rojtberg, Thomas Pöllabauer, Arjan Kuijper

Figure 1 for Style-transfer GANs for bridging the domain gap in synthetic pose estimator training

Figure 2 for Style-transfer GANs for bridging the domain gap in synthetic pose estimator training

Figure 3 for Style-transfer GANs for bridging the domain gap in synthetic pose estimator training

Figure 4 for Style-transfer GANs for bridging the domain gap in synthetic pose estimator training

Given the dependency of current CNN architectures on a large training set, the possibility of using synthetic data is alluring as it allows generating a virtually infinite amount of labeled training data. However, producing such data is a non-trivial task as current CNN architectures are sensitive to the domain gap between real and synthetic data. We propose to adopt general-purpose GAN models for pixel-level image translation, allowing to formulate the domain gap itself as a learning problem. Here, we focus on training the single-stage YOLO6D object pose estimator on synthetic CAD geometry only, where not even approximate surface information is available. Our evaluation shows a considerable improvement in model performance when compared to a model trained with the same degree of domain randomization, while requiring only very little additional effort.

Via

Access Paper or Ask Questions