Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Akshay Paruchuri

Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos

Mar 26, 2024
Akshay Paruchuri, Samuel Ehrenstein, Shuxian Wang, Inbar Fried, Stephen M. Pizer, Marc Niethammer, Roni Sengupta

Figure 1 for Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos

Figure 2 for Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos

Figure 3 for Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos

Figure 4 for Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos

Monocular depth estimation in endoscopy videos can enable assistive and robotic surgery to obtain better coverage of the organ and detection of various health issues. Despite promising progress on mainstream, natural image depth estimation, techniques perform poorly on endoscopy images due to a lack of strong geometric features and challenging illumination effects. In this paper, we utilize the photometric cues, i.e., the light emitted from an endoscope and reflected by the surface, to improve monocular depth estimation. We first create two novel loss functions with supervised and self-supervised variants that utilize a per-pixel shading representation. We then propose a novel depth refinement network (PPSNet) that leverages the same per-pixel shading representation. Finally, we introduce teacher-student transfer learning to produce better depth maps from both synthetic data with supervision and clinical data with self-supervision. We achieve state-of-the-art results on the C3VD dataset while estimating high-quality depth maps from clinical data. Our code, pre-trained models, and supplementary materials can be found on our project page: https://ppsnet.github.io/

* 26 pages, 7 tables, 7 figures

Via

Access Paper or Ask Questions

Motion Matters: Neural Motion Transfer for Better Camera Physiological Sensing

Apr 02, 2023
Akshay Paruchuri, Xin Liu, Yulu Pan, Shwetak Patel, Daniel McDuff, Soumyadip Sengupta

Figure 1 for Motion Matters: Neural Motion Transfer for Better Camera Physiological Sensing

Figure 2 for Motion Matters: Neural Motion Transfer for Better Camera Physiological Sensing

Figure 3 for Motion Matters: Neural Motion Transfer for Better Camera Physiological Sensing

Figure 4 for Motion Matters: Neural Motion Transfer for Better Camera Physiological Sensing

Machine learning models for camera-based physiological measurement can have weak generalization due to a lack of representative training data. Body motion is one of the most significant sources of noise when attempting to recover the subtle cardiac pulse from a video. We explore motion transfer as a form of data augmentation to introduce motion variation while preserving physiological changes. We adapt a neural video synthesis approach to augment videos for the task of remote photoplethysmography (PPG) and study the effects of motion augmentation with respect to 1) the magnitude and 2) the type of motion. After training on motion-augmented versions of publicly available datasets, the presented inter-dataset results on five benchmark datasets show improvements of up to 75% over existing state-of-the-art results. Our findings illustrate the utility of motion transfer as a data augmentation technique for improving the generalization of models for camera-based physiological sensing. We release our code and pre-trained models for using motion transfer as a data augmentation technique on our project page: https://motion-matters.github.io/

* 16 pages, 6 figures, 14 tables

Via

Access Paper or Ask Questions