UDepth: Fast Monocular Depth Estimation for Visually-guided Underwater Robots

Yu, Boxiao; Wu, Jiayi; Islam, Md Jahidul

Computer Science > Computer Vision and Pattern Recognition

arXiv:2209.12358 (cs)

[Submitted on 26 Sep 2022 (v1), last revised 2 Feb 2023 (this version, v2)]

Title:UDepth: Fast Monocular Depth Estimation for Visually-guided Underwater Robots

Authors:Boxiao Yu, Jiayi Wu, Md Jahidul Islam

View PDF

Abstract:In this paper, we present a fast monocular depth estimation method for enabling 3D perception capabilities of low-cost underwater robots. We formulate a novel end-to-end deep visual learning pipeline named UDepth, which incorporates domain knowledge of image formation characteristics of natural underwater scenes. First, we adapt a new input space from raw RGB image space by exploiting underwater light attenuation prior, and then devise a least-squared formulation for coarse pixel-wise depth prediction. Subsequently, we extend this into a domain projection loss that guides the end-to-end learning of UDepth on over 9K RGB-D training samples. UDepth is designed with a computationally light MobileNetV2 backbone and a Transformer-based optimizer for ensuring fast inference rates on embedded systems. By domain-aware design choices and through comprehensive experimental analyses, we demonstrate that it is possible to achieve state-of-the-art depth estimation performance while ensuring a small computational footprint. Specifically, with 70%-80% less network parameters than existing benchmarks, UDepth achieves comparable and often better depth estimation performance. While the full model offers over 66 FPS (13 FPS) inference rates on a single GPU (CPU core), our domain projection for coarse depth prediction runs at 51.5 FPS rates on single-board NVIDIA Jetson TX2s. The inference pipelines are available at this https URL.

Comments:	10 pages, 6 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
Cite as:	arXiv:2209.12358 [cs.CV]
	(or arXiv:2209.12358v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2209.12358

Submission history

From: Boxiao Yu [view email]
[v1] Mon, 26 Sep 2022 01:08:36 UTC (2,107 KB)
[v2] Thu, 2 Feb 2023 16:31:39 UTC (2,304 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:UDepth: Fast Monocular Depth Estimation for Visually-guided Underwater Robots

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:UDepth: Fast Monocular Depth Estimation for Visually-guided Underwater Robots

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators