Neural Contourlet Network for Monocular 360° Depth Estimation
نویسندگان
چکیده
For a monocular 360° image, depth estimation is challenging because the distortion increases along latitude. To perceive distortion, existing methods devote to designing deep and complex network architecture. In this paper, we provide new perspective that constructs an interpretable sparse representation for image. Considering importance of geometric structure in estimation, utilize contourlet transform capture explicit cue spectral domain integrate it with implicit spatial domain. Specifically, propose neural consisting convolutional branch. encoder stage, design spatial–spectral fusion module effectively fuse two types cues. Contrary encoder, employ inverse learned low-pass subbands band-pass directional compose decoder. Experiments on three popular panoramic image datasets demonstrate proposed approach outperforms state-of-the-art schemes faster convergence. Code available at https://github.com/zhijieshen-bjtu/Neural-Contourlet-Network-for-MODE .
منابع مشابه
Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation
Recent works have shown the benefit of integrating Conditional Random Fields (CRFs) models into deep architectures for improving pixel-level prediction tasks. Following this line of research, in this paper we introduce a novel approach for monocular depth estimation. Similarly to previous works, our method employs a continuous CRF to fuse multi-scale information derived from different layers of...
متن کاملAperture Supervision for Monocular Depth Estimation
We present a novel method to train machine learning algorithms to estimate scene depths from a single image, by using the information provided by a camera’s aperture as supervision. Prior works use a depth sensor’s outputs or images of the same scene from alternate viewpoints as supervision, while our method instead uses images from the same viewpoint taken with a varying camera aperture. To en...
متن کاملDepth Estimation Using Monocular and Stereo Cues
Depth estimation in computer vision and robotics is most commonly done via stereo vision (stereopsis), in which images from two cameras are used to triangulate and estimate distances. However, there are also numerous monocular visual cues— such as texture variations and gradients, defocus, color/haze, etc.—that have heretofore been little exploited in such systems. Some of these cues apply even...
متن کاملBayesian depth estimation from monocular natural images.
Estimating an accurate and naturalistic dense depth map from a single monocular photographic image is a difficult problem. Nevertheless, human observers have little difficulty understanding the depth structure implied by photographs. Two-dimensional (2D) images of the real-world environment contain significant statistical information regarding the three-dimensional (3D) structure of the world t...
متن کاملQualitative Estimation of Depth in Monocular Vision
In this paper we propose two techniques to qualitatively estimate distance in monocular vision. Two kinds of approaches are described, the former based on texture analysis and the latter on histogram inspection. Although both the methods allow only to determine whether a point within an image is nearer or farther than another with respect to the observer, they can be usefully exploited in all t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Circuits and Systems for Video Technology
سال: 2022
ISSN: ['1051-8215', '1558-2205']
DOI: https://doi.org/10.1109/tcsvt.2022.3192283