DepthCut: Improved Depth Edge Estimation Using Multiple Unreliable Channels
نویسندگان
چکیده
In the context of scene understanding, a variety of methods exists to estimate different information channels from mono or stereo images, including disparity, depth, and normals. Although several advances have been reported in the recent years for these tasks, the estimated information is often imprecise particularly near depth contours or creases. Studies have however shown that precisely such depth edges carry critical cues for the perception of shape, and play important roles in tasks like depth-based segmentation or foreground selection. Unfortunately, the currently extracted channels often carry conflicting signals, making it difficult for subsequent applications to effectively use them. In this paper, we focus on the problem of obtaining high-precision depth edges by jointly analyzing such unreliable information channels. We propose DEPTHCUT, a data-driven fusion of the channels using a convolutional neural network trained on a large dataset with known depth. The resulting depth edges can be used for segmentation, decomposing a scene into segments with relatively smooth depth, or improving the accuracy of the depth estimate near depth edges by constraining its gradients to agree with these edges. Quantitative experiments show that our depth edges result in an improved segmentation performance compared to a more naive channel fusion. Qualitatively, we demonstrate that the depth edges can be used for superior segmentation and an improved depth estimate near depth edges.
منابع مشابه
Improved Estimates of Kinematic Wave Parameters for Circular Channels
The momentum equation in the kinematic wave model is a power-law equation with two parameters. These parameters, which relate the discharge to the flow area, are commonly derived using Manning’s equation. In general, the values of these parameters depend on the flow depth except for some special cross sections. In this paper, improved estimates of the kinematic wave parameters for circular chan...
متن کاملEstimation of Network Reliability for a Fully Connected Network with Unreliable Nodes and Unreliable Edges using Neuro Optimization
In this paper it is tried to estimate the reliability of a fully connected network of some unreliable nodes and unreliable connections (edges) between them. The proliferation of electronic messaging has been witnessed during the last few years. The acute problem of node failure and connection failure is frequently encountered in communication through various types of networks. We know that a ne...
متن کاملAN Improved UTD Based Model For The Multiple Building Diffraction Of Plane Waves In Urban Environments By Using Higher Order Diffraction Coeficients
This paper describes an improved model for multiple building diffraction modeling based on the uniform theory of diffraction (UTD). A well-known problem in conventional uniform theory of diffraction (CUTD) is multiple-edge transition zone diffraction. Here, higher order diffracted fields are used in order to improve the result; hence, we use higher order diffraction coefficients to improve a hy...
متن کاملDistributed Incremental Least Mean-Square for Parameter Estimation using Heterogeneous Adaptive Networks in Unreliable Measurements
Adaptive networks include a set of nodes with adaptation and learning abilities for modeling various types of self-organized and complex activities encountered in the real world. This paper presents the effect of heterogeneously distributed incremental LMS algorithm with ideal links on the quality of unknown parameter estimation. In heterogeneous adaptive networks, a fraction of the nodes, defi...
متن کاملSignal Processing: Image Communication
The multiview video exploits both texture and depth video information from various angles to create a 3D video [1]-[3] and free viewpoint video (FVV) [4] which are gradually becoming more popular for their advanced visual experience with depth perception [5]-[7]. Unlike texture, depth video is determined by a gray scale map indicating distance between camera and 3D points in a scene [8]. If a c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1705.07844 شماره
صفحات -
تاریخ انتشار 2017