Generative Models for 2-D images of 3-D scenes
نویسنده
چکیده
Generative Models for 2-D images of 3-D scenes Anitha Kannan Doctor of Philosophy Graduate Department of Computer Science University of Toronto 2007 This thesis introduces generative models of appearance for analyzing 2D images of 3D visual scenes. Many 2D images contain multiple objects, where image components corresponding to each object undergo deformations, eg., due to changes in the shape and appearance of the object, non-uniform scaling caused by camera zoom and global transformations such as changes in the location of the object. In this thesis, we use Bayesian networks to model these sources of variability and their potentially noisy interactions while describing an image. Given an observed image, the model is used to infer the distribution over the possible explanations of the unobserved sources. But, there is an exponentially large number of combinations, so an exact search over these combinations is practically infeasible. This thesis introduces approximate inference and learning algorithms based on variational methods for inferring the underlying explanations of the observed data. In the first part of this thesis, a fast unsupervised algorithm for learning an object’s appearance in a linear subspace, while being invariant to global transformations such as translations, rotations, and scalings is introduced. For modelling multiple objects in the scene, the second part of the thesis presents a generative model for explaining an image using a layered composition of “card-board cutouts”. With this, each object is accounted for by a model of 2D image, including a transparency map that specifies pixels belonging to the object. Each such “layer” includes hidden variables that account for sources of variability such as changes in appearance, shape, deformation and global transformation. The model is learned using a
منابع مشابه
Statistics of Deep Generated Images
Here, we explore the low-level statistics of images generated by state-of-the-art deep generative models. First, Variational auto-encoder (VAE [19]), Wasserstein generative adversarial network (WGAN [2]) and deep convolutional generative adversarial network (DCGAN [23]) are trained on the ImageNet dataset and a large set of cartoon frames from animations. Then, for images generated by these mod...
متن کاملImprovement of generative adversarial networks for automatic text-to-image generation
This research is related to the use of deep learning tools and image processing technology in the automatic generation of images from text. Previous researches have used one sentence to produce images. In this research, a memory-based hierarchical model is presented that uses three different descriptions that are presented in the form of sentences to produce and improve the image. The proposed ...
متن کاملInverse Graphics with Probabilistic CAD Models
Recently, multiple formulations of vision problems as probabilistic inversions of generative models based on computer graphics have been proposed. However, applications to 3D perception from natural images have focused on low-dimensional latent scenes, due to challenges in both modeling and inference. Accounting for the enormous variability in 3D object shape and 2D appearance via realistic gen...
متن کاملPlanelet Transform: A New Geometrical Wavelet for Compression of Kinect-like Depth Images
With the advent of cheap indoor RGB-D sensors, proper representation of piecewise planar depth images is crucial toward an effective compression method. Although there exist geometrical wavelets for optimal representation of piecewise constant and piecewise linear images (i.e. wedgelets and platelets), an adaptation to piecewise linear fractional functions which correspond to depth variation ov...
متن کاملParsing Images into Region and Curve Processes
Natural scenes consist of a wide variety of stochastic pat terns While many patterns are represented well by statistical models in two dimensional regions as most image segmentation work assume some other patterns are fundamentally one dimensional and thus cause major problems in segmentation We call the former region processes and the latter curve processes In this paper we propose a stochasti...
متن کامل