Deformations and Discriminative Models for Image Recognition
نویسندگان
چکیده
In this work we present approaches to incorporate domain-knowledge into discriminative classifiers. In particular, we investigate the incorporation of the image distortion model into log-linear models and support vector machines. Discriminative models are a well-known technique in many fields of machine learning and pattern recognition and the image distortion model has recently been shown to be a very effective means of modelling variability in images of handwritten characters, however, so far it was impossible to fuse the advantages of these two different approaches. In order to incorporate the IDM into the log-linear model, we re-investigate the probabilistic formulation of the IDM, and the relationships between Gaussian models and log-linear models, which allows for a direct integration of the IDM into the log-linear framework, where an alignment of an image to a prototype is considered a hidden variable. Alternating optimisation techniques allow for effective training of log-linear models with hidden variables. Furthermore,we have investigated how to combine SVMs with the IDM. The performance of the different models is examined on two standard tasks, the USPS tasks, which, due to its small size, allows for extensive testing and tuning of models and the MNIST database, which is considered a standard benchmark in OCR applications. The results obtained compare favourably well to other, comparable models. In particular, we report the best error rate using a single density model on the USPS task.
منابع مشابه
Deformation-Aware Log-Linear Models
In this paper, we present a novel deformation-aware discriminative model for handwritten digit recognition. Unlike previous approaches our model directly considers image deformations and allows discriminative training of all parameters, including those accounting for non-linear transformations of the image. This is achieved by extending a log-linear framework to incorporate a latent deformation...
متن کاملDeformations, patches, and discriminative models for automatic annotation of medical radiographs
In this paper, we describe three different methods for the classification and annotation of medical radiographs. The methods were applied in the medical image annotation tasks of ImageCLEF in 2005, 2006, and 2007. Image annotation can be used to access and find images in a database using textual queries when no textual image description is available. One of the methods is a non-linear model tak...
متن کاملImage Retrieval, Object Recognition, and Discriminative Models
In this thesis, we present approaches to image retrieval, object recognition, and discriminative models. For image retrieval, we evaluate a large variety of different descriptors and answer the questions how descriptors can be combined and which descriptor should be chosen according to which criterion. We suggest a set of local descriptors that have been used successfully for object recognition...
متن کاملVinod Nair A thesis submitted in conformity with the requirements for the degree of Doctor of Philosophy
Visual Object Recognition Using Generative Models of Images Vinod Nair Doctor of Philosophy Graduate Department of Computer Science University of Toronto 2010 Visual object recognition is one of the key human capabilities that we would like machines to have. The problem is the following: given an image of an object (e.g. someone’s face), predict its label (e.g. that person’s name) from a set of...
متن کاملA Latent Space Mapping for Link Prediction
Network modeling can be approached using either discriminative or probabilistic models. In the task of link prediction a probabilistic model will give a probability for the existence of a link; while in some scenarios this may be beneficial, in others a hard discriminative boundary needs to be set. Hence the use of a discriminative classifier is preferable. In domains such as image analysis and...
متن کامل