Statistical learning approaches to information filtering
نویسنده
چکیده
Enabling computer systems to understand human thinking or behaviors has ever been an exciting challenge to computer scientists. In recent years one such a topic, information filtering, emerges to help users find desired information items (e.g. movies, books, news) from large amount of available data, and has become crucial in many applications, like product recommendation, image retrieval, spam email filtering, news filtering, and web navigation etc.. An information filtering system must be able to understand users’ information needs. Existing approaches either infer a user’s profile by exploring his/her connections to other users, i.e. collaborative filtering (CF), or analyzing the content descriptions of liked or disliked examples annotated by the user, i.e. content-based filtering (CBF). Those methods work well to some extent, but are facing difficulties due to lack of insights into the problem. This thesis intensively studies a wide scope of information filtering technologies. Novel and principled machine learning methods are proposed to model users’ information needs. The work demonstrates that the uncertainty of user profiles and the connections between them can be effectively modelled by using probability theory and Bayes rule. As one major contribution of this thesis, the work clarifies the “structure” of information filtering and gives rise to principled solutions. In summary, the work of this thesis mainly covers the following three aspects: • Collaborative filtering : We develop a probabilistic model for memorybased collaborative filtering (PMCF), which has clear links with classical memory-based CF. Various heuristics to improve memory-based CF have been proposed in the literature. In contrast, extensions based on PMCF can be made in a principled probabilistic way. With PMCF, we
منابع مشابه
A New Similarity Measure Based on Item Proximity and Closeness for Collaborative Filtering Recommendation
Recommender systems utilize information retrieval and machine learning techniques for filtering information and can predict whether a user would like an unseen item. User similarity measurement plays an important role in collaborative filtering based recommender systems. In order to improve accuracy of traditional user based collaborative filtering techniques under new user cold-start problem a...
متن کاملImproving Web Service Clustering through Post Filtering to Bootstrap the Service Discovery
Web service clustering is one of a very efficient approach to discover Web services efficiently. Current approaches use similarity-distance measurement methods such as string-based, corpus-based, knowledge-based and hybrid methods. These approaches have problems that include discovering semantic characteristics, loss of semantic information, shortage of high-quality ontologies and encoding fine...
متن کاملUsing Probabilistic Relational Models for Collaborative Filtering
Recent projects in collaborative filtering and information filtering address the task of inferring user preference relationships for products or information. The data on which these inferences are based typically consists of pairs of people and items. The items may be information sources (such as web pages or newspaper articles) or products (such as books, software, movies or CDs). We are inter...
متن کاملThe Application of Deep Learning in Collaborative Filtering
As the fast development of Internet, users are facing severe “Information overload” problems. Recommender systems which aim to solve the information problem have been developed very in fast in the past 20 years. Recommender systems aim to provide users with a list of items which can satisfy their preferences. Among the approaches in Recommender systems, Collaborative Filtering [1] mainly utiliz...
متن کاملA Unified Recommendation Framework Based on Probabilistic Relational Models
Zan Huang, Daniel D. Zeng, Hsinchun Chen Department of Management Information Systems, The University of Arizona {zhuang, zeng, hchen}@eller.arizona.edu Abstract Recommender systems are being increasingly adopted in various e-commerce applications. A wide range of recommendation approaches have been developed to analyze past consumer-product interactions, consumer attributes, and product attrib...
متن کامل