Gap Filling in the Plant Kingdom - Trait Prediction Using Hierarchical Probabilistic Matrix Factorization
نویسندگان
چکیده
Plant traits are a key to understanding and predicting the adaptation of ecosystems to environmental changes, which motivates the TRY project aiming at constructing a global database for plant traits and becoming a standard resource for the ecological community. Despite its unprecedented coverage, a large percentage of missing data substantially constrains joint trait analysis. Meanwhile, the trait data is characterized by the hierarchical phylogenetic structure of the plant kingdom. While factorization based matrix completion techniques have been widely used to address the missing data problem, traditional matrix factorization methods are unable to leverage the phylogenetic structure. We propose hierarchical probabilistic matrix factorization (HPMF), which effectively uses hierarchical phylogenetic information for trait prediction. We demonstrate HPMF’s high accuracy, effectiveness of incorporating hierarchical structure and ability to capture trait correlation through experiments.
منابع مشابه
Hierarchical Probabilistic Matrix Factorization with Network Topology for Multi-relational Social Network
Link prediction in multi-relational social networks has attracted much attention. For instance, we may care the chance of two users being friends based on their contacts of other patterns, e.g., SMS and phone calls. In previous work, matrix factorization models are typically applied in single-relational networks; however, two challenges arise to extend it into multi-relational networks. First, ...
متن کاملLeveraging Decomposed Trust in Probabilistic Matrix Factorization for Effective Recommendation
Trust has been used to replace or complement ratingbased similarity in recommender systems, to improve the accuracy of rating prediction. However, people trusting each other may not always share similar preferences. In this paper, we try to fill in this gap by decomposing the original single-aspect trust information into four general trust aspects, i.e. benevolence, integrity, competence, and p...
متن کاملMulti-way clustering of microarray data using probabilistic sparse matrix factorization
MOTIVATION We address the problem of multi-way clustering of microarray data using a generative model. Our algorithm, probabilistic sparse matrix factorization (PSMF), is a probabilistic extension of a previous hard-decision algorithm for this problem. PSMF allows for varying levels of sensor noise in the data, uncertainty in the hidden prototypes used to explain the data and uncertainty as to ...
متن کاملA new approach for building recommender system using non negative matrix factorization method
Nonnegative Matrix Factorization is a new approach to reduce data dimensions. In this method, by applying the nonnegativity of the matrix data, the matrix is decomposed into components that are more interrelated and divide the data into sections where the data in these sections have a specific relationship. In this paper, we use the nonnegative matrix factorization to decompose the user ratin...
متن کاملDevelopment of simulation model for performance evaluation of feed water system in a typical thermal power plant
The present paper deals with development of a simulation model for the performance evaluation of feed water system of a thermal power plant using Markov Birth-Death process and probabilistic approach. In present paper, the feed water system consists of four subsystems. After drawing transition diagram for feed water system, differential equations are developed and then solved recursively using ...
متن کامل