Statistical Tests for Large Tree-Structured Data
نویسندگان
چکیده
منابع مشابه
Statistical Modeling of Branching Probabilities for Tree-structured Data Objects
Study of factors affecting the functioning of the human brain has been of considerable interest for more than a century. In this paper, we focus on the structure of brain artery systems in humans and study how this might be related to factors such as age, gender or handedness (left or right handed). To facilitate this study we first represent brain artery systems using tree-structured objects a...
متن کاملTree - Decision Trees for Tree Structured Data∗
We present Tree, a new approach to structural classification. This integrated approach induces decision trees that test for pattern occurrence in the inner nodes. It combines state-of-the-art tree mining with sophisticated pruning techniques to find the most discriminative pattern in each node. In contrast to existing methods, Tree uses no heuristics and only a single, statistically well founde...
متن کاملTree Structured Data Analysis
Classification and regression trees are becoming increasingly popular for partitioning data and identifying local structure in small and large datasets. Classification trees include those models in which the dependent variable (the predicted variable) is categorical. Regression trees include those in which it is continuous. This paper discusses pitfalls in the use of these methods and highlight...
متن کاملTree-Structured Stick Breaking for Hierarchical Data
Many data are naturally modeled by an unobserved hierarchical structure. In this paper we propose a flexible nonparametric prior over unknown data hierarchies. The approach uses nested stick-breaking processes to allow for trees of unbounded width and depth, where data can live at any node and are infinitely exchangeable. One can view our model as providing infinite mixtures where the component...
متن کاملEfficient Similarity Search for Tree-Structured Data
Tree-structured data are becoming ubiquitous nowadays and manipulating them based on similarity is essential for many applications. Although similarity search on textual data has been extensively studied, searching for similar trees is still an open problem due to the high complexity of computing the similarity between trees, especially for large numbers of tress. In this paper, we propose to t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of the American Statistical Association
سال: 2017
ISSN: 0162-1459,1537-274X
DOI: 10.1080/01621459.2016.1240081