cross validation error

نتایج جستجو برای: cross validation error

تعداد نتایج: 878094 فیلتر نتایج به سال:

Difficulty of selecting among multilevel models using predictive accuracy

2014

Wei Wang Andrew Gelman

As a simple and compelling approach for estimating outof-sample prediction error, cross-validation naturally lends itself to the task of model comparison. However, even with moderate sample size, it can be surprisingly difficult to compare multilevel models based on predictive accuracy. Using a hierarchical model fit to large survey data with a battery of questions, we demonstrate that even tho...

متن کامل

Is cross-validation better than resubstitution for ranking genes?

Journal: :Bioinformatics 2004

Ulisses Braga-Neto Ronaldo Fumio Hashimoto Edward R. Dougherty Danh V. Nguyen Raymond J. Carroll

MOTIVATION Ranking gene feature sets is a key issue for both phenotype classification, for instance, tumor classification in a DNA microarray experiment, and prediction in the context of genetic regulatory networks. Two broad methods are available to estimate the error (misclassification rate) of a classifier. Resubstitution fits a single classifier to the data, and applies this classifier in t...

متن کامل

Cross-validation pitfalls when selecting and assessing regression and classification models

2014

Damjan Krstajic Ljubomir J. Buturovic David E. Leahy Simon Thomas

BACKGROUND We address the problem of selecting and assessing classification and regression models using cross-validation. Current state-of-the-art methods can yield models with high variance, rendering them unsuitable for a number of practical applications including QSAR. In this paper we describe and evaluate best practices which improve reliability and increase confidence in selected models. ...

متن کامل

Error rate estimation via cross-validation and learning curve theory

1996

A. Varfis L. Corleto

متن کامل

Discovery of Relevant Weights by Minimizing Cross-Validation Error

2000

Kazumi Saito Ryohei Nakano

In order to discover relevant weights of neural networks, this paper proposes a novel method to learn a distinct squared penalty factor for each weight as a minimization problem over the cross-validation error. Experiments showed that the proposed method works well in discovering a polynomial-type law even from data containing irrelevant variables and a small amount of noise.

متن کامل

Cross-Validation and Minimum Generation Error based Decision Tree Pruning for HMM-based Speech Synthesis

Journal: :IJCLCLP 2010

Heng Lu Zhen-Hua Ling Li-Rong Dai Ren-Hua Wang

This paper presents a decision tree pruning method for the model clustering of HMM-based parametric speech synthesis by cross-validation (CV) under the minimum generation error (MGE) criterion. Decision-tree-based model clustering is an important component in the training process of an HMM based speech synthesis system. Conventionally, the maximum likelihood (ML) criterion is employed to choose...

متن کامل

The Estimation of Prediction Error: Covariance Penalties and Cross-Validation

2004

Bradley EFRON

Having constructed a data-based estimation rule, perhaps a logistic regression or a classification tree, the statistician would like to know its performance as a predictor of future cases. There are two main theories concerning prediction error: (1) penalty methods such as Cp, Akaike’s information criterion, and Stein’s unbiased risk estimate that depend on the covariance between data points an...

متن کامل

Accurate Robust and Efficient Error Estimation for Decision Trees

2016

Lixin Fan

This paper illustrates a novel approach to the estimation of generalization error of decision tree classifiers. We set out the study of decision tree errors in the context of consistency analysis theory, which proved that the Bayes error can be achieved only if when the number of data samples thrown into each leaf node goes to infinity. For the more challenging and practical case where the samp...

متن کامل

Module 11: Cross-validation and the Control of Error Rates

2015

Robert Browning

This module emphasizes what might be termed “the practice of safe statistics.” The discussion is split into three parts: (1) the importance of cross-validation for any statistical method that relies on an optimization process based on a given data set (or sample); (2) the need to exert control on overall error rates when carrying out multiple testing, even when that testing is done only implici...

متن کامل

Regularization Path of Cross-Validation Error Lower Bounds

2015

Atsushi Shibagaki Yoshiki Suzuki Masayuki Karasuyama Ichiro Takeuchi

Careful tuning of a regularization parameter is indispensable in many machine learning tasks because it has a significant impact on generalization performances. Nevertheless, current practice of regularization parameter tuning is more of an art than a science, e.g., it is hard to tell how many grid-points would be needed in cross-validation (CV) for obtaining a solution with sufficiently small ...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید