نتایج جستجو برای: stratified sampling

تعداد نتایج: 247726  

Journal: :CoRR 1998
Judith Hochberg Clint Scovel Timothy Thomas Sam Hall

This paper describes a method for asking statistical questions about a large text corpus. We exemplify the method by addressing the question, "What percentage of Federal Register documents are real documents, of possible interest to a text researcher or analyst?" We estimate an answer to this question by evaluating 200 documents selected from a corpus of 45,820 Federal Register documents. Strat...

2010
Martin Haugh

In these lecture notes we cover the more advanced variance reduction techniques of importance sampling and stratified sampling. Importance sampling is particularly suited for estimating the probabilities of rare events and, in conjunction with stratified sampling, often results in a sample variance that is several orders of magnitude smaller than the variance of the naive Monte-Carlo estimator....

Journal: :Computational Statistics & Data Analysis 2014
Caren Hasler Yves Tillé

Balanced sampling is a very efficient sampling design when the variable of interest is correlated to the auxiliary variables on which the sample is balanced. A procedure to select balanced samples in a stratified population has previously been proposed. Unfortunately, this procedure becomes very slow as the number of strata increases and it even fails to select samples for some large numbers of...

2016
Janne Pylkkönen Thomas Drugman Max Bisani

Producing large enough quantities of high-quality transcriptions for accurate and reliable evaluation of an automatic speech recognition (ASR) system can be costly. It is therefore desirable to minimize the manual transcription work for producing metrics with an agreed precision. In this paper we demonstrate how to improve ASR evaluation precision using stratified sampling. We show that by alte...

2010
Rajesh Singh Mukesh Kumar Manoj K. Chaudhary Cem Kadilar

In this article we have considered the problem of estimating the population mean   Y in the stratified random sampling using the information of an auxiliary variable x which is correlated with y and suggested improved exponential ratio estimators in the stratified random sampling. The mean square error (MSE) equations for the proposed estimators have been derived and it is shown that the prop...

2005
G. Leobacher

We provide a method for the generation of paths of Lévy processes which has many of the benefits that the Brownian bridge construction has for Brownian motion. We show how, using our method, one can apply stratified sampling and quasi-Monte Carlo methods to obtain better numerical schemes analog to the Brownian case. As a numerical example we consider the problem of pricing an asian option in t...

2004
Diego F. Nehab Philip Shilane

Point sampling is an important intermediate step for a variety of computer graphics applications, and specialized sampling strategies have been developed to satisfy the requirements of each problem. In this article, we present a technique to generate a stratified sampling of 3D models that is applicable across many domains. The algorithm voxelizes the model and selects one sample per voxel, res...

2010
Mohammed Al-Kateb Byung Suk Lee

Reservoir sampling is a well-known technique for random sampling over data streams. In many streaming applications, however, an input stream may be naturally heterogeneous, i.e., composed of substreams whose statistical properties may also vary considerably. For this class of applications, the conventional reservoir sampling technique does not guarantee a statistically sufficient number of tupl...

2013
Yeonkook J. Kim Yoonhwan Oh Sunghoon Park Sungzoon Cho Hayoung Park

OBJECTIVES To explore classification rules based on data mining methodologies which are to be used in defining strata in stratified sampling of healthcare providers with improved sampling efficiency. METHODS We performed k-means clustering to group providers with similar characteristics, then, constructed decision trees on cluster labels to generate stratification rules. We assessed the varia...

2005
Yanrong Li Raj P. Gopalan

It is well recognized that mining association rules in a very large database is usually time consuming due to the I/O overhead in scanning the disk resident database. As one of the techniques for reducing the I/O overhead, sampling for mining association rules has been actively investigated during the last few years. Each sampling method and algorithm proposed in the literature has its own meri...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید