On sampling error in genetic programming
نویسندگان
چکیده
Abstract The initial population in genetic programming (GP) should form a representative sample of all possible solutions (the search space). While large populations accurately approximate the distribution solutions, small tend to incorporate sampling error. This paper analyzes how size GP affects error and contributes answering question populations. First, we present probabilistic model expected number subtrees for initialized with full, grow, or ramped half-and-half. Second, based on our frequency model, that estimates given size. We validate models empirically show that, compared smaller sizes, recommended sizes largely reduce measured fitness values. Increasing even more, however, does not considerably Last, recommend some widely used benchmark problem instances result low A at initialization is necessary (but sufficient) reliable since lowering means overall random variations are reduced. Our results indicate severe GP, making obtain allows practitioners determine minimum so lower than threshold, confidence level.
منابع مشابه
automatic verification of authentication protocols using genetic programming
implicit and unobserved errors and vulnerabilities issues usually arise in cryptographic protocols and especially in authentication protocols. this may enable an attacker to make serious damages to the desired system, such as having the access to or changing secret documents, interfering in bank transactions, having access to users’ accounts, or may be having the control all over the syste...
15 صفحه اولSampling Methods in Genetic Programming for Classification with Unbalanced Data
This work investigates the use of sampling methods in Genetic Programming (GP) to improve the classification accuracy in binary classification problems in which the datasets have a class imbalance. Class imbalance occurs when there are more data instances in one class than the other. As a consequence of this imbalance, when overall classification rate is used as the fitness function, as in stan...
متن کاملSampling Issues of Tournament Selection in Genetic Programming
Tournament selection is one of the most commonly used parent selection schemes in Genetic Programming (GP). While it has a number of advantages over other selection schemes, it still has some issues that need to be thoroughly investigated. Two of the issues are assocated with the sampling process from the population into the tournament. The first one is the socalled “multi-sampled” issue, where...
متن کاملProgressive Sampling for Association Rules Based on Sampling Error Estimation
We explore in this paper a progressive sampling algorithm, called Sampling Error Estimation (SEE), which aims to identify an appropriate sample size for mining association rules. SEE has two advantages over previous works in the literature. First, SEE is highly efficient because an appropriate sample size can be determined without the need of executing association rules. Second, the identified ...
متن کاملTruncation Error on Wavelet Sampling Expansions
(see [9] and [10]). Throughout this work, we assume that the function satis®es the following conditions: (i) j xj cons: jB xj=jxj1"; where " 0 and jB xj is bounded and 1-periodic function on R. (ii) P n2Z n eÿin converges absolutely to a function that has no zeros on ÿ ; . It is known that conditions (i) and (ii) imply that fK x; n; n 2 Zg is a Riesz basis on V , with a unique biort...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Natural Computing
سال: 2021
ISSN: ['1572-9796', '1567-7818']
DOI: https://doi.org/10.1007/s11047-020-09828-w