Evolving simple and accurate symbolic regression models via asynchronous parallel computing
نویسندگان
چکیده
In machine learning, reducing the complexity of a model can help to improve its computational efficiency and avoid overfitting. genetic programming (GP), reduction is often achieved by size evolved expressions. However, previous studies have demonstrated that expression does not necessarily prevent Therefore, this paper uses evaluation time – required evaluate GP on data as estimate complexity. The depends only expressions but also their composition, thus acting more nuanced measure than alone. To discourage complexity, study employs novel method called asynchronous parallel (APGP) introduces race condition in evolutionary process GP; offers an advantage simple solutions when accuracy competitive. proposed method, it compared standard (GP) with bloat control (GP+BC) methods six challenging symbolic regression problems. APGP produced models are significantly accurate (on 6/6 problems) those both GP+BC. terms control, prevailed over GP+BC; however, GP+BC simpler at cost test-set accuracy. Moreover, took lower number evaluations meet target training fitness all tests. Our analysis involved: (1) ablation separated from (2) initialisation scheme encourages functional diversity initial population improved results for methods. These question overall benefits endorse employment controlling it.
منابع مشابه
Symbolic and Asynchronous Semantics via Normalized Coalgebras
The operational semantics of interactive systems is usually described by labeled transition systems. Abstract semantics (that is defined in terms of bisimilarity) is characterized by the final morphism in some category of coalgebras. Since the behaviour of interactive systems is for many reasons infinite, symbolic semantics were introduced as a mean to define smaller, possibly finite, transitio...
متن کاملPrioritization in Parallel Symbolic Computing
It is argued that scheduling is an important determinant of performance for many parallel symbolic computations, over and above the issues of dynamic load balancing and grainsize control. We propose associating unbounded levels of priorities with tasks and messages as the mechanism of choice for specifying scheduling strategies. We demonstrate how priorities can be used in par-allelizing comput...
متن کاملEvolving Software Effort Estimation Models Using Multigene Symbolic Regression Genetic Programming
Software has played an essential role in engineering, economic development, stock market growth and military applications. Mature software industry count on highly predictive software effort estimation models. Correct estimation of software effort lead to correct estimation of budget and development time. It also allows companies to develop appropriate time plan for marketing campaign. Now a da...
متن کاملAsynchronous and implicitly parallel evolutionary computation models
This paper presents the design and the applica1 tion of asynchronous models of parallel evolutionary algo2 rithms. An overview of the existing parallel evolutionary 3 algorithm (PEA) models and available implementations is 4 given. We present new PEA models in the form of asynchro5 nous algorithms and implicit parallelization, as well as exper6 imental data on their efficiency. The paper also d...
متن کاملRevealing Complex Ecological Dynamics via Symbolic Regression
Complex ecosystems, from food webs to our gut microbiota, are essential to human life. Understanding the dynamics of those ecosystems can help us better maintain or control them. Yet, reverse-engineering complex ecosystems (i.e., extracting their dynamic models) directly from measured temporal data has not been very successful so far. Here we propose to close this gap via symbolic regression. W...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied Soft Computing
سال: 2021
ISSN: ['1568-4946', '1872-9681']
DOI: https://doi.org/10.1016/j.asoc.2021.107198