نتایج جستجو برای: critic
تعداد نتایج: 2831 فیلتر نتایج به سال:
This section concerns neural networks which are hybrid either in terms of structure or in terms of training algorithms. The counterpropagation network is one that incorporates structural characteristics of the Kohonen and Grossberg networks and it is trained by composite supervised–unsupervised methods. The adaptive critic concept concerns neural network implementations of reinforcement learnin...
We present an empirical investigation of a recent class of Generative Adversarial Networks (GANs) using Integral Probability Metrics (IPM) and their performance for semi-supervised learning. IPM-based GANs like Wasserstein GAN, Fisher GAN and Sobolev GAN have desirable properties in terms of theoretical understanding, training stability, and a meaningful loss. In this work we investigate how th...
—Catastrophic forgetting has a serious impact in reinforcement learning, as the data distribution is generally sparse and non-stationary over time. The purpose of this study is to investigate whether pseudorehearsal can increase performance of an actor-critic agent with neural-network based policy selection and function approximation in a pole balancing task and compare different pseudorehearsa...
This paper presents a new approach in the modification of CRiteria Importance Through Intercriteria Correlation (CRITIC) method using fuzzy rough numbers. In modified CRITIC (CRITIC-M), normalization procedure home matrix elements was improved and aggregation function for information processing normalized improved. By introducing way normalization, smaller deviations between are obtained, which...
Prediction-error signals consistent with formal models of "reinforcement learning" (RL) have repeatedly been found within dopaminergic nuclei of the midbrain and dopaminoceptive areas of the striatum. However, the precise form of the RL algorithms implemented in the human brain is not yet well determined. Here, we created a novel paradigm optimized to dissociate the subtypes of reward-predictio...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید