نتایج جستجو برای: critic

تعداد نتایج: 2831  

1996
Cris Koutsougeras

This section concerns neural networks which are hybrid either in terms of structure or in terms of training algorithms. The counterpropagation network is one that incorporates structural characteristics of the Kohonen and Grossberg networks and it is trained by composite supervised–unsupervised methods. The adaptive critic concept concerns neural network implementations of reinforcement learnin...

Journal: :CoRR 2017
Tom Sercu Youssef Mroueh

We present an empirical investigation of a recent class of Generative Adversarial Networks (GANs) using Integral Probability Metrics (IPM) and their performance for semi-supervised learning. IPM-based GANs like Wasserstein GAN, Fisher GAN and Sobolev GAN have desirable properties in terms of theoretical understanding, training stability, and a meaningful loss. In this work we investigate how th...

Journal: :Journal of the African Literature Association 2021

Journal: :CoRR 2017
Vladimir Marochko Leonard Johard Manuel Mazzara

—Catastrophic forgetting has a serious impact in reinforcement learning, as the data distribution is generally sparse and non-stationary over time. The purpose of this study is to investigate whether pseudorehearsal can increase performance of an actor-critic agent with neural-network based policy selection and function approximation in a pole balancing task and compare different pseudorehearsa...

Journal: :Decision Making 2022

This paper presents a new approach in the modification of CRiteria Importance Through Intercriteria Correlation (CRITIC) method using fuzzy rough numbers. In modified CRITIC (CRITIC-M), normalization procedure home matrix elements was improved and aggregation function for information processing normalized improved. By introducing way normalization, smaller deviations between are obtained, which...

2017
Jaron T. Colas Wolfgang M. Pauli Tobias Larsen J. Michael Tyszka John P. O'Doherty

Prediction-error signals consistent with formal models of "reinforcement learning" (RL) have repeatedly been found within dopaminergic nuclei of the midbrain and dopaminoceptive areas of the striatum. However, the precise form of the RL algorithms implemented in the human brain is not yet well determined. Here, we created a novel paradigm optimized to dissociate the subtypes of reward-predictio...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید