نتایج جستجو برای: approximately innercsigma dynamics

تعداد نتایج: 666102  

Journal: :Abstract and Applied Analysis 2012

Journal: :Proceedings of the American Mathematical Society 1957

Journal: :Journal of Mathematical Analysis and Applications 2006

Journal: :Demonstratio Mathematica 2016

Journal: :Tohoku Mathematical Journal 1974

Journal: :CoRR 2015
Bradly C. Stadie Sergey Levine Pieter Abbeel

Achieving efficient and scalable exploration in complex domains poses a major challenge in reinforcement learning. While Bayesian and PAC-MDP approaches to the exploration problem offer strong formal guarantees, they are often impractical in higher dimensions due to their reliance on enumerating the state-action space. Hence, exploration in complex domains is often performed with simple epsilon...

Journal: :Journal of Mathematical Analysis and Applications 2011

Journal: :Annual Review of Economics 2019

Journal: :Colloquium Mathematicum 1987

Journal: :Pacific Journal of Mathematics 1977

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید