q model kjartansson

نتایج جستجو برای: q model kjartansson

تعداد نتایج: 2202283 فیلتر نتایج به سال:

Learning Partial Models for Hierarchical Planning

2009

Neville Mehta Prasad Tadepalli

AI planning research typically assumes that complete action models are given. On the other hand, popular approaches in reinforcement learning such as Q-learning completely eschew models and planning. Neither of these approaches is satisfactory to achieve robust human-level AI that includes planning and learning in rich structured domains. In this paper, we introduce the idea of planning with pa...

متن کامل

Designing an Incentive Contract Menu for Sustaining the Electricity Market

2015

Ying Yu Tongdan Jin Chunjie Zhong Ying-Yi Hong

Abstract: This paper designs an incentive contract menu to achieve long-term stability for electricity prices in a day-ahead electricity market. A bi-level Stackelberg game model is proposed to search for the optimal incentive mechanism under a one-leader and multi-followers gaming framework. A multi-agent simulation platform was developed to investigate the effectiveness of the incentive mecha...

متن کامل

Q-learning in Two-Player Two-Action Games

2009

Monica Babes Michael Wunder Michael Littman

Q-learning is a simple, powerful algorithm for behavior learning. It was derived in the context of single agent decision making in Markov decision process environments, but its applicability is much broader— in experiments in multiagent environments, Q-learning has also performed well. Our preliminary analysis finds that Q-learning’s indirect control of behavior via estimates of value contribut...

متن کامل

Modular Q-learning based multi-agent cooperation for robot soccer

Journal: :Robotics and Autonomous Systems 2001

Kui-Hong Park Yong-Jae Kim Jong-Hwan Kim

In a multi-agent system, action selection is important for the cooperation and coordination among agents. As the environment is dynamic and complex, modular Q-learning, which is one of the reinforcement learning schemes, is employed in assigning a proper action to an agent in the multi-agent system. The architecture of modular Q-learning consists of learning modules and a mediator module. The m...

متن کامل

Comments on Switched Reluctance Machine Mathematical Model

2010

Liviu SOMEŞAN Emil PĂDURARIU Loránd SZABÓ Mircea RUBA Ioan-Adrian VIOREL

In the paper the main analytical models of the switched reluctance (SR) machine are presented, based on geometry data, magnetic equivalent circuits and finite element (FEM) analysis results. In each case a representative example is given and finally the advantages and the weak points of each type of model are evinced.

متن کامل

Predicting and Preventing Coordination Problems in Cooperative Q-learning Systems

2007

Nancy Fulda Dan Ventura

We present a conceptual framework for creating Qlearning-based algorithms that converge to optimal equilibria in cooperative multiagent settings. This framework includes a set of conditions that are sufficient to guarantee optimal system performance. We demonstrate the efficacy of the framework by using it to analyze several well-known multi-agent learning algorithms and conclude by employing i...

متن کامل

Learning to Explore with Meta-Policy Gradient

2018

Tianbing Xu Qiang Liu Liang Zhao Jian Peng

The performance of off-policy learning, including deep Q-learning and deep deterministic policy gradient (DDPG), critically depends on the choice of the exploration policy. Existing exploration methods are mostly based on adding noise to the on-going actor policy and can only explore local regions close to what the actor policy dictates. In this work, we develop a simple meta-policy gradient al...

متن کامل

Peramalan Produk Domestik Regional Bruto (PDRB) Provinsi Bali Triwulanan (Q-to-Q) Tahun Dasar 2010 dengan Model Arima

Journal: :Jurnal Ekonomi Kuantitatif Terapan 2019

متن کامل

Anisotropic Strange Star Model Beyond Standard Maximum Mass Limit by Gravitational Decoupling in f(Q)$f(Q)$ Gravity

Journal: :Fortschritte der Physik 2022

The current theoretical development identified as the gravitational decoupling via Complete Geometric Deformation (CGD) method that has been introduced to explore nonmetricity $Q$ effects in relativistic astrophysics. In present work, we have investigated gravitationally decoupled anisotropic solutions for strange star framework of $f(Q)$ gravity by utilizing CGD technique. To do this, started ...

متن کامل

Integrable structure of melting crystal model with two q-parameters

Journal: :Journal of Geometry and Physics 2009

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید