نتایج جستجو برای: weighting agent

تعداد نتایج: 273112  

Journal: :IOP Conference Series: Materials Science and Engineering 2018

2005
Roberto S. Legaspi Raymund Sison Ken-ichi Fukui Masayuki Numao

This paper discusses a cluster knowledge-based predictive modeling framework actualized in a learning agent that leverages on the capability of a clustering algorithm to discover in logged tutorial interactions unknown structures that may exhibit predictive characteristics. The learned cluster models are described along learner-system interaction attributes, i.e., in terms of the learner’s know...

2016
Vadim Bulitko Alexander Sampley

Real-time heuristic search models an autonomous agent solving a search task. The agent operates in a real-time setting by interleaving local planning, learning and move execution. In this paper we propose a simple parametric algorithm that combines weighting with learning from multiple neighbors. Doing so breaks heuristic admissibility but allows the agent to escape heuristic depressions more q...

Journal: :J. Artif. Intell. Res. 2011
Joel Veness Kee Siong Ng Marcus Hutter David Silver

This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. Our approach is based on a direct approximation of AIXI, a Bayesian optimality notion for general reinforcement learning agents. Previously, it has been unclear whether the theory of AIXI could motivate the design of practical algorithms. We answer this hitherto open question in the af...

Journal: :Proceedings of the ... AAAI Conference on Artificial Intelligence 2022

Off-policy sampling and experience replay are key for improving sample efficiency scaling model-free temporal difference learning methods. When combined with function approximation, such as neural networks, this combination is known the deadly triad potentially unstable. Recently, it has been shown that stability good performance at scale can be achieved by combining emphatic weightings multi-s...

Journal: :Journal of Health Politics, Policy and Law 2005

Journal: :J. UCS 2009
Duy Hoang Pham Guido Governatori Subhasis Thakur

Argumentation games have been proved to be a robust and flexible tool to resolve conflicts among agents. An agent can propose its explanation and its goal known as a claim, which can be refuted by other agents. The situation is more complicated when there are more than two agents playing the game. We propose a weighting mechanism for competing premises to tackle with conflicts from multiple age...

2008
Duy Hoang Pham Subhasis Thakur Guido Governatori

Argumentation games have been proved to be a robust and flexible tool to resolve conflicts among agents. An agent can propose its explanation and its goal known as a claim, which can be refuted by other agents. The situation is more complicated when there are more than two agents playing the game. We propose a weighting mechanism for competing premises to tackle with conflicts from multiple age...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید