Some Models for Autonomous Agents' Action Selection in Complex Partially Observable Environments

نویسنده

  • Predrag T. Tosic
چکیده

We study the resource-bounded autonomous agents acting in complex multi-agent and multi-task environments. In particular, we investigate various models of the agent action selection in such environments. Designing such autonomous decision making agents for non-episodic and partially observable dynamic environments is particularly challenging, due to a high-level demand such environments pose in terms of agent’s necessary capabilities. We make an early attempt in modeling and designing agents for largescale multi-agent systems and complex environments, where individual agent’s behaviors are sufficiently simple to be scalable and practical, and where agents’ coordination and self-organization capabilities can still make agents (both individually and as ensembles) effective with respect to accomplishing their goals. The emphasis in this paper is on models of an agent’s local knowledge based individual behaviors. We propose several simple mathematical models for an agent’s local knowledge-based action selection. We illustrate the general ideas about bounded-resource autonomous agents acting in multi-agent, multi-task dynamic environments, and the proposed generic models of agents’ autonomous local-knowledge based action selection in such environments, with a concrete application example: modeling and simulation of a collection of autonomous unmanned aerial vehicles (UAVs) on a multi-task mission.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Self-Made Agent Based on Action-Selection

Some agents have to face multiple objectives simultaneously. In such cases, and considering partially observable environments, classical Reinforcement Learning (RL) is prone to fall in pretty low local optima, only learning straightforward behaviors. We present here a method that tries to identify and learn independent “basic” behaviors solving separate tasks the agent has to face. Using a comb...

متن کامل

Un Mécanisme Constructiviste d'Apprentissage Automatique d'Anticipations pour des Agents Artificiels Situés

(17) This research is characterized, first, by a theoretical discussion on the concept of autonomous agent, based on elements taken from the Situated AI and the Affective AI paradigms. Secondly, this thesis presents the problem of learning world models, providing a bibliographic review regarding some related works. From these discussions, the CAES architecture and the CALM mechanism are present...

متن کامل

Negotiated Learning for Smart Grid Agents: Entity Selection based on Dynamic Partially Observable Features

An attractive approach to managing electricity demand in the Smart Grid relies on real-time pricing (RTP) tariffs, where customers are incentivized to quickly adapt to changes in the cost of supply. However, choosing amongst competitive RTP tariffs is difficult when tariff prices change rapidly. The problem is further complicated when we assume that the price changes for a tariff are published ...

متن کامل

Coordination in large scale multi-agent systems for complex environments

Multi-agent systems allow for robust and flexible solutions, but require complex coordination to operate efficiently. Efficient task allocation is difficult even when agents and tasks are known and unchanging, yet in many circumstances agents might fail or the environment is only partially observable. Our work explores models for large multi-agent systems in partially observable environments to...

متن کامل

Goal-Driven Autonomy for Responding to Unexpected Events in Strategy Simulations

To operate autonomously in complex environments, an agent must monitor its environment and determine how to respond to new situations. To be considered intelligent, an agent should select actions in pursuit of its goals, and adapt accordingly when its goals need revision. However, most agents assume that their goals are given to them; they cannot recognize when their goals should change. Thus, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010