Multi-Agent Architectures that Facilitate Apprenticeship Learning for Real-Time Decision Making: Minerva and Gerona
نویسندگان
چکیده
This paper describes the Minerva and Gerona agent architectures, which have been designed to facilitate apprenticeship learning in real-time decision making domains. Apprenticeship is a form of learning by watching, which is particularly useful in multi-agent knowledge-intensive domains. In this form of situated learning, human and synthetic agents refine their knowledge in the process of critiquing the observed actions of each other, and resolving underlying knowledge differences. A major design feature of Minerva and Gerona is their method of knowledge representation of domain and control knowledge, both static and dynamic. Their representations facilitates reasoning over domain and control knowledge for the purpose of apprenticeship learning. This ability to reason over domain and control knowledge plays a central role in solving the global and local credit assignment problems that confront an apprenticeship learner.
منابع مشابه
An Unsupervised Learning Method for an Attacker Agent in Robot Soccer Competitions Based on the Kohonen Neural Network
RoboCup competition as a great test-bed, has turned to a worldwide popular domains in recent years. The main object of such competitions is to deal with complex behavior of systems whichconsist of multiple autonomous agents. The rich experience of human soccer player can be used as a valuable reference for a robot soccer player. However, because of the differences between real and simulated soc...
متن کاملLearning to Envision: An Intelligent Agent for Ship Damage Control
This paper describes an Intelligent Agent for real-time crisis decision making, called Minerva-DCA, which improves its performance by compiling the results of a first-principles simulator. The agent is blackboard based and uses envisionment to schedule its actions. This is necessary because the complexity and chaos associated with ship damage control don’t allow the range of necessary behaviors...
متن کاملUtilizing Generalized Learning Automata for Finding Optimal Policies in MMDPs
Multi agent Markov decision processes (MMDPs), as the generalization of Markov decision processes to the multi agent case, have long been used for modeling multi agent system and are used as a suitable framework for Multi agent Reinforcement Learning. In this paper, a generalized learning automata based algorithm for finding optimal policies in MMDP is proposed. In the proposed algorithm, MMDP ...
متن کاملResearch Summary: Communication-Sensitive Decision Making in Multi-Agent, Real-Time Environments
In a recently started project, we are developing techniques for intelligent agent control and coordination in a dynamic, real-time, multi-agent setting. The application domain, consisting of teams of autonomous air vehicles (AAVs), is characterized by dynamic environments, real-time response requirements, limited information, and unreliable, low-bandwidth communications. We have developed an in...
متن کاملA DSS-Based Dynamic Programming for Finding Optimal Markets Using Neural Networks and Pricing
One of the substantial challenges in marketing efforts is determining optimal markets, specifically in market segmentation. The problem is more controversial in electronic commerce and electronic marketing. Consumer behaviour is influenced by different factors and thus varies in different time periods. These dynamic impacts lead to the uncertain behaviour of consumers and therefore harden the t...
متن کامل