نتایج جستجو برای: distributed reinforcement learning

تعداد نتایج: 868955  

Journal: :IEEE/CAA Journal of Automatica Sinica 2023

Dear Editor, Underwater distributed antenna systems (DAS) are stationary infrastructures consisting of multiple geographically elements (DAEs) which interconnected through high-rate backbone networks [1]. Compared to centralized systems, the DAS could provide a larger coverage area and higher throughput for underwater acoustic (UWA) transmissions. In this work, exploiting low sound speed in wat...

Journal: :Knowledge Based Systems 2021

Energy and thermal management is a crucial element in Formula-E race strategy development. In this study, the race-level development formulated into Markov decision process (MDP) problem featuring hybrid-type action space. Deep Deterministic Policy Gradient (DDPG) reinforcement learning implemented under distributed architecture Ape-X integrated with prioritized experience replay reward shaping...

Journal: :Machines 2022

A common situation arising in flow shops is that the job processing order must be same on each machine; this referred to as a permutation shop scheduling problem (PFSSP). Although many algorithms have been designed solve PFSSPs, machine availability typically ignored. Healthy conditions are essential for production process, which can ensure productivity and quality; thus, deteriorating effects ...

2002
Min Chee Choy

This paper presents the application of fuzzy-neuroevolutionary hybrid system with online reinforcement learning for intelligent road traffic management and control. Taking a step away from the conventional traffic control system, the hybrid system presents different methodologies in knowledge acquisition, decisionmaking, learning and goal formulation with the use of a three-layered hierarchical...

Journal: :Transactions of the Institute of Systems, Control and Information Engineers 2013

Journal: :The Astronomical Journal 2023

Abstract Time-domain astronomy is an active research area now, which requires frequent observations of the whole sky to capture celestial objects with temporal variations. In optical band, several telescopes in different locations could form a distributed telescope array images continuously. However, there are millions observe each night, and only limited be used for observation. Besides, obser...

Journal: :Intelligent Automation and Soft Computing 2022

In meteorological and electric power Internet of Things scenarios, in order to extend the service life relevant facilities reduce cost emergency repair, intelligent inspection swarm is introduced cooperate with monitoring tasks, which collect process current scene data through a variety sensors cameras, complete tasks such as handling fault inspection. Due limitation computing resources battery...

Journal: :CoRR 2015
Brian Swenson Soummya Kar João M. F. Xavier

The paper studies the highly prototypical Fictitious Play (FP) algorithm, as well as a broad class of learning processes based on best-response dynamics, that we refer to as FP-type algorithms. A well-known shortcoming of FP is that, while players may learn an equilibrium strategy in some abstract sense, there are no guarantees that the period-by-period strategies generated by the algorithm act...

1996
Thomas Haynes Sandip Sen

Groups of agents following fixed behavioral rules can be limited in performance and efficiency. Adaptability and flexibility are key components of intelligent behavior which allow agent groups to improve performance in a given domain using prior problem solving experience. We motivate the usefulness of individual learning by group members in the context of overall group behavior. In particular,...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید