Improving quasi-optimal inventory and transportation policies using adaptive critic based approximate dynamic programming
نویسندگان
چکیده
We demonstrate the possibility of optimal control of physical inventory systems in a nonstationary fitness terrain, based on the combined application of evolutionary search and adaptive critic terrain following. We show that adaptive critic based approximate dynamic programming techniques based on plant-controller Jacobeans can be used with systems characterized by discrete valued states and controls. Improvements upon a quasi-optimal policy found using a genetic algorithm in a high-penalty environment, average 66% under conditions both of stationary and non-stationary demand.
منابع مشابه
Adaptive critic based approximate dynamic programming: A new tool for smart manufacturing
This work supported in part by the National Science Foundation under grant ECS-9904378. Abstract Adaptive critic based approximate dynamic programming techniques are gradient based methods for finding optimal policies for multi-stage decision processes. We believe adaptive critic methods are now developed to the point that they can be applied to the full spectrum of decision and control problem...
متن کاملA New Hybrid Critic-training Method for Approximate Dynamic Programming
A variety of methods for developing quasi-optimal intelligent control systems using reinforcement learning techniques based on adaptive critics have appeared in recent years. This paper reviews the family of approximate dynamic programming techniques based on adaptive critic methods and introduces a new hybrid critic training method.
متن کاملIntelligent supply chain management using adaptive critic learning
A set of neural networks is employed to develop control policies that are better than fixed, theoretically optimal policies, when applied to a combined physical inventory and distribution system in a nonstationary demand environment. Specifically, we show that model-based adaptive critic approximate dynamic programming techniques can be used with systems characterized by discrete valued states ...
متن کاملDedicated to the Marys in My Life Ellen
An abstract of the dissertation of Stephen Shervais for the Doctor of Philosophy in Systems Science presented October 6, 2000. Title: Adaptive Critic Design of Control Policies For A Multi-Echelon Inventory System A common problem in business is the determination of inventory and transportation policies for a physical distribution system within a changing business environment. This dissertation...
متن کاملAn Introduction to Adaptive Critic Control: A Paradigm Based on Approximate Dynamic Programming
Adaptive critic control is an advanced control technology developed for nonlinear dynamical systems in recent years. It is based on the idea of approximate dynamic programming. Dynamic programming was introduced by Bellman in the 1950’s for solving optimal control problems of nonlinear dynamical systems. Due to its high computational complexity, applications of dynamic programming have been lim...
متن کامل