Reinforcement learning for multi-item retrieval in the puzzle-based storage system
نویسندگان
چکیده
• A deep reinforcement learning algorithm is proposed for multi-item retrieval in the PBS system. compact integer programming model built to evaluate solution quality. conversion handle simultaneous movement. decomposition framework designed large-scale instances. The effect of several factors investigated deduce managerial insights. Nowadays, fast delivery services have created need high-density warehouses. puzzle-based storage system a practical way enhance density, however, facing difficulties process. In this work, algorithm, specifically Double&Dueling Deep Q Network, developed solve problem with general settings, where multiple desired items, escorts, and I/O points are placed randomly. Additionally, we propose Extensive numerical experiments demonstrate that approach can yield high-quality solutions outperforms three related state-of-the-art heuristic algorithms. Furthermore, movement instances respectively, thus improving applicability
منابع مشابه
Reducing Retrieval Time in Automated Storage and Retrieval System with a Gravitational Conveyor Based on Multi-Agent Systems
The main objective of this study is to reduce the retrieval time of a list of products by choosing the best combination of storage and retrieval rules at any time. This is why we start by implementing some storage rules in an Automated Storage/Retrieval System (Automated Storage and Retrieval System: AS/RS) fitted with a gravity conveyor while some of these rules are dedicated to storage and ot...
متن کاملAn Integrated Model for Storage Location Assignment and Storage/Retrieval Scheduling in AS/RS system
An integrated optimization framework, including location assignment under grouping class-based storage policy and schedule of dual shuttle cranes, is offered by presenting a new optimization programming model. The objective functions, which are considered at this level, are the minimization of total costs and energy consumption. Scheduling of dual shuttle cranes among specified locations, which...
متن کاملRetrieval–travel-time model for free-fall-flow-rack automated storage and retrieval system
Automated storage and retrieval systems (AS/RSs) are material handling systems that are frequently used in manufacturing and distribution centers. The modelling of the retrieval–travel time of an AS/RS (expected product delivery time) is practically important, because it allows us to evaluate and improve the system throughput. The free-fall-flow-rack AS/RS has emerged as a new technology for dr...
متن کاملHypervolume-Based Multi-Objective Reinforcement Learning
Indicator-based evolutionary algorithms are amongst the best performing methods for solving multi-objective optimization (MOO) problems. In reinforcement learning (RL), introducing a quality indicator in an algorithm’s decision logic was not attempted before. In this paper, we propose a novel on-line multi-objective reinforcement learning (MORL) algorithm that uses the hypervolume indicator as ...
متن کاملAn Online Q-learning Based Multi-Agent LFC for a Multi-Area Multi-Source Power System Including Distributed Energy Resources
This paper presents an online two-stage Q-learning based multi-agent (MA) controller for load frequency control (LFC) in an interconnected multi-area multi-source power system integrated with distributed energy resources (DERs). The proposed control strategy consists of two stages. The first stage is employed a PID controller which its parameters are designed using sine cosine optimization (SCO...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: European Journal of Operational Research
سال: 2023
ISSN: ['1872-6860', '0377-2217']
DOI: https://doi.org/10.1016/j.ejor.2022.03.042