Discovering Heuristics And Metaheuristics For Job Shop Scheduling From Scratch Via Deep Reinforcement Learning

van Ekeris, Tilo; Meyes, Richard; Meisen, Tobias

Startseite
→
Proceedings
→
Conference on Production Systems and Logistics (CPSL)
→
Proceedings CPSL 2021
→
Dokumentanzeige

Originalpublikation

van Ekeris, T.; Meyes, R.; Meisen, T.: Discovering Heuristics And Metaheuristics For Job Shop Scheduling From Scratch Via Deep Reinforcement Learning. In: Herberger, D.; Hübner, M. (Eds.): Proceedings of the Conference on Production Systems and Logistics : CPSL 2021. Hannover : publish-Ing., 2021, S. 709-718. DOI:https://doi.org/10.15488/11231

Name: van Ekeris 2021 - ...

Größe: 1.289Mb

Format: PDF

Öffnen

Zusammenfassung:
Scheduling is the mathematical problem of allocating tasks to resources considering certain constraints. The goal is to achieve the best possible scheduling quality given a quality metric like makespan. Typical scheduling problems, including the classic Job Shop Scheduling Problem (JSP or JSSP), are NP-hard; meaning it is infeasible to use optimal solvers for big problem sizes. Instead, heuristics are frequently used to find suboptimal solutions in polynomial time, especially in real-world applications. Recently, Deep Reinforcement Learning (DRL) has also been applied to find solutions for planning problems like the JSP. In DRL, agents learn solution strategies for specific problem classes through the principle of trial and error. In this paper, we explore the connection between known heuristics and DRL: Heuristics always rely on features that can be extracted from the considered problem with low computational effort. We show that DRL agents, for which we limit the available observation to the underlying features of well-known heuristics, learn the behaviour of the more qualitative heuristics from scratch, while they do not learn the behaviour of less qualitative heuristics that would also be possible learning outcomes given the same feature as observation. Additionally, we motivate the use of DRL as a metaheuristic generator by training with the features of multiple basic heuristics. We show promising results that indicate that this learned metaheuristic finds better schedules in terms of makespan than any single simple heuristic – while only requiring simple computations in the time-critical solution phase and thus being faster than optimal solvers.
Lizenzbestimmungen:	CC BY 3.0 DE - https://creativecommons.org/licenses/by/3.0/de/
Publikationstyp:	BookPart
Publikationsstatus:	publishedVersion
Erstveröffentlichung:	2021
Schlagwörter (englisch):	Deep Reinforcement Learning (DRL), Production planning, Scheduling, Job Shop Scheduling (JSP, JSSP), Proximal Policy Optimization (PPO), Heuristics, Metaheuristics
Fachliche Zuordnung (DDC):	620 \| Ingenieurwissenschaften und Maschinenbau
Kontrollierte Schlagwörter:	Konferenzschrift

Downloadstatistik

Zur Langanzeige

Die Publikation erscheint in Sammlung(en):

Proceedings CPSL 2021
Proceedings of the Conference on Production Systems and Logistics : CPSL 2021
Proceedings CPSL 2021
Proceedings of the Conference on Production Systems and Logistics : CPSL 2021

Discovering Heuristics And Metaheuristics For Job Shop Scheduling From Scratch Via Deep Reinforcement Learning

Originalpublikation

Die Publikation erscheint in Sammlung(en):

Suche im Repositorium

Durchblättern

Gesamter Bestand

Diese Sammlung

Mein Nutzer/innenkonto

Nutzungsstatistiken