Kaven, Lea; Huke, Philipp; Göppert, Amon; Schmitt, … - In: Journal of Intelligent Manufacturing 35 (2024) 8, pp. 3917-3936
policy optimization and consists of a decoder and encoder, allowing for various-sized system state descriptions. A simulation … scheduling in line-less mobile assembly systems. The proposed multi agent deep reinforcement learning algorithm uses proximal …