Reinforcement Learning for Container Terminal

 

 Overview

Company Name / Department ICT Netherlands
Contact Person Jiska Gunthardt
Location Barendrecht
Study programme(s) Operations Management & Logistics
Community ESCF
Start Date February 2023
Housing arranged by company No

Compensation 

1000 EUR per month

Company Description

As a European leading provider of industrial technology solutions, our 1,900 dedicated technical professionals are ready to provide our clients with high-quality services such as recruitment and staffing, training, project-based solutions, consultancy and managed services as well as in-house developed cloud-based products.

 With a track record of over 40 years, ICT Group has both extensive multi-domain expertise and in-depth industry knowledge. We deliver proven technology solutions and are uniquely positioned to help our clients make their business processes more efficient, flexible, simple, secure and – as a result – more sustainable.

Project Description

In this project, you will implement a policy gradient method, for instance Proximal Policy Optimization (PPO), or a variant of policy gradient methods. Policy gradient methods and reinforcement learning methods in general, are notoriously sensitive to preprocessing choices and parameter tuning, making implementation of these algorithms difficult.

Goals of the Project

Container terminals are a vital part of the supply chain. Many optimization problems arise on a container terminal, this project will focus on the stacking problem.

Deliverables

A policy gradient method, for instance Proximal Policy Optimization (PPO), or a variant of policy gradient methods. Policy gradient methods and reinforcement learning methods in general, are notoriously sensitive to preprocessing choices and parameter tuning, making implementation of these algorithms difficult. Next to that, you will perform research on enriching these methods with our extensive domain knowledge, therefore improving the performance of these reinforcement learning algorithms. This project can be implemented in for instance Python, Java or C++.

Essential Student Knowledge

AI, Reinforcement learning, PGM

More information: escf@tue.nl