Publicatie

Learning to communicate with reinforcement learning for an adaptive traffic control system

Boekbijdrage - Boekhoofdstuk Conferentiebijdrage

Korte inhoud:Recent work in multi-agent reinforcement learning has investigated inter agent communication which is learned simultaneously with the action policy in order to improve the team reward. In this paper, we investigate independent Q-learning (IQL) without communication and differentiable inter-agent learning (DIAL) with learned communication on an adaptive traffic control system (ATCS). In real world ATCS, it is impossible to present the full state of the environment to every agent so in our simulation, the individual agents will only have a limited observation of the full state of the environment. The ATCS will be simulated using the Simulation of Urban MObility (SUMO) traffic simulator in which two connected intersections are simulated. Every intersection is controlled by an agent which has the ability to change the direction of the traffic flow. Our results show that a DIAL agent outperforms an independent Q-learner on both training time and on maximum achieved reward as it is able to share relevant information with the other agents.

Boek: Advances on P2P, Parallel, Grid, Cloud and Internet Computing : proceedings of the 16th International Conference on P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC-2021)

Pagina's: 207 - 216

ISBN:978-3-030-89899-1

Jaar van publicatie:2022

Trefwoorden:Mass communications

WoS Id: 000722277600021
DOI: https://doi.org/10.1007/978-3-030-89899-1_21
Handle: https://hdl.handle.net/10067/1846900151162165141

Authors from:Higher Education

Toegankelijkheid:Closed

Zie ook: Learning to Communicate with Reinforcement Learning for an Adaptive Traffic Control System

Publicatie

Learning to communicate with reinforcement learning for an adaptive traffic control system

Boekbijdrage - Boekhoofdstuk Conferentiebijdrage

SDG-label van Aurora

Auteurs/uitgever

Onderzoekseenheden

Evenementen