Publication

Synthesising Reinforcement Learning Policies Through Set-Valued Inductive Rule Learning

Book Contribution - Book Chapter Conference Contribution

Today's advanced Reinforcement Learning algorithms produce black-box policies, that are often difficult to interpret and trust for a person. We introduce a policy distilling algorithm, building on the CN2 rule mining algorithm, that distills the policy into a rule-based decision system. At the core of our approach is the fact that an RL process does not just learn a policy, a mapping from states to actions, but also produces extra meta-information, such as action values indicating the quality of alternative actions. This meta-information can indicate whether more than one action is near-optimal for a certain state. We extend CN2 to make it able to leverage knowledge about equally-good actions to distill the policy into fewer rules, increasing its interpretability by a person. Then, to ensure that the rules explain a valid, non-degenerate policy, we introduce a refinement algorithm that fine-tunes the rules to obtain good performance when executed in the environment. We demonstrate the applicability of our algorithm on the Mario AI benchmark, a complex task that requires modern reinforcement learning algorithms including neural networks. The explanations we produce capture the learned policy in only a few rules, that allow a person to understand what the black-box agent learned. Source code: https://gitlab.ai.vub.ac.be/yocoppen/svcn2.

Book: Trustworthy AI - Integrating Learning, Optimization and Reasoning

Edition: 1

Series: Lecture Notes in Computer Science

Pages: 163-179

Number of pages: 17

ISBN:978-3-030-73958-4

Publication year:2021

Scopus Id: 85105923166
DOI: https://doi.org/10.1007/978-3-030-73959-1_15
Institutional Repository URL: https://arxiv.org/abs/2106.06009
ORCID: /0000-0001-6346-4564/work/92111289
ORCID: /0000-0003-1124-0731/work/92112144
ORCID: /0000-0003-1521-8494/work/105290029

Accessibility:Open

Publication

Synthesising Reinforcement Learning Policies Through Set-Valued Inductive Rule Learning

Book Contribution - Book Chapter Conference Contribution

Authors/publisher

Projects