Skip to content

Commit 23b530b

Browse files
authored
Update publication FGRL
1 parent 7da860d commit 23b530b

1 file changed

Lines changed: 14 additions & 3 deletions

File tree

_data/publications.yaml

Lines changed: 14 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -211,19 +211,30 @@
211211
- title: Feudal Graph Reinforcement Learning
212212
links:
213213
paper: https://arxiv.org/abs/2304.05099
214-
venue: Preprint
215-
year: 2023
214+
venue: Transactions on Machine Learning Research
215+
year: 2024
216216
authors:
217217
- id:tmarzi
218218
- A. Khehra
219219
- id:acini
220220
- id:calippi
221-
first_authors: 2
221+
first_authors: 1
222222
keywords:
223223
- reinforcement learning
224224
- relational inductive biases
225225
- graph neural networks
226226
abstract: We focus on learning composable policies to control a variety of physical agents with possibly different structures. Among state-of-the-art methods, prominent approaches exploit graph-based representations and weight-sharing modular policies based on the message-passing framework. However, as shown by recent literature, message passing can create bottlenecks in information propagation and hinder global coordination. This drawback can become even more problematic in tasks where high-level planning is crucial. In fact, in similar scenarios, each modular policy - e.g., controlling a joint of a robot - would request to coordinate not only for basic locomotion but also achieve high-level goals, such as navigating a maze. A classical solution to avoid similar pitfalls is to resort to hierarchical decision-making. In this work, we adopt the Feudal Reinforcement Learning paradigm to develop agents where control actions are the outcome of a hierarchical (pyramidal) message-passing process. In the proposed Feudal Graph Reinforcement Learning (FGRL) framework, high-level decisions at the top level of the hierarchy are propagated through a layered graph representing a hierarchy of policies. Lower layers mimic the morphology of the physical system and upper layers can capture more abstract sub-modules. The purpose of this preliminary work is to formalize the framework and provide proof-of-concept experiments on benchmark environments (MuJoCo locomotion tasks). Empirical evaluation shows promising results on both standard benchmarks and zero-shot transfer learning settings.
227+
bibtex: >
228+
@article{
229+
marzi2024feudal,
230+
title={Feudal Graph Reinforcement Learning},
231+
author={Tommaso Marzi and Arshjot Singh Khehra and Andrea Cini and Cesare Alippi},
232+
journal={Transactions on Machine Learning Research},
233+
issn={2835-8856},
234+
year={2024},
235+
url={https://openreview.net/forum?id=wFcyJTik90},
236+
note={}
237+
}
227238
- title: Object-Centric Relational Representations for Image Generation
228239
links:
229240
paper: https://arxiv.org/abs/2303.14681

0 commit comments

Comments
 (0)