Feature-Attending Recurrent Modules for Generalization in Reinforcement Learning

Carvalho, Wilka; Lampinen, Andrew; Nikiforou, Kyriacos; Hill, Felix; Shanahan, Murray

Computer Science > Machine Learning

arXiv:2112.08369v1 (cs)

[Submitted on 15 Dec 2021 (this version), latest version 3 Nov 2023 (v3)]

Title:Feature-Attending Recurrent Modules for Generalization in Reinforcement Learning

Authors:Wilka Carvalho, Andrew Lampinen, Kyriacos Nikiforou, Felix Hill, Murray Shanahan

View PDF

Abstract:Deep reinforcement learning (Deep RL) has recently seen significant progress in developing algorithms for generalization. However, most algorithms target a single type of generalization setting. In this work, we study generalization across three disparate task structures: (a) tasks composed of spatial and temporal compositions of regularly occurring object motions; (b) tasks composed of active perception of and navigation towards regularly occurring 3D objects; and (c) tasks composed of remembering goal-information over sequences of regularly occurring object-configurations. These diverse task structures all share an underlying idea of compositionality: task completion always involves combining recurring segments of task-oriented perception and behavior. We hypothesize that an agent can generalize within a task structure if it can discover representations that capture these recurring task-segments. For our tasks, this corresponds to representations for recognizing individual object motions, for navigation towards 3D objects, and for navigating through object-configurations. Taking inspiration from cognitive science, we term representations for recurring segments of an agent's experience, "perceptual schemas". We propose Feature Attending Recurrent Modules (FARM), which learns a state representation where perceptual schemas are distributed across multiple, relatively small recurrent modules. We compare FARM to recurrent architectures that leverage spatial attention, which reduces observation features to a weighted average over spatial positions. Our experiments indicate that our feature-attention mechanism better enables FARM to generalize across the diverse object-centric domains we study.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2112.08369 [cs.LG]
	(or arXiv:2112.08369v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2112.08369

Submission history

From: Wilka Carvalho [view email]
[v1] Wed, 15 Dec 2021 12:48:12 UTC (12,517 KB)
[v2] Fri, 28 Jan 2022 23:18:07 UTC (12,916 KB)
[v3] Fri, 3 Nov 2023 15:12:28 UTC (26,790 KB)

Computer Science > Machine Learning

Title:Feature-Attending Recurrent Modules for Generalization in Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Feature-Attending Recurrent Modules for Generalization in Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators