Reset-free Trial-and-Error Learning for Robot Damage Recovery

Chatzilygeroudis, Konstantinos; Vassiliades, Vassilis; Mouret, Jean-Baptiste

doi:10.1016/j.robot.2017.11.010

Computer Science > Robotics

arXiv:1610.04213 (cs)

[Submitted on 13 Oct 2016 (v1), last revised 12 Dec 2017 (this version, v4)]

Title:Reset-free Trial-and-Error Learning for Robot Damage Recovery

Authors:Konstantinos Chatzilygeroudis, Vassilis Vassiliades, Jean-Baptiste Mouret

View PDF

Abstract:The high probability of hardware failures prevents many advanced robots (e.g., legged robots) from being confidently deployed in real-world situations (e.g., post-disaster rescue). Instead of attempting to diagnose the failures, robots could adapt by trial-and-error in order to be able to complete their tasks. In this situation, damage recovery can be seen as a Reinforcement Learning (RL) problem. However, the best RL algorithms for robotics require the robot and the environment to be reset to an initial state after each episode, that is, the robot is not learning autonomously. In addition, most of the RL methods for robotics do not scale well with complex robots (e.g., walking robots) and either cannot be used at all or take too long to converge to a solution (e.g., hours of learning). In this paper, we introduce a novel learning algorithm called "Reset-free Trial-and-Error" (RTE) that (1) breaks the complexity by pre-generating hundreds of possible behaviors with a dynamics simulator of the intact robot, and (2) allows complex robots to quickly recover from damage while completing their tasks and taking the environment into account. We evaluate our algorithm on a simulated wheeled robot, a simulated six-legged robot, and a real six-legged walking robot that are damaged in several ways (e.g., a missing leg, a shortened leg, faulty motor, etc.) and whose objective is to reach a sequence of targets in an arena. Our experiments show that the robots can recover most of their locomotion abilities in an environment with obstacles, and without any human intervention.

Comments:	18 pages, 16 figures, 3 tables, 6 pseudocodes/algorithms, video at this https URL, code at this https URL
Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI)
Cite as:	arXiv:1610.04213 [cs.RO]
	(or arXiv:1610.04213v4 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.1610.04213
Related DOI:	https://doi.org/10.1016/j.robot.2017.11.010

Submission history

From: Konstantinos Chatzilygeroudis [view email]
[v1] Thu, 13 Oct 2016 19:39:58 UTC (2,119 KB)
[v2] Wed, 12 Apr 2017 23:08:17 UTC (8,279 KB)
[v3] Thu, 23 Nov 2017 10:55:03 UTC (8,141 KB)
[v4] Tue, 12 Dec 2017 08:02:31 UTC (8,141 KB)

Computer Science > Robotics

Title:Reset-free Trial-and-Error Learning for Robot Damage Recovery

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Reset-free Trial-and-Error Learning for Robot Damage Recovery

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators