Speeding-Up Action Learning in a Social Robot With Dyna-Q+: A Bioinspired Probabilistic Model Approach

External link: Speeding-Up Action Learning in a Social Robot With Dyna-Q+: A Bioinspired Probabilistic Model Approach

Robotic systems that are developed for social and dynamic environments require adaptive
mechanisms to successfully operate. Consequently, learning from rewards has provided meaningful results in
applications involving human-robot interaction. In those cases where the robot’s state space and the number
of actions is extensive, dimensionality becomes intractable and this drastically slows down the learning
process. This effect is specially notorious in one-step temporal difference methods because just one update
is performed per robot-environment interaction. In this paper, we prove how the action-based learning of a
social robot can be improved by combining classical temporal difference reinforcement learning methods,
such as Q-learning or Q(λ), with a probabilistic model of the environment. This architecture, which we
have called Dyna, allows the robot to simultaneously act and plan using the experience obtained during real
human-robot interactions. Principally, Dyna improves classical algorithms in terms of convergence speed and
stability, which strengthens the learning process. Hence, in this work we have embedded a Dyna architecture
in our social robot, Mini, to endow it with the ability to autonomously maintain an optimal internal state while
living in a dynamic environment.

Connect with

Quick Links

Contact

Copyright © 2024 Carlos III University of Madrid. All Rights Reserved.
The material on this site may not be reproduced, distributed, transmitted, cached or otherwise used, except with the prior written permission of the Carlos III University of Madrid.
Privacy Policy | Advertising | About Us

Speeding-Up Action Learning in a Social Robot With Dyna-Q+: A Bioinspired Probabilistic Model Approach

IEEE Access

Description

Previous journal

next journal

People

M. Maroto-Gomez

A. Castro-Gonzalez

M. Malfaz

M.A. Salichs

Rodrigo González

Quick Links

Contact

Speeding-Up Action Learning in a Social Robot With Dyna-Q+: A Bioinspired Probabilistic Model Approach

IEEE Access

Description

Previous journal

next journal

People

M. Maroto-Gomez A. Castro-Gonzalez M. Malfaz M.A. Salichs Rodrigo González

Quick Links

Contact

M. Maroto-Gomez

A. Castro-Gonzalez

M. Malfaz

M.A. Salichs

Rodrigo González