Temporal-difference reinforcement learning with distributed representations.

Temporal-difference (TD) algorithms have been proposed as models of reinforcement learning (RL). We examine two issues of distributed representation in these TD algorithms: distributed representations of belief and distributed discounting factors. Distributed representation of belief allows the beli...

Description complète

Enregistré dans:
Détails bibliographiques
Auteurs principaux: Zeb Kurth-Nelson, A David Redish
Format: article
Langue:EN
Publié: Public Library of Science (PLoS) 2009
Sujets:
R
Q
Accès en ligne:https://doaj.org/article/10b71edf81334d619f75d3ba97df1661
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!