Meta-Optimization of Bias-Variance Trade-Off in Stochastic Model Learning

Model-based reinforcement learning is expected to be a method that can safely acquire the optimal policy under real-world conditions by using a stochastic dynamics model for planning. Since the stochastic dynamics model of the real world is generally unknown, a method for learning from state transit...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Takumi Aotani, Taisuke Kobayashi, Kenji Sugimoto
Formato: article
Lenguaje:EN
Publicado: IEEE 2021
Materias:
Acceso en línea:https://doaj.org/article/af14030d799748c8865f08da2aa6ba56
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!