Transfer learning via inter-task mappings for temporal difference learning

Share Embed


Descripción

Temporal difference (TD) learning (Sutton and Barto, 1998) has become a popular reinforcement learning technique in recent years. TD methods, relying on function approximators to generalize learning to novel situations, have had some experimental successes and have been shown to exhibit some desirable properties in theory, but the most basic algorithms have often been found slow in practice. This empirical result has motivated the development of many methods that speed up reinforcement learning by modifying a ...
Lihat lebih banyak...

Comentarios

Copyright © 2017 DATOSPDF Inc.