[科技报告]AD  Fernandez, F. , Veloso, M.15

摘要: Policy Reuse (PR) provides Reinforcement Learning algorithms with a mechanism to bias an exploration process by reusing a set of past policies. Policy Reuse offers the challenge of balancing the exploitation of the ongoing learned... 展开

翻译摘要
相关作者
相关关键词