1 Star 0 Fork 0

虫二/option-critic-pytorch-master

加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
该仓库未声明开源许可证文件(LICENSE),使用请关注具体项目描述及其代码上游依赖。
克隆/下载
贡献代码
同步代码
取消
提示: 由于 Git 不支持空文件夾,创建文件夹后会生成空的 .keep 文件
Loading...
README

Option Critic

This repository is a PyTorch implementation of the paper "The Option-Critic Architecture" by Pierre-Luc Bacon, Jean Harb and Doina Precup arXiv. It is mostly a rewriting of the original Theano code found here into PyTorch.

Feature based deep-option critic

Currently, the feature based model can learn CartPole-v0 with a learning rate of 0.005, this has however only been tested with two options. (I dont see any reason to use more than two in the cart pole environment.) the current runs directory holds the training results for this env with 0.005 and 0.006 learning rates.

I suspect it will only take a grid search over learning rate to work on Pong and such. Just supply the right --env argument and the model should switch between features and convolutions.

Four Room experiment

There are plenty of resources to find a numpy version of the four rooms experiment, this one is a little bit different; represent the state as a one-hot encoded vector, and learn to solve this grid world using a deep net. To enable this experiment, toggle python main.py --switch-goal True --env fourrooms

Requirements

pytorch 1.3.0
tensorboard 2.0.2
gym 0.15.3

Changes with respect to the original implementation

  • Using only one optimizer (RMSProp) for both acto and critic.

空文件

简介

取消

发行版

暂无发行版

贡献者 (3)

全部

近期动态

不能加载更多了
马建仓 AI 助手
尝试更多
代码解读
代码找茬
代码优化
1
https://gitee.com/Friday13/option-critic-pytorch-master.git
git@gitee.com:Friday13/option-critic-pytorch-master.git
Friday13
option-critic-pytorch-master
option-critic-pytorch-master
master

搜索帮助