# rl-notebook **Repository Path**: etoileyue/rl-notebook ## Basic Information - **Project Name**: rl-notebook - **Description**: 源仓库:https://github.com/NoneJou072/rl-notebook - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2025-07-03 - **Last Updated**: 2025-07-03 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # 一个简洁的强化学习算法库 介绍每一种强化学习算法的 * 算法原理 ✔️ * Pytorch实现 ✔️ --- | 单智能体强化学习算法 | Policy | Based | |-------------------------------------------------|:------------:|:--------------:| | 👉 [Sarsa](Sarsa/Sarsa.md) | on-policy | value-based | | 👉 [Q-Learning](Q-learning/Q_learning.md) | off-policy | value-based | | 👉 [DQN](DQN/DQN.md) | off-policy | value-based | | ❌ [Rainbow-DQN]() | off-policy | value-based | | 👉 [REINFORCE](REINFORCE/REINFORCE.md) | on-policy | policy-based | | 👉 [actor-critic](actor-critic/actor-critic.md) | on-policy | policy-based | | 👉 [A2C](A2C/A2C.md) | on-policy | Actor-Critic | | 👉 [DDPG](DDPG/DDPG.md) | off-policy | Actor-Critic | | 👉 [HER-DDPG](HER/HER.md) | off-policy | Actor-Critic | | 👉 [TD3](TD3/TD3.md) | off-policy | Actor-Critic | | ❌ [TRPO]() | on-policy | Actor-Critic | | 👉 [PPO-Continuous](PPO/PPO.md) | on-policy | Actor-Critic | | 👉 [SAC](SAC/SAC.md) | off-policy | Actor-Critic | | 👉 [Relay HER (RHER)](https://github.com/kaixindelele/RHER) | off-policy | Actor-Critic | | 👉 [Behavior Cloning (BC)]() | off-policy | Imitation Learning | | 👉 [Generative Adversarial Imitation Learning (GAIL)]() | on-policy | Imitation Learning | --- 运行环境: > python(in Pycharm)- 3.10 gymnasium-0.28.1 numpy-1.24.3 torch-2.1.0 建议使用 Pycharm,vscode或终端启动会有路径问题。建议更新 `gymnasium` 和 `pytorch` 到最新版本。算法原理请参考每个算法文件夹内的 markdown 文件,内部实现参考以算法名称命名的 `.py` 脚本,运行 `train.py` 脚本以进行训练。 在部分算法中添加了 tensorboard 模块,启动训练后会在对应算法文件夹内生成 log 文件夹,通过下面的终端命令可以打开网页查看训练日志: ```shell tensorboard --logdir . ``` References --- * [OpenAI-Spinningup](https://spinningup.openai.com/en/latest/algorithms/sac.html) * [Easy-RL (蘑菇书)]() * 深度强化学习-王树森 * [Github:Lizhi-sjtu/DRL-code-pytorch](https://github.com/Lizhi-sjtu/DRL-code-pytorch/blob/main/2.Actor-Critic/README.md) * [动手学强化学习](https://hrl.boyuai.com/chapter/2/dqn%E7%AE%97%E6%B3%95)