1 Star 0 Fork 83

朱宗鑫 / rlcard

forked from Daochen Zha / rlcard 
加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
克隆/下载
run_dmc.py 2.11 KB
一键复制 编辑 原始数据 按行查看 历史
Daochen Zha 提交于 2021-06-16 13:56 . Update example
''' An example of training a Deep Monte-Carlo (DMC) Agent on the environments in RLCard
'''
import os
import argparse
import torch
import rlcard
from rlcard.agents.dmc_agent import DMCTrainer
def train(args):
# Make the environment
env = rlcard.make(args.env)
# Initialize the DMC trainer
trainer = DMCTrainer(env,
load_model=args.load_model,
xpid=args.xpid,
savedir=args.savedir,
save_interval=args.save_interval,
num_actor_devices=args.num_actor_devices,
num_actors=args.num_actors,
training_device=args.training_device)
# Train DMC Agents
trainer.start()
if __name__ == '__main__':
parser = argparse.ArgumentParser("DMC example in RLCard")
parser.add_argument('--env', type=str, default='leduc-holdem',
choices=['blackjack', 'leduc-holdem', 'limit-holdem', 'doudizhu', 'mahjong', 'no-limit-holdem', 'uno', 'gin-rummy'])
parser.add_argument('--cuda', type=str, default='1')
parser.add_argument('--load_model', action='store_true',
help='Load an existing model')
parser.add_argument('--xpid', default='doudizhu',
help='Experiment id (default: doudizhu)')
parser.add_argument('--savedir', default='experiments/dmc_result',
help='Root dir where experiment data will be saved')
parser.add_argument('--save_interval', default=30, type=int,
help='Time interval (in minutes) at which to save the model')
parser.add_argument('--num_actor_devices', default=1, type=int,
help='The number of devices used for simulation')
parser.add_argument('--num_actors', default=5, type=int,
help='The number of actors for each simulation device')
parser.add_argument('--training_device', default=0, type=int,
help='The index of the GPU used for training models')
args = parser.parse_args()
os.environ["CUDA_VISIBLE_DEVICES"] = args.cuda
train(args)
马建仓 AI 助手
尝试更多
代码解读
代码找茬
代码优化
1
https://gitee.com/dreamszhu/rlcard.git
git@gitee.com:dreamszhu/rlcard.git
dreamszhu
rlcard
rlcard
master

搜索帮助

344bd9b3 5694891 D2dac590 5694891