Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"
最近更新: 5天前Code for the paper "Exploration by Random Network Distillation"
最近更新: 5天前Code for reproducing key results in the paper "Improving Variational Inference with Inverse Autoregressive Flow"
最近更新: 5天前Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"
最近更新: 5天前Code for the paper "DeepType: Multilingual Entity Linking by Neural Type System Evolution"
最近更新: 5天前Code for the paper "Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents"
最近更新: 5天前Code for reproducing results in "Glow: Generative Flow with Invertible 1x1 Convolutions"
最近更新: 5天前Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
最近更新: 5天前Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
最近更新: 5天前DEPRECATED: Open-source software for robot simulation, integrated with OpenAI Gym.
最近更新: 5天前Repository for the paper "Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images"
最近更新: 5天前Example code for Weight Normalization, from "Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks"
最近更新: 5天前