Improving language understanding with unsupervised learning
GamePad: A learning environment for theorem proving
OpenAI Fellows Fall 2018
Gym Retro
AI and compute
AI safety via debate
Evolved Policy Gradients
Gotta Learn Fast: A new benchmark for generalization in RL
Retro Contest
Variance reduction for policy gradient with action-dependent factorized baselines