Deep double descent
Procgen Benchmark
Safety Gym
Benchmarking safe exploration in deep reinforcement learning
GPT-2: 1.5B release
Solving Rubik’s Cube with a robot hand
OpenAI Scholars 2020: Applications open
Fine-tuning GPT-2 from human preferences
Emergent tool use from multi-agent interaction
Testing robustness against unforeseen adversaries