Domain randomization and generative models for robotic grasping
Competitive self-play
Meta-learning for wrestling
Nonlinear computation in deep linear networks
Learning to model other minds
Learning with opponent-learning awareness
OpenAI Baselines: ACKTR & A2C
More on Dota 2
Dota 2
Gathering human feedback