Learning to communicate
Emergence of grounded compositional language in multi-agent populations
Prediction and control with temporal segment models
Third-person imitation learning
Attacking machine learning with adversarial examples
Adversarial attacks on neural network policies
Team update
PixelCNN++: Improving the PixelCNN with discretized logistic mixture likelihood and other modifications
Faulty reward functions in the wild
Universe