Prediction and control with temporal segment models
Third-person imitation learning
Attacking machine learning with adversarial examples
Adversarial attacks on neural network policies
Team update
PixelCNN++: Improving the PixelCNN with discretized logistic mixture likelihood and other modifications
Faulty reward functions in the wild
Universe
OpenAI and Microsoft
#Exploration: A study of count-based exploration for deep reinforcement learning