Inxa.inSearch.cc | search engine, content portal, news aggretator, circle, nexth

Equivalence between policy gradients and soft Q-learning

Equivalence between policy gradients and soft Q-learning

Stochastic Neural Networks for hierarchical reinforcement learning

Stochastic Neural Networks for hierarchical reinforcement learning

Unsupervised sentiment neuron

Unsupervised sentiment neuron

Spam detection in the physical world

Spam detection in the physical world

Evolution strategies as a scalable alternative to reinforcement learning

Evolution strategies as a scalable alternative to reinforcement learning

One-shot imitation learning

One-shot imitation learning

Distill

Learning to communicate

Learning to communicate

Emergence of grounded compositional language in multi-agent populations

Emergence of grounded compositional language in multi-agent populations

Prediction and control with temporal segment models

Prediction and control with temporal segment models

Latest

Equivalence between policy gradients and soft Q-learning

Equivalence between policy gradients and soft Q-le...

9 years ago 1 Add to circle

Stochastic Neural Networks for hierarchical reinforcement learning

Stochastic Neural Networks for hierarchical reinfo...

9 years ago 1 Add to circle

Unsupervised sentiment neuron

Unsupervised sentiment neuron

9 years ago 1 Add to circle

Spam detection in the physical world

Spam detection in the physical world

9 years ago 1 Add to circle

Evolution strategies as a scalable alternative to reinforcement learning

Evolution strategies as a scalable alternative to ...

9 years ago 1 Add to circle

One-shot imitation learning

One-shot imitation learning

9 years ago 1 Add to circle

Distill

Distill

9 years ago 1 Add to circle

Learning to communicate

Learning to communicate

9 years ago 1 Add to circle

Emergence of grounded compositional language in multi-agent populations

Emergence of grounded compositional language in mu...

9 years ago 1 Add to circle

Prediction and control with temporal segment models

Prediction and control with temporal segment model...

9 years ago 1 Add to circle

Third-person imitation learning

Third-person imitation learning

9 years ago 1 Add to circle

Attacking machine learning with adversarial examples

Attacking machine learning with adversarial exampl...

9 years ago 1 Add to circle

Adversarial attacks on neural network policies

Adversarial attacks on neural network policies

9 years ago 1 Add to circle

Team update

Team update

9 years ago 1 Add to circle

PixelCNN++: Improving the PixelCNN with discretized logistic mixture likelihood and other modifications

PixelCNN++: Improving the PixelCNN with discretize...

9 years ago 1 Add to circle

Faulty reward functions in the wild

Faulty reward functions in the wild

9 years ago 1 Add to circle

Universe

Universe

9 years ago 1 Add to circle

OpenAI and Microsoft

OpenAI and Microsoft

9 years ago 1 Add to circle