Technology - Inxa.inSearch.cc

Technology

Learning a hierarchy

Learning a hierarchy

8 years ago 5 Add to circle

Generalizing from simulation

Generalizing from simulation

8 years ago 6 Add to circle

Asymmetric actor critic for image-based robot learning

Asymmetric actor critic for image-based robot lear...

8 years ago 7 Add to circle

Sim-to-real transfer of robotic control with dynamics randomization

Sim-to-real transfer of robotic control with dynam...

8 years ago 5 Add to circle

Domain randomization and generative models for robotic grasping

Domain randomization and generative models for rob...

8 years ago 7 Add to circle

Competitive self-play

Competitive self-play

8 years ago 8 Add to circle

Meta-learning for wrestling

Meta-learning for wrestling

8 years ago 7 Add to circle

Nonlinear computation in deep linear networks

Nonlinear computation in deep linear networks

8 years ago 8 Add to circle

Learning to model other minds

Learning to model other minds

8 years ago 7 Add to circle

Learning with opponent-learning awareness

Learning with opponent-learning awareness

8 years ago 9 Add to circle

OpenAI Baselines: ACKTR & A2C

OpenAI Baselines: ACKTR & A2C

8 years ago 8 Add to circle

More on Dota 2

More on Dota 2

8 years ago 8 Add to circle

Dota 2

Dota 2

8 years ago 8 Add to circle

Gathering human feedback

Gathering human feedback

8 years ago 9 Add to circle

Better exploration with parameter noise

Better exploration with parameter noise

8 years ago 8 Add to circle

Proximal Policy Optimization

Proximal Policy Optimization

8 years ago 9 Add to circle

Robust adversarial inputs

Robust adversarial inputs

8 years ago 7 Add to circle

Hindsight Experience Replay

Hindsight Experience Replay

9 years ago 8 Add to circle

Teacher–student curriculum learning

Teacher–student curriculum learning

9 years ago 7 Add to circle

Faster physics in Python

Faster physics in Python

9 years ago 8 Add to circle

Learning from human preferences

Learning from human preferences

9 years ago 7 Add to circle

Learning to cooperate, compete, and communicate

Learning to cooperate, compete, and communicate

9 years ago 8 Add to circle

UCB exploration via Q-ensembles

UCB exploration via Q-ensembles

9 years ago 6 Add to circle

OpenAI Baselines: DQN

OpenAI Baselines: DQN

9 years ago 6 Add to circle

Robots that learn

Robots that learn

9 years ago 5 Add to circle

Roboschool

Roboschool

9 years ago 5 Add to circle

Equivalence between policy gradients and soft Q-learning

Equivalence between policy gradients and soft Q-le...

9 years ago 7 Add to circle

Stochastic Neural Networks for hierarchical reinforcement learning

Stochastic Neural Networks for hierarchical reinfo...

9 years ago 5 Add to circle

Unsupervised sentiment neuron

Unsupervised sentiment neuron

9 years ago 9 Add to circle

Spam detection in the physical world

Spam detection in the physical world

9 years ago 6 Add to circle

Evolution strategies as a scalable alternative to reinforcement learning

Evolution strategies as a scalable alternative to ...

9 years ago 7 Add to circle

One-shot imitation learning

One-shot imitation learning

9 years ago 6 Add to circle

Distill

Distill

9 years ago 6 Add to circle

Learning to communicate

Learning to communicate

9 years ago 5 Add to circle

Emergence of grounded compositional language in multi-agent populations

Emergence of grounded compositional language in mu...

9 years ago 9 Add to circle

Prediction and control with temporal segment models

Prediction and control with temporal segment model...

9 years ago 6 Add to circle

Third-person imitation learning

Third-person imitation learning

9 years ago 6 Add to circle

Attacking machine learning with adversarial examples

Attacking machine learning with adversarial exampl...

9 years ago 9 Add to circle

Adversarial attacks on neural network policies

Adversarial attacks on neural network policies

9 years ago 7 Add to circle

Team update

Team update

9 years ago 6 Add to circle

PixelCNN++: Improving the PixelCNN with discretized logistic mixture likelihood and other modifications

PixelCNN++: Improving the PixelCNN with discretize...

9 years ago 6 Add to circle

Faulty reward functions in the wild

Faulty reward functions in the wild

9 years ago 8 Add to circle

Universe

Universe

9 years ago 7 Add to circle

OpenAI and Microsoft

OpenAI and Microsoft

9 years ago 6 Add to circle

#Exploration: A study of count-based exploration for deep reinforcement learning

#Exploration: A study of count-based exploration f...

9 years ago 8 Add to circle

On the quantitative analysis of decoder-based generative models

On the quantitative analysis of decoder-based gene...

9 years ago 11 Add to circle

A connection between generative adversarial networks, inverse reinforcement learning, and energy-based models

A connection between generative adversarial networ...

9 years ago 8 Add to circle

RL²: Fast reinforcement learning via slow reinforcement learning

RL²: Fast reinforcement learning via slow reinforc...

9 years ago 11 Add to circle

Variational lossy autoencoder

Variational lossy autoencoder

9 years ago 11 Add to circle

Extensions and limitations of the neural GPU

Extensions and limitations of the neural GPU

9 years ago 15 Add to circle