Technology - Inxa.inSearch.cc

Technology

Retro Contest

Retro Contest

8 years ago 7 Add to circle

Variance reduction for policy gradient with action-dependent factorized baselines

Variance reduction for policy gradient with action...

8 years ago 7 Add to circle

Improving GANs using optimal transport

Improving GANs using optimal transport

8 years ago 8 Add to circle

Report from the OpenAI hackathon

Report from the OpenAI hackathon

8 years ago 7 Add to circle

On first-order meta-learning algorithms

On first-order meta-learning algorithms

8 years ago 8 Add to circle

Reptile: A scalable meta-learning algorithm

Reptile: A scalable meta-learning algorithm

8 years ago 7 Add to circle

OpenAI Scholars

OpenAI Scholars

8 years ago 7 Add to circle

Some considerations on learning to explore via meta-reinforcement learning

Some considerations on learning to explore via met...

8 years ago 5 Add to circle

Multi-Goal Reinforcement Learning: Challenging robotics environments and request for research

Multi-Goal Reinforcement Learning: Challenging rob...

8 years ago 4 Add to circle

Ingredients for robotics research

Ingredients for robotics research

8 years ago 4 Add to circle

OpenAI hackathon

OpenAI hackathon

8 years ago 4 Add to circle

OpenAI supporters

OpenAI supporters

8 years ago 4 Add to circle

Preparing for malicious uses of AI

Preparing for malicious uses of AI

8 years ago 4 Add to circle

Interpretable machine learning through teaching

Interpretable machine learning through teaching

8 years ago 4 Add to circle

Discovering types for entity disambiguation

Discovering types for entity disambiguation

8 years ago 5 Add to circle

Requests for Research 2.0

Requests for Research 2.0

8 years ago 4 Add to circle

Scaling Kubernetes to 2,500 nodes

Scaling Kubernetes to 2,500 nodes

8 years ago 4 Add to circle

Block-sparse GPU kernels

Block-sparse GPU kernels

8 years ago 5 Add to circle

Learning sparse neural networks through L₀ regularization

Learning sparse neural networks through L₀ regular...

8 years ago 4 Add to circle

Interpretable and pedagogical examples

Interpretable and pedagogical examples

8 years ago 4 Add to circle

Learning a hierarchy

Learning a hierarchy

8 years ago 4 Add to circle

Generalizing from simulation

Generalizing from simulation

8 years ago 5 Add to circle

Asymmetric actor critic for image-based robot learning

Asymmetric actor critic for image-based robot lear...

8 years ago 6 Add to circle

Sim-to-real transfer of robotic control with dynamics randomization

Sim-to-real transfer of robotic control with dynam...

8 years ago 4 Add to circle

Domain randomization and generative models for robotic grasping

Domain randomization and generative models for rob...

8 years ago 5 Add to circle

Competitive self-play

Competitive self-play

8 years ago 6 Add to circle

Meta-learning for wrestling

Meta-learning for wrestling

8 years ago 5 Add to circle

Nonlinear computation in deep linear networks

Nonlinear computation in deep linear networks

8 years ago 6 Add to circle

Learning to model other minds

Learning to model other minds

8 years ago 5 Add to circle

Learning with opponent-learning awareness

Learning with opponent-learning awareness

8 years ago 7 Add to circle

OpenAI Baselines: ACKTR & A2C

OpenAI Baselines: ACKTR & A2C

8 years ago 6 Add to circle

More on Dota 2

More on Dota 2

8 years ago 6 Add to circle

Dota 2

Dota 2

8 years ago 6 Add to circle

Gathering human feedback

Gathering human feedback

8 years ago 7 Add to circle

Better exploration with parameter noise

Better exploration with parameter noise

8 years ago 6 Add to circle

Proximal Policy Optimization

Proximal Policy Optimization

8 years ago 7 Add to circle

Robust adversarial inputs

Robust adversarial inputs

8 years ago 5 Add to circle

Hindsight Experience Replay

Hindsight Experience Replay

8 years ago 6 Add to circle

Teacher–student curriculum learning

Teacher–student curriculum learning

8 years ago 5 Add to circle

Faster physics in Python

Faster physics in Python

8 years ago 5 Add to circle

Learning from human preferences

Learning from human preferences

9 years ago 5 Add to circle

Learning to cooperate, compete, and communicate

Learning to cooperate, compete, and communicate

9 years ago 6 Add to circle

UCB exploration via Q-ensembles

UCB exploration via Q-ensembles

9 years ago 5 Add to circle

OpenAI Baselines: DQN

OpenAI Baselines: DQN

9 years ago 4 Add to circle

Robots that learn

Robots that learn

9 years ago 4 Add to circle

Roboschool

Roboschool

9 years ago 4 Add to circle

Equivalence between policy gradients and soft Q-learning

Equivalence between policy gradients and soft Q-le...

9 years ago 4 Add to circle

Stochastic Neural Networks for hierarchical reinforcement learning

Stochastic Neural Networks for hierarchical reinfo...

9 years ago 4 Add to circle

Unsupervised sentiment neuron

Unsupervised sentiment neuron

9 years ago 8 Add to circle

Spam detection in the physical world

Spam detection in the physical world

9 years ago 5 Add to circle