Technology - Inxa.inSearch.cc

Technology

Learning Montezuma’s Revenge from a single demonstration

Learning Montezuma’s Revenge from a single demonst...

7 years ago 3 Add to circle

OpenAI Five

OpenAI Five

7 years ago 3 Add to circle

Retro Contest: Results

Retro Contest: Results

7 years ago 3 Add to circle

Learning policy representations in multiagent systems

Learning policy representations in multiagent syst...

7 years ago 3 Add to circle

Improving language understanding with unsupervised learning

Improving language understanding with unsupervised...

7 years ago 3 Add to circle

GamePad: A learning environment for theorem proving

GamePad: A learning environment for theorem provin...

8 years ago 3 Add to circle

OpenAI Fellows Fall 2018

OpenAI Fellows Fall 2018

8 years ago 3 Add to circle

Gym Retro

Gym Retro

8 years ago 3 Add to circle

AI and compute

AI and compute

8 years ago 3 Add to circle

AI safety via debate

AI safety via debate

8 years ago 3 Add to circle

Evolved Policy Gradients

Evolved Policy Gradients

8 years ago 3 Add to circle

Gotta Learn Fast: A new benchmark for generalization in RL

Gotta Learn Fast: A new benchmark for generalizati...

8 years ago 4 Add to circle

Retro Contest

Retro Contest

8 years ago 3 Add to circle

Variance reduction for policy gradient with action-dependent factorized baselines

Variance reduction for policy gradient with action...

8 years ago 3 Add to circle

Improving GANs using optimal transport

Improving GANs using optimal transport

8 years ago 3 Add to circle

Report from the OpenAI hackathon

Report from the OpenAI hackathon

8 years ago 3 Add to circle

On first-order meta-learning algorithms

On first-order meta-learning algorithms

8 years ago 4 Add to circle

Reptile: A scalable meta-learning algorithm

Reptile: A scalable meta-learning algorithm

8 years ago 3 Add to circle

OpenAI Scholars

OpenAI Scholars

8 years ago 4 Add to circle

Some considerations on learning to explore via meta-reinforcement learning

Some considerations on learning to explore via met...

8 years ago 3 Add to circle

Multi-Goal Reinforcement Learning: Challenging robotics environments and request for research

Multi-Goal Reinforcement Learning: Challenging rob...

8 years ago 3 Add to circle

Ingredients for robotics research

Ingredients for robotics research

8 years ago 3 Add to circle

OpenAI hackathon

OpenAI hackathon

8 years ago 3 Add to circle

OpenAI supporters

OpenAI supporters

8 years ago 3 Add to circle

Preparing for malicious uses of AI

Preparing for malicious uses of AI

8 years ago 3 Add to circle

Interpretable machine learning through teaching

Interpretable machine learning through teaching

8 years ago 3 Add to circle

Discovering types for entity disambiguation

Discovering types for entity disambiguation

8 years ago 3 Add to circle

Requests for Research 2.0

Requests for Research 2.0

8 years ago 3 Add to circle

Scaling Kubernetes to 2,500 nodes

Scaling Kubernetes to 2,500 nodes

8 years ago 3 Add to circle

Block-sparse GPU kernels

Block-sparse GPU kernels

8 years ago 3 Add to circle

Learning sparse neural networks through L₀ regularization

Learning sparse neural networks through L₀ regular...

8 years ago 3 Add to circle

Interpretable and pedagogical examples

Interpretable and pedagogical examples

8 years ago 3 Add to circle

Learning a hierarchy

Learning a hierarchy

8 years ago 3 Add to circle

Generalizing from simulation

Generalizing from simulation

8 years ago 3 Add to circle

Asymmetric actor critic for image-based robot learning

Asymmetric actor critic for image-based robot lear...

8 years ago 4 Add to circle

Sim-to-real transfer of robotic control with dynamics randomization

Sim-to-real transfer of robotic control with dynam...

8 years ago 3 Add to circle

Domain randomization and generative models for robotic grasping

Domain randomization and generative models for rob...

8 years ago 3 Add to circle

Competitive self-play

Competitive self-play

8 years ago 3 Add to circle

Meta-learning for wrestling

Meta-learning for wrestling

8 years ago 3 Add to circle

Nonlinear computation in deep linear networks

Nonlinear computation in deep linear networks

8 years ago 3 Add to circle

Learning to model other minds

Learning to model other minds

8 years ago 3 Add to circle

Learning with opponent-learning awareness

Learning with opponent-learning awareness

8 years ago 5 Add to circle

OpenAI Baselines: ACKTR & A2C

OpenAI Baselines: ACKTR & A2C

8 years ago 3 Add to circle

More on Dota 2

More on Dota 2

8 years ago 3 Add to circle

Dota 2

Dota 2

8 years ago 3 Add to circle

Gathering human feedback

Gathering human feedback

8 years ago 3 Add to circle

Better exploration with parameter noise

Better exploration with parameter noise

8 years ago 3 Add to circle

Proximal Policy Optimization

Proximal Policy Optimization

8 years ago 3 Add to circle

Robust adversarial inputs

Robust adversarial inputs

8 years ago 3 Add to circle

Hindsight Experience Replay

Hindsight Experience Replay

8 years ago 3 Add to circle