Inxa.inSearch.cc | search engine, content portal, news aggretator, circle, nexth

Asymmetric actor critic for image-based robot learning

Asymmetric actor critic for image-based robot learning

Sim-to-real transfer of robotic control with dynamics randomization

Sim-to-real transfer of robotic control with dynamics randomization

Domain randomization and generative models for robotic grasping

Domain randomization and generative models for robotic grasping

Competitive self-play

Competitive self-play

Meta-learning for wrestling

Meta-learning for wrestling

Nonlinear computation in deep linear networks

Nonlinear computation in deep linear networks

Learning to model other minds

Learning to model other minds

Learning with opponent-learning awareness

Learning with opponent-learning awareness

OpenAI Baselines: ACKTR & A2C

OpenAI Baselines: ACKTR & A2C

More on Dota 2

Latest

Asymmetric actor critic for image-based robot learning

Asymmetric actor critic for image-based robot lear...

8 years ago 5 Add to circle

Sim-to-real transfer of robotic control with dynamics randomization

Sim-to-real transfer of robotic control with dynam...

8 years ago 4 Add to circle

Domain randomization and generative models for robotic grasping

Domain randomization and generative models for rob...

8 years ago 4 Add to circle

Competitive self-play

Competitive self-play

8 years ago 5 Add to circle

Meta-learning for wrestling

Meta-learning for wrestling

8 years ago 5 Add to circle

Nonlinear computation in deep linear networks

Nonlinear computation in deep linear networks

8 years ago 6 Add to circle

Learning to model other minds

Learning to model other minds

8 years ago 5 Add to circle

Learning with opponent-learning awareness

Learning with opponent-learning awareness

8 years ago 7 Add to circle

OpenAI Baselines: ACKTR & A2C

OpenAI Baselines: ACKTR & A2C

8 years ago 6 Add to circle

More on Dota 2

More on Dota 2

8 years ago 6 Add to circle

Dota 2

Dota 2

8 years ago 5 Add to circle

Gathering human feedback

Gathering human feedback

8 years ago 6 Add to circle

Better exploration with parameter noise

Better exploration with parameter noise

8 years ago 5 Add to circle

Proximal Policy Optimization

Proximal Policy Optimization

8 years ago 7 Add to circle

Robust adversarial inputs

Robust adversarial inputs

8 years ago 5 Add to circle

Hindsight Experience Replay

Hindsight Experience Replay

8 years ago 5 Add to circle

Teacher–student curriculum learning

Teacher–student curriculum learning

8 years ago 5 Add to circle

Faster physics in Python

Faster physics in Python

8 years ago 5 Add to circle