Inxa.inSearch.cc | search engine, content portal, news aggretator, circle, nexth

Meta-learning for wrestling

Meta-learning for wrestling

Nonlinear computation in deep linear networks

Nonlinear computation in deep linear networks

Learning to model other minds

Learning to model other minds

Learning with opponent-learning awareness

Learning with opponent-learning awareness

OpenAI Baselines: ACKTR & A2C

OpenAI Baselines: ACKTR & A2C

More on Dota 2

Dota 2

Gathering human feedback

Gathering human feedback

Better exploration with parameter noise

Better exploration with parameter noise

Proximal Policy Optimization

Proximal Policy Optimization

Latest

Meta-learning for wrestling

Meta-learning for wrestling

8 years ago 2 Add to circle

Nonlinear computation in deep linear networks

Nonlinear computation in deep linear networks

8 years ago 2 Add to circle

Learning to model other minds

Learning to model other minds

8 years ago 2 Add to circle

Learning with opponent-learning awareness

Learning with opponent-learning awareness

8 years ago 1 Add to circle

OpenAI Baselines: ACKTR & A2C

OpenAI Baselines: ACKTR & A2C

8 years ago 2 Add to circle

More on Dota 2

More on Dota 2

8 years ago 2 Add to circle

Dota 2

Dota 2

8 years ago 1 Add to circle

Gathering human feedback

Gathering human feedback

8 years ago 1 Add to circle

Better exploration with parameter noise

Better exploration with parameter noise

8 years ago 1 Add to circle

Proximal Policy Optimization

Proximal Policy Optimization

8 years ago 1 Add to circle

Robust adversarial inputs

Robust adversarial inputs

8 years ago 1 Add to circle

Hindsight Experience Replay

Hindsight Experience Replay

8 years ago 1 Add to circle

Teacher–student curriculum learning

Teacher–student curriculum learning

8 years ago 1 Add to circle

Faster physics in Python

Faster physics in Python

8 years ago 1 Add to circle

Learning from human preferences

Learning from human preferences

8 years ago 1 Add to circle

Learning to cooperate, compete, and communicate

Learning to cooperate, compete, and communicate

8 years ago 1 Add to circle

UCB exploration via Q-ensembles

UCB exploration via Q-ensembles

8 years ago 1 Add to circle

OpenAI Baselines: DQN

OpenAI Baselines: DQN

9 years ago 1 Add to circle