×
Site Menu
Everything
International
Politics
Business
Finance
Sports
Entertainment
Lifestyle
Literature
Travel
Technology
Startups
Innovation
iBazaar deals
Art & Culture
Wine & Spirits
Science
Health
Local
Faulty reward functions in the wild
9 years ago
4
Add to circle
Reinforcement learning algorithms can break in surprising, counterintuitive ways. In this post we’ll explore one failure mode, which is where you misspecify your reward function.
Read Entire Article
Homepage
Technology
Faulty reward functions in the wild
Related
US-Iran Peace Agreement Prompts Stock Rally, Leaves Some Inv...
27 minutes ago
0
Rylo, which develops real-time speech and sign-language tran...
28 minutes ago
0
Hypha, which extracts data from private-market documents to ...
1 hour ago
0
Everything
International
Politics
Business
Finance
Sports
Entertainment
Lifestyle
Literature
Travel
Technology
Startups
Innovation
iBazaar deals
Art & Culture
Wine & Spirits
Science
Health
Local