Models
Compare
News
Skills
Tools
Guides
Search...
Back to News
Policy
2018-03-20
Variance reduction for policy gradient with action-dependent factorized baselines
Source:
OpenAI
Read Original Article on OpenAI
openai
gpt