Models
Compare
News
Skills
Tools
Guides
Search...
Back to News
Policy
2017-04-21
Equivalence between policy gradients and soft Q-learning
Source:
OpenAI
Read Original Article on OpenAI
openai
gpt