Open WIKI
Home
Sources
About
Contacts
⯈
☰
Trust Region Policy Optimization
Trust Region Policy Optimization
is a
policy-gradient method
.