Policy Gradient¶
- class PolicyGradient(model, lr)[源代码]¶
基类:
Algorithm
- __init__(model, lr)[源代码]¶
Policy gradient algorithm
- 参数:
model (parl.Model) – model defining forward network of policy.
lr (float) – learning rate.
基类:Algorithm
Policy gradient algorithm
model (parl.Model) – model defining forward network of policy.
lr (float) – learning rate.