Skip to content

Commit

Permalink
update SAC
Browse files Browse the repository at this point in the history
  • Loading branch information
johnjim0816 committed Jul 30, 2023
1 parent 9d97dd5 commit 171f981
Showing 1 changed file with 1 addition and 2 deletions.
3 changes: 1 addition & 2 deletions docs/ch13/main.md
Original file line number Diff line number Diff line change
Expand Up @@ -314,6 +314,5 @@ $$
J(\alpha)=\mathbb{E}_{a_t \sim \pi_t}\left[-\alpha \log \pi_t\left(a_t \mid s_t\right)-\alpha \mathcal{H}_0\right]
$$

这样一来就能实现温度因子的自动调节了。

这样一来就能实现温度因子的自动调节了。这一版本由于引入了温度因子的自动调节,因此不需要额外的 `V` 值网络,直接使用两个 `Q` 网络(包含目标网络和当前网络)来作为 `Critic` 估计价值即可。
## 实战:SAC 算法

0 comments on commit 171f981

Please sign in to comment.