We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 1d34ba6 commit 4c1e778Copy full SHA for 4c1e778
README.md
@@ -51,12 +51,15 @@ WORK IN PROGRESS!!
51
* Trust Region Policy Optimization (TRPO)
52
* Proximal Policy Optimization (PPO)
53
* Actor-Critic with Experience Replay (ACER)
54
+ * Actor-Critic using Kronecker-Factored Trust Region (ACKTR)
55
* Deep Deterministic Policy Gradient (DDPG)
56
- DDPG with Hindsight Experience Replay (HER)
57
* Twin Delayed Deep Deterministic (TD3)
58
* Soft Actor Critic (SAC)
59
- SAC Discrete
-
60
-Model-Based Algorithms
+8. Multi-Agent Algorithms
61
+ * Multi-Agent DDPG (MADDPG)
62
+ * Multi-Agent TD3
63
+ * Multi-Agent SAC
64
+9. Model-Based Algorithms
65
* TODO
0 commit comments