Releases: thu-ml/tianshou
Releases · thu-ml/tianshou
0.2.2
Algorithm Implementation
- Generalized Advantage Estimation (GAE);
- Update PPO algorithm with arXiv:1811.02553 and arXiv:1912.09729;
- Vanilla Imitation Learning (BC & DA, with continuous/discrete action space);
- Prioritized DQN;
- RNN-style policy network;
- Fix SAC with torch==1.5.0
API change
- change
__call__toforwardin policy; - Add
save_fnin trainer; - Add
__repr__in tianshou.data, e.g.print(buffer)
0.2.1
First version with full documentation.
Support algorithms: DQN/VPG/A2C/DDPG/PPO/TD3/SAC