Skip to content

v0.0.5

Compare
Choose a tag to compare
@github-actions github-actions released this 16 Apr 10:41
· 55 commits to main since this release

Environment

  1. MemoryEnv (#197)
  2. MountainCar (#181)

Algorithm

  1. Gumbel AlphaZero in ctree (#212)

Enhancement

  1. add eval_offline option (#188)
  2. save the updated searched policy and value to the buffer during reanalyze (#190)
  3. add muzero visualization (#181)
  4. add efficientzero tictactoe configs (#204)
  5. add 2 mcts related iclr2024 papers
  6. add load pretrained model option in test_game_segment (#194)
  7. polish _forward_learn() and some data process operations (#191)

Fix

  1. fix sync_gradients and log in DDP settings (#200)
  2. fix channel_last bug
  3. fix total_episode_count bug in collector
  4. fix memory_lightzero_env return bug
  5. fix obs_max_scale bug in memory_env

Style

  1. add ZeroPal and discord link (#209)
  2. add unittest for game_buffer_muzero (#186)
  3. add customization documentation section in readme

Full Changelog: v0.0.4...v0.0.5

Contributors: @karroyan @HarryXuancy @nighood @puyuan1996