v0.0.3

puyuan1996 released this 07 Dec 08:27

· 88 commits to main since this release

3cb7fff

Env

MiniGrid env (#110)
Bsuite env (#110)
GoBigger env (#39)

Algorithm

Sampled AlphaZero (#141)
MuZero+RND (#110)
Multi-Agent MuZero/EfficientZero (#39)

Enhancement

add ctree version of mcts in alphazero (#142)
upgrade the dependency on gym with gymnasium (#150)
add agent class to support LightZero's HuggingFace Model Zoo (#163)
add recent MCTS-related papers in readme (#159)
add muzero config for connect4 (#107)
add CONTRIBUTING.md (#119)
add .gitpod.yml and .gitpod.Dockerfile (#123)
add contributors subsection in README (#132)
add CODE_OF_CONDUCT.md (#127)
polish comments and render_eval configs for various common envs (#154) (#161)
polish action_type and env_type, fix test.yml, fix unittest (#160)
update env and algo tutorial doc (#106)
polish gomoku env (#141)
add random_policy support for continuous env (#118)
polish simulation method of ptree_az (#120)
polish comments of game_segment_to_array

Fix

fix render method for various common envs (#154) (#161)
fix gumbel muzero collector bug, fix gumbel typo (#144)
fix assert bug in game_segment.py (#138)
fix visit_count_distributions name in muzero_evaluator
fix mcts and alphabeta bot unittest (#120)
fix typos in ptree_mz.py (#113)
fix root_sampled_actions_tmp shape bug in sez ptree
fix policy utils unittest
fix typo in readme and add a 'back to top' button in readme (#104) (#109) (#111)

Style

add NeurIPS 2023 paper link

News

NeurIPS 2023 Spotlight: LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios

Full Changelog: v0.0.2...v0.0.3

Contributors: @PaParaZz1 @karroyan @nighood @jayyoung0802 @timothijoe @TuTuHuss @HarryXuancy @puyuan1996 @HansBug @mohitd404 @@PentesterPriyanshu @0Armaan025 @prajjwalyd @suravshresth @sohamtembhurne @eltociear

Contributors

HansBug, eltociear, and 14 other contributors

Assets 2