Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sampled Muzero with the EfficientZero and the stochastic muzero to introduce random events into the continuous action space #308

Open
LiuDongyang39 opened this issue Dec 16, 2024 · 1 comment
Labels
discussion Discussion of a typical issue or concept

Comments

@LiuDongyang39
Copy link

Based on how you combined Sampled Muzero with EfficientZero, my latest idea is to combine Sampled Muzero with the EfficientZero and the stochastic muzero to introduce random events into the continuous action space. My question is is it feasible? Looking forward to your answer. Thank you very much!

@puyuan1996 puyuan1996 added the discussion Discussion of a typical issue or concept label Dec 18, 2024
@puyuan1996
Copy link
Collaborator

Based on my analysis, this approach is highly feasible. Expanding the action space (Sampled MuZero), addressing stochasticity (Stochastic MuZero), and enhancing sample efficiency (EfficientZero) represent orthogonal improvements that can be seamlessly integrated. By combining these advancements, the approach is expected to support the modeling of continuous actions and stochastic events while maintaining strong learning efficiency. We are eagerly looking forward to seeing the corresponding implementation and would like to express our sincere gratitude for your valuable contributions. If you have any questions, please feel free to continue the discussion here at any time!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
discussion Discussion of a typical issue or concept
Projects
None yet
Development

No branches or pull requests

2 participants