Sampled Muzero with the EfficientZero and the stochastic muzero to introduce random events into the continuous action space #308

LiuDongyang39 · 2024-12-16T07:30:37Z

Based on how you combined Sampled Muzero with EfficientZero, my latest idea is to combine Sampled Muzero with the EfficientZero and the stochastic muzero to introduce random events into the continuous action space. My question is is it feasible? Looking forward to your answer. Thank you very much!

puyuan1996 · 2024-12-18T14:53:17Z

Based on my analysis, this approach is highly feasible. Expanding the action space (Sampled MuZero), addressing stochasticity (Stochastic MuZero), and enhancing sample efficiency (EfficientZero) represent orthogonal improvements that can be seamlessly integrated. By combining these advancements, the approach is expected to support the modeling of continuous actions and stochastic events while maintaining strong learning efficiency. We are eagerly looking forward to seeing the corresponding implementation and would like to express our sincere gratitude for your valuable contributions. If you have any questions, please feel free to continue the discussion here at any time!

puyuan1996 added the discussion Discussion of a typical issue or concept label Dec 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sampled Muzero with the EfficientZero and the stochastic muzero to introduce random events into the continuous action space #308

Sampled Muzero with the EfficientZero and the stochastic muzero to introduce random events into the continuous action space #308

LiuDongyang39 commented Dec 16, 2024

puyuan1996 commented Dec 18, 2024

Sampled Muzero with the EfficientZero and the stochastic muzero to introduce random events into the continuous action space #308

Sampled Muzero with the EfficientZero and the stochastic muzero to introduce random events into the continuous action space #308

Comments

LiuDongyang39 commented Dec 16, 2024

puyuan1996 commented Dec 18, 2024