feature(yzj): add ptz ctde pipeline #149

jayyoung0802 · 2023-11-23T15:29:48Z

No description provided.

puyuan1996 · 2023-11-26T09:30:25Z

lzero/model/muzero_model_mlp.py

-
-        next_latent_state, reward = self.dynamics_network(state_action_encoding)
+        agent_state_action_encoding = torch.cat((agent_latent_state, action_encoding), dim=1)
+        global_state_action_encoding = torch.cat((agent_latent_state, global_latent_state, action_encoding), dim=1)


global_state_action_encoding 有必要把agent_latent_state也拼接进去吗？

拼接进去后，action_encoding只占了5/(256*2+5),信息密度是否过低呢

这个需要测试一下

s‘，s1‘， s2’， s3'， r =(s,s1,s2,s3,a1,a2,a3) 用一个网络建模联合dynamic function，需要同时考虑team中每个agent的信息。

collect按照team存储data。

foward_learn中需要更改数据处理流程。unroll 5步，是整个team同时roll 5步。

foward_learn 中reward的处理。

global_state_dynamic的输入只有一个agent action, 没有joint action，是不合理的

puyuan1996 · 2023-11-26T10:03:13Z

lzero/policy/muzero.py

            policy_logits = policy_logits.detach().cpu().numpy().tolist()
-
-            legal_actions = [[i for i, x in enumerate(action_mask[j]) if x == 1] for j in range(active_collect_env_num)]
+            reward_roots = [[reward_root]*self.cfg.model.agent_num for reward_root in reward_roots]


这里reward_roots就是一个长度为24的list，为什么要按照这里的方式变换呢？24=8*3,按理讲，应该每3个对应的reward都是同一个team_reward才对？

这个地方就是变成每3个智能体用同一个reward去搜索

puyuan1996 · 2023-11-26T10:13:21Z

zoo/petting_zoo/config/ptz_simple_spread_mz_config.py

@@ -0,0 +1,116 @@
+from easydict import EasyDict


mz simple_spread有这个报错，你们那边是正常运行的吗？pettingzoo是1.22.3版吗

ez 改的位置https://github.com/jayyoung0802/LightZero/blob/dev-ctde2-835/lzero/mcts/buffer/game_buffer_efficientzero.py#L110
mz改的位置 https://github.com/jayyoung0802/LightZero/blob/dev-ctde2-835/lzero/mcts/buffer/game_buffer_muzero.py#L208
这两个地方要改一下，zero_obs的维度

jayyoung0802 and others added 9 commits October 14, 2023 19:47

feature(yzj): add ptz with lightzero main

d333aaf

fix(yzj): fix data device on mz policy

1bd05f0

feature(yzj): adapt ptz simple env

1b4ff2b

fix(yzj): fix visualization

60f0832

fix(yzj): fix combinations

6f80173

feature(yzj): add ptz simple env

08ef2ba

feature(yzj): ptz simple mz cfg is ready and add ptz simple ez cfg

0e6dfd3

fix(yzj): fix ptz simple ez eval muzero

c323a44

feature(yzj): polish ctde2-(8,3,5)

6ea3f9b

jayyoung0802 mentioned this pull request Nov 23, 2023

feature(yzj): add ptz ctde pipeline #108

Closed

puyuan1996 reviewed Nov 26, 2023

View reviewed changes

puyuan1996 added environment New or improved environment algorithm New algorithm discussion Discussion of a typical issue or concept labels Nov 26, 2023

puyuan1996 reviewed Nov 26, 2023

View reviewed changes

chosenone added 2 commits November 26, 2023 21:59

polish(yzj): polish reward_roots in mz

59c7c56

fix(yzj): fix device bug

829d86d

jayyoung0802 force-pushed the dev-ctde2-835 branch from 8d71f96 to 829d86d Compare November 26, 2023 14:03

feature(yzj): polish buffer in ctde

0368c55

puyuan1996 mentioned this pull request Mar 15, 2024

Question: How can I set up a custom environment? #198

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature(yzj): add ptz ctde pipeline #149

feature(yzj): add ptz ctde pipeline #149

jayyoung0802 commented Nov 23, 2023

puyuan1996 Nov 26, 2023

jayyoung0802 Nov 26, 2023

puyuan1996 Nov 26, 2023 •

edited

Loading

puyuan1996 Nov 29, 2023

puyuan1996 Nov 26, 2023 •

edited

Loading

jayyoung0802 Nov 26, 2023

puyuan1996 Nov 26, 2023

jayyoung0802 Nov 26, 2023

feature(yzj): add ptz ctde pipeline #149

Are you sure you want to change the base?

feature(yzj): add ptz ctde pipeline #149

Conversation

jayyoung0802 commented Nov 23, 2023

puyuan1996 Nov 26, 2023

Choose a reason for hiding this comment

jayyoung0802 Nov 26, 2023

Choose a reason for hiding this comment

puyuan1996 Nov 26, 2023 • edited Loading

Choose a reason for hiding this comment

puyuan1996 Nov 29, 2023

Choose a reason for hiding this comment

puyuan1996 Nov 26, 2023 • edited Loading

Choose a reason for hiding this comment

jayyoung0802 Nov 26, 2023

Choose a reason for hiding this comment

puyuan1996 Nov 26, 2023

Choose a reason for hiding this comment

jayyoung0802 Nov 26, 2023

Choose a reason for hiding this comment

puyuan1996 Nov 26, 2023 •

edited

Loading

puyuan1996 Nov 26, 2023 •

edited

Loading