Skip to content

FulChou/RL_IoT_distillation

Repository files navigation

RL_IoT_distillation:

实验记录:

12.06: 复现初步部分 policy distillation思想的论文,剩下三个todo,后续完善。正在跑自己的distillation,然后第二台服务器跑vanilla DQN 今天对比了一下 test 100 比 30 大概与环境 collect 1M次,慢了50min,接近一个h。

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published