Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Assistance] OS task retrun infos #173

Open
xiaxiaxiatengxi opened this issue Nov 19, 2024 · 1 comment
Open

[Assistance] OS task retrun infos #173

xiaxiaxiatengxi opened this issue Nov 19, 2024 · 1 comment
Labels
bug Something isn't working help wanted Extra attention is needed

Comments

@xiaxiaxiatengxi
Copy link

Describe the bug
我看我们的OS环境目前是以用docker的方式把环境进行打包,之后利用Python的Docker.py和os环境交互。
想问一下,我们的OS环境,有相关的操作说明吗?
就类似RL的环境
每一步都有哪些可执行操作、环境会有的反馈以及每一次的reward,我想单独把OS交互的代码拿出来,之后适配到我们自己的任务里……

@xiaxiaxiatengxi xiaxiaxiatengxi added bug Something isn't working help wanted Extra attention is needed labels Nov 19, 2024
@zhc7
Copy link
Collaborator

zhc7 commented Dec 6, 2024

我们没有额外规定动作空间,模型相当于就是对着一个bash想输入什么输入什么,bash返回了什么我们就给模型看什么,因此也没有每一个步骤的reward,只有最后checking script返回对错与否可以当做最终的reward。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

2 participants