Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

奖励模型能否不是一个model,而是一个自己定义的函数 #6423

Open
1 task done
cdhx opened this issue Dec 23, 2024 · 0 comments
Open
1 task done

奖励模型能否不是一个model,而是一个自己定义的函数 #6423

cdhx opened this issue Dec 23, 2024 · 0 comments
Labels
pending This problem is yet to be addressed

Comments

@cdhx
Copy link

cdhx commented Dec 23, 2024

Reminder

  • I have read the README and searched the existing issues.

System Info

Reproduction

Expected behavior

Others

请问奖励模型能否不是一个model,而是一个自己定义的函数,比如规则,或者直接调用gpt这样,如果可以的话要修改哪些部分

感谢~

@github-actions github-actions bot added the pending This problem is yet to be addressed label Dec 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
pending This problem is yet to be addressed
Projects
None yet
Development

No branches or pull requests

1 participant