Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Have you thought about generating the SFT data for Deepseek-Coder-V2? #13

Open
tangzhy opened this issue Jul 17, 2024 · 2 comments
Open

Comments

@tangzhy
Copy link

tangzhy commented Jul 17, 2024

Given its exceptional capabilities in coding and mathematics, the accuracy of both can be automatically verified by the final results. It would be quite persuasive if your method could match their reported performance.

@zhangchen-xu
Copy link
Member

zhangchen-xu commented Jul 17, 2024

Hi,
Thanks for the suggestion! I already put the run scripts of Deepseek-Coder-V2 here. It can work! However, most of our computing resources focus on the Gemma 2 series, which has no copyright claim. So we don't start extracting Deepseek data for now.

@zhangchen-xu
Copy link
Member

We also mentioned the plans for extracting Deepseek-Coder-V2 in our internal meeting. Once Gemma 2 is done, we may come back!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants