We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Normal Llama 3 can work or need to train hypernetwork
The text was updated successfully, but these errors were encountered:
I am in the process of training a hypernetwork for Llama3!
Sorry, something went wrong.
nice ❤️ also can share info about training how much time it takes and i tried to train but most notebook doesn't support python 3.11
Here is the first version of a Llama3 hypernet: benjamin/zett-hypernetwork-Meta-Llama-3-8B-experimental.
It seems to underperform on Code though. I haven't yet found the reason why but will look into this later, so keeping this open.
Training took ~4 days on a TPUv4-32 pod.
No branches or pull requests
Normal Llama 3 can work or need to train hypernetwork
The text was updated successfully, but these errors were encountered: