Llama3 #1

zf0x00 · 2024-05-14T12:31:15Z

Normal Llama 3 can work or need to train hypernetwork

bminixhofer · 2024-05-15T09:16:19Z

I am in the process of training a hypernetwork for Llama3!

zf0x00 · 2024-05-15T18:31:52Z

nice ❤️
also can share info about training how much time it takes and i tried to train but most notebook doesn't support python 3.11

bminixhofer · 2024-05-20T16:04:49Z

It seems to underperform on Code though. I haven't yet found the reason why but will look into this later, so keeping this open.

Training took ~4 days on a TPUv4-32 pod.

Provide feedback