-
Notifications
You must be signed in to change notification settings - Fork 158
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
4080显卡,基本跑不了多少数据,过万条训练数据就报错 #54
Comments
raceback (most recent call last): |
文中说的16G内存,4G显存,有真实成功跑过吗?求解 |
3000条,24G显存直接拉满 |
看了文档,用train.py训练,修改batch_size_per_gpu为1,对内存占用的确很少了。 |
我这边是可以的,整个SFT都是在16G显存的显卡上进行的,如果显存占用异常,需要检查:
|
我已经把配置文件改小了:
`class T5ModelConfig:
`
词汇表也只10000,百度百科百万级别数据,我只能取几千条跑,多了就报错。
电脑配置:
显卡:4080(12G显存)
内存:32G,
cpu:i9-14900HX(24核,32线程)
这配置不配训练大模型吗?
The text was updated successfully, but these errors were encountered: