Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Only save necessary parameters #32

Open
wants to merge 1 commit into
base: main
Choose a base branch
from
Open

Conversation

Line290
Copy link

@Line290 Line290 commented Dec 21, 2020

@Line290
Copy link
Author

Line290 commented Dec 21, 2020

感觉qkv那边应该分的不对,有些feedforward的bias也没有分,
我之前用的脚本:https://github.com/Line290/CPM-ckpts-conversion

@zzy14
Copy link
Contributor

zzy14 commented Dec 25, 2020

感谢!我才看到,昨天我自己修了🤣

@Line290
Copy link
Author

Line290 commented Dec 28, 2020

没事哈,对了,还有一点就是最上面说的,torch tensor slice不会创建新的tensor,所以会把slice之前的整个tensor存下,会在磁盘上多存一些冗余的参数,不过不影响程序运行。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants