Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Deep Compression Vector Quantize AutoEncoder? #163

Open
markson14 opened this issue Dec 11, 2024 · 2 comments
Open

Deep Compression Vector Quantize AutoEncoder? #163

markson14 opened this issue Dec 11, 2024 · 2 comments

Comments

@markson14
Copy link

It's a very impressive job! Well done.

I am wondering if you have conducted any further experiments on vector quantization. The DCAE-f128 can compress a 256x256 image into a 2x2 feature map, resulting in 4 tokens with VQ. This could lead to significant acceleration in LLM training and inference, paving the way for real-time video generation. Feel free to ask if you need any more adjustments!

@han-cai
Copy link
Contributor

han-cai commented Dec 11, 2024

Thanks for your interest in our work! VQ is one direction we are working on. We will push our updates to this repo.

@markson14
Copy link
Author

Thanks for your interest in our work! VQ is one direction we are working on. We will push our updates to this repo.

That's wonderful! We are trying to train the DCAE with our own dataset. I wonder how many epochs we should use and what learning rate is recommended, as these details were not mentioned in the 4.1 implementation details.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants