VAE unnecessary VRAM consumption #4

axel-havard · 2023-07-21T09:44:57Z

Hi,

I was looking at the current implementation, and was noticing that before every generation you pass all reference images through the VAE as one batch. After a certain amount of references images, that would result in huge amount of VRAM needed I believe.
Wouldn't it be better to get the latents for each selected image beforehand, store them either in the RAM or on the drive temporarly, then load them on generation?
That way you avoid big batch in the VAE, and you compute the latents only once for a given reference image instead of for each generation.

What do you think ?

https://github.com/sd-fabric/fabric/blob/caaa5831bacefb060d46168372b45e3bac84a3ae/fabric/generator.py#L357C1-L373C14

matrix4767 · 2023-07-21T11:00:24Z

Moving it to RAM sounds good.

dvruette · 2023-07-21T12:31:46Z

That sounds like a reasonable tradeoff, feel free to open a PR!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VAE unnecessary VRAM consumption #4

VAE unnecessary VRAM consumption #4

axel-havard commented Jul 21, 2023

matrix4767 commented Jul 21, 2023

dvruette commented Jul 21, 2023

VAE unnecessary VRAM consumption #4

VAE unnecessary VRAM consumption #4

Comments

axel-havard commented Jul 21, 2023

matrix4767 commented Jul 21, 2023

dvruette commented Jul 21, 2023