-
Notifications
You must be signed in to change notification settings - Fork 2.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
proposed cleanups/refactoring #206
Comments
Nice ideas all. While you're in there (so to speak), it'd be great to have something in the standard PNG "Description" metadata field (possibly at least part of the prompt, but yeah prompts can get pretty long). Given that it's a standard field, it would allow other software (e.g. automated gallery constructors) to have something to generate captions from. |
Good idea. Seems like the prompt is an obvious thing to put there (possibly with something like "generated by Stable Diffusion"). |
TIL; basujindal/stable-diffusion#65 suggests that PNG metadata is EXIF-compliant. If so, then that actually expands the use cases - there are LOTS of scenarios involving automation and EXIF data. |
Why are these things never straightforward... https://stackoverflow.com/a/62203081/5074550 indicates that
To my eye, the most portable/compatible scheme would be storing XMP docs in text chunks. At least then some software (like presumably Lightroom) would be able to read them. |
My fork of the old compviz repo writes Exif data to the PNG output. I do this because, as a mac user, I can view the exif data within Finder. And since this Exif data is written to the device model field, it can be displayed in Finder: I'm mentioning this here in case the code to do this is useful for this repo.
|
Since you guys are talking about refactoring and such, I thought I'd mention this here. Macs initially couldn't get consistent images even with the same seed. Some people figured it out (in this issues) but I am pretty sure a side effect of the "fix" is to change all cuda user seed results (I haven't tested it). Because of this I can't imagine that people will want to make this change. However, there's a 1.5 checkpoint model that's currently being beta tested on the Stable-Diffusion Discord. If I understand correctly, when this is released all of the seed results from the 1.4 model will change. It seems like this is the best time to make a change like this. |
I've been messing around with the code for a while now, and there's a few pain points I'd like to address. Before I do a bunch of work on this I want to get confirmation that these changes would be merged:
These need to be done in order - it will be a lot easier to refactor pngwriter after removing batch_size, for example.
I'd really like to work on getting interpolation working, but for that code to be of acceptable quality I really need to make the above changes first. (I actually started doing it and then kept getting distracted by all of the above stuff.)
@lstein if I send a PR with all of these changes, would you be up for merging them? I can split it into seperate PRs if necessary, but if each one needs to get through review in order it's going to take a lot longer.
The text was updated successfully, but these errors were encountered: