[core] ConsisID #10140

SHYuanBest · 2024-12-06T08:55:10Z

What does this PR do?

Add support for ConsisID (#10100)

Paper: https://arxiv.org/abs/2411.17440
Project: https://pku-yuangroup.github.io/ConsisID
Code: https://github.com/PKU-YuanGroup/ConsisID
Demo: https://huggingface.co/spaces/BestWishYsh/ConsisID-preview-Space

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

SHYuanBest · 2024-12-06T09:00:07Z

@a-r-r-o-w Do we need to create a branch of huggingface: ConsisID, or I just use SHYuanBest: main?

a-r-r-o-w · 2024-12-06T09:02:01Z

SHYuanBest:main works. This is just a branch from your diffusers fork to HF diffusers library, so you are free to make any changes you'd like here. Looking forward to the ConsisID changes!

SHYuanBest · 2024-12-10T07:38:04Z

@a-r-r-o-w @HuggingFaceDocBuilderDev hi, I have add consisid to this branch, can you help us to reveiew the code? Is there anything else I missed?

import torch
from diffusers import ConsisIDPipeline
from diffusers.pipelines.consisid.consisid_utils import prepare_face_models, process_face_embeddings_infer
from diffusers.utils import export_to_video
from huggingface_hub import snapshot_download

snapshot_download(repo_id="BestWishYsh/ConsisID-preview", local_dir="BestWishYsh/ConsisID-preview")

face_helper_1, face_helper_2, face_clip_model, face_main_model, eva_transform_mean, eva_transform_std = prepare_face_models("BestWishYsh/ConsisID-preview", device="cuda", dtype=torch.bfloat16)

pipe = ConsisIDPipeline.from_pretrained("BestWishYsh/ConsisID-preview", torch_dtype=torch.bfloat16)
pipe.to("cuda")

prompt = "The video captures a boy walking along a city street, filmed in black and white on a classic 35mm camera. His expression is thoughtful, his brow slightly furrowed as if he's lost in contemplation. The film grain adds a textured, timeless quality to the image, evoking a sense of nostalgia. Around him, the cityscape is filled with vintage buildings, cobblestone sidewalks, and softly blurred figures passing by, their outlines faint and indistinct. Streetlights cast a gentle glow, while shadows play across the boy's path, adding depth to the scene. The lighting highlights the boy's subtle smile, hinting at a fleeting moment of curiosity. The overall cinematic atmosphere, complete with classic film still aesthetics and dramatic contrasts, gives the scene an evocative and introspective feel."
image = "https://github.com/PKU-YuanGroup/ConsisID/blob/main/asserts/example_images/2.png?raw=true"

id_cond, id_vit_hidden, image, face_kps = process_face_embeddings_infer(face_helper_1, face_clip_model, face_helper_2, eva_transform_mean, eva_transform_std, face_main_model, "cuda", torch.bfloat16, image, is_align_face=True)

video = pipe(image=image, prompt=prompt, use_dynamic_cfg=False, id_vit_hidden=id_vit_hidden, id_cond=id_cond, kps_cond=face_kps, generator=torch.Generator("cuda").manual_seed(42))
export_to_video(video.frames[0], "output.mp4", fps=8)

HuggingFaceDocBuilderDev · 2024-12-10T08:08:07Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

src/diffusers/pipelines/consisid/pipeline_consisid.py

Co-authored-by: hlky <[email protected]>

SHYuanBest · 2024-12-11T02:27:00Z

@a-r-r-o-w @hlky hi, what should I do next?

hlky · 2024-12-18T14:05:15Z

Thanks @SHYuanBest. Please wait for further review from @yiyixuxu

a-r-r-o-w · 2024-12-18T20:31:53Z

Thanks for working on this @SHYuanBest! The PR looks mostly good to me. There are some things I would like to test and maybe change. Looking into it now. I hope it would be okay if I push to this branch directly

docs/source/en/api/pipelines/consisid.md

a-r-r-o-w · 2024-12-18T21:56:24Z

docs/source/en/using-diffusers/consisid.md

+video = pipe(image=image, prompt=prompt, num_inference_steps=50, guidance_scale=6.0, use_dynamic_cfg=False, id_vit_hidden=id_vit_hidden, id_cond=id_cond, kps_cond=face_kps, generator=torch.Generator("cuda").manual_seed(42))
+export_to_video(video.frames[0], "output.mp4", fps=8)
+```
+<table>


Any results being demonstrated should be linked from the huggingface documentation-images repository on HF Hub: https://huggingface.co/datasets/huggingface/documentation-images/tree/main/diffusers

If you could open a PR to their, I can merge it and then that could be linked here.

sure, the pr is here https://huggingface.co/datasets/huggingface/documentation-images/discussions/406

@a-r-r-o-w @sayakpaul I have revised the PR, could you help to merge?
https://huggingface.co/datasets/huggingface/documentation-images/discussions/406

a-r-r-o-w · 2024-12-18T22:01:08Z

src/diffusers/models/transformers/consisid_transformer_3d.py

@@ -0,0 +1,845 @@
+# Copyright 2024 ConsisID Authors and The HuggingFace Team. All rights reserved.


@yiyixuxu So this model is mostly similar to CogVideoX except for performing cross attention with the face embeddings. I see some usages of nn.Sequential's that we don't like to have in Diffusers. Should we do the conversions and have a conversion script?

tests/models/transformers/test_models_transformer_consisid.py

tests/pipelines/consisid/test_consisid.py

SHYuanBest · 2024-12-22T03:52:06Z

to do:

Make the test script very small and pass all (model, pipeline, lora).
Check if test_vae_tiling requires expected_max_diff==0.35.
Have a conversion script about nn.Sequential.
Merge https://huggingface.co/datasets/huggingface/documentation-images/discussions/406 and update the Doc links.

a-r-r-o-w · 2024-12-23T01:49:23Z

@SHYuanBest Great work on the changes! We will try and integrate this soon and target it for next diffusers release (we have one this week, which is why we've been very busy). On your end, I think we are mostly good with the changes, and just need to address some minor concerns for diffusers-side integration. I will let YiYi comment and do her review first and then we can tackle the remaining things

SHYuanBest · 2024-12-23T02:52:52Z

@a-r-r-o-w @yiyixuxu That's great, much thanks for your great support! Looking forward to merge.

Update __init__.py

0036376

SHYuanBest mentioned this pull request Dec 6, 2024

[training] CogVideoX-I2V LoRA #9482

Merged

SHYuanBest and others added 6 commits December 9, 2024 16:20

Merge branch 'huggingface:main' into main

940ec92

add consisid

c78cf01

update consisid

61c85f7

update consisid

12855b2

make style

787a69c

make_style

33d4291

hlky reviewed Dec 10, 2024

View reviewed changes

SHYuanBest and others added 7 commits December 10, 2024 16:32

Update src/diffusers/pipelines/consisid/pipeline_consisid.py

455d68d

Co-authored-by: hlky <[email protected]>

Update src/diffusers/pipelines/consisid/pipeline_consisid.py

8f310c5

Co-authored-by: hlky <[email protected]>

Update src/diffusers/pipelines/consisid/pipeline_consisid.py

0f447a4

Co-authored-by: hlky <[email protected]>

Update src/diffusers/pipelines/consisid/pipeline_consisid.py

d348901

Co-authored-by: hlky <[email protected]>

Update src/diffusers/pipelines/consisid/pipeline_consisid.py

a35f92a

Co-authored-by: hlky <[email protected]>

Update src/diffusers/pipelines/consisid/pipeline_consisid.py

33f3acb

Co-authored-by: hlky <[email protected]>

add doc

6503a17

SHYuanBest requested a review from hlky December 10, 2024 09:04

SHYuanBest and others added 4 commits December 10, 2024 18:36

Merge branch 'main' into main

a24a4ee

Merge branch 'huggingface:main' into main

19d1fa3

make style

c13fb17

Rename consisid .md to consisid.md

61ad37b

hlky added 4 commits December 11, 2024 08:19

Update geodiff_molecule_conformation.ipynb

3a274ca

Update geodiff_molecule_conformation.ipynb

02c16ba

Update geodiff_molecule_conformation.ipynb

e76338e

Update demo.ipynb

a597713

a-r-r-o-w added 6 commits December 18, 2024 22:41

remove some changes from docs

141038b

refactor

d0fe503

fix

60856c7

undo changes to examples

313c2e3

remove save/load and fuse methods

935319a

update

0f5d677

a-r-r-o-w reviewed Dec 18, 2024

View reviewed changes

SHYuanBest and others added 8 commits December 19, 2024 21:25

link hf-doc-img & make test extremely small

aa7b0eb

update

aa98858

Merge branch 'huggingface:main' into main

03ebc66

Merge branch 'huggingface:main' into main

c8ba3c0

Merge branch 'huggingface:main' into main

2e15509

add lora

b174d9f

fix test

fbb09aa

Merge branch 'huggingface:main' into main

3b05257

SHYuanBest requested review from stevhliu and a-r-r-o-w December 22, 2024 04:26

SHYuanBest added 2 commits December 22, 2024 19:28

update

5813825

update

7734a29

SHYuanBest and others added 6 commits December 23, 2024 10:55

change expected_diff_max to 0.4

5fd9a81

Merge branch 'huggingface:main' into main

0937753

fix typo

cdc04bf

fix link

0af2f83

fix typo

e17aa82

Merge branch 'main' into main

3b17e2e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[core] ConsisID #10140

[core] ConsisID #10140

SHYuanBest commented Dec 6, 2024 •

edited

Loading

SHYuanBest commented Dec 6, 2024

a-r-r-o-w commented Dec 6, 2024

SHYuanBest commented Dec 10, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Dec 10, 2024

SHYuanBest commented Dec 11, 2024 •

edited

Loading

hlky commented Dec 18, 2024

a-r-r-o-w commented Dec 18, 2024

a-r-r-o-w Dec 18, 2024

SHYuanBest Dec 19, 2024

SHYuanBest Dec 20, 2024 •

edited

Loading

a-r-r-o-w Dec 18, 2024

SHYuanBest commented Dec 22, 2024 •

edited

Loading

a-r-r-o-w commented Dec 23, 2024

SHYuanBest commented Dec 23, 2024

		@@ -0,0 +1,845 @@
		# Copyright 2024 ConsisID Authors and The HuggingFace Team. All rights reserved.

[core] ConsisID #10140

Are you sure you want to change the base?

[core] ConsisID #10140

Conversation

SHYuanBest commented Dec 6, 2024 • edited Loading

What does this PR do?

Who can review?

SHYuanBest commented Dec 6, 2024

a-r-r-o-w commented Dec 6, 2024

SHYuanBest commented Dec 10, 2024 • edited Loading

HuggingFaceDocBuilderDev commented Dec 10, 2024

SHYuanBest commented Dec 11, 2024 • edited Loading

hlky commented Dec 18, 2024

a-r-r-o-w commented Dec 18, 2024

a-r-r-o-w Dec 18, 2024

Choose a reason for hiding this comment

SHYuanBest Dec 19, 2024

Choose a reason for hiding this comment

SHYuanBest Dec 20, 2024 • edited Loading

Choose a reason for hiding this comment

a-r-r-o-w Dec 18, 2024

Choose a reason for hiding this comment

SHYuanBest commented Dec 22, 2024 • edited Loading

a-r-r-o-w commented Dec 23, 2024

SHYuanBest commented Dec 23, 2024

SHYuanBest commented Dec 6, 2024 •

edited

Loading

SHYuanBest commented Dec 10, 2024 •

edited

Loading

SHYuanBest commented Dec 11, 2024 •

edited

Loading

SHYuanBest Dec 20, 2024 •

edited

Loading

SHYuanBest commented Dec 22, 2024 •

edited

Loading