comfyanonymous
fb3b728203
Fix issue where autocast fp32 CLIP gave different results from regular.
1 year ago
comfyanonymous
7d401ed1d0
Add ldm format support to UNETLoader.
1 year ago
comfyanonymous
e85be36bd2
Add a penultimate_hidden_states to the clip vision output.
1 year ago
comfyanonymous
1e6b67101c
Support diffusers format t2i adapters.
1 year ago
comfyanonymous
326577d04c
Allow cancelling of everything with a progress bar.
1 year ago
comfyanonymous
f88f7f413a
Add a ConditioningSetAreaPercentage node.
1 year ago
comfyanonymous
1938f5c5fe
Add a force argument to soft_empty_cache to force a cache empty.
1 year ago
comfyanonymous
7746bdf7b0
Merge branch 'generalize_fixes' of https://github.com/simonlui/ComfyUI
1 year ago
Simon Lui
2da73b7073
Revert changes in comfy/ldm/modules/diffusionmodules/util.py, which is unused.
1 year ago
comfyanonymous
a74c5dbf37
Move some functions to utils.py
1 year ago
Simon Lui
4a0c4ce4ef
Some fixes to generalize CUDA specific functionality to Intel or other GPUs.
1 year ago
comfyanonymous
77a176f9e0
Use common function to reshape batch to.
2 years ago
comfyanonymous
7931ff0fd9
Support SDXL inpaint models.
2 years ago
comfyanonymous
0e3b641172
Remove xformers related print.
2 years ago
comfyanonymous
5c363a9d86
Fix controlnet bug.
2 years ago
comfyanonymous
cfe1c54de8
Fix controlnet issue.
2 years ago
comfyanonymous
1c012d69af
It doesn't make sense for c_crossattn and c_concat to be lists.
2 years ago
comfyanonymous
7e941f9f24
Clean up DiffusersLoader node.
2 years ago
Simon Lui
18617967e5
Fix error message in model_patcher.py
...
Found while tinkering.
2 years ago
comfyanonymous
fe4c07400c
Fix "Load Checkpoint with config" node.
2 years ago
comfyanonymous
f2f5e5dcbb
Support SDXL t2i adapters with 3 channel input.
2 years ago
comfyanonymous
15adc3699f
Move beta_schedule to model_config and allow disabling unet creation.
2 years ago
comfyanonymous
bed116a1f9
Remove optimization that caused border.
2 years ago
comfyanonymous
65cae62c71
No need to check filename extensions to detect shuffle controlnet.
2 years ago
comfyanonymous
4e89b2c25a
Put clip vision outputs on the CPU.
2 years ago
comfyanonymous
a094b45c93
Load clipvision model to GPU for faster performance.
2 years ago
comfyanonymous
1300a1bb4c
Text encoder should initially load on the offload_device not the regular.
2 years ago
comfyanonymous
f92074b84f
Move ModelPatcher to model_patcher.py
2 years ago
comfyanonymous
4798cf5a62
Implement loras with norm keys.
2 years ago
comfyanonymous
b8c7c770d3
Enable bf16-vae by default on ampere and up.
2 years ago
comfyanonymous
1c794a2161
Fallback to slice attention if xformers doesn't support the operation.
2 years ago
comfyanonymous
d935ba50c4
Make --bf16-vae work on torch 2.0
2 years ago
comfyanonymous
a57b0c797b
Fix lowvram model merging.
2 years ago
comfyanonymous
f72780a7e3
The new smart memory management makes this unnecessary.
2 years ago
comfyanonymous
c77f02e1c6
Move controlnet code to comfy/controlnet.py
2 years ago
comfyanonymous
15a7716fa6
Move lora code to comfy/lora.py
2 years ago
comfyanonymous
ec96f6d03a
Move text_projection to base clip model.
2 years ago
comfyanonymous
30eb92c3cb
Code cleanups.
2 years ago
comfyanonymous
51dde87e97
Try to free enough vram for control lora inference.
2 years ago
comfyanonymous
e3d0a9a490
Fix potential issue with text projection matrix multiplication.
2 years ago
comfyanonymous
cc44ade79e
Always shift text encoder to GPU when the device supports fp16.
2 years ago
comfyanonymous
a6ef08a46a
Even with forced fp16 the cpu device should never use it.
2 years ago
comfyanonymous
00c0b2c507
Initialize text encoder to target dtype.
2 years ago
comfyanonymous
f081017c1a
Save memory by storing text encoder weights in fp16 in most situations.
...
Do inference in fp32 to make sure quality stays the exact same.
2 years ago
comfyanonymous
afcb9cb1df
All resolutions now work with t2i adapter for SDXL.
2 years ago
comfyanonymous
85fde89d7f
T2I adapter SDXL.
2 years ago
comfyanonymous
cf5ae46928
Controlnet/t2iadapter cleanup.
2 years ago
comfyanonymous
763b0cf024
Fix control lora not working in fp32.
2 years ago
comfyanonymous
199d73364a
Fix ControlLora on lowvram.
2 years ago
comfyanonymous
d08e53de2e
Remove autocast from controlnet code.
2 years ago
comfyanonymous
0d7b0a4dc7
Small cleanups.
2 years ago
Simon Lui
9225465975
Further tuning and fix mem_free_total.
2 years ago
Simon Lui
2c096e4260
Add ipex optimize and other enhancements for Intel GPUs based on recent memory changes.
2 years ago
comfyanonymous
e9469e732d
--disable-smart-memory now disables loading model directly to vram.
2 years ago
comfyanonymous
c9b562aed1
Free more memory before VAE encode/decode.
2 years ago
comfyanonymous
b80c3276dc
Fix issue with gligen.
2 years ago
comfyanonymous
d6e4b342e6
Support for Control Loras.
...
Control loras are controlnets where some of the weights are stored in
"lora" format: an up and a down low rank matrice that when multiplied
together and added to the unet weight give the controlnet weight.
This allows a much smaller memory footprint depending on the rank of the
matrices.
These controlnets are used just like regular ones.
2 years ago
comfyanonymous
39ac856a33
ReVision support: unclip nodes can now be used with SDXL.
2 years ago
comfyanonymous
76d53c4622
Add support for clip g vision model to CLIPVisionLoader.
2 years ago
Alexopus
e59fe0537a
Fix referenced before assignment
...
For https://github.com/BlenderNeko/ComfyUI_TiledKSampler/issues/13
2 years ago
comfyanonymous
be9c5e25bc
Fix issue with not freeing enough memory when sampling.
2 years ago
comfyanonymous
ac0758a1a4
Fix bug with lowvram and controlnet advanced node.
2 years ago
comfyanonymous
c28db1f315
Fix potential issues with patching models when saving checkpoints.
2 years ago
comfyanonymous
3aee33b54e
Add --disable-smart-memory for those that want the old behaviour.
2 years ago
comfyanonymous
2be2742711
Fix issue with regular torch version.
2 years ago
comfyanonymous
89a0767abf
Smarter memory management.
...
Try to keep models on the vram when possible.
Better lowvram mode for controlnets.
2 years ago
comfyanonymous
2c97c30256
Support small diffusers controlnet so both types are now supported.
2 years ago
comfyanonymous
53f326a3d8
Support diffusers mini controlnets.
2 years ago
comfyanonymous
58f0c616ed
Fix clip vision issue with old transformers versions.
2 years ago
comfyanonymous
ae270f79bc
Fix potential issue with batch size and clip vision.
2 years ago
comfyanonymous
a2ce9655ca
Refactor unclip code.
2 years ago
comfyanonymous
9cc12c833d
CLIPVisionEncode can now encode multiple images.
2 years ago
comfyanonymous
0cb6dac943
Remove 3m from PR #1213 because of some small issues.
2 years ago
comfyanonymous
e244b2df83
Add sgm_uniform scheduler that acts like the default one in sgm.
2 years ago
comfyanonymous
58c7da3665
Gpu variant of dpmpp_3m_sde. Note: use 3m with exponential or karras.
2 years ago
comfyanonymous
ba319a34e4
Merge branch 'dpmpp3m' of https://github.com/FizzleDorf/ComfyUI
2 years ago
FizzleDorf
3cfad03a68
dpmpp 3m + dpmpp 3m sde added
2 years ago
comfyanonymous
585a062910
Print unet config when model isn't detected.
2 years ago
comfyanonymous
c8a23ce9e8
Support for yet another lora type based on diffusers.
2 years ago
comfyanonymous
2bc12d3d22
Add --temp-directory argument to set temp directory.
2 years ago
comfyanonymous
c20583286f
Support diffuser text encoder loras.
2 years ago
comfyanonymous
cf10c5592c
Disable calculating uncond when CFG is 1.0
2 years ago
comfyanonymous
1f0f4cc0bd
Add argument to disable auto launching the browser.
2 years ago
comfyanonymous
d8e58f0a7e
Detect hint_channels from controlnet.
2 years ago
comfyanonymous
c5d7593ccf
Support loras in diffusers format.
2 years ago
comfyanonymous
1ce0d8ad68
Add CMP 30HX card to the nvidia_16_series list.
2 years ago
comfyanonymous
c99d8002f8
Make sure the pooled output stays at the EOS token with added embeddings.
2 years ago
comfyanonymous
4a77fcd6ab
Only shift text encoder to vram when CPU cores are under 8.
2 years ago
comfyanonymous
3cd31d0e24
Lower CPU thread check for running the text encoder on the CPU vs GPU.
2 years ago
comfyanonymous
2b13939044
Remove some useless code.
2 years ago
comfyanonymous
95d796fc85
Faster VAE loading.
2 years ago
comfyanonymous
4b957a0010
Initialize the unet directly on the target device.
2 years ago
comfyanonymous
c910b4a01c
Remove unused code and torchdiffeq dependency.
2 years ago
comfyanonymous
1141029a4a
Add --disable-metadata argument to disable saving metadata in files.
2 years ago
comfyanonymous
fbf5c51c1c
Merge branch 'fix_batch_timesteps' of https://github.com/asagi4/ComfyUI
2 years ago
comfyanonymous
68be24eead
Remove some prints.
2 years ago
asagi4
1ea4d84691
Fix timestep ranges when batch_size > 1
2 years ago
comfyanonymous
5379051d16
Fix diffusers VAE loading.
2 years ago
comfyanonymous
727588d076
Fix some new loras.
2 years ago
comfyanonymous
4f9b6f39d1
Fix potential issue with Save Checkpoint.
2 years ago
comfyanonymous
5f75d784a1
Start is now 0.0 and end is now 1.0 for the timestep ranges.
2 years ago
comfyanonymous
7ff14b62f8
ControlNetApplyAdvanced can now define when controlnet gets applied.
2 years ago
comfyanonymous
d191c4f9ed
Add a ControlNetApplyAdvanced node.
...
The controlnet can be applied to the positive or negative prompt only by
connecting it correctly.
2 years ago
comfyanonymous
0240946ecf
Add a way to set which range of timesteps the cond gets applied to.
2 years ago
comfyanonymous
22f29d66ca
Try to fix memory issue with lora.
2 years ago
comfyanonymous
67be7eb81d
Nodes can now patch the unet function.
2 years ago
comfyanonymous
12a6e93171
Del the right object when applying lora.
2 years ago
comfyanonymous
78e7958d17
Support controlnet in diffusers format.
2 years ago
comfyanonymous
09386a3697
Fix issue with lora in some cases when combined with model merging.
2 years ago
comfyanonymous
58b2364f58
Properly support SDXL diffusers unet with UNETLoader node.
2 years ago
comfyanonymous
0115018695
Print errors and continue when lora weights are not compatible.
2 years ago
comfyanonymous
4760c29380
Merge branch 'fix-AttributeError-module-'torch'-has-no-attribute-'mps'' of https://github.com/KarryCharon/ComfyUI
2 years ago
comfyanonymous
0b284f650b
Fix typo.
2 years ago
comfyanonymous
e032ca6138
Fix ddim issue with older torch versions.
2 years ago
comfyanonymous
18885f803a
Add MX450 and MX550 to list of cards with broken fp16.
2 years ago
comfyanonymous
9ba440995a
It's actually possible to torch.compile the unet now.
2 years ago
comfyanonymous
51d5477579
Add key to indicate checkpoint is v_prediction when saving.
2 years ago
comfyanonymous
ff6b047a74
Fix device print on old torch version.
2 years ago
comfyanonymous
9871a15cf9
Enable --cuda-malloc by default on torch 2.0 and up.
...
Add --disable-cuda-malloc to disable it.
2 years ago
comfyanonymous
55d0fca9fa
--windows-standalone-build now enables --cuda-malloc
2 years ago
comfyanonymous
1679abd86d
Add a command line argument to enable backend:cudaMallocAsync
2 years ago
comfyanonymous
3a150bad15
Only calculate randn in some samplers when it's actually being used.
2 years ago
comfyanonymous
ee8f8ee07f
Fix regression with ddim and uni_pc when batch size > 1.
2 years ago
comfyanonymous
3ded1a3a04
Refactor of sampler code to deal more easily with different model types.
2 years ago
comfyanonymous
5f57362613
Lower lora ram usage when in normal vram mode.
2 years ago
comfyanonymous
490771b7f4
Speed up lora loading a bit.
2 years ago
comfyanonymous
50b1180dde
Fix CLIPSetLastLayer not reverting when removed.
2 years ago
comfyanonymous
6fb084f39d
Reduce floating point rounding errors in loras.
2 years ago
comfyanonymous
91ed2815d5
Add a node to merge CLIP models.
2 years ago
comfyanonymous
b2f03164c7
Prevent the clip_g position_ids key from being saved in the checkpoint.
...
This is to make it match the official checkpoint.
2 years ago
comfyanonymous
46dc050c9f
Fix potential tensors being on different devices issues.
2 years ago
KarryCharon
3e2309f149
fix mps miss import
2 years ago
comfyanonymous
606a537090
Support SDXL embedding format with 2 CLIP.
2 years ago
comfyanonymous
6ad0a6d7e2
Don't patch weights when multiplier is zero.
2 years ago
comfyanonymous
d5323d16e0
latent2rgb matrix for SDXL.
2 years ago
comfyanonymous
0ae81c03bb
Empty cache after model unloading for normal vram and lower.
2 years ago
comfyanonymous
d3f5998218
Support loading clip_g from diffusers in CLIP Loader nodes.
2 years ago
comfyanonymous
a9a4ba7574
Fix merging not working when model2 of model merge node was a merge.
2 years ago
comfyanonymous
bb5fbd29e9
Merge branch 'condmask-fix' of https://github.com/vmedea/ComfyUI
2 years ago
comfyanonymous
e7bee85df8
Add arguments to run the VAE in fp16 or bf16 for testing.
2 years ago
comfyanonymous
608fcc2591
Fix bug with weights when prompt is long.
2 years ago
comfyanonymous
ddc6f12ad5
Disable autocast in unet for increased speed.
2 years ago
comfyanonymous
603f02d613
Fix loras not working when loading checkpoint with config.
2 years ago
comfyanonymous
af7a49916b
Support loading unet files in diffusers format.
2 years ago
comfyanonymous
e57cba4c61
Add gpu variations of the sde samplers that are less deterministic
...
but faster.
2 years ago
comfyanonymous
f81b192944
Add logit scale parameter so it's present when saving the checkpoint.
2 years ago
comfyanonymous
acf95191ff
Properly support SDXL diffusers loras for unet.
2 years ago
mara
c61a95f9f7
Fix size check for conditioning mask
...
The wrong dimensions were being checked, [1] and [2] are the image size.
not [2] and [3]. This results in an out-of-bounds error if one of them
actually matches.
2 years ago
comfyanonymous
8d694cc450
Fix issue with OSX.
2 years ago
comfyanonymous
c3e96e637d
Pass device to CLIP model.
2 years ago
comfyanonymous
5e6bc824aa
Allow passing custom path to clip-g and clip-h.
2 years ago
comfyanonymous
dc9d1f31c8
Improvements for OSX.
2 years ago
comfyanonymous
103c487a89
Cleanup.
2 years ago
comfyanonymous
2c4e0b49b7
Switch to fp16 on some cards when the model is too big.
2 years ago
comfyanonymous
6f3d9f52db
Add a --force-fp16 argument to force fp16 for testing.
2 years ago
comfyanonymous
1c1b0e7299
--gpu-only now keeps the VAE on the device.
2 years ago
comfyanonymous
ce35d8c659
Lower latency by batching some text encoder inputs.
2 years ago
comfyanonymous
3b6fe51c1d
Leave text_encoder on the CPU when it can handle it.
2 years ago
comfyanonymous
b6a60fa696
Try to keep text encoders loaded and patched to increase speed.
...
load_model_gpu() is now used with the text encoder models instead of just
the unet.
2 years ago
comfyanonymous
97ee230682
Make highvram and normalvram shift the text encoders to vram and back.
...
This is faster on big text encoder models than running it on the CPU.
2 years ago
comfyanonymous
5a9ddf94eb
LoraLoader node now caches the lora file between executions.
2 years ago
comfyanonymous
9920367d3c
Fix embeddings not working with --gpu-only
2 years ago
comfyanonymous
62db11683b
Move unet to device right after loading on highvram mode.
2 years ago
comfyanonymous
4376b125eb
Remove useless code.
2 years ago
comfyanonymous
89120f1fbe
This is unused but it should be 1280.
2 years ago
comfyanonymous
2c7c14de56
Support for SDXL text encoder lora.
2 years ago
comfyanonymous
fcef47f06e
Fix bug.
2 years ago
comfyanonymous
8248babd44
Use pytorch attention by default on nvidia when xformers isn't present.
...
Add a new argument --use-quad-cross-attention
2 years ago
comfyanonymous
9b93b920be
Add CheckpointSave node to save checkpoints.
...
The created checkpoints contain workflow metadata that can be loaded by
dragging them on top of the UI or loading them with the "Load" button.
Checkpoints will be saved in fp16 or fp32 depending on the format ComfyUI
is using for inference on your hardware. To force fp32 use: --force-fp32
Anything that patches the model weights like merging or loras will be
saved.
The output directory is currently set to: output/checkpoints but that might
change in the future.
2 years ago
comfyanonymous
b72a7a835a
Support loras based on the stability unet implementation.
2 years ago
comfyanonymous
c71a7e6b20
Fix ddim + inpainting not working.
2 years ago
comfyanonymous
4eab00e14b
Set the seed in the SDE samplers to make them more reproducible.
2 years ago
comfyanonymous
cef6aa62b2
Add support for TAESD decoder for SDXL.
2 years ago
comfyanonymous
20f579d91d
Add DualClipLoader to load clip models for SDXL.
...
Update LoadClip to load clip models for SDXL refiner.
2 years ago
comfyanonymous
b7933960bb
Fix CLIPLoader node.
2 years ago
comfyanonymous
78d8035f73
Fix bug with controlnet.
2 years ago
comfyanonymous
05676942b7
Add some more transformer hooks and move tomesd to comfy_extras.
...
Tomesd now uses q instead of x to decide which tokens to merge because
it seems to give better results.
2 years ago
comfyanonymous
fa28d7334b
Remove useless code.
2 years ago
comfyanonymous
8607c2d42d
Move latent scale factor from VAE to model.
2 years ago
comfyanonymous
30a3861946
Fix bug when yaml config has no clip params.
2 years ago
comfyanonymous
9e37f4c7d5
Fix error with ClipVision loader node.
2 years ago
comfyanonymous
9f83b098c9
Don't merge weights when shapes don't match and print a warning.
2 years ago
comfyanonymous
f87ec10a97
Support base SDXL and SDXL refiner models.
...
Large refactor of the model detection and loading code.
2 years ago
comfyanonymous
9fccf4aa03
Add original_shape parameter to transformer patch extra_options.
2 years ago
comfyanonymous
51581dbfa9
Fix last commits causing an issue with the text encoder lora.
2 years ago
comfyanonymous
8125b51a62
Keep a set of model_keys for faster add_patches.
2 years ago
comfyanonymous
45beebd33c
Add a type of model patch useful for model merging.
2 years ago
comfyanonymous
036a22077c
Fix k_diffusion math being off by a tiny bit during txt2img.
2 years ago
comfyanonymous
8883cb0f67
Add a way to set patches that modify the attn2 output.
...
Change the transformer patches function format to be more future proof.
2 years ago
comfyanonymous
cd930d4e7f
pop clip vision keys after loading them.
2 years ago
comfyanonymous
c9e4a8c9e5
Not needed anymore.
2 years ago
comfyanonymous
fb4bf7f591
This is not needed anymore and causes issues with alphas_cumprod.
2 years ago
comfyanonymous
45be2e92c1
Fix DDIM v-prediction.
2 years ago
comfyanonymous
e6e50ab2dd
Fix an issue when alphas_comprod are half floats.
2 years ago
comfyanonymous
ae43f09ef7
All the unet weights should now be initialized with the right dtype.
2 years ago
comfyanonymous
f7edcfd927
Add a --gpu-only argument to keep and run everything on the GPU.
...
Make the CLIP model work on the GPU.
2 years ago
comfyanonymous
7bf89ba923
Initialize more unet weights as the right dtype.
2 years ago
comfyanonymous
e21d9ad445
Initialize transformer unet block weights in right dtype at the start.
2 years ago
comfyanonymous
bb1f45d6e8
Properly disable weight initialization in clip models.
2 years ago
comfyanonymous
21f04fe632
Disable default weight values in unet conv2d for faster loading.
2 years ago