Phr00t's picture
Update README.md
8ed451d verified
metadata
base_model:
  - Wan-AI/Wan2.2-I2V-A14B
  - Wan-AI/Wan2.2-T2V-A14B
tags:
  - wan
  - wan2.2
  - accelerator

These are mixtures of WAN 2.2 and other WAN-like models and accelerators (with CLIP and VAE also included) to provide a fast, "all in one" solution for making videos as easily and quickly as possible. FP8 precision. Generally the latest version available for each type of model (image to video or text to video) is recommended.

base: This is the first attempt and very "stable", but mostly WAN 2.1 with few WAN 2.2 features. sa_solver recommended.

V2: This is a more dynamic mixture with more WAN 2.2 features. sa_solver OR euler_a sampler recommended. Suffers from minor color shifts and noise in I2V, typically just at the start.

V3: This is a mixture of SkyReels and WAN 2.2, which should improve prompt adherence and quality. euler_a sampler recommended, beta scheduler. Suffers from minor color shifts and noise in I2V, typically just at the start.

V4: WAN 2.2 Lightning in the mix! euler_a/beta recommended. I2V noise and color shifting generally fixed, but motion is a bit overexaggerated.

V5: Improved overexaggeration of I2V model. euler_a/beta recommended.

You just need to use the basic ComfyUI "Load Checkpoint" node with these, as you can take the VAE, CLIP and Model all from one AIO safetensors. All models are intended to use 1 CFG and 4 steps. See sampler recommendations for each version.

WAN 2.1 LORA compatibility is generally still good, along with "low noise" WAN 2.2 LORA compatibility. You might need to adjust LORA strengths (up or down) to get results you want, though.

image/png

image/png

image/png

image/png

Seems to work even on 8GB VRAM:

image/png

Looking for FP16 precision? TekeshiX has been helping me build variants in FP16 format. These should be the V5 I2V model:

https://huggingface.co/TekeshiX/RAPID-AIO-FP16/tree/main