模型/Wan Video 2.2 - ComfyUI Repack - 5B Text-Image-to-Video

Wan Video 2.2 - ComfyUI Repack - 5B Text-Image-to-Video

|
9/3/2025
|
10:00:35 AM
| Discussion|
0
A sleek chrome robot serving a cup of coffee to a man sitting at a cafe booth, with warm overhead lighting in a casual cafe setting.
A college student typing code on a 1990s vintage computer in a green wallpapered dorm room with carpeted floors and a wooden desk, captured with nostalgic grainy texture.
A mysterious figure wearing a full black cloak walking through a narrow back alley between tall urban buildings with steam leaking and dim, night-time lighting.
View from inside a car driving through a rainy tropical highway with palm trees and lush jungle greenery on both sides during daytime.
View through a car windshield during a rainy drive in a suburban neighborhood with American style houses and wet roads, taken with a slight motion blur.
Tall man wearing a white pinstriped suit and sunglasses stands confidently in front of palm trees and a modern Miami skyscraper, shot from a low camera angle.
African American man with afro wearing a pink suit and sunglasses standing on a Miami street at night with palm trees and a modern skyscraper illuminated by neon lights behind him.
Group of young women wearing crop tops and denim shorts dancing outdoors at night in Miami with palm trees and modern skyscrapers illuminated by neon lights in the background.

推薦參數

resolution

720x480, 720x720

vae

Wan2.2-VAE

提示

Wan2.2 supports both text-to-video and image-to-video generation.

The model runs efficiently on consumer-grade GPUs such as the Nvidia 4090.

Wan2.2 features fine-grained control over cinematic aesthetics including lighting, composition, and color.

版本亮點

Wan 2.2 5B for on-site Generation

創作者贊助

Wan Video

Note: There are other Wan Video files hosted on Civitai - these may be duplicates, but this model card is primarily to host the files used by Wan Video in the Civitai Generator.

Wan2.2, a major upgrade to our visual generative models, which is now open-sourced, offering more powerful capabilities, better performance, and superior visual quality. With Wan2.2, we have focused on incorporating the following technical innovations:

👍 MoE Architecture: Wan2.2 introduces a Mixture-of-Experts (MoE) architecture into video diffusion models. By separating the denoising process cross timesteps with specialized powerful expert models, this enlarges the overall model capacity while maintaining the same computational cost.

💪🏻 Data Scaling: Compared to Wan2.1, Wan2.2 is trained on a significantly larger data, with +65.6% more images and +83.2% more videos. This expansion notably enhances the model's generalization across multiple dimensions such as motions, semantics, and aesthetics, achieving TOP performance among all open-sourced and closed-sourced models.

🎬 Cinematic Aesthetics: Wan2.2 incorporates specially curated aesthetic data with fine-grained labels for lighting, composition, and color. This allows for more precise and controllable cinematic style generation, facilitating the creation of videos with customizable aesthetic preferences.

🚀 Efficient High-Definition Hybrid TI2V: Wan2.2 open-sources a 5B model built with our advanced Wan2.2-VAE that achieves a compression ratio of 16×16×4. This model supports both text-to-video and image-to-video generation at 720P resolution with 24fps and can also run on consumer-grade graphics cards like 4090. It one of the fastest 720P@24fps models currently available, capable of serving both the industrial and academic sectors simultaneously.

Wan2.2-T2V-A14B

The T2V-A14B model, supports generating 5s videos at both 480P and 720P resolutions. Built with a Mixture-of-Experts (MoE) architecture, it delivers outstanding video generation quality. On our new benchmark Wan-Bench 2.0, the model surpasses leading commercial models across most key evaluation dimensions.

Wan2.2-I2V-A14B

The I2V-A14B model, designed for image-to-video generation, supports both 480P and 720P resolutions. Built with a Mixture-of-Experts (MoE) architecture, it achieves more stable video synthesis with reduced unrealistic camera movements and offers enhanced support for diverse stylized scenes.

Wan2.2-TI2V-5B

The TI2V-5B model is built with the advanced Wan2.2-VAE that achieves a compression ratio of 16×16×4. This model supports both text-to-video and image-to-video generation at 720P resolution with 24fps and can runs on single consumer-grade GPU such as the 4090. It is one of the fastest 720P@24fps models available, meeting the needs of both industrial applications and academic research.

GitHub: https://github.com/Wan-Video/Wan2.2

Originally HuggingFace Repo: https://huggingface.co/Comfy-Org/Wan_2.2_ComfyUI_Repackaged/tree/main/split_files/diffusion_models

上一個
Flux Detail & Style - aiai Glimra LoRA - v1.0
下一個
lyh_anime_Flux - v4 niji

模型詳情

模型類型

Checkpoint

基礎模型

Wan Video 2.2 TI2V-5B

模型版本

5B Text-Image-to-Video

模型雜湊值

33fc2f5384

創作者

討論

log in以發表評論。