Official repository for LTX-Video
-
Updated
May 25, 2025 - Python
Official repository for LTX-Video
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
LTX-Video Support for ComfyUI
Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persistence! Official ComfyUI workflow release! Only 4GB VRAM is enough to run!
Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025)
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)
OpenMusic: SOTA Text-to-music (TTM) Generation
MoH: Multi-Head Attention as Mixture-of-Head Attention
Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.
CogVideoX-5B 4-bit quantization model
Official implementation of "JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization"
An open source community implementation of the model from the paper: "Movie Gen: A Cast of Media Foundation Models". Join our community to help implement this model!
🤗CacheDiT: A Training-free and Easy-to-use Cache Acceleration Toolbox for DiTs (DBCache, DBPrune, FBCache)🔥
Official implementation of the paper "Inverse Virtual Try-On: Generating Multi-Category Product-Style Images from Clothed Individuals"
从0到1手写基于mnist手写数字数据集的diffusion transformer模型复现
Serve Text-to-Video Models in Production
Add a description, image, and links to the dit topic page so that developers can more easily learn about it.
To associate your repository with the dit topic, visit your repo's landing page and select "manage topics."