Kandinsky AI
Visit SiteOpen-source multimodal tool: text/image/video generation & inpainting.

Kandinsky AI
Visit SiteOpen-source multimodal tool: text/image/video generation & inpainting.

What is Kandinsky AI?
Kandinsky AI is an open-source multimodal tool supporting text-to-image, image-to-image and video generation, with inpainting and blending capabilities, multilingual compatibility, high-resolution output and efficient inference.
Kandinsky AI's Core Features
Fast Text-to-Video Generation
Generate smooth, high-resolution videos (up to ~10 seconds) directly from text prompts with models optimized for quality (Pro) or speed (Flash/Lite) for rapid prototyping and production-ready clips.
High-Quality Text-to-Image
Produce sharp, high-resolution images from text prompts with strong detail, style consistency and multilingual prompt support suitable for campaigns, concept art, and product visuals.
Image-to-Video / Animation
Animate existing images or concept art into short video clips while preserving subject identity, composition and visual style — useful for storyboards, previews and dynamic product shots.
Inpainting & Blending
Edit and refine generated or uploaded images/videos using inpainting, outpainting and blending tools to remove or replace elements, extend scenes, and maintain visual continuity across frames.
Open-source Models & Fine-tuning
Built on an open-source diffusion transformer backbone with pretrained checkpoints and Flow Matching training paradigm, enabling further fine-tuning, experimentation and integration into custom workflows.