Stable Diffusion web UI
- Updated Dec 18, 2025
- Python
Stable Diffusion web UI
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Generate audiobooks from e-books, voice cloning & 1158+ languages!
stable diffusion webui colab
Easy Docker setup for Stable Diffusion with user-friendly UI
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!
[EMNLP-2024] Build multimodal language agents for fast prototype and production
A minimal Agentic RAG built with LangGraph — learn Retrieval-Augmented Generation Agents in minutes.
A sketch extractor for anime/illustration.
Fast stable diffusion on CPU and AI PC
🌊 Images to 3D Parallax effect video
Auto1111 extension implementing text2video diffusion models (like ModelScope or VideoCrafter) using only Auto1111 webui dependencies