gradio - 技术专题

相关标签

webaideep-learningtorchpytorchunstableimage-generationgradiodiffusionupscaling

Here are 3,989 public repositories matching this topic...

AUTOMATIC1111 / stable-diffusion-webui

Stable Diffusion web UI

web ai deep-learning torch pytorch unstable image-generation gradio diffusion upscaling text2image image2image img2img ai-art txt2img stable-diffusion

Updated Mar 2, 2026
Python

gradio-app / gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

python data-science machine-learning ui deep-learning interface deploy models data-visualization data-analysis ui-components gradio python-notebook gradio-interface

Updated Jul 28, 2026
Python

Zeyi-Lin / HivisionIDPhotos

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

docker demo machine-learning tools cnn face-recognition unet gradio matting mtcnn fastapi idphoto

Updated Jul 3, 2026
Python

DrewThomasson / ebook2audiobook

Generate audiobooks from e-books, voice cloning & 1158+ languages!

multilingual windows linux docker mac kaggle audiobook tts english epub chinese gradio audiobooks colab-notebook voice-cloning xtts

Updated Jul 26, 2026
Python

camenduru / stable-diffusion-webui-colab

stable diffusion webui colab

ai deep-learning pytorch colab image-generation lora gradio colaboratory colab-notebook texttovideo img2img ai-art text2video t2v txt2img stable-diffusion dreambooth stable-diffusion-webui stable-diffusion-web-ui

Updated Dec 16, 2025
Jupyter Notebook

voice-pro

abus-aikorea / voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

text-to-speech translator audiobook podcasts tts speech-synthesis subtitles speech-recognition webui speech-to-text karaoke transcription gradio whisper voice-conversion voice-cloning yt-dlp faster-whisper whisperx

Updated Jul 13, 2026
Python

AbdBarho / stable-diffusion-webui-docker

Easy Docker setup for Stable Diffusion with user-friendly UI

docker pytorch gradio docker-compse stable-diffusion

Updated Aug 18, 2024
Shell

modelscope / FunClip

FunASR-powered video transcription, subtitle generation, and LLM-assisted clipping tool with a local Gradio UI.

Updated Jul 26, 2026
Python

GiovanniPasq / agentic-rag-for-dummies

A modular Agentic RAG built with LangGraph — learn Retrieval-Augmented Generation Agents in minutes.

agent gradio agents bm25 ai-agents rag qdrant llm generative-ai langchain retrieval-augmented-generation ollama langgraph rag-pipeline rag-chatbot rag-agents agentic-rag agentic-ai retrieval-augmented-generation-rag

Updated Jul 25, 2026
Jupyter Notebook

ant-research / MagicQuill

[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System

image-editing gradio aigc mllm

Updated Dec 3, 2025
Python

OpenGVLab / Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

chat video gradio big-model video-understanding captioning-videos video-question-answering foundation-models large-model large-language-models chatgpt langchain stablelm

Updated Jul 17, 2026
Python

TTS-WebUI

rsxdalv / TTS-WebUI

A single Gradio + React WebUI with extensions for ACE-Step, OmniVoice, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

music text-to-speech ai generator tts rvc gradio openvoice openai-api audio-generation generative-ai tortoise-tts musicgen vocos styletts2 cosyvoice ace-step

Updated Jul 27, 2026
TypeScript

OpenGVLab / InternGPT

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

sam click vqa image-captioning llama gpt gradio husky multimodal video-generation vicuna gpt-4 llm chatgpt langchain foundation-model segment-anything internimage imagebind draggan

Updated Aug 20, 2024
Python

jhj0517 / Whisper-WebUI

A Web UI for easy subtitle using whisper model.

python open-source ai web-ui pytorch gradio whisper

Updated Dec 29, 2025
Python

om-ai-lab / OmAgent

[EMNLP-2024] Build multimodal language agents for fast prototype and production

python agent workflow chatbot gemini openai llama gpt gradio vlm multimodal vision-and-language rag gpt4 large-language-models llm llava smart-hardware language-agent multimodal-agent

Updated Mar 19, 2025
Python

Mukosame / Anime2Sketch

A sketch extractor for anime/illustration.

computer-vision deep-learning anime sketch comic manga pytorch generative-adversarial-network gan image-generation gans gradio wacv

Updated Aug 16, 2023
Python

rupeshs / fastsdcpu

Fast stable diffusion on CPU and AI PC

api cli flux qt cpu torch webui gradio diffusion openvino aipc diffusers stablediffusion lcmdiffusion latentconsistencymodels fastsdcpu desktopgui sdxlturbo sdupcale sdxs

Updated Jul 24, 2026
Python

camenduru / text-generation-webui-colab

A colab gradio web UI for running Large Language Models

colab llama gradio koala llamas alpaca lama colaboratory colab-notebook vicuna llm

Updated Dec 22, 2023
Jupyter Notebook

kabachuha / sd-webui-text2video

Auto1111 extension implementing text2video diffusion models (like ModelScope or VideoCrafter) using only Auto1111 webui dependencies

extension webui gradio text2video stable-diffusion automatic1111 modelscope videocrafter

Updated Jul 14, 2024
Python

image-matching-webui

Vincentqyw / image-matching-webui

🤗 image matching webui

deep-learning sift gradio pose-estimation image-matching feature-matching visual-localization superpoint superglue kornia keypoint-matching topicfm loftr lightglue aspanformer

Updated Jul 14, 2026
Python

在 GitHub 查看