Stars: 39.0k
| Created at: 2023-08-09
| Last updated: 2024-08-01
Focus on prompting and generating
https://github.com/open-webui/open-webuiStars: 34.0k
| Created at: 2023-10-06
| Last updated: 2024-08-01
User-friendly WebUI for LLMs (Formerly Ollama WebUI)
https://github.com/PKU-YuanGroup/Open-Sora-PlanStars: 11.1k
| Created at: 2024-02-20
| Last updated: 2024-08-01
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
https://github.com/hua1995116/awesome-ai-paintingStars: 11.0k
| Created at: 2022-10-08
| Last updated: 2024-08-01
AI绘画资料合集(包含国内外可使用平台、使用教程、参数教程、部署教程、业界新闻等等) Stable diffusion、AnimateDiff、Stable Cascade 、Stable SDXL Turbo
https://github.com/InstantID/InstantIDStars: 10.6k
| Created at: 2023-12-11
| Last updated: 2024-08-01
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
https://github.com/guoyww/AnimateDiffStars: 10.0k
| Created at: 2023-06-17
| Last updated: 2024-08-01
Official implementation of AnimateDiff.
https://github.com/KwaiVGI/LivePortraitStars: 9.1k
| Created at: 2024-07-03
| Last updated: 2024-08-02
Bring portraits to life!
https://github.com/TencentARC/PhotoMakerStars: 9.0k
| Created at: 2023-12-06
| Last updated: 2024-08-02
PhotoMaker [CVPR 2024]
https://github.com/NVIDIA/TensorRT-LLMStars: 7.8k
| Created at: 2023-08-16
| Last updated: 2024-08-01
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
https://github.com/fudan-generative-vision/halloStars: 7.6k
| Created at: 2024-06-12
| Last updated: 2024-08-01
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
https://github.com/TheLastBen/fast-stable-diffusionStars: 7.4k
| Created at: 2022-09-21
| Last updated: 2024-08-01
fast-stable-diffusion + DreamBooth
https://github.com/LiheYoung/Depth-AnythingStars: 6.6k
| Created at: 2024-01-22
| Last updated: 2024-08-01
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
https://github.com/AbdBarho/stable-diffusion-webui-dockerStars: 6.4k
| Created at: 2022-08-27
| Last updated: 2024-08-01
Easy Docker setup for Stable Diffusion with user-friendly UI
https://github.com/jagenjo/litegraph.jsStars: 6.1k
| Created at: 2013-09-26
| Last updated: 2024-08-01
A graph node engine and editor written in Javascript similar to PD or UDK Blueprints, comes with its own editor in HTML5 Canvas2D. The engine can run client side or server side using Node. It allows to export graphs as JSONs to be included in applications independently.
https://github.com/modelscope/DiffSynth-StudioStars: 6.0k
| Created at: 2023-12-07
| Last updated: 2024-08-02
Enjoy the magic of Diffusion models!
https://github.com/Acly/krita-ai-diffusionStars: 5.9k
| Created at: 2023-09-01
| Last updated: 2024-08-01
Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.
https://github.com/aigc-apps/sd-webui-EasyPhotoStars: 4.8k
| Created at: 2023-08-28
| Last updated: 2024-08-01
📷 EasyPhoto | Your Smart AI Photo Generator.
https://github.com/tencent-ailab/IP-AdapterStars: 4.7k
| Created at: 2023-08-16
| Last updated: 2024-08-01
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
https://github.com/Stability-AI/StableSwarmUIStars: 4.3k
| Created at: 2023-05-12
| Last updated: 2024-08-01
StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
https://github.com/luosiallen/latent-consistency-modelStars: 4.2k
| Created at: 2023-10-06
| Last updated: 2024-07-31
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
https://github.com/ParisNeo/lollms-webuiStars: 4.1k
| Created at: 2023-04-06
| Last updated: 2024-08-01
Lord of Large Language Models Web User Interface
https://github.com/AILab-CVC/YOLO-WorldStars: 4.1k
| Created at: 2024-01-29
| Last updated: 2024-08-01
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
https://github.com/fudan-generative-vision/champStars: 3.5k
| Created at: 2024-03-17
| Last updated: 2024-08-01
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
https://github.com/philz1337x/clarity-upscalerStars: 3.3k
| Created at: 2024-03-15
| Last updated: 2024-08-01
Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative
https://github.com/leejet/stable-diffusion.cppStars: 3.0k
| Created at: 2023-08-13
| Last updated: 2024-08-01
Stable Diffusion in pure C/C++
https://github.com/Tencent/HunyuanDiTStars: 3.0k
| Created at: 2024-05-10
| Last updated: 2024-08-01
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
https://github.com/VinsonLaro/stable-diffusion-webui-chineseStars: 2.9k
| Created at: 2022-10-10
| Last updated: 2024-08-01
stable-diffusion-webui 的汉化扩展
https://github.com/Kwai-Kolors/KolorsStars: 2.9k
| Created at: 2024-07-05
| Last updated: 2024-08-01
Kolors Team
https://github.com/TencentARC/InstantMeshStars: 2.8k
| Created at: 2024-04-10
| Last updated: 2024-08-01
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
https://github.com/DepthAnything/Depth-Anything-V2Stars: 2.8k
| Created at: 2024-06-13
| Last updated: 2024-08-01
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
https://github.com/AiuniAI/Unique3DStars: 2.6k
| Created at: 2024-05-30
| Last updated: 2024-08-01
Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
https://github.com/PixArt-alpha/PixArt-alphaStars: 2.6k
| Created at: 2023-10-12
| Last updated: 2024-08-01
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
https://github.com/butaixianran/Stable-Diffusion-Webui-Civitai-HelperStars: 2.4k
| Created at: 2023-03-07
| Last updated: 2024-08-01
Stable Diffusion Webui Extension for Civitai, to manage your model much more easily.
https://github.com/Gourieff/sd-webui-reactorStars: 2.4k
| Created at: 2023-06-18
| Last updated: 2024-08-01
Fast and Simple Face Swap Extension for StableDiffusion WebUI (A1111 SD WebUI, SD WebUI Forge, SD.Next, Cagliostro)
https://github.com/Doubiiu/DynamiCrafterStars: 2.2k
| Created at: 2023-11-27
| Last updated: 2024-08-01
[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
https://github.com/TMElyralab/MuseVStars: 2.2k
| Created at: 2024-03-25
| Last updated: 2024-08-01
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
https://github.com/xyflow/awesome-node-based-uisStars: 2.2k
| Created at: 2022-11-14
| Last updated: 2024-08-01
A curated list with resources about node-based UIs
https://github.com/tencent-ailab/V-ExpressStars: 2.1k
| Created at: 2024-05-21
| Last updated: 2024-08-01
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
https://github.com/KohakuBlueleaf/LyCORISStars: 2.1k
| Created at: 2023-02-27
| Last updated: 2024-08-01
Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.
https://github.com/TMElyralab/MuseTalkStars: 2.1k
| Created at: 2024-03-26
| Last updated: 2024-08-01
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
https://github.com/IceClear/StableSRStars: 2.0k
| Created at: 2023-04-02
| Last updated: 2024-08-01
[IJCV2024] Exploiting Diffusion Prior for Real-World Image Super-Resolution
https://github.com/PeterH0323/Streamer-SalesStars: 2.0k
| Created at: 2024-04-05
| Last updated: 2024-08-02
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️
https://github.com/TMElyralab/MusePoseStars: 2.0k
| Created at: 2024-05-24
| Last updated: 2024-08-01
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
https://github.com/adieyal/sd-dynamic-promptsStars: 2.0k
| Created at: 2022-10-08
| Last updated: 2024-08-01
A custom script for AUTOMATIC1111/stable-diffusion-webui to implement a tiny template language for random prompt generation
https://github.com/Alpha-VLLM/Lumina-T2XStars: 1.9k
| Created at: 2024-03-28
| Last updated: 2024-08-01
Lumina-T2X is a unified framework for Text to Any Modality Generation
https://github.com/PRIS-CV/DemoFusionStars: 1.9k
| Created at: 2023-10-29
| Last updated: 2024-08-01
Let us democratise high-resolution generation! (CVPR 2024)
https://github.com/taishi-i/awesome-ChatGPT-repositoriesStars: 1.9k
| Created at: 2023-04-02
| Last updated: 2024-08-01
A curated list of resources dedicated to open source GitHub repositories related to ChatGPT
https://github.com/lllyasviel/LayerDiffuseStars: 1.9k
| Created at: 2024-02-27
| Last updated: 2024-08-01
Transparent Image Layer Diffusion using Latent Transparency
https://github.com/uhub/awesome-cStars: 1.9k
| Created at: 2015-08-12
| Last updated: 2024-08-01
A curated list of awesome C frameworks, libraries and software.
https://github.com/thisjam/sd-webui-oldsix-promptStars: 1.7k
| Created at: 2023-07-27
| Last updated: 2024-07-31
sd-webui中文提示词插件、老手新手炼丹必备
https://github.com/BadToBest/EchoMimicStars: 1.6k
| Created at: 2024-07-03
| Last updated: 2024-08-01
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
https://github.com/ChenyangSi/FreeUStars: 1.6k
| Created at: 2023-09-14
| Last updated: 2024-08-01
FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
https://github.com/Coyote-A/ultimate-upscale-for-automatic1111Stars: 1.6k
| Created at: 2023-01-02
| Last updated: 2024-08-01
None
https://github.com/PixArt-alpha/PixArt-sigmaStars: 1.5k
| Created at: 2024-02-29
| Last updated: 2024-08-01
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
https://github.com/InstantStyle/InstantStyleStars: 1.5k
| Created at: 2023-12-22
| Last updated: 2024-08-01
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
https://github.com/xinsir6/ControlNetPlusStars: 1.5k
| Created at: 2024-07-02
| Last updated: 2024-08-01
ControlNet++: All-in-one ControlNet for image generations and editing!
https://github.com/dtlnor/stable-diffusion-webui-localization-zh_CNStars: 1.4k
| Created at: 2022-11-06
| Last updated: 2024-08-01
Simplified Chinese translation extension for AUTOMATIC1111's stable diffusion webui
https://github.com/amrzv/awesome-colab-notebooksStars: 1.3k
| Created at: 2020-12-27
| Last updated: 2024-08-01
Collection of google colaboratory notebooks for fast and easy experiments
https://github.com/sergree/matcheringStars: 1.3k
| Created at: 2018-09-28
| Last updated: 2024-07-31
🎚️ Open Source Audio Matching and Mastering
https://github.com/TencentARC/BrushNetStars: 1.3k
| Created at: 2024-03-10
| Last updated: 2024-08-01
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
https://github.com/PKU-YuanGroup/MagicTimeStars: 1.2k
| Created at: 2024-04-07
| Last updated: 2024-08-01
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
https://github.com/bianchenglequ/NetCodeTopStars: 1.2k
| Created at: 2023-01-01
| Last updated: 2024-08-01
收集GitHub上有关.Net、.NetCore有趣、有用、热门的开源项目。
https://github.com/numz/sd-wav2lip-uhqStars: 1.2k
| Created at: 2023-08-03
| Last updated: 2024-08-01
Wav2Lip UHQ extension for Automatic1111
https://github.com/homebrewltd/awesome-local-aiStars: 1.1k
| Created at: 2023-09-06
| Last updated: 2024-08-01
An awesome repository of local AI tools
https://github.com/chengzeyi/stable-fastStars: 1.1k
| Created at: 2023-10-17
| Last updated: 2024-07-31
Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
https://github.com/panyanyany/Awesome-ChatTTSStars: 1.1k
| Created at: 2024-06-08
| Last updated: 2024-08-02
ChatTTS资源大全,免费体验地址,音色库等
https://github.com/ToTheBeginning/PuLIDStars: 1.0k
| Created at: 2024-04-17
| Last updated: 2024-08-01
Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
https://github.com/TencentQQGYLab/ELLAStars: 1.0k
| Created at: 2024-03-07
| Last updated: 2024-08-01
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
https://github.com/receyuki/stable-diffusion-prompt-readerStars: 1.0k
| Created at: 2023-03-24
| Last updated: 2024-08-01
A simple standalone viewer for reading prompts from Stable Diffusion generated image outside the webui.
https://github.com/JonathanFly/barkStars: 977
| Created at: 2023-04-21
| Last updated: 2024-07-28
🚀 BARK INFINITY GUI CMD 🎶 Powered Up Bark Text-prompted Generative Audio Model