数据更新: 2024-07-08 / 温馨提示:中文项目泛指「文档母语为中文」OR「含有中文翻译」的项目,通常在项目的「readme/wiki/官网」可以找到
# | Repository | Description | Stars | Average daily growth | Updated |
---|---|---|---|---|---|
1 | 2noise/ChatTTS | A generative speech model for daily dialogue. | 27197 | 648 | 2024-07-07 |
2 | KwaiVGI/LivePortrait | Make one portrait alive! | 1428 | 286 | 2024-07-06 |
3 | OpenDevin/OpenDevin | 🐚 OpenDevin: Code Less, Make More | 28468 | 243 | 2024-07-07 |
4 | fudan-generative-vision/hallo | Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation | 6208 | 239 | 2024-07-04 |
5 | Kwai-Kolors/Kolors | Kolors Team | 614 | 205 | 2024-07-06 |
6 | RVC-Boss/GPT-SoVITS | 1 min voice data can also be used to train a good TTS model! (few shot voice cloning) | 28729 | 163 | 2024-07-07 |
7 | hpcaitech/Open-Sora | Open-Sora: Democratizing Efficient Video Production for All | 20311 | 146 | 2024-07-06 |
8 | FunAudioLLM/CosyVoice | Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability. | 703 | 141 | 2024-07-07 |
9 | jianchang512/ChatTTS-ui | 一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces. | 5120 | 131 | 2024-06-30 |
10 | binary-husky/gpt_academic | 为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss ... | 61410 | 129 | 2024-07-07 |
11 | harry0703/MoneyPrinterTurbo | 利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM. | 14644 | 123 | 2024-07-03 |
12 | myshell-ai/OpenVoice | Instant voice cloning by MyShell. | 27142 | 122 | 2024-07-06 |
13 | onuratakan/gpt-computer-assistant | gpt-4o for windows, macos and linux | 4757 | 111 | 2024-07-02 |
14 | FunAudioLLM/SenseVoice | Multilingual Voice Understanding Model | 550 | 110 | 2024-07-05 |
15 | adithya-s-k/omniparse | Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks | 3249 | 96 | 2024-07-05 |
16 | THUDM/ChatGLM-6B | ChatGLM-6B: An Open Bilingual Dialogue Language Model 开源双语对话语言模型 | 39961 | 83 | 2024-06-27 |
17 | ScrapeGraphAI/Scrapegraph-ai | Python scraper based on AI | 12996 | 80 | 2024-07-05 |
18 | PKU-YuanGroup/Open-Sora-Plan | This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project. | 10885 | 78 | 2024-07-05 |
19 | lm-sys/FastChat | An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena. | 35617 | 75 | 2024-07-07 |
20 | buaacyw/MeshAnything | From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers" | 1543 | 70 | 2024-07-02 |
21 | THUDM/GLM-4 | GLM-4 series: Open Multilingual Multimodal Chat LMs 开源多语言多模态对话模型 | 3466 | 64 | 2024-07-06 |
22 | hiyouga/LLaMA-Factory | A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024) | 25757 | 63 | 2024-07-07 |
23 | huggingface/transformers | 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. | 128693 | 62 | 2024-07-07 |
24 | LC044/WeChatMsg | 提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手 | 31172 | 57 | 2024-07-06 |
25 | VikParuchuri/marker | Convert PDF to markdown quickly with high accuracy | 14052 | 56 | 2024-06-30 |
26 | netease-youdao/QAnything | Question and Answer based on Anything. | 10550 | 56 | 2024-06-28 |
27 | AiuniAI/Unique3D | Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image | 2190 | 56 | 2024-07-03 |
28 | infiniflow/ragflow | RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding. | 11102 | 53 | 2024-07-07 |
29 | THUDM/ChatGLM3 | ChatGLM3 series: Open Bilingual Chat LLMs 开源双语对话语言模型 | 13012 | 51 | 2024-07-04 |
30 | VikParuchuri/surya | OCR, layout analysis, reading order, line detection in 90+ languages | 9054 | 50 | 2024-07-04 |
31 | OpenBMB/MiniCPM-V | MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone | 7859 | 49 | 2024-07-03 |
32 | Tencent/HunyuanDiT | Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding | 2693 | 46 | 2024-07-04 |
33 | RVC-Project/Retrieval-based-Voice-Conversion-WebUI | Easily train a good VC model with voice data <= 10 mins! | 20838 | 44 | 2024-07-07 |
34 | chatanywhere/GPT_API_free | Free ChatGPT API Key,免费ChatGPT API,支持GPT4 API(免费),ChatGPT国内可用免费转发API,直连无需代理。可以搭配ChatBox等软件/插件使用,极大降低接口使用成本。国内即可无限制畅快聊天。 | 18705 | 42 | 2024-07-01 |
35 | THUDM/ChatGLM2-6B | ChatGLM2-6B: An Open Bilingual Chat LLM 开源双语对话语言模型 | 15612 | 41 | 2024-06-27 |
36 | zhayujie/chatgpt-on-wechat | 基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT4.0/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。 | 28042 | 40 | 2024-07-05 |
37 | hpcaitech/ColossalAI | Making large AI models cheaper, faster and more accessible | 38312 | 39 | 2024-07-06 |
38 | 6drf21e/ChatTTS_colab | 🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。 | 1538 | 39 | 2024-07-02 |
39 | ultralytics/ultralytics | NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite | 25830 | 39 | 2024-07-07 |
40 | QwenLM/Qwen | The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud. | 12467 | 37 | 2024-06-27 |
41 | ymcui/Chinese-LLaMA-Alpaca | 中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs) | 17883 | 37 | 2024-04-30 |
42 | microsoft/UFO | A UI-Focused Agent for Windows OS Interaction. | 6470 | 36 | 2024-07-07 |
43 | LinYuanovo/pikpak_auto_invite | PikPak自动邀请程序,附带图像识别过验证码,支持本地及GitHub Actions云端运行 | 1058 | 35 | 2024-07-04 |
44 | ultralytics/yolov5 | YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite | 48401 | 32 | 2024-07-05 |
45 | assafelovic/gpt-researcher | GPT based autonomous agent that does online comprehensive research on any given topic | 13071 | 31 | 2024-07-07 |
46 | GaiZhenbiao/ChuanhuChatGPT | GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI. | 15005 | 30 | 2024-06-28 |
47 | jianchang512/clone-voice | A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频 | 6745 | 29 | 2024-03-08 |
48 | reflex-dev/reflex | 🕸️ Web apps in pure Python 🐍 | 17976 | 29 | 2024-07-07 |
49 | myshell-ai/MeloTTS | High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean. | 4018 | 29 | 2024-07-06 |
50 | OpenBMB/XAgent | An Autonomous LLM Agent for Complex Task Solving | 7834 | 29 | 2024-05-02 |
51 | hiroi-sora/Umi-OCR | OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。 | 22990 | 28 | 2024-07-05 |
52 | Textualize/rich | Rich is a Python library for rich text and beautiful formatting in the terminal. | 48146 | 28 | 2024-07-06 |
53 | Sinaptik-AI/pandas-ai | Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG. | 11830 | 27 | 2024-07-03 |
54 | OpenBMB/MiniCPM | MiniCPM-2B: An end-side LLM outperforming Llama2-13B. | 4397 | 27 | 2024-07-04 |
55 | modelscope/DiffSynth-Studio | Enjoy the magic of Diffusion models! | 5677 | 27 | 2024-07-05 |
56 | 1Panel-dev/MaxKB | 🚀 基于 LLM 大语言模型的知识库问答系统。开箱即用、模型中立、灵活编排,支持快速嵌入到第三方业务系统,1Panel 官方出品。 | 7972 | 27 | 2024-07-06 |
57 | netease-youdao/EmotiVoice | EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine | 6631 | 27 | 2024-06-20 |
58 | eosphoros-ai/DB-GPT | AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents | 12396 | 27 | 2024-07-07 |
59 | dataelement/bisheng | Bisheng is an open LLM devops platform for next generation AI applications. | 8122 | 26 | 2024-07-06 |
60 | PaddlePaddle/PaddleOCR | Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and de ... | 40285 | 26 | 2024-07-07 |
61 | vvbbnn00/WARP-Clash-API | 该项目可以让你通过订阅的方式使用Cloudflare WARP+,自动获取流量。This project enables you to use Cloudflare WARP+ through subscription, automatically acquiring traffic. | 8193 | 26 | 2024-06-25 |
62 | OpenMOSS/MOSS | An open-source tool-augmented conversational language model from Fudan University | 11881 | 26 | 2024-05-19 |
63 | bilibili/Index-1.9B | A SOTA lightweight multilingual LLM | 686 | 25 | 2024-06-27 |
64 | THUDM/CogVLM2 | GPT4V-level open-source multi-modal model based on Llama3-8B | 1457 | 25 | 2024-07-07 |
65 | guoyww/AnimateDiff | Official implementation of AnimateDiff. | 9748 | 25 | 2024-06-01 |
66 | xinntao/Real-ESRGAN | Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration. | 26953 | 25 | 2024-07-07 |
67 | jzhang38/TinyLlama | The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens. | 7301 | 24 | 2024-05-03 |
68 | deepseek-ai/DeepSeek-Coder | DeepSeek Coder: Let the Code Write Itself | 6039 | 23 | 2024-05-21 |
69 | lss233/chatgpt-mirai-qq-bot | 🚀 一键部署!真正的 AI 聊天机器人!支持ChatGPT、文心一言、讯飞星火、Bing、Bard、ChatGLM、POE,多账号,人设调教,虚拟女仆、图片渲染、语音发送 支持 QQ、Telegram、Discord、微信 等平台 | 12527 | 22 | 2024-03-23 |
70 | Plachtaa/VALL-E-X | An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io | 7420 | 22 | 2024-02-11 |
71 | THUDM/CodeGeeX2 | CodeGeeX2: A More Powerful Multilingual Code Generation Model | 7594 | 22 | 2024-07-07 |
72 | aixcoder-plugin/aiXcoder-7B | official repository of aiXcoder-7B Code Large Language Model | 2151 | 22 | 2024-04-22 |
73 | fishaudio/Bert-VITS2 | vits2 backbone with multilingual-bert | 7425 | 22 | 2024-07-01 |
74 | facebookresearch/nougat | Implementation of Nougat Neural Optical Understanding for Academic Documents | 8437 | 21 | 2024-04-16 |
75 | Kanaries/pygwalker | PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis | 10664 | 21 | 2024-07-07 |
76 | microsoft/DeepSpeed | DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective. | 33743 | 21 | 2024-07-06 |
77 | RUCAIBox/LLMSurvey | The official GitHub page for the survey paper "A Survey of Large Language Models". | 9544 | 20 | 2024-05-19 |
78 | ymcui/Chinese-LLaMA-Alpaca-2 | 中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models) | 6994 | 20 | 2024-04-30 |
79 | TMElyralab/MuseV | MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising | 2089 | 20 | 2024-06-28 |
80 | ageitgey/face_recognition | The world's simplest facial recognition api for Python and the command line | 52359 | 20 | 2024-06-18 |
81 | THUDM/CogVLM | a state-of-the-art-level open visual language model 多模态预训练模型 | 5604 | 19 | 2024-05-29 |
82 | voicepaw/so-vits-svc-fork | so-vits-svc fork with realtime support, improved interface and more features. | 8526 | 18 | 2024-07-06 |
83 | Alpha-VLLM/Lumina-T2X | Lumina-T2X is a unified framework for Text to Any Modality Generation | 1858 | 18 | 2024-07-06 |
84 | 3b1b/manim | Animation engine for explanatory math videos | 60272 | 18 | 2024-06-24 |
85 | TMElyralab/MuseTalk | MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting | 1848 | 18 | 2024-06-24 |
86 | FlagOpen/FlagEmbedding | Retrieval and Retrieval-augmented LLMs | 5934 | 17 | 2024-07-05 |
87 | xxlong0/Wonder3D | Single Image to 3D using Cross-Domain Diffusion for 3D Generation | 4513 | 17 | 2024-06-01 |
88 | wenge-research/YAYI2 | YAYI 2 是中科闻歌研发的新一代开源大语言模型,采用了超过 2 万亿 Tokens 的高质量、多语言语料进行预训练。(Repo for YaYi 2 Chinese LLMs) | 3597 | 17 | 2024-04-07 |
89 | OpenGVLab/InternVL | [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型 | 3939 | 17 | 2024-07-05 |
90 | OptimalScale/LMFlow | An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All. | 8120 | 17 | 2024-07-07 |
91 | lanqian528/chat2api | A service that can convert ChatGPT on the web to OpenAI API format. | 1616 | 17 | 2024-06-30 |
92 | fishaudio/fish-speech | Brand new TTS solution | 4559 | 17 | 2024-07-06 |
93 | BlinkDL/ChatRWKV | ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source. | 9334 | 17 | 2024-07-03 |
94 | google-deepmind/penzai | A JAX research toolkit for building, editing, and visualizing neural networks. | 1541 | 16 | 2024-07-07 |
95 | DachunKai/EvTexture | [ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution | 813 | 16 | 2024-07-02 |
96 | modelscope/agentscope | Start building LLM-empowered multi-agent applications in an easier way. | 2873 | 16 | 2024-07-05 |
97 | PeterH0323/Streamer-Sales | Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️ | 1463 | 16 | 2024-07-03 |
98 | llmware-ai/llmware | Unified framework for building enterprise RAG pipelines with small, specialized models | 4224 | 15 | 2024-07-06 |
99 | InternLM/InternLM | Official release of InternLM2.5 7B base and chat models. 1M context support | 5698 | 15 | 2024-07-04 |
100 | FujiwaraChoki/MoneyPrinterV2 | Automate the process of making money online. | 2246 | 15 | 2024-04-17 |
101 | gradio-app/gradio | Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work! | 30758 | 15 | 2024-07-07 |
102 | aigc-apps/sd-webui-EasyPhoto | 📷 EasyPhoto Your Smart AI Photo Generator. | 4714 | 15 | 2024-06-06 |
103 | luosiallen/latent-consistency-model | Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference | 4197 | 15 | 2024-06-14 |
104 | tyxsspa/AnyText | Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing> | 4018 | 14 | 2024-06-21 |
105 | OpenEthan/SMSBoom | SMSBoom - Deprecate: Due to judicial reasons, the repository has been suspended! | 15373 | 14 | 2024-03-20 |
106 | RayVentura/ShortGPT | 🚀🎬 ShortGPT - Experimental AI framework for youtube shorts / tiktok channel automation | 5268 | 14 | 2024-02-17 |
107 | X-PLUG/MobileAgent | Mobile-Agent: The Powerful Mobile Device Operation Assistant Family | 2308 | 14 | 2024-07-01 |
108 | dyang886/Game-Cheats-Manager | Easily download and manage game cheats for your convenience | 2582 | 14 | 2024-06-13 |
109 | QwenLM/Qwen-VL | The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud. | 4274 | 13 | 2024-05-28 |
110 | linyiLYi/street-fighter-ai | This is an AI agent for Street Fighter II Champion Edition. | 6253 | 13 | 2024-05-14 |
111 | open-mmlab/mmdetection | OpenMMLab Detection Toolbox and Benchmark | 28522 | 13 | 2024-07-07 |
112 | barry-far/V2ray-Configs | 🛰️✨ Free V2ray Configs , Updating Every 10 minutes. | 3730 | 13 | 2024-07-07 |
113 | SunoAI-API/Suno-API | This is an unofficial Suno AI API based on Python and FastAPI. It currently supports generating songs, lyrics, etc.👇 | 1314 | 13 | 2024-06-14 |
114 | MustardChef/WSABuilds | Run Windows Subsystem For Android on your Windows 10 and Windows 11 PC using prebuilt binaries with Google Play Store (MindTheGapps) and/or Magisk or KernelSU (root solutions) built in. | 6974 | 13 | 2024-07-05 |
115 | taosdata/TDengine | TDengine is an open source, high-performance, cloud native time-series database optimized for Internet of Things (IoT), Connected Cars, Industrial IoT and DevOps. | 23013 | 13 | 2024-07-07 |
116 | moesnow/March7thAssistant | 崩坏:星穹铁道全自动 三月七小助手 | 4102 | 13 | 2024-07-06 |
117 | jxxghp/MoviePilot | NAS媒体库自动化管理工具 | 5432 | 13 | 2024-07-07 |
118 | WZMIAOMIAO/deep-learning-for-image-processing | deep learning for image processing including classification and object-detection etc. | 21649 | 13 | 2024-07-07 |
119 | xaoyaoo/PyWxDump | 获取微信账号信息(昵称/账号/手机/邮箱/数据库密钥/wxid);PC微信数据库读取、解密脚本;聊天记录查看工具;聊天记录导出为html(包含语音图片)。支持多账户信息获取,支持所有微信版本。 | 4295 | 13 | 2024-07-07 |
120 | 521xueweihan/GitHub520 | 😘 让你“爱”上 GitHub,解决访问时图裂、加载慢的问题。(无需安装) | 20421 | 13 | 2024-07-07 |
121 | Aabyss-Team/ARL | ARL官方仓库备份项目:ARL(Asset Reconnaissance Lighthouse)资产侦察灯塔系统旨在快速侦察与目标关联的互联网资产,构建基础资产信息库。 协助甲方安全团队或者渗透测试人员有效侦察和检索资产,发现存在的薄弱点和攻击面。 | 703 | 13 | 2024-05-29 |
122 | YaoFANGUK/video-subtitle-remover | 基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures. | 3155 | 12 | 2024-07-06 |
123 | yihong0618/xiaogpt | Play ChatGPT and other LLM with Xiaomi AI Speaker | 5859 | 12 | 2024-05-22 |
124 | THUDM/CodeGeeX | CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023) | 7950 | 12 | 2024-07-07 |
125 | OpenGVLab/LLaMA-Adapter | [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters | 5615 | 12 | 2024-03-14 |
126 | reorx/awesome-chatgpt-api | Curated list of apps and tools that not only use the new ChatGPT API, but also allow users to configure their own API keys, enabling free and on-demand usage of their own quota. | 5787 | 12 | 2024-05-18 |
127 | JadyXuan/NTTS | NO TIME TO SLEEP | 627 | 11 | 2024-05-26 |
128 | yangjianxin1/Firefly | Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型 | 5232 | 11 | 2024-06-07 |
129 | BlinkDL/RWKV-LM | RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, sa ... | 11972 | 11 | 2024-07-04 |
130 | yl4579/StyleTTS2 | StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models | 4409 | 11 | 2024-06-13 |
131 | xlang-ai/OpenAgents | OpenAgents: An Open Platform for Language Agents in the Wild | 3741 | 11 | 2024-05-28 |
132 | qnguyen3/chat-with-mlx | An all-in-one LLMs Chat UI for Apple Silicon Mac using MLX Framework. | 1397 | 10 | 2024-07-03 |
133 | MzeroMiko/VMamba | VMamba: Visual State Space Models,code is based on mamba | 1839 | 10 | 2024-06-28 |
134 | ihmily/DouyinLiveRecorder | 可循环值守和多人录制的直播录制软件,支持抖音、TikTok、快手、虎牙、斗鱼、B站、小红书、pandatv、afreecatv、flextv、popkontv、twitcasting、winktv、百度、微博、酷狗、花椒、流星、Twitch等平台直播录制 | 3716 | 10 | 2024-07-07 |
135 | thuml/Time-Series-Library | A Library for Advanced Deep Time Series Models. | 5086 | 10 | 2024-07-07 |
136 | XPixelGroup/DiffBIR | Official codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior | 3134 | 10 | 2024-07-03 |
137 | mli/autocut | 用文本编辑器剪视频 | 6407 | 10 | 2024-04-16 |
138 | eeeeeeeeee-code/e0e1-wx | 微信小程序辅助渗透-自动化 | 589 | 10 | 2024-06-07 |
139 | TencentARC/BrushNet | [ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion" | 1161 | 10 | 2024-07-01 |
140 | cubiq/ComfyUI_IPAdapter_plus | - | 3192 | 10 | 2024-06-28 |
141 | modelscope/modelscope | ModelScope: bring the notion of Model-as-a-Service to life. | 6463 | 9 | 2024-07-04 |
142 | PKU-YuanGroup/MoE-LLaVA | Mixture-of-Experts for Large Vision-Language Models | 1831 | 9 | 2024-05-15 |
143 | AutoGPTQ/AutoGPTQ | An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm. | 4075 | 9 | 2024-07-07 |
144 | THUDM/VisualGLM-6B | Chinese and English multimodal conversational language model 多模态中英双语对话语言模型 | 4035 | 9 | 2024-06-28 |
145 | pkuliyi2015/multidiffusion-upscaler-for-automatic1111 | Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0 | 4585 | 9 | 2024-07-06 |
146 | Plachtaa/VITS-fast-fine-tuning | This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion | 4638 | 9 | 2024-07-03 |
147 | friuns2/Leaked-GPTs | Leaked GPTs Prompts Bypass the 25 message limit or to try out GPTs without a Plus subscription. | 1935 | 9 | 2024-01-17 |
148 | hankcs/HanLP | Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification | 32987 | 9 | 2024-07-07 |
149 | recommenders-team/recommenders | Best Practices on Recommendation Systems | 18414 | 9 | 2024-07-07 |
150 | hitsz-ids/synthetic-data-generator | SDG is a specialized framework designed to generate high-quality structured tabular data. | 3005 | 9 | 2024-07-05 |
151 | Tele-AI/Telechat | - | 1678 | 9 | 2024-07-01 |
152 | madawei2699/myGPTReader | A community-driven way to read and chat with AI bots - powered by chatGPT. | 4420 | 9 | 2024-04-25 |
153 | PaddlePaddle/PaddleNLP | 👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ... | 11728 | 9 | 2024-07-07 |
154 | z1069614715/objectdetection_script | 一些关于目标检测的脚本的改进思路代码,详细请看readme.md | 4746 | 9 | 2024-07-07 |
155 | InternLM/xtuner | An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...) | 3203 | 9 | 2024-07-04 |
156 | xorbitsai/inference | Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any ... | 3555 | 9 | 2024-07-06 |
157 | QwenLM/Qwen-Agent | Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension. | 2562 | 9 | 2024-07-05 |
158 | continue-revolution/sd-webui-animatediff | AnimateDiff for AUTOMATIC1111 Stable Diffusion WebUI | 2929 | 8 | 2024-06-23 |
159 | kohya-ss/sd-scripts | - | 4491 | 8 | 2024-07-05 |
160 | malinkang/weread2notion-pro | - | 1483 | 8 | 2024-07-07 |
161 | tgbot-collection/YYeTsBot | 🎬 人人影视 机器人和网站,包含人人影视全部资源以及众多网友的网盘分享 | 14025 | 8 | 2024-05-22 |
162 | ali-vilab/dreamtalk | Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models | 1480 | 8 | 2024-01-15 |
163 | JaveleyQAQ/WeChatOpenDevTools-Python | WeChatOpenDevTool 微信小程序强制开启开发者工具 | 1381 | 8 | 2024-06-07 |
164 | ParthJadhav/Tkinter-Designer | An easy and fast way to create a Python GUI 🐍 | 8627 | 8 | 2024-07-01 |
165 | open-compass/opencompass | OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets. | 3191 | 8 | 2024-07-05 |
166 | tech-shrimp/WechatMoments | 微信朋友圈导出工具-技术爬爬虾 | 790 | 8 | 2024-06-18 |
167 | CHNZYX/Auto_Simulated_Universe | 崩坏:星穹铁道 模拟宇宙自动化 (Honkai Star Rail - Auto Simulated Universe) | 3432 | 8 | 2024-07-06 |
168 | EstrellaXD/Auto_Bangumi | AutoBangumi - 全自动追番工具 | 6203 | 8 | 2024-07-04 |
169 | RUC-NLPIR/FlashRAG | ⚡FlashRAG: A Python Toolkit for Efficient RAG Research | 894 | 8 | 2024-07-06 |
170 | InternLM/lmdeploy | LMDeploy is a toolkit for compressing, deploying, and serving LLMs. | 3202 | 8 | 2024-07-07 |
171 | jhao104/proxy_pool | Python ProxyPool for web spider | 20870 | 8 | 2024-06-17 |
172 | yerfor/GeneFacePlusPlus | GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code | 1309 | 8 | 2024-06-05 |
173 | sml2h3/ddddocr | 带带弟弟 通用验证码识别OCR pypi版 | 8978 | 8 | 2024-07-05 |
174 | clovaai/donut | Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022 | 5515 | 8 | 2024-06-10 |
175 | rev1si0n/lamda | ⚡️ Android reverse engineering & automation framework 史上最强安卓抓包/逆向/HOOK & 云手机/远程桌面/自动化取证框架,你的工作从未如此简单快捷。 | 5673 | 8 | 2024-05-05 |
176 | mini-sora/minisora | MiniSora: A community aims to explore the implementation path and future development direction of Sora. | 1103 | 8 | 2024-06-01 |
177 | PaddlePaddle/Paddle | PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署) | 21793 | 8 | 2024-07-07 |
178 | TencentQQGYLab/ELLA | ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment | 981 | 8 | 2024-06-14 |
179 | Ucas-HaoranWei/Vary | [ECCV2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models. | 1646 | 8 | 2024-07-02 |
180 | modelscope/FunASR | A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc. | 4580 | 8 | 2024-07-05 |
181 | PeterL1n/RobustVideoMatting | Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML! | 8345 | 8 | 2024-04-02 |
182 | EvolvingLMMs-Lab/lmms-eval | Accelerating the development of large multimodal models (LMMs) with lmms-eval | 1039 | 8 | 2024-07-07 |
183 | Akegarasu/lora-scripts | LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model. | 4064 | 8 | 2024-06-30 |
184 | emcf/thepipe | Extract markdown and images from URLs, PDFs, docs, slides, and more, ready for multimodal LLMs. ⚡ | 814 | 8 | 2024-07-06 |
185 | Evil0ctal/Douyin_TikTok_Download_API | 🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。 | 7737 | 8 | 2024-07-07 |
186 | kwuking/TimeMixer | [ICLR 2024] Official implementation of "TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting" | 1049 | 8 | 2024-07-04 |
187 | yuka-friends/Windrecorder | Windrecorder is a memory search app by records everything on your screen in small size, to let you rewind what you have seen, query through OCR text or image description, and get activity statistics. | 2699 | 8 | 2024-07-07 |
188 | wenge-research/YAYI | 雅意大模型:为客户打造安全可靠的专属大模型,基于大规模中英文多领域指令数据训练的 LlaMA 2 & BLOOM 系列模型,由中科闻歌算法团队研发。(Repo for YaYi Chinese LLMs based on LlaMA2 & BLOOM) | 3241 | 8 | 2024-01-17 |
189 | fxsjy/jieba | 结巴中文分词 | 32789 | 8 | 2024-03-18 |
190 | CVHub520/X-AnyLabeling | Effortless data labeling with AI support from Segment Anything and other awesome models. | 3068 | 7 | 2024-07-07 |
191 | aigc-apps/EasyAnimate | 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion | 659 | 7 | 2024-07-06 |
192 | Langboat/Mengzi3 | - | 704 | 7 | 2024-06-28 |
193 | modelscope/FunClip | Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated. | 2717 | 7 | 2024-07-04 |
194 | InternLM/InternLM-XComposer | InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output | 2094 | 7 | 2024-07-05 |
195 | sqlmapproject/sqlmap | Automatic SQL injection and database takeover tool | 31271 | 7 | 2024-06-28 |
196 | DeepInsight-AI/DeepBI | LLM based data scientist, AI native data application. AI-driven infinite thinking redefines BI. | 1592 | 7 | 2024-07-04 |
197 | modelscope/swift | ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4, Internlm2.5, Yi, Llama3, Llava, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...) | 2246 | 7 | 2024-07-07 |
198 | project-baize/baize-chatbot | Let ChatGPT teach your own chatbot in hours with a single GPU! | 3148 | 7 | 2024-03-17 |
199 | OpenGVLab/InternGPT | InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, ... | 3166 | 7 | 2024-06-09 |
200 | continue-revolution/sd-webui-segment-anything | Segment Anything for Stable Diffusion WebUI | 3298 | 7 | 2024-04-30 |
↓ -- 感谢读者 -- ↓
榜单持续更新,如有帮助请加星收藏,方便后续浏览,感谢你的支持!