πΒ A ranked list of awesome projects. Updated weekly.
This curated list contains 24 awesome open-source projects with a total of 280K stars grouped into 9 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome!
π§ββοΈΒ Discover other best-of lists or create your own.
- Data Preprocessing Tools 0 projects
- Data visualization tools 4 projects
- Data labelling tools 1 projects
- Data pipelining tools for machine learning 7 projects
- Data versioning tools 1 projects
- MLOps end-to-end platforms 1 projects
- MLOps experimentation tracking tools 3 projects
- Model Serving 6 projects
- Model Training Tools 1 projects
- π₯π₯π₯Β Combined project-quality score
- βοΈΒ Star count from GitHub
- π£Β New project (less than 6 months old)
- π€Β Inactive project (6 months no activity)
- πΒ Dead project (12 months no activity)
- ππΒ Project is trending up or down
- βΒ Project was recently added
- βοΈΒ Warning (e.g. missing/risky license)
- π¨βπ»Β Contributors count from GitHub
- πΒ Fork count from GitHub
- πΒ Issue count from GitHub
- β±οΈΒ Last update timestamp on package manager
- π₯Β Download count from package manager
- π¦Β Number of dependent projects
- Β Python projects
- Β C/C++ projects
- Β Uses/supports Tensorflow
- Β Uses/supports PyTorch
- Β Uses/supports ONNX (Open Neural Network Exchange)
Tools for data cleaning, transformation, and enrichment.
Tools for creating high-level visualizations
Gradio (π₯36 Β· β 25K) - Build and share delightful machine learning apps, all in Python... Apache-2
-
GitHub (π¨βπ» 240 Β· π 1.8K Β· π¦ 21K Β· π 3.5K - 12% open Β· β±οΈ 11.01.2024):
git clone https://github.com/gradio-app/gradio
Dash (π₯32 Β· β 20K Β· π) - Data Apps & Dashboards for Python. No JavaScript Required. MIT
-
GitHub (π¨βπ» 140 Β· π 1.9K Β· π₯ 53 Β· π¦ 57K Β· π 1.6K - 47% open Β· β±οΈ 09.01.2024):
git clone https://github.com/plotly/dash
Streamlit (π₯29 Β· β 30K) - Streamlit A faster way to build and share data apps. Apache-2
-
GitHub (π¨βπ» 220 Β· π 2.6K Β· π 3.8K - 17% open Β· β±οΈ 11.01.2024):
git clone https://github.com/streamlit/streamlit
Show 1 hidden projects...
- voila (π₯28 Β· β 5K) - Voil turns Jupyter notebooks into standalone web applications.
βUnlicensed
Tools for data labelling
Show 1 hidden projects...
Frameworks that allow the creation, management, and optimization of complex data pipelines in a machine learning environment.
Airflow (π₯43 Β· β 33K) - Apache Airflow - A platform to programmatically author, schedule, and.. Apache-2
-
GitHub (π¨βπ» 3.1K Β· π 13K Β· π₯ 580K Β· π¦ 9.1K Β· π 8.4K - 8% open Β· β±οΈ 11.01.2024):
git clone https://github.com/apache/airflow
Argo (π₯39 Β· β 14K) - Workflow Engine for Kubernetes. Apache-2
-
GitHub (π¨βπ» 830 Β· π 2.9K Β· π₯ 3.4M Β· π¦ 230 Β· π 5.5K - 16% open Β· β±οΈ 11.01.2024):
git clone https://github.com/argoproj/argo-workflows
dagster (π₯35 Β· β 9.4K) - An orchestration platform for the development, production, and.. Apache-2
-
GitHub (π¨βπ» 380 Β· π 1.1K Β· π¦ 1.8K Β· π 6.4K - 29% open Β· β±οΈ 11.01.2024):
git clone https://github.com/dagster-io/dagster
Kubeflow (Kubeflow Pipelines) (π₯33 Β· β 13K) - Machine Learning Toolkit for Kubernetes. Apache-2
-
GitHub (π¨βπ» 300 Β· π 2.1K Β· π₯ 79K Β· π¦ 41 Β· π 3.8K - 5% open Β· β±οΈ 19.12.2023):
git clone https://github.com/kubeflow/kubeflow
Luigi (π₯32 Β· β 17K Β· π) - Luigi is a Python module that helps you build complex pipelines of.. Apache-2
-
GitHub (π¨βπ» 610 Β· π 2.3K Β· π¦ 2.3K Β· π 980 - 9% open Β· β±οΈ 08.01.2024):
git clone https://github.com/spotify/luigi
Pachyderm (π₯31 Β· β 6K) - Data-Centric Pipelines and Data Versioning. Apache-2
-
GitHub (π¨βπ» 190 Β· π 560 Β· π₯ 26K Β· π 3.1K - 22% open Β· β±οΈ 11.01.2024):
git clone https://github.com/pachyderm/pachyderm
Metaflow (π₯27 Β· β 7.3K Β· π) - Build and manage real-life data science projects with ease!. Apache-2
-
GitHub (π¨βπ» 77 Β· π 650 Β· π¦ 580 Β· π 590 - 41% open Β· β±οΈ 11.01.2024):
git clone https://github.com/Netflix/metaflow
Tools for data (and model) versioning
dvc (Data Version Control) (π₯38 Β· β 13K) - ML Experiments Management with Git. Apache-2
-
GitHub (π¨βπ» 290 Β· π 1.1K Β· π₯ 45K Β· π¦ 12K Β· π 4.6K - 11% open Β· β±οΈ 11.01.2024):
git clone https://github.com/iterative/dvc
MLOps platforms that covers all the machine learning lifecycle.
Kubeflow (π₯33 Β· β 13K) - Machine Learning Toolkit for Kubernetes. Apache-2
-
GitHub (π¨βπ» 300 Β· π 2.1K Β· π₯ 79K Β· π¦ 41 Β· π 3.8K - 5% open Β· β±οΈ 19.12.2023):
git clone https://github.com/kubeflow/kubeflow
Tools for managing machine learning experimentation/workflows
MLFlow (Tracking) (π₯37 Β· β 16K) - Open source platform for the machine learning lifecycle. Apache-2
-
GitHub (π¨βπ» 680 Β· π 3.6K Β· π¦ 29K Β· π 3.4K - 31% open Β· β±οΈ 11.01.2024):
git clone https://github.com/mlflow/mlflow
ClearML (π₯30 Β· β 5K) - ClearML - Auto-Magical CI/CD to streamline your ML workflow... Apache-2
-
GitHub (π¨βπ» 88 Β· π 610 Β· π₯ 1.9K Β· π¦ 890 Β· π 930 - 44% open Β· β±οΈ 10.01.2024):
git clone https://github.com/allegroai/clearml
sacred (π₯26 Β· β 4.1K) - Sacred is a tool to help you configure, organize, log and reproduce.. MIT
-
GitHub (π¨βπ» 100 Β· π 360 Β· π¦ 2.8K Β· π 560 - 17% open Β· β±οΈ 13.11.2023):
git clone https://github.com/IDSIA/sacred
Tools for model serving
KServe (π₯34 Β· β 2.7K) - Standardized Serverless ML Inference Platform on.. Apache-2
-
GitHub (π¨βπ» 220 Β· π 830 Β· π₯ 870K Β· π¦ 300 Β· π 1.6K - 19% open Β· β±οΈ 10.01.2024):
git clone https://github.com/kserve/kserve
TensorFlow Serving (π₯32 Β· β 6K) - A flexible, high-performance serving system for.. Apache-2
-
GitHub (π¨βπ» 220 Β· π 2.1K Β· π¦ 22 Β· π 1.4K - 3% open Β· β±οΈ 11.01.2024):
git clone https://github.com/tensorflow/serving
BentoML (π₯31 Β· β 6.1K) - Build Production-Grade AI Applications. Apache-2
-
GitHub (π¨βπ» 190 Β· π 680 Β· π₯ 2.1K Β· π¦ 1.5K Β· π 970 - 18% open Β· β±οΈ 11.01.2024):
git clone https://github.com/bentoml/BentoML
TorchServe (π₯31 Β· β 3.8K) - Serve, optimize and scale PyTorch models in production. Apache-2
-
GitHub (π¨βπ» 180 Β· π 780 Β· π₯ 5.5K Β· π¦ 580 Β· π 1.5K - 20% open Β· β±οΈ 09.01.2024):
git clone https://github.com/pytorch/serve
Triton Inference Server (π₯29 Β· β 6.7K) - The Triton Inference Server provides an optimized.. BSD-3
-
GitHub (π¨βπ» 110 Β· π 1.3K Β· π₯ 210K Β· π 3.3K - 10% open Β· β±οΈ 11.01.2024):
git clone https://github.com/triton-inference-server/server
Seldon (π₯28 Β· β 4.1K Β· π) - An MLOps framework to package, deploy, monitor and.. Apache-2
-
GitHub (π¨βπ» 200 Β· π 790 Β· π₯ 810 Β· π¦ 530 Β· π 2.3K - 4% open Β· β±οΈ 09.10.2023):
git clone https://github.com/SeldonIO/seldon-core
Tools for training machine learning models.
TFX (π₯25 Β· β 2K) - TFX is an end-to-end platform for deploying production ML pipelines. Apache-2
-
GitHub (π¨βπ» 180 Β· π 670 Β· π 880 - 3% open Β· β±οΈ 11.01.2024):
git clone https://github.com/tensorflow/tfx
- Best-of lists: Discover other best-of lists with awesome open-source projects on all kinds of topics.
- Awesome MLOps: An awesome list of references for MLOps - Machine Learning Operations: ml-ops.org
- Best-of Machine Learning with Python: A ranked list of awesome machine learning Python libraries. Updated weekly.
Contributions are encouraged and always welcome! If you like to add or update projects, choose one of the following ways:
- Open an issue by selecting one of the provided categories from the issue page and fill in the requested information.
- Modify the projects.yaml with your additions or changes, and submit a pull request. This can also be done directly via the Github UI.
If you like to contribute to or share suggestions regarding the project metadata collection or markdown generation, please refer to the best-of-generator repository. If you like to create your own best-of list, we recommend to follow this guide.
For more information on how to add or update projects, please read the contribution guidelines. By participating in this project, you agree to abide by its Code of Conduct.