Documentation | Stackable Data Platform | Platform Docs | Discussions | Discord
This is a Kubernetes operator to manage Apache Hadoop.
It is part of the Stackable Data Platform, a curated selection of the best open source data apps like Apache Kafka, Apache Druid, Trino or Apache Spark, all working together seamlessly. Based on Kubernetes, it runs everywhere – on prem or in the cloud.
You can install the operator using stackablectl or helm.
Read on to get started with it, or see it in action in one of our demos.
You can follow this tutorial.
The stable documentation for this operator can be found here. If you are interested in the most recent state of this repository, check out the nightly docs instead.
The documentation for all Stackable products can be found at docs.stackable.tech.
If you have a question about the Stackable Data Platform contact us via our homepage or ask a public questions in our Discussions forum.
This operator is written and maintained by Stackable and it is part of a larger data platform.
Stackable makes it easy to operate data applications in any Kubernetes cluster.
The data platform offers many operators, new ones being added continuously. All our operators are designed and built to be easily interconnected and to be consistent to work with.
The Stackable GmbH is the company behind the Stackable Data Platform. Offering professional services, paid support plans and custom development.
We love open-source!
We develop and test our operators on the following cloud platforms:
- AKS on Microsoft Azure
- EKS on Amazon Web Services (AWS)
- GKE on Google Cloud Platform (GCP)
- IONOS Cloud Managed Kubernetes
- K3s
- Kubernetes (for an up to date list of supported versions please check the release notes in our docs)
- Red Hat OpenShift
These are the operators that are currently part of the Stackable Data Platform:
- Stackable Operator for Apache Airflow
- Stackable Operator for Apache Druid
- Stackable Operator for Apache HBase
- Stackable Operator for Apache Hadoop HDFS
- Stackable Operator for Apache Hive
- Stackable Operator for Apache Kafka
- Stackable Operator for Apache NiFi
- Stackable Operator for Apache Spark
- Stackable Operator for Apache Superset
- Stackable Operator for Trino
- Stackable Operator for Apache ZooKeeper
And our internal operators:
Contributions are welcome. Follow our Contributors Guide to learn how you can contribute. All contributors will have to sign a Contributor License Agreement. This is enforced automatically when you submit a Pull Request where a bot will guide you through the process.
Open Software License version 3.0.
Get started with the community edition! If you want professional support, we offer subscription plans and custom licensing.