pangeo-forge is an open-source tool designed to aid the extraction, transformation, and loading of datasets. The goal of pangeo-forge is to make it easy to extract datasets from traditional data repositories and deposit them into cloud object storage in analysis-ready, cloud-optimized format.
pangeo-forge is inspired by conda-forge, a community-led collection of recipes for building Conda packages. We hope that pangeo-forge can play the same role for datasets.
More can be learned about pangeo-forge, its progress, and related subprojects in its official documentation.
pangeo-forge is still early in development - there are several ways to contribute:
- Create a recipe for a dataset you are interested in
- Open an issue or pull request here or in any of the related subprojects (pangeo-smithy, staged-recipes)
- Check out the project roadmap
Discussions on Pangeo Forge are generally hosted biweekly on Mondays at 2pm ET. Calendar link here. We aim to announce cancellations on this discourse thread.
This project is licensed under the Apache License, Version 2.0.