Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Automate and document the workflow for reproducibility. Data has to move around a lot. Big data. Can facilitate collaboration because you don’t have to reinvent the wheel for every project #10

Open
rndsrc opened this issue May 15, 2019 · 0 comments

Comments

@rndsrc
Copy link
Member

rndsrc commented May 15, 2019

SOP:

  • Develop a prototypical workflow using the test dataset.
  • Refine and adjust workflow as the “real” data is generated.
  • Define what can change and what can’t change with the data format; changes to the format can be iterative but need to be communicated
  • Empower the data scientists to suggest changes in the data format where needed
  • Allow for scaling and built in flexibility when building data science workflows in transdisciplinary science.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant