Moving data between Data Product versions #1690

murdo-moj · 2023-09-27T13:42:02Z

murdo-moj
Sep 27, 2023
Collaborator

User story

As a data consumer
I want as much historical data as possible in my data product as it gets versioned
So I can produce reports or analyse trends

Scenario

A data product maintainer has deleted a column in a existing table schema in a data product. This has created a new major version of the data product. This new data product will initially have no associated data.

Major versions of a data product have different tables so that breaking changes do not silently impact the consumers of that data product. Data consumers are expected to update their pipelines to consume the latest data product and update their processes accordingly.

Talking points

Would you expect this behaviour when a new data product version is created?
Should the data transfer be optional?
There's potentially some overlap with using Iceberg here which would change some of the assumptions here, namely if we need separate tables to version data product tables.

murdo-moj · 2023-10-25T08:41:22Z

murdo-moj
Oct 25, 2023
Collaborator Author

We agreed the suggested solution is what we'll do, on 24 Oct

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Moving data between Data Product versions #1690

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

Moving data between Data Product versions #1690

murdo-moj Sep 27, 2023 Collaborator

User story

Scenario

Suggested solution

Talking points

Replies: 1 comment

murdo-moj Oct 25, 2023 Collaborator Author

murdo-moj
Sep 27, 2023
Collaborator

murdo-moj
Oct 25, 2023
Collaborator Author