Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HDFS/Downloads Table Incremental Build #1078

Open
fmendezh opened this issue Jun 27, 2024 · 0 comments
Open

HDFS/Downloads Table Incremental Build #1078

fmendezh opened this issue Jun 27, 2024 · 0 comments
Assignees
Labels
research research, evaluation or proof of concept

Comments

@fmendezh
Copy link
Contributor

The TableBackfill supports the creating of partitioned tables (by datasetKey), Apache Iceberg partitions supports the atomic update/replace of table partitions INSERT OVERWRITE TABLE occurrence PARTITION (dataset_key).
This feature can be used to keep the download table up-to-date and use the current table build process for disaster recovery and data synchronisation.

@fmendezh fmendezh added the research research, evaluation or proof of concept label Jun 27, 2024
fmendezh added a commit to gbif/occurrence that referenced this issue Aug 16, 2024
fmendezh added a commit that referenced this issue Sep 12, 2024
fmendezh added a commit that referenced this issue Sep 13, 2024
fmendezh added a commit that referenced this issue Sep 13, 2024
…rpretation for that table is not requested
fmendezh added a commit that referenced this issue Sep 13, 2024
fmendezh added a commit that referenced this issue Sep 20, 2024
adding CLI components for incremental table build using iceberg
fmendezh added a commit that referenced this issue Sep 23, 2024
routing was using the wrong message
fmendezh added a commit that referenced this issue Sep 23, 2024
fmendezh added a commit to gbif/gbif-api that referenced this issue Sep 24, 2024
fmendezh added a commit that referenced this issue Sep 24, 2024
changing step type for the data warehouse cli
fmendezh added a commit that referenced this issue Sep 24, 2024
changing step type for the data warehouse cli
fmendezh added a commit that referenced this issue Sep 24, 2024
fmendezh added a commit that referenced this issue Sep 24, 2024
fmendezh added a commit that referenced this issue Sep 24, 2024
fmendezh added a commit to gbif/registry that referenced this issue Sep 25, 2024
fmendezh added a commit to gbif/registry that referenced this issue Sep 25, 2024
fmendezh added a commit to gbif/postal-service that referenced this issue Sep 26, 2024
fmendezh added a commit that referenced this issue Sep 26, 2024
moving data warehouse message to postal service
fmendezh added a commit to gbif/registry that referenced this issue Sep 26, 2024
adding data warehouse message processing
fmendezh added a commit to gbif/postal-service that referenced this issue Sep 26, 2024
fmendezh added a commit to gbif/postal-service that referenced this issue Sep 26, 2024
fmendezh added a commit that referenced this issue Oct 11, 2024
fmendezh added a commit that referenced this issue Oct 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
research research, evaluation or proof of concept
Projects
None yet
Development

No branches or pull requests

2 participants