New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

feat: Added support to write iceberg tables #5989

Open

malhotrashivam wants to merge 48 commits into deephaven:main from malhotrashivam:sm-ice-write

+2,945 −329

Contributor

malhotrashivam commented Aug 26, 2024 •

edited

Loading

Closes: #6125
Should be merged after #6156

Also moves existing Iceberg tests from Junit4 to Junit5.

malhotrashivam added parquet DocumentationNeeded ReleaseNotesNeeded s3 iceberg labels

malhotrashivam added this to the 0.37.0 milestone

malhotrashivam requested review from lbooker42 and devinrsmith

August 26, 2024 21:58

malhotrashivam self-assigned this


          Initial commit

8c81883

malhotrashivam force-pushed the sm-ice-write branch from 2a60cf8 to 8c81883 Compare

August 27, 2024 17:00

malhotrashivam added 4 commits

August 28, 2024 14:17


          Added type info map and modified instructions class hierarchy

758c1f3


          Minor tweaks to the instructions class hierarchy

48cb8d8


          Merged writeTable and appendTable into addPartition

244bc99


          Split IcebergParquetWriteInstructions into WriteInstr and ParquetWrit…

09340c2

…eInstr

malhotrashivam marked this pull request as draft

September 6, 2024 18:27

malhotrashivam changed the title ~~feat: [DO NOT MERGE] Added support to write iceberg tables~~ feat: Added support to write iceberg tables

malhotrashivam added 6 commits

September 25, 2024 13:26


          Merge branch 'main' into sm-ice-write

33b60e2


          Resolving more conflicts

c70b50e


          Merge branch 'main' into sm-ice-write

cd278ab


          Added unit tests and moved Iceberg tests to Junit5

689e8a1


          Preparing change for code review Part 1

d7f2c81


          Preparing for review Part 2

131a552

malhotrashivam marked this pull request as ready for review

October 1, 2024 17:25


          Added more unit tests

lbooker42 reviewed

View reviewed changes

extensions/iceberg/s3/src/main/java/io/deephaven/iceberg/util/IcebergToolsS3.java Show resolved Hide resolved

extensions/iceberg/src/main/java/io/deephaven/iceberg/base/IcebergUtils.java Outdated Show resolved Hide resolved

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/IcebergCatalogAdapter.java Show resolved Hide resolved

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/IcebergCatalogAdapter.java Outdated Show resolved Hide resolved

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/IcebergCatalogAdapter.java Outdated Show resolved Hide resolved

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/IcebergCatalogAdapter.java Outdated Show resolved Hide resolved

malhotrashivam commented

View reviewed changes

extensions/iceberg/s3/src/test/java/io/deephaven/iceberg/util/IcebergLocalStackTest.java Outdated Show resolved Hide resolved

extensions/parquet/table/src/main/java/io/deephaven/parquet/table/ParquetInstructions.java

@@ @@ -33,95 +36,18 @@ @@
                */
               public abstract class ParquetInstructions implements ColumnToCodecMappings {
-                  private static volatile String defaultCompressionCodecName = CompressionCodecName.SNAPPY.toString();

Contributor Author

malhotrashivam Oct 1, 2024

Removing unnecessary configuration parameters.

Member

rcaudy Oct 28, 2024

In general this seems like an improvement, the old code was adding little value for the complexity. What about enterprise usages?

extensions/parquet/table/src/main/java/io/deephaven/parquet/table/ParquetInstructions.java

@@ @@ -433,6 +382,14 @@ public boolean useDictionary() { @@
                       public void useDictionary(final boolean useDictionary) {
                           this.useDictionary = useDictionary;
                       }
+                      public OptionalInt getFieldId() {

Contributor Author

malhotrashivam Oct 1, 2024

The field Id related logic may change when #6156 gets merged.

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/IcebergCatalogAdapter.java Outdated Show resolved Hide resolved

extensions/iceberg/s3/src/main/java/io/deephaven/iceberg/util/IcebergToolsS3.java Show resolved Hide resolved

extensions/iceberg/src/main/java/io/deephaven/iceberg/base/IcebergUtils.java Outdated Show resolved Hide resolved

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/IcebergCatalogAdapter.java Show resolved Hide resolved

malhotrashivam added 3 commits

October 3, 2024 11:14


          Review with Larry part 1

9f82ba0


          Fix for failing job

cbae64e


          Review with Larry Part 2

7de59b0

malhotrashivam added 6 commits

October 17, 2024 13:47


          Merge branch 'main' into sm-ice-write

de6eba0


          Review with Ryan Part 1

0ebeba2


          Review with Ryan Part 2

31f46ba


          Fix for failing parquet reads

946def0


          Added more tests for writeDataFile

bd8535c


          Added tests for on write callback

78bd605

malhotrashivam commented

View reviewed changes

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/IcebergCatalogAdapter.java Outdated Show resolved Hide resolved

malhotrashivam added 4 commits

October 21, 2024 15:10


          Merge branch 'main' into sm-ice-write

500cfe6


          Added support for writing partitioned tables

e2aba1f


          Merge branch 'main' into sm-ice-write

e4a936e


          Minor tweaks

b32ad68

chipkent reviewed

View reviewed changes

py/server/deephaven/experimental/iceberg.py Outdated

+                               maximum_dictionary_size: Optional[int] = None,
+                               target_page_size: Optional[int] = None,
+                               verify_schema: Optional[bool] = None,
+                               dh_to_iceberg_column_renames: Optional[Dict[str, str]] = None,

Member

chipkent Oct 25, 2024

name is very long, especially if a user is specifying it. Any reason it can't just be column_renames?

Member

chipkent Oct 25, 2024

you should also look through the rest of the API to see if column_renames or col_renames would be most consistent. I would guess col_renames.

py/server/deephaven/experimental/iceberg.py Outdated

Comment on lines 197 to 220

+                          if compression_codec_name is not None:
+                              builder.compressionCodecName(compression_codec_name)
+                          if maximum_dictionary_keys is not None:
+                              builder.maximumDictionaryKeys(maximum_dictionary_keys)
+                          if maximum_dictionary_size is not None:
+                              builder.maximumDictionarySize(maximum_dictionary_size)
+                          if target_page_size is not None:
+                              builder.targetPageSize(target_page_size)
+                          if verify_schema is not None:
+                              builder.verifySchema(verify_schema)
+                          if dh_to_iceberg_column_renames is not None:
+                              for dh_name, iceberg_name in dh_to_iceberg_column_renames.items():
+                                  builder.putDhToIcebergColumnRenames(dh_name, iceberg_name)
+                          if table_definition is not None:
+                              builder.tableDefinition(TableDefinition(table_definition).j_table_definition)
+                          if data_instructions is not None:
+                              builder.dataInstructions(data_instructions.j_object)

Member

chipkent Oct 25, 2024

I suspect all of these cases can have is not None removed. Confirm with @jmao-denver on what he wants to see.

py/server/deephaven/experimental/iceberg.py

+                             tables: List[Table],
+                             partition_paths: Optional[List[str]] = None,
+                             instructions: Optional[IcebergParquetWriteInstructions] = None):
+                      # TODO Review javadoc in this file once again

Member

chipkent Oct 25, 2024

todo

py/server/deephaven/experimental/iceberg.py Outdated

+                             table_identifier: str,
+                             tables: List[Table],
+                             partition_paths: Optional[List[str]] = None,
+                             instructions: Optional[IcebergParquetWriteInstructions] = None):

Member

chipkent Oct 25, 2024

missing a return type hint

py/server/deephaven/experimental/iceberg.py

+                             instructions: Optional[IcebergParquetWriteInstructions] = None):
+                      # TODO Review javadoc in this file once again
+                      """
+                      Append the provided Deephaven table as a new partition to the existing Iceberg table in a single snapshot. This

Member

chipkent Oct 25, 2024

this says "table" and "partition", but the input is a list of tables. Does that mean multiple tables go to one partition or multiple partitions? etc.

py/server/deephaven/experimental/iceberg.py Outdated

Comment on lines 405 to 406

		tables: List[Table],
		partition_paths: Optional[List[str]] = None,

Member

chipkent Oct 25, 2024

see other comments

py/server/deephaven/experimental/iceberg.py Outdated

+                                      table_identifier: str,
+                                      tables: List[Table],
+                                      partition_paths: Optional[List[str]] = None,
+                                      instructions: Optional[IcebergParquetWriteInstructions] = None):

Member

chipkent Oct 25, 2024

missing a return type hint

py/server/deephaven/experimental/iceberg.py Outdated

+                      of data files that were written. Users can use this list to create a transaction/snapshot if needed.
+                      Args:
+                          table_identifier (str): the identifier string for iceberg table to write to.

Member

chipkent Oct 25, 2024

grammar

py/server/deephaven/experimental/iceberg.py Outdated

Comment on lines 414 to 416

+                          tables (List[Table]): the tables to write.
+                          partition_paths (Optional[List[str]]): the partitioning path at which data would be written, for example,
+                              "year=2021/month=01". If omitted, we will try to write data to the table without partitioning.

Member

chipkent Oct 25, 2024

see other comments

py/server/deephaven/experimental/iceberg.py

+                          partition_paths (Optional[List[str]]): the partitioning path at which data would be written, for example,
+                              "year=2021/month=01". If omitted, we will try to write data to the table without partitioning.
+                          instructions (Optional[IcebergParquetWriteInstructions]): the instructions for customizations while writing.
+                      """

Member

chipkent Oct 25, 2024

All above cases that are missing the return type hint are also missing docs on the return value

rcaudy reviewed

View reviewed changes

Member

rcaudy left a comment

I was a little less than thorough in the parquet writing and table adapter code, but I think we got the salient bits reviewed.
We should gather a consensus around our schema evolution support, as it influences this PR quite a lot.

extensions/iceberg/src/main/java/io/deephaven/iceberg/base/IcebergUtils.java Outdated Show resolved Hide resolved

extensions/iceberg/src/main/java/io/deephaven/iceberg/base/IcebergUtils.java Show resolved Hide resolved

extensions/iceberg/src/main/java/io/deephaven/iceberg/base/IcebergUtils.java Outdated Show resolved Hide resolved

extensions/iceberg/src/main/java/io/deephaven/iceberg/base/IcebergUtils.java Outdated Show resolved Hide resolved

extensions/iceberg/src/main/java/io/deephaven/iceberg/base/IcebergUtils.java Outdated Show resolved Hide resolved

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/IcebergBaseInstructions.java Show resolved Hide resolved

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/IcebergTableAdapter.java Outdated Show resolved Hide resolved

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/IcebergTableAdapter.java Outdated Show resolved Hide resolved

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/IcebergTableAdapter.java Outdated Show resolved Hide resolved

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/IcebergTableAdapter.java Outdated Show resolved Hide resolved

malhotrashivam added 5 commits

October 28, 2024 18:08


          Merge branch 'main' into sm-ice-write

98d7e8e


          Continuing the rebase

f77e46c


          Merge branch 'main' into sm-ice-write

a1d7912


          Resolving more conflicts

4aae03d


          Fix for failing tests

b9d198a

malhotrashivam mentioned this pull request

Add support for more types for writing to iceberg tables #6327

Open

malhotrashivam added 4 commits

November 4, 2024 09:50


          Minor tweaks to tests

19f5715


          Minor changes to interface

b4204bc


          Review with Ryan and Devin contd.

06bc407


          Added more tests for refreshing iceberg + partitioned append

0d5d213

malhotrashivam mentioned this pull request

feat: Ensure data read order reflects commit sequence in Iceberg tables #6341

Open


          Added some comments

7161d19

devinrsmith reviewed

View reviewed changes

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/IcebergWriteInstructions.java Outdated

+                   * The Deephaven tables to be written. All tables should have the same definition, else a {@link #tableDefinition()
+                   * table definition} should be provided.
+                   */
+                  public abstract List<Table> dhTables();

Member

devinrsmith Nov 7, 2024

Not a fan generally of prefixing based on namespaces like "dh"; it's okay sometimes as a variable, but I would just let the return type speak for itself io.deephaven.engine.table.Table and call this tables().

Member

devinrsmith Nov 7, 2024

Do all of these tables need to have exactly #tableDefinition if that is present? If so, we should add a check.

Contributor Author

malhotrashivam Nov 11, 2024

All tables do not need to have exactly the same definition if #tableDefinition is provided, else they need to have same definition. I have these check in IcebergTableAdapter::ensureDefinition, I can move some of that logic to the Instructions class.

I have also updated the docs for #tableDefinition to make it more clear.

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/IcebergWriteInstructions.java Outdated Show resolved Hide resolved

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/TableWriterOptions.java Show resolved Hide resolved

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/TableWriterOptions.java

+                   * <p>
+                   * If not provided, we use the latest schema from the table.
+                   */
+                  public abstract Optional<Schema> schema();

Member

devinrsmith Nov 7, 2024

I was debating arguing for the point

We may want to enforce that schema is present if the map is present, because it's Iceberg's responsibility to set the field ids when the schema is created or updated, and thus a user should only be getting the field IDs from an existing schema

but upon further reflection, it's possible that the users are either hard-coding just field IDs after the fact (ie, after the table has been created), or getting it from some other system that interacts with Iceberg.

Of course, in that scenario, you might argue that they should be hard-coding the Schema as opposed to just the field ids... this also does up a bit of a bootstrapping chicken and an egg problem - how can a piece of writing logic be written that is both responsible for creating an initial table if it doesn't exist, but also appending it to it successfully if the table does exist? Are we allowed to assume that Iceberg will create the Schema with the field IDs incrementing starting from 1? We should discuss this bootstrapping assumption problem, and how we may best need to solve it...

I think this method should move above the map; it feels more structurally important.

Member

devinrsmith Nov 7, 2024

It looks like we might be able to make assumptions about the ordering. Digging into the code I see org.apache.iceberg.types.TypeUtil#assignFreshIds(int, org.apache.iceberg.Schema, org.apache.iceberg.types.TypeUtil.NextID); of course, this is only the de-facto implementation, not sure if the field ids are guaranteed to be in this order based on the spec...

Member

devinrsmith Nov 7, 2024

Another argument, in the case where DH is the only writer, is the user should be able to completely ignore schema and the map...

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/IcebergTableAdapter.java Outdated Show resolved Hide resolved

extensions/iceberg/src/test/java/io/deephaven/iceberg/junit5/SqliteCatalogBase.java Outdated Show resolved Hide resolved

extensions/iceberg/src/test/java/io/deephaven/iceberg/junit5/SqliteCatalogBase.java Outdated

Comment on lines 722 to 727

    
                      // Sleep for 0.5 second

                      try {

                          Thread.sleep(500);

                      } catch (InterruptedException e) {

                          Thread.currentThread().interrupt();

                      }

Member

devinrsmith Nov 7, 2024

Ideally, we should have some sort of testing hooks so we don't need to do any sleeps; or maybe we already have a way to wait for new data against the table?

Regardless, if a thread is interrupted in this way in a test, probably better to just add it as exception to the test method.

Contributor Author

malhotrashivam Nov 11, 2024

I checked with Larry, he said he couldn't find a better way to test this part, so I left it like this for now.
Will check with Ryan.

extensions/iceberg/src/test/java/io/deephaven/iceberg/junit5/SqliteCatalogBase.java Outdated Show resolved Hide resolved

extensions/parquet/table/src/main/java/io/deephaven/parquet/table/ParquetInstructions.java

Comment on lines +197 to +201

+                  /**
+                   * @return A callback to be executed when on completing each parquet data file write (excluding the index and
+                   *         metadata files).
+                   */
+                  public abstract Optional<OnWriteCompleted> onWriteCompleted();

Member

devinrsmith Nov 8, 2024

Based on the current implementation, I see this always gets invoked on-thread in a linear fashion; we may want to document that the consumer is responsible for thread-safety, or that the writing code will invoke this in a thread safe way. Both ways have there merit, not sure which I prefer. @rcaudy ?

extensions/parquet/table/src/main/java/io/deephaven/parquet/table/ParquetInstructions.java Outdated Show resolved Hide resolved

malhotrashivam added 2 commits

November 11, 2024 15:49


          Merge branch 'main' into sm-ice-write

43d26d7


          Review with Devin Part 1

34135b1

devinrsmith reviewed

View reviewed changes

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/IcebergWriteInstructions.java Show resolved Hide resolved

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/IcebergTableWriter.java

-                          if (numTables == 0) {
-                              return writeInstructions.withTableDefinition(TableDefinition.of());
-                          }
+                          final List<Table> dhTables = writeInstructions.tables();

Member

devinrsmith Nov 11, 2024

So, I get the spirit of this method, but I'm not a fan of how we now have 2 classes of IcebergParquetWriteInstructions. It's the same sort of situation I'm sad about wrt #6149.

I wonder instead of we should add a method

public abstract class IcebergWriteInstructions implements IcebergBaseInstructions {
    ...
    public final TableDefinition tableDefinitionOrFirst() {
        return tableDefinition().orElse(tables().get(0).getDefinition());
    }
    ...

and then, where applicable, have callers use tableDefinitionOrFirst instead of tableDefinition. This saves us from having to create a new object and allows us to preserve the original instructions further down through the call stack.

extensions/iceberg/src/test/java/io/deephaven/iceberg/junit5/SqliteCatalogBase.java Show resolved Hide resolved

extensions/iceberg/src/test/java/io/deephaven/iceberg/junit5/SqliteCatalogBase.java Show resolved Hide resolved

extensions/iceberg/src/test/java/io/deephaven/iceberg/junit5/SqliteCatalogBase.java Show resolved Hide resolved

extensions/parquet/table/src/main/java/io/deephaven/parquet/table/CompletedParquetWrite.java Show resolved Hide resolved

extensions/parquet/table/src/main/java/io/deephaven/parquet/table/ParquetTools.java

+                                      writeInstructions.onWriteCompleted()
+                                              .ifPresent(callback -> callback.onWriteCompleted(CompletedParquetWrite.builder()
+                                                      .destination(tableDestination)
+                                                      .numRows(source.size())

Member

devinrsmith Nov 11, 2024

I wonder if it's more appropriate to plumb the callback down through ParquetTableWriter.write? In that way, write doesn't need to return the number of bytes anymore, and it's going to be responsible internally for calling the number of rows (instead of making the caller do source.size(); I do see the safety check is at the inner layer calling checkInitiateSerialTableOperation).

malhotrashivam commented

View reviewed changes

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/IcebergCatalogAdapter.java Outdated

+                  public IcebergTableAdapter createTable(
+                          @NotNull final TableIdentifier tableIdentifier,
+                          @NotNull final TableDefinition definition) {
+                      // TODO Add these APIs to python code once finalized

Contributor Author

malhotrashivam Oct 28, 2024

TODO

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/IcebergWriteInstructions.java


		INSTRUCTIONS_BUILDER addAllDhTables(Iterable<? extends Table> elements);

		// TODO Discuss about the API for partition paths, and add tests

Contributor Author

malhotrashivam Nov 6, 2024

TODO Check with Devin if this is okay.

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/IcebergWriteInstructions.java Outdated

+                   * The Deephaven tables to be written. All tables should have the same definition, else a {@link #tableDefinition()
+                   * table definition} should be provided.
+                   */
+                  public abstract List<Table> dhTables();

Contributor Author

malhotrashivam Nov 11, 2024

All tables do not need to have exactly the same definition if #tableDefinition is provided, else they need to have same definition. I have these check in IcebergTableAdapter::ensureDefinition, I can move some of that logic to the Instructions class.

I have also updated the docs for #tableDefinition to make it more clear.

extensions/iceberg/src/test/java/io/deephaven/iceberg/junit5/SqliteCatalogBase.java Outdated

Comment on lines 722 to 727

    
                      // Sleep for 0.5 second

                      try {

                          Thread.sleep(500);

                      } catch (InterruptedException e) {

                          Thread.currentThread().interrupt();

                      }

Contributor Author

malhotrashivam Nov 11, 2024

I checked with Larry, he said he couldn't find a better way to test this part, so I left it like this for now.
Will check with Ryan.

extensions/iceberg/src/test/java/io/deephaven/iceberg/junit5/SqliteCatalogBase.java Outdated

Comment on lines 220 to 226

    
                      // Overwrite with an empty table

                      final Table emptyTable = TableTools.emptyTable(0)

                              .update("intCol = (int) 4 * i + 30",

                                      "doubleCol = (double) 4.5 * i + 30");

                      tableAdapter.overwrite(instructionsBuilder()

                              .addDhTables(emptyTable)

                              .build());

Contributor Author

malhotrashivam Nov 11, 2024

Yea, it now seems like an unnecessary complication.
If user wants to delete the content, they can do it with just the iceberg API and don't need Deephaven to help.
So I can delete it for now and we can add it later if needed.

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/IcebergTableWriter.java Outdated Show resolved Hide resolved

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/TableWriterOptions.java

+                   * The {@link TableDefinition} to use when writing Iceberg data files. All tables written by this writer should have
+                   * the same definition.
+                   */
+                  public abstract TableDefinition tableDefinition();

Contributor Author

malhotrashivam Nov 11, 2024

So this definition can be a subset or a superset as well.
This is similar to how we treat table definition on parquet reading/writing side too, where adding an additional column here will lead to null values in the table.
I have updated the javadocs to make it more clear.

extensions/iceberg/src/main/java/io/deephaven/iceberg/util/IcebergTableWriter.java Outdated Show resolved Hide resolved

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

devinrsmith devinrsmith left review comments

rcaudy rcaudy left review comments

chipkent chipkent left review comments

lbooker42 lbooker42 left review comments

jmao-denver Awaiting requested review from jmao-denver jmao-denver is a code owner

At least 1 approving review is required to merge this pull request.

Labels

DocumentationNeeded iceberg parquet ReleaseNotesNeeded s3