[Kernel] Assign base row ID and default row commit version to AddFile #3894

qiyuandong-db · 2024-11-21T14:53:39Z

Which Delta project/connector is this regarding?

Description

This PR implements the first part of row tracking support in Delta Kernel, based on the Delta Protocol. Specifically, it includes the following changes:

add baseRowId, defaultRowCommitVersion fields to AddFile and RemoveFile actions
implement functionality to assign baseRowId and defaultRowCommitVersion to AddFile actions prior to committing them
maintain the rowIdHighWaterMark of the delta.rowTracking metadata domain during the base row ID assignment, which is the highest assigned fresh row id for the table

It doesn't include:

resolving conflicts with transactions that assigned overlapping Row IDs and Commit Versions
assignment of the physical row ID column names at table creation

How was this patch tested?

Added tests in RowTrackingSuite.scala. This includes unit tests and integration tests with Delta-Spark.

Does this PR introduce any user-facing changes?

No.

johanl-db

There are a few things missing from the row tracking spec, see my comment

kernel/kernel-api/src/main/java/io/delta/kernel/internal/rowtracking/RowTracking.java

johanl-db · 2024-11-21T17:42:29Z

kernel/kernel-api/src/main/java/io/delta/kernel/internal/DeltaErrors.java

+    return new KernelException(
+        "Cannot assign baseRowId to add action. "
+            + "The number of records in this data file is missing.");
+  }


As we discussed, this can be an issue: if connectors don't populate numRecords stats in the addFile action that are committed, the commit will fail if row tracking is supported (note that this is still better than today where we always fail in that case since we don't support row tracking.

Question more for kernel folks: do we some guarantee or requirement that connectors populate numRecords? Are connectors that implement writes today (if any) populating numRecords?

In any case, I would word the exception so that it puts the burden more on the connector, for example:
"All add actions must have statistics that include the number of records when writing to a Delta table with the RowTracking table feature enabled."

Question more for kernel folks: do we some guarantee or requirement that connectors populate numRecords? Are connectors that implement writes today (if any) populating numRecords?

cc @vkorukanti thoughts?

We expect the connector to populate the numRecords and other stats as it is a heavy operation and we don't want to do that in Kernel. If some protocol feature (e.g., icebergCompatV2) requires stats and they are missing in the DataFileStatus, we throw errors.

So it seems like we should update the Row Tracking protocol to indicate that the AddFile statistics must include numRecords ?

+1 on making the error message more explicit about (1) what is unsupported (cannot write to row tracking table)(2) why it is unsupported (requires numRecords to be populated in stats) (3) who is responsible/what needs to be updated for support (not supported by this kernel integration/engine)

Thanks for the suggestion! I’ve updated the error message to better explain the situation and our expectations.

So it seems like we should update the Row Tracking protocol to indicate that the AddFile statistics must include numRecords ?

I'm not sure if this should go into protocol. The numRecords is used for assigning baseRowId, but we might have alternative ways to obtain this information (e.g. passing it directly from the connector, or potentially computing it within the Kernel ourselves). It seems more like an expectation for connectors to provide this data.

…tions

qiyuandong-db · 2024-12-09T08:40:04Z

kernel/kernel-api/src/main/java/io/delta/kernel/internal/rowtracking/RowTracking.java

+   * @return an {@link CloseableIterable} of data actions with base row IDs and default row commit
+   *     versions assigned
+   */
+  public static CloseableIterable<Row> assignBaseRowIdAndDefaultRowCommitVersion(


Another possible approach is to assign baseRowId and defaultRowCommitVersion when creating AddFile actions.

However, this would duplicate logic across all places where AddFile actions are created (currently just Transaction.generateAppendActions, but potentially more in the future I guess). Plus, baseRowId assignment is stateful, making it tricky if AddFile actions are created in multiple steps/places.

I chose to handle it here for now to keep things centralized, even though it requires converting AddFile rows back to actions, updating them, and converting back to rows.

kernel/kernel-api/src/main/java/io/delta/kernel/internal/SnapshotImpl.java

kernel/kernel-api/src/main/java/io/delta/kernel/internal/TableFeatures.java

kernel/kernel-api/src/main/java/io/delta/kernel/internal/rowtracking/RowTracking.java

johanl-db · 2024-12-09T12:50:37Z

kernel/kernel-api/src/main/java/io/delta/kernel/internal/actions/AddFile.java

+  private final String path;
+  private final MapValue partitionValues;
+  private final long size;
+  private final long modificationTime;
+  private final boolean dataChange;
+  private final Optional<DeletionVectorDescriptor> deletionVector;
+  private final Optional<MapValue> tags;
+  private final Optional<Long> baseRowId;
+  private final Optional<Long> defaultRowCommitVersion;
+  private final Optional<DataFileStatistics> stats;


Moves this first in the class

Apologies -- what does this comment mean? Having the static methods first, then the static variables, then the member variables and constructor LGTM

I don't know the ordering convention used in kernel, I would typically put all variables (static/member) before any method definition but that's personal and we should follow whatever convention kernel uses.

What tripped me initially is that AddFile is essentially a record and these member fields are arguably the most important part of the class to make sense of the rest but they are lost in the middle of the file

kernel/kernel-api/src/main/java/io/delta/kernel/internal/actions/AddFile.java

kernel/kernel-defaults/src/test/scala/io/delta/kernel/defaults/RowTrackingSuite.scala

scottsand-db

Looks great! Thanks for making this. Left some comments.

kernel/kernel-api/src/main/java/io/delta/kernel/internal/DeltaErrors.java

kernel/kernel-api/src/main/java/io/delta/kernel/internal/actions/AddFile.java

scottsand-db · 2024-12-09T19:43:08Z

kernel/kernel-api/src/main/java/io/delta/kernel/internal/actions/AddFile.java

+  private final String path;
+  private final MapValue partitionValues;
+  private final long size;
+  private final long modificationTime;
+  private final boolean dataChange;
+  private final Optional<DeletionVectorDescriptor> deletionVector;
+  private final Optional<MapValue> tags;
+  private final Optional<Long> baseRowId;
+  private final Optional<Long> defaultRowCommitVersion;
+  private final Optional<DataFileStatistics> stats;


Apologies -- what does this comment mean? Having the static methods first, then the static variables, then the member variables and constructor LGTM

kernel/kernel-api/src/main/java/io/delta/kernel/internal/actions/AddFile.java

kernel/kernel-api/src/main/java/io/delta/kernel/internal/rowtracking/RowTracking.java

scottsand-db · 2024-12-11T18:45:45Z

kernel/kernel-api/src/main/java/io/delta/kernel/internal/rowtracking/RowTracking.java

+   * @param addFile the AddFile action
+   * @return the number of records
+   */
+  private static long getNumRecords(AddFile addFile) {


why not put this onto the AddFIle itself? This feels a lot like c and struct like code. Putting this into AddFile would make the code more cohesive

My idea was to make it throw rowIDAssignmentWithoutStats in case of missing stats to simply some row tracking code, which is very specific to row tracking and would be inappropriate for a getter method of AddFile itself.

Now I've put the getter public Optional<Long> getNumRecords() to the AddFile itself, and have a helper method in RowTracking.java:

private static long getNumRecordsOrThrow(AddFile addFile) { return addFile.getNumRecords().orElseThrow(DeltaErrors::rowIDAssignmentWithoutStats); }

Does this look good to you?

kernel/kernel-api/src/main/java/io/delta/kernel/internal/rowtracking/RowTracking.java

kernel/kernel-api/src/main/java/io/delta/kernel/internal/TableFeatures.java

kernel/kernel-api/src/main/java/io/delta/kernel/internal/rowtracking/RowTracking.java

scottsand-db · 2024-12-16T17:40:19Z

kernel/kernel-api/src/main/java/io/delta/kernel/internal/TableFeatures.java

+   * @param protocol the protocol to check
+   * @return true if the protocol supports row tracking, false otherwise
+   */
+  public static boolean isRowTrackingSupported(Protocol protocol) {


Question: where (probably in future PRs?) will you check that RowTracking is enabled (which is strictly stronger than supported?

https://github.com/delta-io/delta/blob/master/PROTOCOL.md#row-tracking

We will not be checking whether row tracking is enabled in this PR or in planned future ones. The current goal is to add a minimal implementation to ensure row tracking is supported in Delta Kernel. Addressing the enabled requirement is outside the current scope.

I imagine that in the future, we may need to check it is enabled when 1) reconstructing stable row ID / row commit version during reads, and 2) preserving stable row ID / row commit version during writes (e.g., for handling UPDATE and DELETE operations).

scottsand-db · 2024-12-16T17:47:22Z

kernel/kernel-api/src/main/java/io/delta/kernel/internal/actions/AddFile.java

+                    .orElse(null)));
+  }
+
+  private final String path;


@vkorukanti -- were we trying to avoid materializing all of the AddFile fields in memory into a POJO?

the addfile is materialized into a pojo inside of RowTracking.java below like so:

AddFile addFile = AddFile.fromRow(row.getStruct(ADD_FILE_ORDINAL)); // Assign base row ID if missing if (!addFile.getBaseRowId().isPresent()) { final long numRecords = getNumRecordsOrThrow(addFile); addFile = addFile.withNewBaseRowId(currRowIdHighWatermark.get() + 1L); currRowIdHighWatermark.addAndGet(numRecords); } // Assign default row commit version if missing if (!addFile.getDefaultRowCommitVersion().isPresent()) { addFile = addFile.withNewDefaultRowCommitVersion(commitVersion); } // Return a new AddFile row with assigned baseRowId/defaultRowCommitVersion return SingleAction.createAddFileSingleAction(addFile.toRow());

I think there is a better way here, to implement AddFile without materializing all of the values, and just pointing to the underlying row

I think we can solve this problem like so:

Step 1: Create a DelegateRow class

public abstract class DelegateRow implements Row { private final Row delegate; public DelegateRow(Row delegate) { this.delegate = delegate; } @Override public StructType getSchema() { return baseRow.getSchema(); } // implement all Row interfaces using the delegate }

Step 2: Implement AddFile class without materializing everything

class AddFile { private final Row row; public AddFile(Row row) { this.row = row; this.parsedPartitionValues = ... } public String getPath() { return row.getString(COL_NAME_TO_ORDINAL.get("path")); } // ... implement getters by referencing the row ... public AddFile withNewBaseRowId(long baseRowId) { Row updatedRow = new DelegateRow(row) { @Override public long getLong(int ordinal) { if (ordinal == COL_NAME_TO_ORDINAL.get("baseRowId")) { return baseRowId; } return super.getLong(ordinal); } @Override public boolean isNullAt(int ordinal) { if (ordinal == COL_NAME_TO_ORDINAL.get("baseRowId")) { return false; // baseRowId is now defined } return super.isNullAt(ordinal); } }; return new AddFile(updatedRow); } }

Another way to implement DelegateRow is to pass in a map of overrides ... I'd have to think more of the tradeoffs between the two designs

Is the issue that we now need to modify an add file? Instead of just previously accessing the values/writing them as is?

I think this is important to think about from generic rust/java standpoint as well. Like if we have some generic chunk of engine data, and we need to change/update/add to it, how do we do so?

Can we unblock this change and go forward with the current approach while we're figuring things out?

I agree it's not ideal to materialize AddFiles, but we only do it when row tracking is used so this PR is still a strict improvement over the current situation since tables with row tracking are not readable by kernel today.

Avoiding materializing actions to mutate them is then an optimization. It may also be useful for other features in the future

Hi @johanl-db and @qiyuandong-db -- I think we need to pause work on this PR and take a step back to really plan this out. I think that the Kernel folks need to decide how we want (or further, if we want) Rows to be created and updated.

For example, another way to accomplish this would be to create an expression that transforms the row to create a new one with whatever values we want.

We can chat more on Slack to figure out next steps here. Great work on this PR and I'm glad that this PR has made us think quite hard about our data model and integration with engine connectors.

scottsand-db · 2024-12-16T17:47:58Z

kernel/kernel-api/src/main/java/io/delta/kernel/internal/actions/AddFile.java

+    StringBuilder sb = new StringBuilder();
+    sb.append("AddFile{");
+    sb.append("path='").append(path).append('\'');
+    sb.append(", partitionValues=").append(partitionValuesJavaMap);


I think that partitionValues would be better printed out as the in-order part1=value1, part2=value2 string

this map has no order guarantees, it could be confusing if we printed out { part2=value2, part1=value1 }

The updated to AddFile alone, btw, would warrant their own PR. You could add an AddFileSuite to test these methods

I think that partitionValues would be better printed out as the in-order part1=value1, part2=value2 string

Yes, I’ll use a TreeMap to ensure the order.

The updated to AddFile alone, btw, would warrant their own PR. You could add an AddFileSuite to test these methods

I agree the updates to AddFile alone do need their own PR. My initial intention was to combine them to better showcase how AddFile is used in row tracking and to justify the changes. I can prepare a separate PR for it.

qiyuandong-db force-pushed the delta-kernel-row-tracking branch from 54b77cc to 75c0005 Compare November 21, 2024 15:09

johanl-db reviewed Nov 21, 2024

View reviewed changes

qiyuandong-db force-pushed the delta-kernel-row-tracking branch from 75c0005 to c584707 Compare December 7, 2024 21:46

qiyuandong-db changed the title ~~[Kernel] Assign base row ID to AddFile actions~~ [Kernel] Assign base row ID and default row commit version to AddFile Dec 7, 2024

Assign baseRowId/defaultRowCommitVersion to AddFile and RemoveFile ac…

e94da89

…tions

qiyuandong-db force-pushed the delta-kernel-row-tracking branch from c584707 to e94da89 Compare December 9, 2024 08:28

qiyuandong-db commented Dec 9, 2024

View reviewed changes

johanl-db reviewed Dec 9, 2024

View reviewed changes

Address pr comments

2816342

qiyuandong-db force-pushed the delta-kernel-row-tracking branch from 954641b to 2816342 Compare December 9, 2024 15:14

scottsand-db requested changes Dec 11, 2024

View reviewed changes

Address PR comments

895e0ae

qiyuandong-db force-pushed the delta-kernel-row-tracking branch from 700a46f to 895e0ae Compare December 12, 2024 15:06

scottsand-db requested review from vkorukanti and allisonport-db December 16, 2024 17:30

scottsand-db reviewed Dec 16, 2024

View reviewed changes

qiyuandong-db added 2 commits December 17, 2024 16:39

Address PR comments - 3

86d83ce

Rename the error to missingNumRecordsStatsForRowTracking for consistency

0fd9d1b

zachschuermann requested review from zachschuermann and nicklan December 17, 2024 17:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Kernel] Assign base row ID and default row commit version to AddFile #3894

[Kernel] Assign base row ID and default row commit version to AddFile #3894

qiyuandong-db commented Nov 21, 2024 •

edited

Loading

johanl-db left a comment

johanl-db Nov 21, 2024

scottsand-db Dec 16, 2024

vkorukanti Dec 16, 2024

scottsand-db Dec 16, 2024

allisonport-db Dec 16, 2024

qiyuandong-db Dec 17, 2024

qiyuandong-db Dec 9, 2024

johanl-db Dec 9, 2024

scottsand-db Dec 9, 2024

johanl-db Dec 17, 2024

scottsand-db left a comment

scottsand-db Dec 9, 2024

scottsand-db Dec 11, 2024

qiyuandong-db Dec 12, 2024 •

edited

Loading

scottsand-db Dec 16, 2024

qiyuandong-db Dec 17, 2024

scottsand-db Dec 16, 2024

scottsand-db Dec 16, 2024

scottsand-db Dec 16, 2024

scottsand-db Dec 16, 2024

allisonport-db Dec 16, 2024

johanl-db Dec 17, 2024

scottsand-db Dec 17, 2024 •

edited

Loading

scottsand-db Dec 16, 2024

qiyuandong-db Dec 17, 2024

[Kernel] Assign base row ID and default row commit version to AddFile #3894

Are you sure you want to change the base?

[Kernel] Assign base row ID and default row commit version to AddFile #3894

Conversation

qiyuandong-db commented Nov 21, 2024 • edited Loading

Which Delta project/connector is this regarding?

Description

How was this patch tested?

Does this PR introduce any user-facing changes?

johanl-db left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

scottsand-db left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

qiyuandong-db Dec 12, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Step 1: Create a DelegateRow class

Step 2: Implement AddFile class without materializing everything

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

scottsand-db Dec 17, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

qiyuandong-db commented Nov 21, 2024 •

edited

Loading

qiyuandong-db Dec 12, 2024 •

edited

Loading

scottsand-db Dec 17, 2024 •

edited

Loading