DDD-4 Manipulate Connector Offset via Debezium Signals #12

ani-sha · 2024-08-08T08:49:51Z

This is the first draft of offset manipulation design document (simplified version of #6), starting with implementation for Debezium Postgres connector.

cc : @jpechane

DDD-4.md

jpechane · 2024-08-21T03:43:34Z

DDD-4.md

+
+* `name` - The name of the action. This will be `change-offset`.
+* `offset-position` - The new offset position to set (which corresponds to `lsn_commit` in the PostgreSQL connector).
+* `last-offset-position` - The last offset position to set (which corresponds to `lsn_proc` in the PostgreSQL connector).


I think we need to come up with better naming and descrption/explanation, current one is a bit confusing.
Also I think we need an example of the signal message in the DDD too.

I don't think we should tie the change-offset payload to any specific connector implementation.

While there could be some common attributes between them, I believe it's best to take this as an iterative approach where we consider the payload being processed by the signal handler and delegated to each connector with a specific impl that validates & returns the writable payload. Then we can look to refactor that as a second pass, if needed.

So I would propose the signal payload may look:

{ /* This acts as an envelope where we can add some common attributes in pass 2 */ "connector-offsets": { /* This is the connector-offset payload, to be processed by each connector individually */ /* Here we would have unique key/value tuples, this being an example of PostgreSQL */ "last_commit_lsn": "....", "last_procsesed_lsn": "...", /* Here's another example for something like Oracle */ "last_processed_scn": "123456789", "committed_scns": [ { "redo_thread": 1, "commit_scn": "123456799", "transactions_committed_at_commit_scn": ["trx1", "trx2"] }, { "redo_thread": 2, "commit_scn": "123456650", "transactions_committed_at_commit_scn": ["trx3"] } ] } }

The envelope could be something handled directly by the main signal handler if needed, but we'd have a connector-specific object that is provided the contents of the "connector-offsets" and it would return the necessary Struct payload based on that data to be written to the Kafka offsets.

In this case, you may not need the OffsetValidator contract to be separate, too. wdyt?

jpechane · 2024-08-21T03:45:25Z

DDD-4.md

+The following refers to the offsets regardless of streaming/snapshot:
+
+```json
+{


We should also make a distintion between the fields that are an the offsets and those that are actually necessary for repositioning.

DDD-4 Manipulate Connector Offset via Debezium Signals

f9a6a56

mfvitale reviewed Aug 12, 2024

View reviewed changes

DDD-4.md Outdated Show resolved Hide resolved

jpechane reviewed Aug 21, 2024

View reviewed changes

DDD-4 Add suggestions from review

ed357ac

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DDD-4 Manipulate Connector Offset via Debezium Signals #12

DDD-4 Manipulate Connector Offset via Debezium Signals #12

ani-sha commented Aug 8, 2024

jpechane Aug 21, 2024

Naros Aug 28, 2024 •

edited

Loading

jpechane Aug 21, 2024

DDD-4 Manipulate Connector Offset via Debezium Signals #12

Are you sure you want to change the base?

DDD-4 Manipulate Connector Offset via Debezium Signals #12

Conversation

ani-sha commented Aug 8, 2024

jpechane Aug 21, 2024

Choose a reason for hiding this comment

Naros Aug 28, 2024 • edited Loading

Choose a reason for hiding this comment

jpechane Aug 21, 2024

Choose a reason for hiding this comment

Naros Aug 28, 2024 •

edited

Loading