Comment by luuio

Comment by luuio 6 months ago

9 replies

The KV store had etag support for conditional writes. Etags are only useful to make sure the data didn't change underneath between your read and your write.

Storing the checkpoints along with the mutation was for idempotency. If the checkpoint was in the document, that meant the mutation had succeeded and a retry should be no-op

jrochkind1 6 months ago

Hm, I misunderstood, I thought the checkpoint system was also concurrency control to make sure nobody else had changed it from underneath you between read and write, since you had to read and write the whole (mega) document even though you only wanted to change a part (sub-document).

Doesn't the KV provide idempotency on it's own -- so long as you're checking that no changes have happened between read and write, why wouldn't doing the same write twice produce an idempotent result? A change happenening between read and write seems the only reason that would be a problem.

But clearly it's complicated and we don't have the whole picture as to business needs. Definitely sounds awful.

  • luuio 6 months ago

    > why wouldn't doing the same write twice produce an idempotent result

    you can imagine this:

    ```

    var thing = KVStore.Get<MyThing>(...);

    if (things.Checkpoints.Contains(myUuid) == false) {

      thing.Counter += 1;
    
    }

    KVStore.Update(thing); ```

    having an etag doesn't help you with retries, where we expect that `thing` could be mutated by another flow between your retries.

    • jrochkind1 6 months ago

      If the thing was mutated between your retries, then wasn't the etag changed by that mutation? So if you know the etag you started with, your conditional update on etag fails due to changed etag. So you fetch it again and start over. Which is the general optimistic locking algorithm.

      i may be missing something though?

      • luuio 6 months ago

        let's say you have an object like this when you started: {count: 10, etag: 1}. then for some reason, something failed.

        when you retry and load the object, you get {count: 12, etag: 3}. how do you know if your previous attempt had successfully persisted or not, or if the updates to the object came from other processes/requests?

        you're mixing up conflict handling vs. idempotency