I think you missed the part in the middle: ----------------- So the protocol end...

loeg · 2025-07-20T14:14:01 1753020841

No, we saw this scheme, it just doesn't work. Either of the async writes can fail after ack'ing the logical write to the client as successful (e.g., kernel crash or power failure) and then you have lost data.

cyanydeez · 2025-07-20T14:32:44 1753021964

You can always have data loss. The intent is that when the client is told the data is saved, it doesnt happen before the garuntee.

I dont know if OP achieved this, but the client isnt told "we have your data" until both of the WALs are agreeing. If the system goes down those WALs are used to rebuild data in flight.

The speed up allows for decoupling synchronous disk writes that are now parallel.

You are not conceptualizing what data loss means in the ACID contract between DB and Client.

But you

loeg · 2025-07-20T20:21:36 1753042896

> I dont know if OP achieved this,

They did not.

> but the client isnt told "we have your data" until both of the WALs are agreeing.

Wrong. In the proposed scheme, the client writes are ack'd before the WAL writes are flushed. Their contents may or may not agree after subsequent power loss or kernel crash.

(It is generally considered unacceptable for network databases/filers to be lossier than the underlying media. Sometimes stronger guarantees are required/provided, but that is usually the minimum.)

jcgrillo · 2025-07-20T14:09:45 1753020585

There's no fsync in the async version, though, unless I missed it? The problem with the two WAL approach is that now none of the WAL writes are durable--you could encounter a situation where a client reads an entry on the completion WAL which upon recovery does not exist on disk. Before with the single fsynced WAL, writes were durably persisted.