Quack: The DuckDB Client-Server Protocol

Posted by aduffy 17 hours ago

Quack: The DuckDB Client-Server Protocol(duckdb.org)

293 points | 60 commentspage 2

ashkankiani 12 hours ago|

My first thought: setting up a self replicating duckdb wrapper over ssh so that I can execute queries on any computer. Can’t wait to play with this!

timsuchanek 8 hours ago||

This is very exciting. Now we just need this for Postgres as well.

ozgrakkurt 15 hours ago||

> It would be rather misguided not to build a database protocol on top of HTTP in 2026

This is wrong, HTTP is bad for transferring large amount of data and it is also bad for doing streaming.

It is bad for large amount of data because you have timeout issues on some clients, you hit request/response size limits etc.

It is obviously bad for streaming as there is no concept of streaming in it.

It is comical to go the path of least resistance so lazy people can put a reverse proxy on top of it. And then say HTTP is the only relevant way to do it in 2026.

The benchmark doesn't seem to mean much as TCP can max out 50GB/s on a single thread. Pretty sure it can do more than that even. So you could be using anything that isn't terrible and you should get max performance out of this.

Also the protocol is something else from the format. For example if you are transferring mp4 over ftp and http you can compare that.

If you are transferring different things over different protocols then the comparison means nothing.

The benchmark graph for bulk transfer should show more granularity so it is possible to understand how much of the % of the hardware limit it is reaching. Similar to how BLAS GEMM routines are benchmarked based on the % of theoretical max flops of the hardware.

> 60 million rows (76 GB in CSV format!)

This reads a bit disingenuous.

It is dissappointing to see this instead of something like PostgreSQL protocol with support for a columnar format.

arpinum 14 hours ago||

It uses http/2, it has streaming.

geysersam 12 hours ago|||

They mention in the benchmarks section that the network they're on is a "up to" 15 Gbps connection. So to max out 50GB/s is not realistic.

I agree they should have also listed the compressed size of the table instead of only mentioning the CSV size. But the compressed dataset is probably not smaller than 1/10 of the CSV size. If that's the case they're transferring ~8GB in 4.6 s on a 2GB/s (15Gbps) connection. Seems pretty close to max.

ozgrakkurt 10 hours ago||

That makes sense. I meant to write 50gbps, I don’t mean they should reach that, I mean you could use any protocol that is fairly efficient and it would reach that.

The size of the dataset should be under 3GB in parquet from what I understand. [0]

So it did 3*8/4.94 = 4.85 Gbps which is underwhelming in terms of network performance.

It is still not possible to make any conclusions since we don’t know how specifically they encode it or how they are running the query.

I just mean this writing is useless in terms of engineering perspective, also what it says about http doesn’t make sense

[0] - https://clickhouse.com/docs/getting-started/example-datasets...

geysersam 5 hours ago||

Agreed, that does seem a bit underwhelming. Hopefully there are some performance gains to be made before the production release in september.

jpdenford 3 hours ago|||

They also wanted the protocol to work with duckdb wasm in the browser. I can’t comment on the performance side but that consistency piece is pretty key to duckdbs value proposition I think.

duzer65657 14 hours ago||

really like duckdb and sorry to pile on, but the parent makes some strong points. I wonder if MotherDuck builds on http as well?

jdnier 8 hours ago|||

The parent reads more like "it works in practice but does it work in theory?" The innovations that have come out of the DuckDB team seem to always focus on "in practice" instead of focusing on how things are supposed to (or are expected to) be done.

matsonj 7 hours ago|||

no we don't (source: work at motherduck)

znite 14 hours ago||

Does this work with duckdb-wasm?

PhilippGille 14 hours ago||

It's in the article:

> HTTP also allows the DuckDB-Wasm distribution to speak Quack natively! So DuckDB running in a browser can e.g., directly connect to a DuckDB instance running in an EC2 server using Quack.

philipallstar 1 hour ago|||

That is a pretty amazing feature.

znite 9 hours ago|||

Thanks, thought I searched for it & didn't come up. Great stuff

hfmuehleisen 14 hours ago||

Maintainer here. Yes!

znite 9 hours ago||

Thanks, thought I searched for it & didn't come up. Great stuff

Arcaveli 5 hours ago||

cool

andrew_kwak 8 hours ago||

[flagged]

analyticsfs 14 hours ago|

[dead]