So how would that compare to DynamoDB or BigQuery? (I have zero interest in payi...

mulmen · 2025-10-24T17:22:29 1761326549

Are you asking how Dynamo compares at the storage level? Like in comparison to S3? As a key-value database it doesn’t even have a native aggregation capability. It’s a very poor choose for OLAP.

BigQuery is comparable to DuckDB. I’m curious how the various Redshift flavors (provisioned, serverless, spectrum) and Spark compare.

I don’t have a lot of experience with DuckDB but it seems like Spark is the most comparable.

fifilura · 2025-10-24T21:51:15 1761342675

BigQuery is built for the distributed case while DuckDB is single CPU and requires the workarounds described in the article to act like a distributed engine.

tishj · 2025-10-25T14:18:13 1761401893

DuckDB is not single CPU, it's single machine - big difference

fifilura · 2025-10-26T06:42:36 1761460956

Fair enough i slipped. And single RAM.

And yeah these days you can boost a single machine to enormous specifications. I guess the main difference will be the cost. A distributed engine can "lease" a little bit of time here and there, while a single RAM engine needs to keep all that capacity ready for when it is actually needed.

mulmen · 2025-10-24T22:36:59 1761345419

Ah ok. Maybe that does make sense as a comparison to ask if you need an analytics stack or can just grind through your prod Dynamo.