in DuckDB, if you have a 500m row table and you want to estimate the size of it as a CSV, how would you do it?

Originally was using bit_length with VARCHAR casting, but this is REALLY expensive. Maybe just adding a limit and inflating would give an estimate, but error-prone b/c of drift

Any ideas?

Comments