If you're interested I've adopted the tpch dbgen tool to generate parquet files. In case you still need to generate the data.

Comments