Implement Sampling > Random as Engine:In-Database(SQL) for Snowflake

Registered Posts: 47 ✭✭✭✭✭
edited July 2024 in Product Ideas
2
2 votes

New · Last Updated

Currently if I select Sampling method: Random (approx. ratio) or Random (approx. nb. records) the only allowed engine is DSS which will require downloading the input dataset to dss.

It's possible to do sampling at the Snowflake side, with https://docs.snowflake.com/en/sql-reference/constructs/sample

For Random(approx. nb.records) I believe it would be as easy as generating the following SQL

select * from input_table sample row (10 rows) seed (99);

Comments

Welcome!

It looks like you're new here. Sign in or register to get started.

Welcome!

It looks like you're new here. Sign in or register to get started.