The Dataiku Frontrunner Awards are now accepting submissions until July 15 to recognize your achievements! ENTER YOUR SUBMISSION

Why do I need to visualize only a sample?

Solved!
UserBird
Dataiker
Dataiker
Why do I need to visualize only a sample?
 
1 Solution
jrouquie
Dataiker Alumni
DSS tries hard to provide an interactive user experience and fast responses. One of the main things to do so is to work on a sample when exploring / visualizing a dataset. Also, when datasets get big, they could make your browser run out of RAM, so sampling is pretty useful.

That being said, DSS does not enforces a strict limit, it just chooses a reasonable default. So for not so big datasets, you can parameter the sample to be the whole dataset.

Tip: when the dataset is stored on a powerful SQL server, for visualization, you can choose to use the whole dataset as sample. DSS will then offer to run the aggregations on the SQL server instead of on the DSS server.

View solution in original post

1 Reply
jrouquie
Dataiker Alumni
DSS tries hard to provide an interactive user experience and fast responses. One of the main things to do so is to work on a sample when exploring / visualizing a dataset. Also, when datasets get big, they could make your browser run out of RAM, so sampling is pretty useful.

That being said, DSS does not enforces a strict limit, it just chooses a reasonable default. So for not so big datasets, you can parameter the sample to be the whole dataset.

Tip: when the dataset is stored on a powerful SQL server, for visualization, you can choose to use the whole dataset as sample. DSS will then offer to run the aggregations on the SQL server instead of on the DSS server.

View solution in original post

Labels (2)
A banner prompting to get Dataiku DSS
Public