is there a way to split output file into smaller chunks?

Witw
Level 1
is there a way to split output file into smaller chunks?

Hi, I have dataiku flow to format the data and generate csv files to azure blob storage.

However it seems like the output files is over 100MB each. We would like to have a control over the output file - spliting data into smaller files - not more than 50MB each file. 

Wonder if this is possible to do on dataiku flow?

 

Thanks


Operating system used: window

0 Kudos
1 Reply
Turribeach

Certainly possible, calculate your average row size, then calculate how many rows that amounts to 50 MB and then split the data in that number of rows / files. It would certainly need Python but it's not that complicated to do.

0 Kudos