Survey banner
The Dataiku Community is moving to a new home! New posts are now disabled and the community will shortly be in temporary read only mode: LEARN MORE

Shaker-samples in cache section

manuelberbig
Level 2
Shaker-samples in cache section

Hello in the data/dss/cache directory there is a folder called "shaker-recipes". In this folder there is a directory for each uploaded dataset. In this directory there are 2-3 files (see picture below). Exspecially the ...dssl file is large with about 1 GB for some datasets. It seems like the contents of the dataset are also in this file. If I delete one of these folders i gets created again when I am accessing the dataset.

Can someone explain the purpose of these "shaker-recipe" folder and its subfolders? Can I delete the contents of it?

Thanks in Advance!


Operating system used: Ubuntu

0 Kudos
1 Reply
Turribeach

This folder's pupose is documented here:

https://doc.dataiku.com/dss/latest/operations/disk-usage.html#caches

As you found out it is there to store data samples which are shown in the Explore tab of each dataset. You can from time to time delete old folders but I wouldn't delete everything since it will slow down users when they Explore datasets in their flow, until the samples get regenerated.

And in my system the cache folder wasn't that big and well below other directories that grow much bigger (run, uploads, exports, scenarios and jobs) so quite low in the priority list of  the things that need to cleaned up manually by a Dataiku Admin.

0 Kudos