Shaker-samples in cache section
Hello in the data/dss/cache directory there is a folder called "shaker-recipes". In this folder there is a directory for each uploaded dataset. In this directory there are 2-3 files (see picture below). Exspecially the ...dssl file is large with about 1 GB for some datasets. It seems like the contents of the dataset are also in this file. If I delete one of these folders i gets created again when I am accessing the dataset.
Can someone explain the purpose of these "shaker-recipe" folder and its subfolders? Can I delete the contents of it?
Thanks in Advance!
Operating system used: Ubuntu
Answers
-
Turribeach Dataiku DSS Core Designer, Neuron, Dataiku DSS Adv Designer, Registered, Neuron 2023 Posts: 2,165 Neuron
This folder's pupose is documented here:
https://doc.dataiku.com/dss/latest/operations/disk-usage.html#caches
As you found out it is there to store data samples which are shown in the Explore tab of each dataset. You can from time to time delete old folders but I wouldn't delete everything since it will slow down users when they Explore datasets in their flow, until the samples get regenerated.
And in my system the cache folder wasn't that big and well below other directories that grow much bigger (run, uploads, exports, scenarios and jobs) so quite low in the priority list of the things that need to cleaned up manually by a Dataiku Admin.