git - clear notebooks before commit
Tanguy
Dataiku DSS Core Designer, Dataiku DSS & SQL, Dataiku DSS ML Practitioner, Dataiku DSS Core Concepts, Neuron, Dataiku DSS Adv Designer, Registered, Dataiku DSS Developer, Neuron 2023 Posts: 119 Neuron
Several times we've encountered projects that could not be exported due to "git saturation." From memory, I believe the export limit for a project occurs when version control exceeds 2 GB (but I think this limit has recently been raised).
After investigating, we found that this issue was caused by committing notebooks, particularly those with output related to computer vision tasks, where image data was significantly increasing the project's git size.
One solution could be to clear the notebook outputs before committing them to version control, as demonstrated in this short tutorial:
Is it possible to configure this behavior as the default for a project's notebooks?