Join us on Wednesday, June 3rd for a deep dive into Customer Predictive Analytics Learn more

What is the difference between uploaded datasets and managed datasets ?

Dataiker
Dataiker
What is the difference between uploaded datasets and managed datasets ?
 
2 Replies
Dataiker
Dataiker
Author

There are three kinds of datasets in dataiku: 




  • Managed datasets:  datasets that are created by recipe. Dataiku assumes he “owns” the data and schema of those datasets

  • Uploaded datasets:  raw files that are uploaded through the user interface. They are actually stored locally in raw form, in a specific folder (that you can find in DATA_DIR) with a subfolder per dataset name PROJECTNAME.DATASETNAME

    You can “modify” an uploaded dataset by reuploading it. An uploaded dataset is actually a folder, meaning that it can contain several files

  • Non-managed, non-uploaded datasets. Usually it's a dataset that points to exisiting data (a table in a sql table for instance) that you can use as an input for a recipe 



 

Level 1
Thank you!
0 Kudos