New to Dataiku DSS? Try out our NEW Quick Start Programs today and get onboarded on the product in just one hour! Let's go

What is the difference between uploaded datasets and managed datasets ?

Solved!
FlorianD
Dataiker
Dataiker
What is the difference between uploaded datasets and managed datasets ?
 
1 Solution
FlorianD
Dataiker
Dataiker
Author

There are three kinds of datasets in dataiku: 




  • Managed datasets:  datasets that are created by recipe. Dataiku assumes he “owns” the data and schema of those datasets

  • Uploaded datasets:  raw files that are uploaded through the user interface. They are actually stored locally in raw form, in a specific folder (that you can find in DATA_DIR) with a subfolder per dataset name PROJECTNAME.DATASETNAME

    You can “modify” an uploaded dataset by reuploading it. An uploaded dataset is actually a folder, meaning that it can contain several files

  • Non-managed, non-uploaded datasets. Usually it's a dataset that points to exisiting data (a table in a sql table for instance) that you can use as an input for a recipe 



 

View solution in original post

2 Replies
FlorianD
Dataiker
Dataiker
Author

There are three kinds of datasets in dataiku: 




  • Managed datasets:  datasets that are created by recipe. Dataiku assumes he “owns” the data and schema of those datasets

  • Uploaded datasets:  raw files that are uploaded through the user interface. They are actually stored locally in raw form, in a specific folder (that you can find in DATA_DIR) with a subfolder per dataset name PROJECTNAME.DATASETNAME

    You can “modify” an uploaded dataset by reuploading it. An uploaded dataset is actually a folder, meaning that it can contain several files

  • Non-managed, non-uploaded datasets. Usually it's a dataset that points to exisiting data (a table in a sql table for instance) that you can use as an input for a recipe 



 

View solution in original post

Alex
Level 1
Thank you!
0 Kudos
Labels (3)
A banner prompting to get Dataiku DSS