Delete the external dataset in Dataiku project

Gourab_Chanda
Gourab_Chanda Registered Posts: 3 ✭✭✭

Hi Everyone

I am looking to delete a DSS project and the project has external datasets , I am assuming that while deleting a project (from Admin end) the managed datasets are deleted but the external datasets are not deleted . Is there any way by which we can delete the external datasets while deleting the projects .

Regards

Gourab

Answers

  • Ignacio_Toledo
    Ignacio_Toledo Dataiku DSS Core Designer, Dataiku DSS Core Concepts, Neuron 2020, Neuron, Registered, Dataiku Frontrunner Awards 2021 Finalist, Neuron 2021, Neuron 2022, Frontrunner 2022 Finalist, Frontrunner 2022 Winner, Dataiku Frontrunner Awards 2021 Participant, Frontrunner 2022 Participant, Neuron 2023 Posts: 415 Neuron

    Hi @Gourab_Chanda
    .

    The difference between a managed dataset and an external dataset can be found here: https://doc.dataiku.com/dss/latest/concepts/index.html#managed-and-external-datasets

    So, if you create a dataset by connecting to an already existing data source, that is called "a external dataset", and since it was not created by Datiku, it would be dangerous to delete those when deleting the project, as those data sources do not belong in principle to DSS.

    On the other hand, if the dataset is created by Dataiku, and the data is stored in a database or file system, independently if it the storing is local to your machine or in a different server, then this is a "managed dataset", and those are deleted provided that when Deleting the project you check that option.

    External vs managed in this context doesn't mean if the data source or data storage is local to your machine or if it lives in a different "external" server, but it has to do with whom created the dataset.

    Hope this helps!

Setup Info
    Tags
      Help me…