Copying a sub flow to another project

Options
ben_p
ben_p Neuron 2020, Registered, Neuron 2021, Neuron 2022, Dataiku Frontrunner Awards 2021 Participant Posts: 143 ✭✭✭✭✭✭✭

Hi everyone,

I have, for the first time, copied a sub flow into a new project. I have a couple of questions around what this actually means, and I can't see them mentioned in the docs.

In my new project, where I see a copy of my flow, the source dataset is now black - what does this mean?

Capture.PNG

Do I need to take any action here? Also, with regards to the names of all the other datasets, i assume I don't need to rename them as they are now in a project.

When I look back at my old flow, my original dataset now has a little tick/arrow on it:

ben_p_0-1581331759393.png

What does this mean? Is some relationship being maintained between these projects now, because I copied this flow over?

Thanks for your help,

Ben

Best Answer

  • DamienJ
    DamienJ Dataiker, Registered Posts: 10 Dataiker
    Answer ✓
    Options

    Hi Ben,

    The black icon and the arrow mean the dataset is shared across project.

    You can see the dependencies of projects using the graph mode (DSS homepage --> See all projects --> switch mosaic to graph).

    If you want to get rid of the dependencies, sync the shared dataset in your new project and unshare it.

    Best,

Answers

  • EstelleB
    EstelleB Dataiker, Registered Posts: 6 Dataiker
    Options

    Hello,

    Your observation is accurate!

    1/ When the source dataset is black in "project B", it means that it is a dataset that is shared from another project in project B. However, this dataset is actually computed in this other project - let's call it "project A".

    2/ Yes, you don't need to rename them if you have copied your sub-flow in another project because now your datasets complete names will be PROJECT_KEY.dataset_name. Here the project key is different for PROJECT_A and PROJECT_B. If you had copied them inside the same project, then you would have needed to rename them to avoid conflicts in your database.

    3/ The little arrow on your dataset in your project A means that this dataset is shared with one or several other projects. If you click on "Share" in the panel "Action" on the right, then you will have the list of projects were your dataset has been shared. You only need to make sure this source dataset is updated in project A if you need to use it in project B. Otherwise, directly recreate your dataset in project B.

    Little tips: You can unshare it either going in project A: Action > Share > Delete the project in the list > Save, or going in project B: Action > Other Actions > Stop sharing to this project.

    Good luck with your project,

    Estelle

  • ben_p
    ben_p Neuron 2020, Registered, Neuron 2021, Neuron 2022, Dataiku Frontrunner Awards 2021 Participant Posts: 143 ✭✭✭✭✭✭✭
    Options

    Thanks @DamienJ
    - I don't want the dependency, what are the steps required to sync and unshare the dataset?

  • DamienJ
    DamienJ Dataiker, Registered Posts: 10 Dataiker
    Options

    1. Create a sync recipe from the shared dataset and run it. Let's assume the output dataset name is "dataset_copy"

    2. Go the the recipe (double click) just after the the shared dataset that was here before the sync
    --> go to input/output
    --> change the input to be the dataset dataset_copy
    --> save

    4. Delete the sync recipe

    5. Click on the dataset you want to unshare --> actions --> unshare

    You also might want to rebuild the subflow you copied: "Build All" from "Flow action" at the bottom left.

    Best,

  • ben_p
    ben_p Neuron 2020, Registered, Neuron 2021, Neuron 2022, Dataiku Frontrunner Awards 2021 Participant Posts: 143 ✭✭✭✭✭✭✭
    Options
    Thank you for the explanation, that works nicely!
  • DamienJ
    DamienJ Dataiker, Registered Posts: 10 Dataiker
    Options
  • Sajid
    Sajid Partner, Dataiku DSS Core Designer, Dataiku DSS Adv Designer, Registered Posts: 16 Partner
    Options

    Hey @EstelleB
    ,

    Can you please help me on where can I find the options,

    Shared From: Action > Share > Delete the project in the list > Save, OR

    Shared To: Action > Other Actions > Stop sharing to this project.

    I am unable to find that option, I am using version 9.0.4

    Thanks,

    Sajid

  • Sajid
    Sajid Partner, Dataiku DSS Core Designer, Dataiku DSS Adv Designer, Registered Posts: 16 Partner
    Options

    Thanks @DamienJ
    ,

    It helped me. But now I am facing issues in reverting the share step in version 9.0.4

  • Sajid
    Sajid Partner, Dataiku DSS Core Designer, Dataiku DSS Adv Designer, Registered Posts: 16 Partner
    Options

    I have found the option.

    Thanks

  • Sajid
    Sajid Partner, Dataiku DSS Core Designer, Dataiku DSS Adv Designer, Registered Posts: 16 Partner
    Options

    I have found the option.

    Thank You

  • CoreyS
    CoreyS Dataiker Alumni, Dataiku DSS Core Designer, Dataiku DSS Core Concepts, Registered Posts: 1,150 ✭✭✭✭✭✭✭✭✭
    Options

    Hi @Sajid_Khan
    if you wouldn't mind, could share where you found it (like a link to it) or what that option is?

  • Sajid
    Sajid Partner, Dataiku DSS Core Designer, Dataiku DSS Adv Designer, Registered Posts: 16 Partner
    Options

    Sure @CoreyS
    ,

    There are 2 methods as @EstelleB
    has explained

    1) Project shared from: You can select the shared dataset (the one with black color or the arrow mark). Once you click, on top right Actions > Share > Scroll to the bottom > Delete the project in the list. Here you get a list of projects where you have shared the selected dataset, you can delete the one you want to and then save the changes. This step worked for me as there were some access limitations on the other (shared to) project. You can try both and see which works for you.

    2) Project shared to: You can select the shared dataset (the one with black color or the arrow mark). Once you click, on top right Actions > Share > Scroll to the bottom > Stop sharing to this project

    Please let me know if any more details required.

    ------------------------------------------------------------------------------

    Thanks @EstelleB
    for this. It really helped.

Setup Info
    Tags
      Help me…