Announcing the winners & finalists of the Dataiku Frontrunner Awards 2021! Read their inspiring stories

Update/replace dataset

MI_POLAND
Level 1
Update/replace dataset

Hello,

I'm a beginner in Dataiku, I built few flows, did tranings.

 

Now I'm stucked with a problem: I built a flow (prepare, filter, group etc) and now I've got some updated data. I would like to replace very first dataset with my new data. Number of columns is the same, some column names are changed.

Is it possible?

I know recipes uses column names and it will not be possible for some of them to recognize new column names. But still - can I do it? Will it show me points where I need to update column names/positions?

Sorry for my English, I'm trying my best 🙂

 

Thank you in advance for any comment! 

0 Kudos
2 Replies
AlexT
Dataiker
Dataiker

Hi @MI_POLAND ,

Welcome to the Dataiku Community!

If the input dataset changes and column names have changed you will need to Propagate the new schema in your flow.  The course  here goes into more detail. 

But in summary, if you right-click on your input dataset  once you have updated it and saved the new schema you can then select "Propagate Schema across Flow from here" 

Screenshot 2021-10-13 at 15.35.18.png

 

Hope this helps!

0 Kudos
MI_POLAND
Level 1
Author

Hi @AlexT ,

thanks for the welcome.

I went for a simple solution, which is replacing files in setting section of a dataset - worked for test files and the propagation did its job.

I'll work on target large files next week and we'll see how it goes:)

Thank you!

0 Kudos
A banner prompting to get Dataiku DSS