I'm a beginner in Dataiku, I built few flows, did tranings.
Now I'm stucked with a problem: I built a flow (prepare, filter, group etc) and now I've got some updated data. I would like to replace very first dataset with my new data. Number of columns is the same, some column names are changed.
Is it possible?
I know recipes uses column names and it will not be possible for some of them to recognize new column names. But still - can I do it? Will it show me points where I need to update column names/positions?
Sorry for my English, I'm trying my best 🙂
Thank you in advance for any comment!
Hi @MI_POLAND ,
Welcome to the Dataiku Community!
If the input dataset changes and column names have changed you will need to Propagate the new schema in your flow. The course here goes into more detail.
But in summary, if you right-click on your input dataset once you have updated it and saved the new schema you can then select "Propagate Schema across Flow from here"
Hope this helps!
Hi @AlexT ,
thanks for the welcome.
I went for a simple solution, which is replacing files in setting section of a dataset - worked for test files and the propagation did its job.
I'll work on target large files next week and we'll see how it goes:)