Sign up to take part
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Added on March 11, 2025 2:06PM
Likes: 0
Replies: 3
I would like to know which data is used in a flow.
For example, my input dataset contains 10 data but only 4 are needed for calculations; I want to be able to identify the 6 "useless" data.
Do you have an idea to do this ?
Thanks
I think by data you really mean rows. Do you rows of data have any unique identifier? If so you can you use it to identify which ones are being used and which are not. If not then you could use the Window recipe to give every row a random ID and use that to track down where they are being used. Where is your dataset stored on?
thanks for your answer.
I'm really talking about columns:
Example:
I have a dataset of 5 columns, 3 are used in the recipes. I would like to be able to easily identify them and exclude the 2 unused columns.
In v12 there isn't much you can do. But v13 has a new Column-level Data Lineage so another good reason to upgrade: