Convert files name in a folder to a column
Hi community,
I have a specific use case where my data are stored in a Dataiku folder on S3 in separated CSV files everyday.
For instance, the extract date is only stored in the name of the files and not inside the files itself.
I want to use the 'Create a dataset' function to create a new dataset from this folder and I need the extract date as a column.
Is there any way during this process to create a column that stores the name of each source file from which I can extract the date with a Right 10 for example? I have not found such option in DSS:
Thanks a lot for your help,
Operating system used: Linux
Best Answer
-
Turribeach Dataiku DSS Core Designer, Neuron, Dataiku DSS Adv Designer, Registered, Neuron 2023 Posts: 2,090 Neuron
Indeed this is possible with "hidden gem" feature of the Files in Folder dataset that you are already using. The post below:
Answers
-
Hi @Turribeach , thanks a lot for your help.
I just tested it and it worked like a charm.
Best regards,