Survey banner
The Dataiku Community is moving to a new home! We are temporary in read only mode: LEARN MORE

Is it possible to retrieve the selected files list in an input S3 dataset ?

Solved!
Victorine
Level 1
Is it possible to retrieve the selected files list in an input S3 dataset ?

Hi,

I created an input S3 dataset based on REGEXP mode to select the files inside the S3 connection.

Capture17.PNG

I clicked on "List files" i got the list of selected files.

Capture18.PNG

Is it possible to retrieve the selected file names with Python API ? Could you please give us the snippet code to do that ?

Thank you for your help,

Best regards,

Victorine

0 Kudos
1 Solution
Clément_Stenac

Hi,

Yes it is possible, using this code:

import dataiku
dataiku.Dataset("yourdataset").get_files_info()["globalPaths"]

View solution in original post

0 Kudos
2 Replies
Turribeach

No, what are you really trying to achieve? If you want the file name in the records you load follow this post I created:

https://community.dataiku.com/t5/Using-Dataiku/Using-the-quot-Files-in-folder-quot-dataset/m-p/33214

 

0 Kudos
Clément_Stenac

Hi,

Yes it is possible, using this code:

import dataiku
dataiku.Dataset("yourdataset").get_files_info()["globalPaths"]
0 Kudos