Import Dataset from SFTP - Regex Based Rules for file selection not working
Hi, I am trying to import a set of xls files from a functioning SFTP connection from my DataIKU DSS 11. The inclusion and exclusion rules based on the Regex 'xls' is not filtering for the xls files. Instead it is listing all the files that are available for viewing on the SFTP site.
Operating system used: Windows 10 Professional
Answers
-
Miguel Angel Dataiker, Dataiku DSS Core Designer, Dataiku DSS ML Practitioner, Dataiku DSS Adv Designer, Registered Posts: 118 Dataiker
Hi,
From the screenshot we can see no files were selected with the regex provided. This makes sense because 'xls' is not really a regular expresion.
If you just want to select all the files with that extension, you can just set '.*xls' as the inclusion rule (without the quotes). If you want to further fine tune which xls files to select, a more refined regex would be necessary. Fortunately, there is ample documentation online on them, plus there are also helpers that can aid in the trial and error necessary while building the expresion. For example: https://regex101.com/