-
Create dataset based on criteria
I have a folder of 40k+ images, and I have created a separate dataset from multiple sources that resulted in a list of image names that I want to keep, plus some key attributes. How can I create a new image folder from my big image folder that only contains the images with names that match a column in one of my prepared…
-
Test Amazon S3 connection
I setup a Amazon S3 connection to a Minio server using an http endpoint and using path style option. When i click on the TEST button , the following error appears: Connection failed: Bad Request (Service: Amazon S3; Status Code: 400; Error Code: 400 Bad Request; Request ID: 17EC370F655CA6AA; S3 Extended Request ID:…
-
dataiku internal client not compatible with Python 3.12
Hi Dataiku Guru, We used the dataiku internal client outside of Dataiku in a python script, but in python 3.12, when import dataiku module an error raised due to the depreciation of imp python module in pyhon 3.12 https://github.com/python/cpython/issues/98040 The dataiku/base/utils.py still uses the imp When will the…
-
Creating a data extract with a SQL and passing a filter parameter
I am trying to create a data extract from SnowFlake . I have selected the connection and written my sql . I want this SQL to be available to multiple users such that they can apply filters to the dataset to retrieve the information . For example, in my dataset I have a "COUNTRY" column. I want a way for users to use the…
-
How can I replace a dataset created from a csv?
I have uploaded a CSV and stored it in the filesystem_folders. I have built several recipes from this dataset. I have now received an updated version of the CSV, but cannot figure out how to upload it and overwrite the original dataset. It seems to require I create a new dataset. If I do create a new dataset, there doesn't…
-
Data Masking
Hi, I have a general question about data masking in Datiku. How well can data masking be implemented and are there tools for this out of the box? Especially static masking as well as deterministic masking. Than you. Best regards Mic
-
How to monitor automatically the status of the API services deployed on one node ?
Today, we have several API services deployed in a production environment used by different IT applications. We would like to monitor the state of these services : Are they active and ok ? Please, do you know a way to automate this check and be notified in case of issue ? If nothing has been designed for this purpose, I was…
-
anyway for Streamlit code studios to share code for multi-person development?
Hi, We have a project with multiple developers that want to write a Streamlit web-app using code studios. However, it seems that only the person who created the code studio can view the code. Is there any way around this behavior? If this is how Code Studios work, I fail to see how they are a good solution, for frankly…
-
Schedule export functionality in flow
I am exporting a dataset to Power BI and would like to schedule the export on a daily basis such that the latest data gets updated in Power BI. How can I achieve that?
-
Comment arrêter un scénario en échec
Bonjour, ne trouvant pas une solution. Je m'en remet à vous. J'ai deux scénarios : le premier est un scénario construisant un flow. (aucun soucis ici) le deuxième est constitué de deux steps. Le premier step incrémente "selected_MOIS" (exemple : 202201) puis le deuxième step vérifie la condition suivante afin de lancer le…