-
dataiku failed to load data from snowflake(large dataset)
Exception: Reading dataset failed: failed to read data from table, caused by: SnowfalkeSQLLoggedException: JDBC driver internal error: Timeout waiting for the download of #chunk0(Total chunks: 79)
-
I'm somewhat embarrassed as this should be basic and incredibly simple but ...
I'm working with CSV files and developed a couple of recipes with one dataset until they did what I wanted. I assumed that replacing the development dataset would be dead simple and I would just click to replace the old data set and be able to specify the new csv file and run the recipes with the new data using the UI. The…
-
Create dataset based on criteria
I have a folder of 40k+ images, and I have created a separate dataset from multiple sources that resulted in a list of image names that I want to keep, plus some key attributes. How can I create a new image folder from my big image folder that only contains the images with names that match a column in one of my prepared…
-
Test Amazon S3 connection
I setup a Amazon S3 connection to a Minio server using an http endpoint and using path style option. When i click on the TEST button , the following error appears: Connection failed: Bad Request (Service: Amazon S3; Status Code: 400; Error Code: 400 Bad Request; Request ID: 17EC370F655CA6AA; S3 Extended Request ID:…
-
Question about the install path of Dataiku
Due to the space in my username(Zhao Guanghao)of my laptop, I can not run dataiku correctly. Meanwhile, the filename under the 'User' file in C disk can't be modified, once you rename it directly, the system can not identify the account and it will collapse. Besides, Dataiku can only be installed in the path 'C…
-
dataiku internal client not compatible with Python 3.12
Hi Dataiku Guru, We used the dataiku internal client outside of Dataiku in a python script, but in python 3.12, when import dataiku module an error raised due to the depreciation of imp python module in pyhon 3.12 https://github.com/python/cpython/issues/98040 The dataiku/base/utils.py still uses the imp When will the…
-
Creating a data extract with a SQL and passing a filter parameter
I am trying to create a data extract from SnowFlake . I have selected the connection and written my sql . I want this SQL to be available to multiple users such that they can apply filters to the dataset to retrieve the information . For example, in my dataset I have a "COUNTRY" column. I want a way for users to use the…
-
How can I replace a dataset created from a csv?
I have uploaded a CSV and stored it in the filesystem_folders. I have built several recipes from this dataset. I have now received an updated version of the CSV, but cannot figure out how to upload it and overwrite the original dataset. It seems to require I create a new dataset. If I do create a new dataset, there doesn't…
-
Data Masking
Hi, I have a general question about data masking in Datiku. How well can data masking be implemented and are there tools for this out of the box? Especially static masking as well as deterministic masking. Than you. Best regards Mic
-
How to monitor automatically the status of the API services deployed on one node ?
Today, we have several API services deployed in a production environment used by different IT applications. We would like to monitor the state of these services : Are they active and ok ? Please, do you know a way to automate this check and be notified in case of issue ? If nothing has been designed for this purpose, I was…