-
Delete records based on multiple JOINs
Newbie here. Trying to convert a SQL from HIVE that pulls records partly based on several JOIN conditions but limits those record based on other JOIN conditions. In SQL it is a "WHERE NOT EXISTS" condition. The following is the code - SELECT x FROM y, z Multiple left joins… (and) WHERE NOT EXISTS ( SELECT 1 FROM…
-
Avoid python recipe to change data types
Hello everyone, I would like to prevent python from inferring the data type of my dataframe during a python recipe . For example, I would like an id column to remain in string type rather than dataiku converting it to float. I could, for example, convert each of my columns manually, but this is tedious for datasets with…
-
Group by with empty value and with Null value
Hello everyone ! I have a dataset with empty values in one of the columns (col1) and I use a group by recipe on an other column (col2) without empty values with col1_distinct as aggregation. I get a volume of 21, 199 and 1608 for the 3 col2 fields. But I wanted to add a condition on col1 with a prepare recipe with a…
-
How to leverage locally hosted llm using python scripts
Dataiku Version: 13.3.1 I have several LLMs that I have in my DSS cache from the HuggingFace Connection. I can leverage these models using the prompt recipe. However I am struggling to use them in custom python scripts. For example if I get the LLM id of all the models I have a connection to using the following script, and…
-
Error with instance of APP111ParametersAnalyzer
Hello, I try to create an instance of APP111ParametersAnalyzer, but when I did that I have this problem. "Invalid connection for dataset product_database_prepared_filtered : filesystem_managed (Filesystem) is not freely usable and needs to be remapped." Do you know how I can fixe this issue? Operating system used: Data IKU…
-
Read files in managed folders with shell
Hi, can someone help me please. Given a folder input and a folder output I want to link them with a shell script so that the shell script can read a test.txt file from input folder and write the output.txt file in the output folder with a .sh script but when i use the variables of dataiku it doesn't work. Here an example…
-
Use a Recipe filter where value is NULL
How do I identify a record where the value in a specific field is NULL, in a Recipe filter condition?
-
space character handling in Dataiku dataset
Hi, I have a Greenplum database table as an input for my dataset, let's name this as gp_tableA. From this dataset, I have a Prepare recipe, we name it as compute_gp_tableA_prepared. I only use this recipe to rename some columns and remove unwanted ones. I notice at the output dataset (gp_tableA_prepared), one of the…
-
dataiku failed to load data from snowflake(large dataset)
Exception: Reading dataset failed: failed to read data from table, caused by: SnowfalkeSQLLoggedException: JDBC driver internal error: Timeout waiting for the download of #chunk0(Total chunks: 79)
-
Decimal format
Hello, I would like to round the numbers of my datasets with only 1 or 2 decimals. Does someone know how to do it easily ? Thanks in advance !