Newbie here. Trying to convert a SQL from HIVE that pulls records partly based on several JOIN conditions but limits those record based on other JOIN conditions. In SQL it is a "WHERE NOT EXISTS" condition. The following is the code - SELECT x FROM y, z Multiple left joins… (and) WHERE NOT EXISTS ( SELECT 1 FROM…
Hello everyone, I would like to prevent python from inferring the data type of my dataframe during a python recipe . For example, I would like an id column to remain in string type rather than dataiku converting it to float. I could, for example, convert each of my columns manually, but this is tedious for datasets with…
Hello everyone ! I have a dataset with empty values in one of the columns (col1) and I use a group by recipe on an other column (col2) without empty values with col1_distinct as aggregation. I get a volume of 21, 199 and 1608 for the 3 col2 fields. But I wanted to add a condition on col1 with a prepare recipe with a…
Dataiku Version: 13.3.1 I have several LLMs that I have in my DSS cache from the HuggingFace Connection. I can leverage these models using the prompt recipe. However I am struggling to use them in custom python scripts. For example if I get the LLM id of all the models I have a connection to using the following script, and…
Hi, can someone help me please. Given a folder input and a folder output I want to link them with a shell script so that the shell script can read a test.txt file from input folder and write the output.txt file in the output folder with a .sh script but when i use the variables of dataiku it doesn't work. Here an example…
How do I identify a record where the value in a specific field is NULL, in a Recipe filter condition?
Hi, I have a Greenplum database table as an input for my dataset, let's name this as gp_tableA. From this dataset, I have a Prepare recipe, we name it as compute_gp_tableA_prepared. I only use this recipe to rename some columns and remove unwanted ones. I notice at the output dataset (gp_tableA_prepared), one of the…
Exception: Reading dataset failed: failed to read data from table, caused by: SnowfalkeSQLLoggedException: JDBC driver internal error: Timeout waiting for the download of #chunk0(Total chunks: 79)
Hello, I would like to round the numbers of my datasets with only 1 or 2 decimals. Does someone know how to do it easily ? Thanks in advance !
A new scenario step was added in a recent DSS version which is to execute a Python unit test. I'd like to start using this. However, the documentation is pretty brief: "this step executes one or more Python pytest tests from a project’s Libraries folder using a Pytest selector". Anyone have more details on or an example of…
Create an account to contribute great content, engage with others, and show your appreciation.