Hello everyone, I would like to prevent python from inferring the data type of my dataframe during a python recipe . For example, I would like an id column to remain in string type rather than dataiku converting it to float. I could, for example, convert each of my columns manually, but this is tedious for datasets with…
Hello everyone ! I have a dataset with empty values in one of the columns (col1) and I use a group by recipe on an other column (col2) without empty values with col1_distinct as aggregation. I get a volume of 21, 199 and 1608 for the 3 col2 fields. But I wanted to add a condition on col1 with a prepare recipe with a…
Dataiku Version: 13.3.1 I have several LLMs that I have in my DSS cache from the HuggingFace Connection. I can leverage these models using the prompt recipe. However I am struggling to use them in custom python scripts. For example if I get the LLM id of all the models I have a connection to using the following script, and…
Hi, can someone help me please. Given a folder input and a folder output I want to link them with a shell script so that the shell script can read a test.txt file from input folder and write the output.txt file in the output folder with a .sh script but when i use the variables of dataiku it doesn't work. Here an example…
How do I identify a record where the value in a specific field is NULL, in a Recipe filter condition?
Hi, I have a Greenplum database table as an input for my dataset, let's name this as gp_tableA. From this dataset, I have a Prepare recipe, we name it as compute_gp_tableA_prepared. I only use this recipe to rename some columns and remove unwanted ones. I notice at the output dataset (gp_tableA_prepared), one of the…
Exception: Reading dataset failed: failed to read data from table, caused by: SnowfalkeSQLLoggedException: JDBC driver internal error: Timeout waiting for the download of #chunk0(Total chunks: 79)
Hello, I would like to round the numbers of my datasets with only 1 or 2 decimals. Does someone know how to do it easily ? Thanks in advance !
A new scenario step was added in a recent DSS version which is to execute a Python unit test. I'd like to start using this. However, the documentation is pretty brief: "this step executes one or more Python pytest tests from a project’s Libraries folder using a Pytest selector". Anyone have more details on or an example of…
Hello, I'm currently using Dataiku and SQLExecutor2 to run queries on my Oracle database hosted on AWS RDS, port 2484. When I execute the same query from a Docker container on AWS, the query takes about 15 ms. However, when I run it in Dataiku, it takes approximately 1 second, and the whole process, which takes about 8…
Create an account to contribute great content, engage with others, and show your appreciation.