-
Dataiku Scenario
I need to run a scenario within the same scenario(recursion or self call). It is basically running the scenario within a loop until a condition is failed or satisfied. Please help.
-
Metrics and Checks
Hey, hope y'all are doing great I have a quick question regarding using metrics and checks. I have a table and I want to do basic checks such as duplication of primary keys, duplication of records, missing values in a particular column etc. I used metrics to compute the count all of these along with checks and have them…
-
Data Quality Checks in a flow
Hi everyone, I have a quick question: I am importing datasets from snowflake, in a proper flow i.e., source -> warehouse -> data mart. 1) can either create them again in dataiku (double work) OR is there a way that this whole staging process goes in a proper sequential manner within the dataiku? 2) In this whole flow i…
-
Retrieve data based on tree structure inside a column
Hi, I'm trying to create a new column, "Parent", based on the 2 columns X and Y. It's kind of a basic tree structure in Y. 2 is the daughter of 1 and so on, as in the example below. In the "Parent", we use the Y column, but with the X value. I really don't know how to do that, I tried to do it with a formula/windows recipe…
-
Show only 1 Labor value per project.
Hi Team Trying to achieve the result by joining and then applying the window function to get Labor only in 1 record for project instead of all records. Attached is the data and the current o/p vs desired output Please help me in getting this answered to derive the solution Operating system used: Browser
-
kubernetes EKS storage file limit
hello I am contacting you because I am having difficulty using Eks S3 on Dataiku. When saving data on the Dataiku server, there is no limit to the number of data, but when saving data on EKS s3, no more than 4000 pieces can be saved. I don't think it's a storage issue because the file formats are all different, but I'd…
-
Trailing zeros in Metrics
I am using the Dashboard facility to show metrics that have been rounded to 1 d.p. within a prepare recipe. The resulting metrics sometimes show several trailing zeros, after the 1st decimal place. Is there a way for me to resolve this? I have checked the underlying dataset (which has just 1 row of data) and the all…
-
Adding images in Prompt Studio for the Gemini Pro Vision model
Hello everyone, I would like to know if it is possible in Gemini Pro Vision, through Prompt Studio, to add images as a parameter. If so, should this image be a URL, a decoded image, bytes? Thank you very much in advance for your help !Ludovic
-
Filter rows in partitioned dataset
Hi All, I have a partitioned dataset which is partitioned by column 'current' and each partitioned contains 2000000 rows. That's why I want to filter rows in the attached dataset so that whenever there is a value in column 'current', I will return its whole corresponding row and all rows located 1 minute before the row…
-
Build several partitions in one go
Hi, I want to synchronize an Oracle table of 1 billion rows to another Oracle table. The query is very long and I end up with the following Oracle error: [11:06:27] [INFO] [dku.output.sql] - appended 178620000 rows, errors=0 [11:06:27] [INFO] [dku.utils] - Closing oracle.jdbc.driver.T4CConnection@7fc1cb4f [11:06:27] [INFO]…