Hello, I would like some help to create a hierarchy. Indeed, I have some issues to create one. I have a a raw file such the way explained on the raw excel data (colored) with parents and child IDs and Names and I would like to be able to get 3 tables : One with all the parent and child name, one with all their ID and one…
I have a screenshot function as part of my Rshiny app. Currently, I'm saving the screenshot as follows in a datiku managed folder. This code is working fine, and I'm able to save the image as a PNG file screenshot_folder <- dkuManagedFolderPath("SCREENSHOT_FOLDER") screenshot_path <- paste0(screenshot_folder,"/")…
I keep getting the below error when I am trying to load data from oracle to snoowflake using a dataiku sql query recipe. User character length limit (25) exceeded by string 'Gomez Hidalgo del Castillo' File 'snowflake_stage/snowflake_stage/tmp.FBe55Q5TgEnla6Jj/out-s0-c1.csv.gz', line 2939, character 1 Row 2939, column…
I have tried looping through the project's scenarios using get_last_finished_run(), but it looks as if that throws an exception if a scenario has never run, which can be the case. Since that loop is already within a try/except block, the coding gets tricky. Any help much appreciated here. Operating system used: AWS
I have dataset A and dataset B. I need the aggregate total from one column called "Total Commission" from B. I want to bring it into A and populate a single column with that value. I know I can do this in Python with two dataframes and I know I can do this with a join if I create a join key in the datasets. Is there a…
Hello, is there a way to check and validate data? I have webshop traffic data in my spreadsheet on a daily basis. These are divided into our different channels like SEA, Price Search Engines, SEO and so on. I'm looking for a way to check if there are major discrepancies in new data compared to previous ones. In this way I…
Hi Team, I have created scenario with consists of 5 steps. step 1: sql query to check latest data step 2: set project variable as execution start time using now() formula step 3: Build the dataset step 4: Run Ml model step 5: set project variable as execution end time using now() formula. Here question is step 2 and step 5…
I am surprised this is missing from the GUI but what really surprises me more is that it's not even shown in the logs.The fact that one needs to query the API to get this data should be a good indication that both logging and GUI need to have an overhaul. I have posted a Product Idea feel free to vote for it. It also asks…
is there any data architecture diagram available for dataiku which shows a complete project ???
Create an account to contribute great content, engage with others, and show your appreciation.