-
How to append dataframe in existing output dataset
Hello experts, In dataiku v12.3.0, I was trying to append dataframe using write_dataframe() in existing dataset (with same schema). But it always overwrites with last dataframe even though the dataset spec is configured like: dataset.spec_item["appendMode"] = True The dataset is classified as output so it doesn't let me…
-
Changing the default font size for all new charts
I often send screenshots of the charts to my stakeholders over Teams chat to get quick confirmations/alignment on the intermediary results. A feedback I heard often is that they are having a hard time reading the legend, axis titles, values on axis etc. I don't want to manually change the font size for all these different…
-
How can I use websockets in a Webapp?
I would like to make a custom Webapp where the Python backend talks to the frontend and vice versa. I see that the Dataiku Answers webapp uses websockets and I would like to do the same. My current attempts using Flask-SocketIO did not work unfortunately, as it seems to use Werkzeug under the hood and I cannot start the…
-
How to load a mlflow model in a script to make predictions
I trained and deployed a model using mlflow in Dataiku. I want to make predictions on a test dataset using this deployed model. However, I don't want to use the "predict" visual recipe. Instead, I want to load the model in a script and make predictions. But I am not able to do it. Operating system used: Linux Operating…
-
Run Scenario after multiple scenarios have completed
Hi - I am trying to create a scenario that will auto-trigger once other time-based scenarios (in other projects) have completed. I think this is possible for 1 scenario using "Trigger after scenario" which automatically checks the status of a scenario at the frequency you set but I can't figure out how to do this using…
-
How to Sync Current Date from Greenplum to Oracle with Only the Date Part?
I have added a current_date column to my table in Greenplum using a Prepare recipe (with now() in Formula language). I want to sync this column to an Oracle database, but I need to keep only the date part of the value. For example, I want to convert a value like 2025-01-25T21:50:28.102Z into 2025-01-25 and store it as a…
-
Split / Join formula
Hi, Is it possible in a split (and only a split recipe not an sql one) with formula to check a condition by join. For example, to check that one of the child lines a of a parent b contains a certain value. If so, put all the corresponding lines a ->b* (of which at least one b satisfies the condition) in the split. Best…
-
Folding multiple columns by MULTIPLE patterns
Hello, I have a large number of columns that all follow the same pattern, with 3 parts that can change: Path#_Field_Year So either the Path #, Field, or Year could vary. For example, I could have: 1_Car_2030 1_Car_2031 1_Car_2032 2_Car_2030 2_Car_2031 2_Car_2032 Or I could have: 2_Bus_2040 2_Bus_2041 2_Bus_2042 5_Bus_2030…
-
I want Better Dataset Visualization in Dataiku Dashboard
I need to leverage Dataiku Dashboard to visualize Dataset with a better look. this is what I want (set different colors for value criteria / bold font / dataframe multindex support /etc): this is visualization of dataset currently (quite plain): I am code designer and can write python, btw, is there any solution? Operating…
-
Combining data sources from Blob Storage
Hello, I have a pretty simple problem, but somehow I'm not able to solve it. I'm using Dataiku to do ETL stuff and I need to pull the data from Blob Storage. Usually I just specify a path this is the start of my flow. But in this case, I need to create the first source dataset from multiple blob "folders" that are in a…