Hello, I’ve encountered an unusual error while using the Group By recipe in Dataiku. Here’s a summary of the issue: Context: I created a Group By recipe on three columns, applying three custom aggregations using SQL. Input Data: The recipe takes as input a PostgreSQL (PGSQL) table, which is the output of a JOIN operation…
Hello Team, I hope you are doing well. I am currently working on a project in Dataiku 13.1.2, where I am generating embeddings using LLM Mesh in Python code. At present, I am storing these embeddings in a PostgreSQL dataset. However, I would like to store them directly into a Knowledge Bank using Python code. Key…
Hi guys, I have a flow with 2 different scenarios. I have one variable v_idproduct used in a post filter join recipe in a sql code (id_product IN v_id_product). In each scenario I have a different list of id products. I want to modify one of the scenarios so that this filter is no longer applied, allowing all product IDs…
What functionality exists to show the relationship between flowzones. Say we have 42 flow zones with an average of 50 datasets each. Is there a way to summarize the relationships? I am interested in seeing stuff like: Which flow zones have the same color. Which flow zones have datasets that feed into other ones? Which…
Hi Everyone, I started facing an error since today for recipe which contains two outputs, one Dataset and one Managed Folder. Do we know what changed in the recent Dataiku update for this?
hello, i am trying to use the function to use chrome so the user can logging to a website one the user logs in a web scrapping code will be executed however when i run the function i have this error ? are we allowed to call a chrome login inside dataiku instance ? Chrome binary not found at /usr/bin/chromium-browser
On optimization>Initial learning field I put 0.0001, but the system was showing error. I tried 5640,5638, 100,1000. All shows the same error. Please help
Hi everyone, I have a scenario in Dataiku where I am running step by step one flow and there I need to check four datasets. If any of them are empty, I want to send a custom email notification to users mentioning which datasets are empty. Additionally, the process should continue only if at least two datasets have…
Once the load of table is done and flow is successful add "1" to table value indicating load complete.
Appending to output dataset in python code recipes. Currently the way to do this is with a check box in the settings of the recipes that says "Append instead of overwrite". However- this is limiting and does not have good functionality with respect to potential schema changes (this button has caused significant data loss…
Create an account to contribute great content, engage with others, and show your appreciation.