-
[WEBAPP] How to run a scenario with a local variable
I have a python recipe in my scenario working with a local variable. I have succesfully executed my scenario within the webapp but : How can I change the local variable from the webapp's code (in the Python back-end) ?
-
How to chart time series longer than 20 data points?
I'm creating a dashboard for reporting abnormal data series, looking at the past 90 day sales. I want to create the line chart of the past 90 day sales to monitor spikes and abnormalities. But the line chart only shows about 20 data points and lump all rest into "other" categories. Is this the default setting of chart…
-
How Do I filter the content of a managed folder and output to another managed folder?
This is a question I've asked myself and solved with a little Python code so I thought I'd share. I had a folder with several subfolders, each containing a JPEG for each page of the original PDF (for context this folder is the output of the Greyscale recipe from our Text Extraction plugin). I really only want to parse…
-
Dataiku remove duplicate data
Hi all in dataiku after doing a left join operation I am removing all the duplicate values, but when I exported the dataset as an excel I noticed there are few rows which are exactly matching but are not removed or considered duplicate. Once I did the TRIM operation in excel then duplicate values were considered and…
-
Problem using python recipe
I have a problem when using a python recipe that uses the kubernetes engine when running. I checked the log step by step, the python process should have been completed, but for some reason the kubernetes engine does not show a sign of completion and looks like an unfinished loop. Please help with this error.
-
How to reproduce a groupBy with selected values without aggregations ?
SELECT DISTINCT N1.COLUMN1 as "Column 1", max(N1.COLUMN2) as "Column 2" FROM DB GROUP BY N1.COLUMN1 ORDER BY N1.COLUMN1 Hello, I'm trying to reproduce an sql script with Dataiku recipes, and in that script, there is a groupBy with selected columns with no aggregation, how do I reproduce this with Dataiku recipes and…
-
Batch Processing for Custom API end point
I’ve developed a custom Python API endpoint for regression and successfully predicted outcomes for individual records. However, when I attempt to process a batch of records, I encounter the following error: "Failed: Could not parse a SinglePredictionQuery from request body, caused by: JsonSyntaxException: Expected a…
-
Create API from visual RAG
I'm using visual recipes to create a simple RAG system. Can I create API from my RAG set up? Operating system used: Linux
-
The recipe execution is taking long time due to handling a large volume of data in dataiku
We are experiencing long execution times for a recipe in Dataiku due to handing large datasets, while we have implemented partitioning using a filter on a specific column, it still takes 1.5-2 hours to partitioning 30M records. Is there a more efficient way to handle and process this data quickly and effectively because…
-
Default content for project description
Hello DataIKU friends, I would like to enforce better documentation standard across our DSS project, one way I would like to do this is by having a "template" project description which each author completes, which follows a sec structure. is it possible to give DSS a default value for the "description" field when a new…