-
Issue when I import module
Hello everyone, First of all, I've already checked the "__init__.py" file in the directory and I added the python path in external- librairies.json. The fact is that the problem occurs even when I'm using a notebook provided by dataiku with one of the tutorial (so that must work ??? That why I suppose that I did sth wrong…
-
Decimal point restriction
How to restrict the decimal values upto 4 decimal places in all the double decimal type column in dataiku
-
[WEBAPP] How to run a scenario with a local variable
I have a python recipe in my scenario working with a local variable. I have succesfully executed my scenario within the webapp but : How can I change the local variable from the webapp's code (in the Python back-end) ?
-
How to chart time series longer than 20 data points?
I'm creating a dashboard for reporting abnormal data series, looking at the past 90 day sales. I want to create the line chart of the past 90 day sales to monitor spikes and abnormalities. But the line chart only shows about 20 data points and lump all rest into "other" categories. Is this the default setting of chart…
-
How Do I filter the content of a managed folder and output to another managed folder?
This is a question I've asked myself and solved with a little Python code so I thought I'd share. I had a folder with several subfolders, each containing a JPEG for each page of the original PDF (for context this folder is the output of the Greyscale recipe from our Text Extraction plugin). I really only want to parse…
-
Dataiku remove duplicate data
Hi all in dataiku after doing a left join operation I am removing all the duplicate values, but when I exported the dataset as an excel I noticed there are few rows which are exactly matching but are not removed or considered duplicate. Once I did the TRIM operation in excel then duplicate values were considered and…
-
Problem using python recipe
I have a problem when using a python recipe that uses the kubernetes engine when running. I checked the log step by step, the python process should have been completed, but for some reason the kubernetes engine does not show a sign of completion and looks like an unfinished loop. Please help with this error.
-
How to reproduce a groupBy with selected values without aggregations ?
SELECT DISTINCT N1.COLUMN1 as "Column 1", max(N1.COLUMN2) as "Column 2" FROM DB GROUP BY N1.COLUMN1 ORDER BY N1.COLUMN1 Hello, I'm trying to reproduce an sql script with Dataiku recipes, and in that script, there is a groupBy with selected columns with no aggregation, how do I reproduce this with Dataiku recipes and…
-
Batch Processing for Custom API end point
I’ve developed a custom Python API endpoint for regression and successfully predicted outcomes for individual records. However, when I attempt to process a batch of records, I encounter the following error: "Failed: Could not parse a SinglePredictionQuery from request body, caused by: JsonSyntaxException: Expected a…
-
Create API from visual RAG
I'm using visual recipes to create a simple RAG system. Can I create API from my RAG set up? Operating system used: Linux