-
Include in a bundle only active model and not all saved models
Our bundle became very big (2Gb) and difficult to import in automation. We have realized that these are the saved models that are too heavy. Currently, we have 10 saved models (older versions of the currently active model) that are used in design for testing but are not needed in automation. We would like to include in a…
-
Turn a custom model in the flow into a model object
I was told that it was possible to turn a custom trained model, typically stored in a managed folder, into a visual model object in the flow. Currently our flow looks like this: but we would like to see something like this in the flow: I couldn’t find any documentation on how to do this, so I’m turning to the Dataiku…
-
Formula
how to concatenate for exemple the first 10 characters of the column [CRIBINV], followed by the entire value of [CRD6002], and then the last 10 characters of [CRIBINV]
-
Dashboard rendering with scenario automation
Dear Dataiku user, I am facing a quite annoying issue when running a scenario that automatically sends a dashboard in PDF format to a diffusion list. When editing the dashboard, organizing the tiles, and exporting the dashboard MANUALLY, everything is as I'd like it to be (although I feel like there's a huge lack of…
-
Connection for dataiku-managed-storage
When I was importing project made by others, there's a error below. But this project was made by others so I can't change the dataset. Any ideas or advice? thanks ERROR Missing connection Missing connection: Connection missing for dataset test (not remapped): dataiku-managed-storage (EC2)
-
How to split dataset based on the value of a column and define the number of the output datasets
Hello, I have a dataset with millions rows in the format as below, and want to split it into two datasets A and B. Is it possible to do by using the visual recipes of Dataiku (without coding)? Dataset A: data with only "target_flg" = 1 Dataset B: data with only "target_flg" = 0, but instead of exporting all rows where…
-
Filtering files on a folder based on a external list
Ok, I am beyond (or behind) a newbie on Dataiku so, bear with me on this. I have a folder containing csv files, the folder contains currently 3000 files, size of each file is probably 100KB at most. but all together they go to maybe 15MM. I've created a dataset based on this folder and filter only rows that I needed with a…
-
Add a recipe in the middle of the flow
Dear Dataiku users, I am having trouble finding an answer to my question. I am building a dataiku flow with multiple recipes. Is it possible to add a recipe (let's say a prepare recipe) in the middle of the flow ? I just want to reorganize the order of my columns for the last datasets created in my flow. The only solution…
-
Score Recipe - FileNotFoundError
I'm following the Scoring Basics course, and when I try to run the score recipe, I get the following error: [09:46:39] [INFO] [dku.utils] - *************** Recipe code failed **************[09:46:39] [INFO] [dku.utils] - Begin Python stack[09:46:39] [INFO] [dku.utils] - Traceback (most recent call last):[09:46:39] [INFO]…
-
Integrating with GitHub and Dataiku DSS
Hello, As per the description for working with Github, (Working with Git — Dataiku DSS 12 documentation) I need a DSS user's public SSH key, and if I don't have it, need to generate SSH key, but the example shown on the description is for whom those use Dataiku Cloud. (I don't use Dataiku Cloud but Enterprise.) How do I…