Using Dataiku

Application as Recipe Inputs are Broken (or insanely obtuse to use)
I have a project that I built out to be an Application-As-A-Recipe to upload a Dataiku dataset as a file to our API. I will refer to my Application-as-a-Recipe as my "child process" for brevity's sake. Another Project calls this recipe within its flow. The child process has a scenario to build out all datasets and…
Using Scenarios to automatically retrain models
Hi, I'm new to Dataiku and the community and I'm using Dataiku online. Documentation indicates that scenarios can be used to "Automate the retraining of “saved models” on a regular basis, and only activate the new version if the performance is improved". This is exactly what I need to set up but I can't seem to find…
[Python API] Get path of the file inside a "UploadedFiles" dataset
Hi, I am a looking the name of the input file inserted into an "UploadedFiles" dataset. For a managed folder, I am using the "list_contents" function to do so and it works perfectly. My code is currently the following import dataiku project = dataiku.api_client().get_default_project() dataset =…
Error while running K-means clustering
Hello, I tried to run K-means clustering on version 13.0.2 and got the attached error. Can anybody help please? Operating system used: Mac OS Sonoma 14.5
how to add calculated filters in Charts ?
I want to embed calculated filters (if x >1 set display color to blue, if x > 1.15 set color to yellow etc…) in a chart. Let's say Bar chart for example. I know how to add static filters but is there a way to add filter based on a calculation. here in this chart , each color is a parameter. I want to change parameter color…
File '/data/dss-data/saved_models/xxxxxx/ASB22I5z/versions/17235340x1xxx/user_meta.json' does not
I get this error while using saved ml model. I build my own data to predict (same variables as train) and when I use predict recipe I can't run : I get error
Splitting dataset into groups and process each group seperately
I'm new to dataiku and any help would be appreciated, I have a scenario where I have to send mail to sales representative which contains questions asked by its associated client (customer that he manages) eg Input SR_Mail | Client_name | Question | — - - - - - - - - - - - - - - - - - abc@def.com | SBC | ABCD ? |…
Connect to Oracle NetSuite
Hello, team. A client tells me that his data is hosted on Oracle NetSuite. When he wants to view them, he installs the NeuSuite odbc driver on his Windows machine where he only enters the host, port, and service name. Subsequently, to display information in tables, it uses SQL Server with Link Server where he enters again…
Type after Python recipe
Hello all, I've an issue in DSS. A dataset where i've forced the type in setting, schema is an input of a python recipe. I try with different proposition of the developper guide but i'm lost because each time i've an error. I forced the type of column "MASTER-ID" who is the ref of the object in int. At this first level…
モデルのトレーニングが失敗する
Mac OSでインストールしたDataikuを利用しています。モデルをトレーニングしたところ以下エラーが出ます。エラーに従ってインストールが必要なのでしょうか？ Failed to train : <class 'xgboost.core.XGBoostError'> : XGBoost Library (libxgboost.dylib) could not be loaded. Likely causes: * OpenMP runtime is not installed - vcomp140.dll or libgomp-1.dll for Windows - libomp.dylib for Mac OSX -…

Trending Discussions

Logging in dataiku notebook / recipe ...
Hello Team, I am working on pyspark recipes. I use notebook to build the logic and change it back into recipe. The dataiku and spark operations ( e.g. df.count() ) emits a lot of log statements to the console and makes the notebook very difficult to use. Is there a way for me to supress logging from dataku and spark APIs?…
Defining a global variable in the base name of the output file for a dataset
Hello I am working on a flow that has a python recipe that sets global variables. In the output dataset of the recipe a couple of these variables are being used to set the path and filename of the dataset which is stored in Azure. From researching on how to define the filename it states to set the "Force single output…
i am looking a strange error while accessing my dataflows
flows were working previously but now this error window limits me to use any of my projects

Leaderboard

Turribeach 3581

tgb417 2477

Ignacio_Toledo 1079