-
Assistance Needed: Error Encountered in Running Scored Recipe with "Output Explanations" Option
Hi, We are currently facing an issue while attempting to run a scored recipe with the "Output Explanations" checkbox selected. The error message, as captured in the attached screenshot We have examined the input dataset, but been unable to pinpoint the root cause of the issue. Interestingly, the job completes successfully…
-
2 recipe that output to the same datasets
Hi there, I understand that we cannot use the output of another recipe as another output of a recipe. Is there a work way around this? My situation is as below: one of the recipe would have a condition that if necessary it would need to update an output of another recipe with the datasets. the datasets of the other recipes…
-
Pandas version 2 timeline
Hi, is there any timeline for when Dataiku will be supporting Pandas v2? Thanks
-
Repartition discrete partitions on DSS engine failed, but manual input worked, suggestions?
Hello team, I am trying to redispatch a discrete partition through the sync recipe, using the process described in this documentation. I have tried running on the DSS Engine, however, on my end the recipe fails with the "Job process died (killed - maybe out of memory ?)" error. The Spark, Hive, and/or Imapala engine have…
-
Create a dataiku dataframe from api call
Hi all, I am using API designer for which I created one API which takes some datasets from the flow and takes the API response fields, and some calculation is being done which is then stored in a pandas data frame. Now this data frame will be created every time the API is called with a new set of responses, I want to store…
-
Decision Tree and Random Forest Model Settings and Optimization
I am in a course that is teaching Dataiku as an add-on curriculum feature. I would like to know more about techniques to improve model performance for a classification problem using decision trees and random forest. Also, I see that we are only able to see the test results when running the models. Is there a way to see…
-
Path issues while creating a Keras ImageDataGenerator.flow_from_directory for image classification.
Hello everyone. I am trying to train a simple Cat vs Dog image classification model using Keras in Dataiku DSS. However, I am having certain difficulties in constructing the path for flow_from_directory(). Before we get started, here's a structure of the training data I am using. "training_dataset" is present in the…
-
Printing Dataset Size
Good morning, We would like to print the size of all the datasets in a specific flow zone through Python code in order to monitor the diskspace already taken. Do you know if there is a way to accomplish this task? Operating system used: Linux RedHat
-
UPSERT Sample or statement
Hi, I'm new in Dataiku, I hv 1 new project to perform in dataiku which need to do upsert for existing table or dataset. May I know how can I perform the insert/update statment in SQL? Especially on input value from dataset? Let say I have abc dataset with 10 column data, but need to insert/update only 5 from 10 column? How…
-
Error while using datasets which are not declared as inputs on Python recipe.
# -------------------------------------------------------------------------------- NOTEBOOK-CELL: CODE# -*- coding: utf-8 -*-import dataikuimport pandas as pd, numpy as npfrom dataiku import pandasutils as pduproject_name = "ABC_PEI_2023"project = dataiku.api_client().get_project(project_name)prefix =…