-
Set model as an output in python recipe.
Hi, I am new in Dataiku and I would like to know how to save a regressor as an output in a python recipe. I have seen an option to load a model Interaction with saved models — Dataiku DSS 11 documentation How can I set as an output the model created for example in this function: def train_model(X_train: pd.DataFrame,…
-
Authentication token has expired. The user must authenticate again.
When I used to run tables consisting of huge datasets it usually takes longer duration to compute. When a single recipe exceeds 5hrs, I get this token expired error and the job fails. Do we have some solution to exceed out run duration.
-
Limit the size of a dataset with appending behavior
Hi, I want to limit the number of rows in a dataiku dataset. It should only keep the latest 90 Rows and delete the oldest. The dataset is built by appending one row at a time to it. I tried, instead of appending directly to the dataset, to create another dataset with just the newest row and a python recipe to implement the…
-
Assistance Needed: Error Encountered in Running Scored Recipe with "Output Explanations" Option
Hi, We are currently facing an issue while attempting to run a scored recipe with the "Output Explanations" checkbox selected. The error message, as captured in the attached screenshot We have examined the input dataset, but been unable to pinpoint the root cause of the issue. Interestingly, the job completes successfully…
-
2 recipe that output to the same datasets
Hi there, I understand that we cannot use the output of another recipe as another output of a recipe. Is there a work way around this? My situation is as below: one of the recipe would have a condition that if necessary it would need to update an output of another recipe with the datasets. the datasets of the other recipes…
-
Pandas version 2 timeline
Hi, is there any timeline for when Dataiku will be supporting Pandas v2? Thanks
-
Repartition discrete partitions on DSS engine failed, but manual input worked, suggestions?
Hello team, I am trying to redispatch a discrete partition through the sync recipe, using the process described in this documentation. I have tried running on the DSS Engine, however, on my end the recipe fails with the "Job process died (killed - maybe out of memory ?)" error. The Spark, Hive, and/or Imapala engine have…
-
Create a dataiku dataframe from api call
Hi all, I am using API designer for which I created one API which takes some datasets from the flow and takes the API response fields, and some calculation is being done which is then stored in a pandas data frame. Now this data frame will be created every time the API is called with a new set of responses, I want to store…
-
Decision Tree and Random Forest Model Settings and Optimization
I am in a course that is teaching Dataiku as an add-on curriculum feature. I would like to know more about techniques to improve model performance for a classification problem using decision trees and random forest. Also, I see that we are only able to see the test results when running the models. Is there a way to see…
-
Path issues while creating a Keras ImageDataGenerator.flow_from_directory for image classification.
Hello everyone. I am trying to train a simple Cat vs Dog image classification model using Keras in Dataiku DSS. However, I am having certain difficulties in constructing the path for flow_from_directory(). Before we get started, here's a structure of the training data I am using. "training_dataset" is present in the…