Hi all, I have a question regarding custom python models for a clustering modelling task. I am trying to do something really basic, like running Agglomerative Clustering using a different metric and l…
Dataiku's category handling = Dummy encoding with dropping dummy option seems to be using a level with the least exposure/volume as a dummy. Q1. Is there a way to set this dummy manually instead of Da…
I have an ongoing project in production that I intend to replace with another project currently in development. As part of this transition, I find myself comparing a dataset that has undergone scoring…
Hey there, so I am having trouble training new models on an existing project, if I either update an existing recipe or deploy the newly trained model in a new visual tool in the flow whenever I try to…
Hello Dataiku Community, I'm looking for guidance on how to set up a batch inference API for a machine learning model. Specifically, I want to create an API endpoint that can take a batch of data and …
Is this training error, rather than validation (or test) set error? Because the graphs exhibit very little evidence of overfitting, even when model complexity is maximized according to either hyperpar…
I am using Random Forest algorithm. Also, I am trying tune hyperparameters by using Grid Search. Grid Search picking biggest number hyperparameters all the time. I am sharing a pictures about my probl…
Hi All, If I make a managed folder, ID is created as a random number Is this ID value fixed once it is created? Or does it automatically changes? Operating system used: centos
im trying to use custom callback callbacks = [ EarlyStopping(monitor='val_accuracy', min_delta=1e-3, patience=5, mode='max', restore_best_weights=True, verbose=1),] its running ok! but o dashboard not…
Hi, I built a Gaussian Process regressor as a custom model in a Visual Analysis, and deployed it to the Flow and an API. For each model prediction, I need the predicted mean and standard deviation to …