-
Interaction Terms In OLS
Hello, I used the feature generation feature in AUTO ML predictive model and I noticed that my core variables were repeated with "computed" in parenthesis with different regression coefficient for them. I was wondering if there was any documentation on how dataiku handles interactions for feature generation or anyone could…
-
Delete the nth row
Hello, I would like to delete the nth row of my output dataset, yet I don't know how to do that..
-
Parsing xml file, does not handle XPaths to Context to be after the Root element XPath
In the above example Root element XPath = /ORDERS/ORDER/ORD_DETAIL_set/ORD_DETAIL XPaths to context = /ORDERS/ORDER/ORD_NUM/text() → ORD_NUM I have the result ORD_NUM not aligned with the ORD_DETAIL_set but one row lagging because I believe it is expecting XPaths to context to be before Root element XPath. Is there…
-
Checks with custom error/warning messages
Is it possible to create a custom error message for a check? I've created some checks for important datasets but find the default error messaging of "Checks on the output produced 1 error" a bit lacking as this does not give any information on what's wrong to users who are not as versed in Dataiku or programming in…
-
I have an issue using Chroma as a vector store.
When attempting to setup a connection with Chroma I get an error regarding needing sqlite3 >= 3.35.0. vectorstore = Chroma( collection_name="full_documents", embedding_function=OpenAIEmbeddings() ) I get the following error: RuntimeError: Your system has an unsupported version of sqlite3. Chroma requires sqlite3 >=…
-
Project Level API keys when deploying bundles
Hi Dataiku Team, I am trying to create a project level API key which can be used to execute scenarios in that single project from an external application. I created the API ID and secret in the project security in the Design Instance ( or Dev) This secret works and the external application is able to run the scenario on…
-
Retrain the Partition Model
Hi, Initially i have trained the model for partition dataset and deployed the partitioned model (Partitioned on CITY column) . with model Id = 'XYZ' The requirement is to re train the partitioned model for all the partitions (The Number of partitions changes every time the dataset is reloaded) and activate the latest…
-
How to get regression coefficients of each model build on a partitioned dataset with python code?
Hi , I have build and deployed liner regression model on a partitioned data, so there is a regression model for each partition. I want to get the regression coefficients of each of those model with a python code or recipe (basically to automate, I do not want to download the coefficients manually). Does anyone has any idea…
-
How to create a hierarchy
Hello, I would like some help to create a hierarchy. Indeed, I have some issues to create one. I have a a raw file such the way explained on the raw excel data (colored) with parents and child IDs and Names and I would like to be able to get 3 tables : One with all the parent and child name, one with all their ID and one…
-
Saving an image in the box folder connected to datiku
I have a screenshot function as part of my Rshiny app. Currently, I'm saving the screenshot as follows in a datiku managed folder. This code is working fine, and I'm able to save the image as a PNG file screenshot_folder <- dkuManagedFolderPath("SCREENSHOT_FOLDER") screenshot_path <- paste0(screenshot_folder,"/")…