-
How to access specific check results in a scenario
I have a simple dataset containing 1 row and about 20 columns. Each column represents a Boolean value of whether or not a particular threshold has been breached. If the result is true for any of the 20 columns, then a message is sent to a designated MS Teams Channel that a breach has occurred. I would like to supplement…
-
Validation failed: Query failed: ERROR: column PR.proj_zz_prline does not exist
Hello community, I have a SQL query; it is working fine. I have 2 inputs with redshift tables. I have added 2 more columns in one of the table. Now, when I try to add those into the query, I am getting validation error and saying those columns are not available in the table. In fact, they are. The schema is consistent.…
-
Partial Dependency plot of a lab model
Hi, I am working on building a prediction model in Dataiku. I am building models in the lab, and I'm trying to build a dashboard that can compare different versions of the model in LAB. I'm accessing different versions of the model and comparing the different aspects of the model. I'm accessing the Lab model as follows.…
-
Error running spark Recipes on Kubernetes- Initial job has not accepted any resources
when I use DSS v 13 to push execution of visual recipes to containerized execution on Kubernetes cluster(k8s), using Spark as the execution engine. I pushed two images to registry: dku-exec-base and dku-spark-base However, when I run the recipe it takes forever running (creating and deleting pods in k8s), I found this line…
-
Best practice hosting hugging face LLMs as a service?
Hi all Generally speaking, what are the optimal routes in Dataiku to host e.g. an instruct fine tuned Falcon 7B model using Dataiku? Would it be building a code studio and using vLLM or something along those lines? Or is there capability as part of the LLM mesh? We'd like to host open source models that are instruct fine…
-
Interaction Terms In OLS
Hello, I used the feature generation feature in AUTO ML predictive model and I noticed that my core variables were repeated with "computed" in parenthesis with different regression coefficient for them. I was wondering if there was any documentation on how dataiku handles interactions for feature generation or anyone could…
-
Delete the nth row
Hello, I would like to delete the nth row of my output dataset, yet I don't know how to do that..
-
Parsing xml file, does not handle XPaths to Context to be after the Root element XPath
In the above example Root element XPath = /ORDERS/ORDER/ORD_DETAIL_set/ORD_DETAIL XPaths to context = /ORDERS/ORDER/ORD_NUM/text() → ORD_NUM I have the result ORD_NUM not aligned with the ORD_DETAIL_set but one row lagging because I believe it is expecting XPaths to context to be before Root element XPath. Is there…
-
Checks with custom error/warning messages
Is it possible to create a custom error message for a check? I've created some checks for important datasets but find the default error messaging of "Checks on the output produced 1 error" a bit lacking as this does not give any information on what's wrong to users who are not as versed in Dataiku or programming in…
-
I have an issue using Chroma as a vector store.
When attempting to setup a connection with Chroma I get an error regarding needing sqlite3 >= 3.35.0. vectorstore = Chroma( collection_name="full_documents", embedding_function=OpenAIEmbeddings() ) I get the following error: RuntimeError: Your system has an unsupported version of sqlite3. Chroma requires sqlite3 >=…