-
Run a Time Series Forecasting Model
I get the following error message Error message: Failed to train : <class 'ImportError'> : libcuda.so.1: cannot open shared object file: No such file or directory Operating system used: 13.1.4
-
Docs for "pandasutils"?
Hello, My apologies if this is a remedial question, but at the start of every Python recipe the boilerplate code includes an import of: from dataiku import pandasutils as pdu Is there documentation for pandasutils? Is it a package that can be used in Python recipes? I've tried looking through the Dataiku Developer Guide,…
-
Identifying the Node Type in a DSS Notebook using Python
In Python, in a DSS notebook, I want to know if the code is running in the design node or the automation node. How can I do that?
-
Issues about text in learning path
Yesterday I could see text and video in the Core course. Todday I can only see video but not th text. Could you help me ?
-
RAG LLM for multiple datasets
Greetings, While working with the embedding recipe, we faced a limitation where we have two datasets, we want to apply the rag on, how can we apply the knowledge bank on them specifically? Regards
-
automation of insights export to pdf from notebook
I have a Dataiku notebook in a flow that creates various visualizations. After the visualizations are generated, I manually publish and export them as a PDF. I would like to automate this entire process, from generating the visualizations to publishing and exporting the PDF. Could someone guide me on how to automate this…
-
Connect Databricks Catalog from Dataiku API Designer
I have created a connector to query databricks catalogs from dataiku. this works fine when i test the same inside a python notebook. but from an API designer, this is not working and asking for project key. from dataiku import SQLExecutor2 executor = SQLExecutor2(connection="NAME") sql_query = f"""select * from…
-
"NumberFormatException: For input string" in scenario with integer partitioned dataset
I have a non-partitioned D1 dataset. The first column "dt_partition" will be used to partition the next dataset. dt_partition is of type integer, representing the month (for example 202409). It only contains one value at a time, so the data will go into a single partition. My database is Snowflake. At the output of D1,…
-
How to make TF-IDF vectorization on a textual column?
Hello, I am searching how to make a TF-IDF vectorization on a textual column in order to combine it after with a cosine similarity. Thanks.
-
Using the dataiku API to reverse the design of a ML Task
lo, For a specific case we would like to use the Dataiku API to revert the design of a visual analysis to a design which was used for a specific model. E.g. we have three sessions, the last trained session (session 3) had a different design then session 1. I would like to revert the design back to session 1 and this is…