Using Dataiku

Export monthly files
I want to export files 9 months back from today's date, but always the 1st of that month. I have this working but when the month changes, it doesn't automatically include November for example when doing testing. Is there a way to automate this so I don't have to manually change the time period?
"NumberFormatException: For input string" in scenario with integer partitioned dataset
I have a non-partitioned D1 dataset. The first column "dt_partition" will be used to partition the next dataset. dt_partition is of type integer, representing the month (for example 202409). It only contains one value at a time, so the data will go into a single partition. My database is Snowflake. At the output of D1,…
Run a Time Series Forecasting Model
I get the following error message Error message: Failed to train : <class 'ImportError'> : libcuda.so.1: cannot open shared object file: No such file or directory Operating system used: 13.1.4
Docs for "pandasutils"?
Hello, My apologies if this is a remedial question, but at the start of every Python recipe the boilerplate code includes an import of: from dataiku import pandasutils as pdu Is there documentation for pandasutils? Is it a package that can be used in Python recipes? I've tried looking through the Dataiku Developer Guide,…
Issues about text in learning path
Yesterday I could see text and video in the Core course. Todday I can only see video but not th text. Could you help me ?
RAG LLM for multiple datasets
Greetings, While working with the embedding recipe, we faced a limitation where we have two datasets, we want to apply the rag on, how can we apply the knowledge bank on them specifically? Regards
automation of insights export to pdf from notebook
I have a Dataiku notebook in a flow that creates various visualizations. After the visualizations are generated, I manually publish and export them as a PDF. I would like to automate this entire process, from generating the visualizations to publishing and exporting the PDF. Could someone guide me on how to automate this…
Connect Databricks Catalog from Dataiku API Designer
I have created a connector to query databricks catalogs from dataiku. this works fine when i test the same inside a python notebook. but from an API designer, this is not working and asking for project key. from dataiku import SQLExecutor2 executor = SQLExecutor2(connection="NAME") sql_query = f"""select * from…
How to make TF-IDF vectorization on a textual column?
Hello, I am searching how to make a TF-IDF vectorization on a textual column in order to combine it after with a cosine similarity. Thanks.
Using the dataiku API to reverse the design of a ML Task
lo, For a specific case we would like to use the Dataiku API to revert the design of a visual analysis to a design which was used for a specific model. E.g. we have three sessions, the last trained session (session 3) had a different design then session 1. I would like to revert the design back to session 1 and this is…

Trending Discussions

Embedding a link to download a dataset in scenario email report?
Hi, Our workflows take a user-defined data payload and create N datasets. Is it possible to add a hyperlink with the dataset name as the text and the link being the URL to download the dataset as a CSV file? thx Operating system used: Windows 10
Getting the below error when trying to ping of the SFTP folders in the dataiku instance:
Getting the below error when trying to ping of the SFTP folders in the dataiku instance: SSH error connecting to ftp1.str.com, caused by: JSchAlgoNegoFailException: Algorithm negotiation fail: algorithmName="server_host_key"…
"CASE types double precision and character varying cannot be matched" error in prepare recipe
I get the above error in prepare recipe, not trying to match anything.

Leaderboard

Turribeach 3583

tgb417 2477

Ignacio_Toledo 1079