Using Dataiku

Update/replace dataset
Hello, I'm a beginner in Dataiku, I built few flows, did tranings. Now I'm stucked with a problem: I built a flow (prepare, filter, group etc) and now I've got some updated data. I would like to replace very first dataset with my new data. Number of columns is the same, some column names are changed. Is it possible? I know…
How to concat multiple columns by index?
I have 2 dataset with different column names but same length. Dataset 1 have column A,B, length is 10. Dataset 2 have column C,D, length is 10. I want to get Dataset 3 by concating Dataset1&2, Its column name are A,B,C,D and length is 10. I have tried join and stack, but I can't get what I expect. Operating system used:…
PROTECT SCHEMA OF POSTGRES TABLES
Hello, When I launch recpies, it appears that output datasets connected with a postgres tables erase all parameters of this table, including schema, constraints, foreign keys, etc. I would imagine a possibility to manage conflicts between the existing table and the dataiku output. Is there any possibility to protect these…
Failed to create the tutorial - Advanced Designer Exam
Dear Dataiku community, I had started my advanced designer exam with a mistake in the project and to be able to answer the question correctly, I deleted the project from Dataiku. An error message came up but when I opened the homepage, the project was gone. I then tried to add the project again from DSS Tutorials but I am…
recipe zone manipulation
I would like to create a file (from python) into a specific zone. Problem is, I could not find any way to read the zone attribute of a certain existing file, yet alone to change/set that attribute to a different zone, out of python.
Exporter un graphique géographique
Bonjour, Je n'arrive pas à exporter une carte de dataiku vers Excel. Le fait est que j'ai plein de magasin (point sur la carte) qui forme des secteurs et le but est de voir s'il y a des secteurs qui se croisent/se superposent et si oui ou exactement. A part faire des screen-shoot ce qui rend très moche le rendu, je n'ai…
Reusing few-shot classification prompts in prompt studio?
Hi Dataiku'ers, Playing around with the few-shot classification prompts in prompt studio using Open AI GPT, it looks like the entire prompt is sent to the LLM for each row of data to be classified. With zero-shot labels that might make sense, but for few-shot classification prompts that include example texts, definitions…
Insert data into Hive table for every periodic execution of recipe to maintain timeseries of result
User case : A complicated data-intensive Quantitate model is executed every month. The result of periodic execution should be saved in Hive table for reporting. Quantitate model is complicated and seems suitable for PySpark recipe Quantitate model produces multiple result set & each result set should be saved in respective…
How to re-run a failed job from automation monitoring
Hi, After looking around what feels like everywhere I can't find the "correct" way to re-run a failed job from the automation monitoring. If I press open a failed run, then go to the failed job there is a button "Retry this job", this will re-run the job but it won't show up as a run in the automation monitoring so you…
Run python Recipe with Scenario
Hi, I have a Python recipe that takes two datasets as inputs and provides a dataset as output, now I want to run this recipe with a scenario and run it every day at a specific time. How can I run this recipe? Thanks

Trending Discussions

Docs for "pandasutils"?
Hello, My apologies if this is a remedial question, but at the start of every Python recipe the boilerplate code includes an import of: from dataiku import pandasutils as pdu Is there documentation for pandasutils? Is it a package that can be used in Python recipes? I've tried looking through the Dataiku Developer Guide,…
Run a Time Series Forecasting Model
I get the following error message Error message: Failed to train : <class 'ImportError'> : libcuda.so.1: cannot open shared object file: No such file or directory Operating system used: 13.1.4
Identifying the Node Type in a DSS Notebook using Python
In Python, in a DSS notebook, I want to know if the code is running in the design node or the automation node. How can I do that?

Leaderboard

Turribeach 3539

tgb417 2473

Ignacio_Toledo 1079