-
How to Automate Clustering with Anomaly Detection for Each Partition in Dataiku?
Hello Dataiku Community, I’m working on a project where I’ve partitioned my dataset by category and year. For example, my partitions look like this: Category A | 2021 Category A | 2022 Category A | 2023 Category A | 2024 Category B | 2021 Category B | 2022 Category B | 2023 Category B | 2024 Category C | 2021 Category C |…
-
How to Handle Missing Data for Seasonal Analysis in Dataiku?
Hi Dataiku Community, I’m working on a dataset containing daily commodity prices over multiple years (2021-2024). However, there are significant gaps in the data, which are affecting my ability to analyze trends. Here are the details: Daily Data The dataset records daily prices for various commodities, but many days are…
-
Can we use multiple data sources/tables to create knowledge bank for a RAG model?
-
Issue with Rendering Images in HTML from Temporary Folder path in Dataiku
Hi everyone, I am facing issue while trying to render images in a HTML file that i have generated from a word document using mammoth library in Python in dataiku. Here's what i have done so far: a) Extracted images from the word document and saved them in to a managed folder in dataiku named "images" as i didn't want to go…
-
Windows recipe
Hello, I am currently working on a window recipe (to add a row number), and I would like this row number to be in the first column without using a Prepare recipe exclusively for that. Do you think it is possible ? Thank you !
-
How to accumulate numbers from largest to smallest
As sample data shows, the data has customer ID column and revenue by customer column, and I would like to accumulate the revenue from largest revenue to smallest revenue to identify which customers are in top80% market and which customers are in bottom 20%. How can I calculate like orange-colored column in DSS?
-
Use Case : sync data
example table a most updated data. data b has not been updated. but examples there are data inconsistencies. suppose table a has 25 data. now table b only has 15 data: 1. table A 1 - 10 suppose it has the same id as in table b. but table b data has not been updated even though the id is the same. 2. Table A 11 - 25 the…
-
Admin Academy
Hi, i want ask about become a admin. i saw my friend have a admin learning path
-
how we can achieve Data Lineage in DataIku
How we can achieve table and column decencies in recipes.
-
How to load a pre-trained model into a codenv (Resources Directory) in a no-internet-access instance
Hi. I am looking for using some pretrained model (for example embeddings model) within my project. The DSS instance I am working on cannot access Internet. Still i was able to retrieve the models at some point…and now I want to re-use them. I was also able to upload the model in a managed folder and use it in a code recipe…