-
Using the Dataiku DSS Python API for Interfacing with SQL Databases - Watch on Demand
In most companies, SQL databases are a primary source of data for data science projects. The seamless access to a broad range of SQL databases is a key feature of Dataiku DSS. DSS builds on this capability by providing a Python API for interfacing with SQL tables. This functionality is a boon for Data Scientists who use…
-
Threshold optimization for what if? fuction in AutoMLprediction
I used "what if?" function in AutoMLprediction to do simulation, and I would like to change the threshold DSS automatically used. In the userguide, there is below description. But I could not find any button/icon to change a threshold. Please let me know how to change it. Prediction settings — Dataiku DSS 13 documentation…
-
How to Automate Clustering with Anomaly Detection for Each Partition in Dataiku?
Hello Dataiku Community, I’m working on a project where I’ve partitioned my dataset by category and year. For example, my partitions look like this: Category A | 2021 Category A | 2022 Category A | 2023 Category A | 2024 Category B | 2021 Category B | 2022 Category B | 2023 Category B | 2024 Category C | 2021 Category C |…
-
How to Handle Missing Data for Seasonal Analysis in Dataiku?
Hi Dataiku Community, I’m working on a dataset containing daily commodity prices over multiple years (2021-2024). However, there are significant gaps in the data, which are affecting my ability to analyze trends. Here are the details: Daily Data The dataset records daily prices for various commodities, but many days are…
-
Dataiku users from Romania
Are there any other community members from Romania? PS: > also started this thread as a log for tracing personal progress. >Day one-13.01.2025 - Installed application x Data Preparation Quick Start 8 of 8 lessons completed (100%)
-
Can we use multiple data sources/tables to create knowledge bank for a RAG model?
-
Issue with Rendering Images in HTML from Temporary Folder path in Dataiku
Hi everyone, I am facing issue while trying to render images in a HTML file that i have generated from a word document using mammoth library in Python in dataiku. Here's what i have done so far: a) Extracted images from the word document and saved them in to a managed folder in dataiku named "images" as i didn't want to go…
-
Windows recipe
Hello, I am currently working on a window recipe (to add a row number), and I would like this row number to be in the first column without using a Prepare recipe exclusively for that. Do you think it is possible ? Thank you !
-
How to accumulate numbers from largest to smallest
As sample data shows, the data has customer ID column and revenue by customer column, and I would like to accumulate the revenue from largest revenue to smallest revenue to identify which customers are in top80% market and which customers are in bottom 20%. How can I calculate like orange-colored column in DSS?
-
Use Case : sync data
example table a most updated data. data b has not been updated. but examples there are data inconsistencies. suppose table a has 25 data. now table b only has 15 data: 1. table A 1 - 10 suppose it has the same id as in table b. but table b data has not been updated even though the id is the same. 2. Table A 11 - 25 the…
-
Admin Academy
Hi, i want ask about become a admin. i saw my friend have a admin learning path