-
Use first row as column headers/column names
I have an excel file whose format is somewhat different. I need to skip first 9 lines and use 10th line as column names. I have tried using "parse next line as column headers" but it is not working for me. Has anyone faced this type of error. Please let me know how to resolve it. I am pasting input file format below:
-
Outer Join
I am trying to convert a HIVE SQL that includes a left outer join. I don't see that in a JOIN recipe. How do I add it? Operating system used: Windows
-
Is there a repository where I can report issues?
Operating system used: macOS Operating system used: macOS
-
How does the evaluation store threshold actually work?
In the documentation for the evaluation store, when doing a two-class (binary) classification, there is a slider for the threshold used. The documentation for this threshold reads in part: When doing binary classification, most models don’t output a single binary answer, but instead a continuous “score of being positive”.…
-
How to Automate Clustering with Anomaly Detection for Each Partition in Dataiku?
Hello Dataiku Community, I’m working on a project where I’ve partitioned my dataset by category and year. For example, my partitions look like this: Category A | 2021 Category A | 2022 Category A | 2023 Category A | 2024 Category B | 2021 Category B | 2022 Category B | 2023 Category B | 2024 Category C | 2021 Category C |…
-
How to Handle Missing Data for Seasonal Analysis in Dataiku?
Hi Dataiku Community, I’m working on a dataset containing daily commodity prices over multiple years (2021-2024). However, there are significant gaps in the data, which are affecting my ability to analyze trends. Here are the details: Daily Data The dataset records daily prices for various commodities, but many days are…
-
Issue with Rendering Images in HTML from Temporary Folder path in Dataiku
Hi everyone, I am facing issue while trying to render images in a HTML file that i have generated from a word document using mammoth library in Python in dataiku. Here's what i have done so far: a) Extracted images from the word document and saved them in to a managed folder in dataiku named "images" as i didn't want to go…
-
Use Case : sync data
example table a most updated data. data b has not been updated. but examples there are data inconsistencies. suppose table a has 25 data. now table b only has 15 data: 1. table A 1 - 10 suppose it has the same id as in table b. but table b data has not been updated even though the id is the same. 2. Table A 11 - 25 the…
-
403 Forbidden on Jupyter notebooks after updating from 13.0.0 to 13.0.3
Hello, I'm using a custom installed dataiku, on debian 11, free license (with advanced features trial). I'm getting "403 Forbbiden" when opening jupyter notebooks after updating from 13.0.0 to 13.0.3. I've noticed that I don't get them when connecting to the Dataiku instance directly from my home network, only when doing…
-
Can we use a Kubernetes cluster with the free edition of Dataiku?
Hi, Can we use a Kubernetes cluster with the free edition of Dataiku? Let's say we have a Linux VM in AWS or Azure environment where we have deployed the free edition of DSS Ver 12 or later. Is it possible to use a Kubernetes cluster in this environment to reduce model training time? Thank you. Taka Operating system used:…