-
Use first row as column headers/column names
I have an excel file whose format is somewhat different. I need to skip first 9 lines and use 10th line as column names. I have tried using "parse next line as column headers" but it is not working for me. Has anyone faced this type of error. Please let me know how to resolve it. I am pasting input file format below:
-
Issue with get_connection in Python Endpoint for Snowflake – API Node
Hi all, I'm currently working on a Python endpoint in Dataiku API Node, where I need to dynamically generate and execute SQL queries against a Snowflake connection. These SQL queries are generated via OpenAI, so I cannot use a predefined SQL endpoint (as those require fixed queries). I followed the official Dataiku…
-
WARNING: Retrying (Retry(total=1, connect=None, read=None, redirect=None, status=None)) after connec
In my Dataiku UI, I created two Python environments under Code Envs: One for Python 3.12 One for Python 3.9 Each environment installs the Python libraries listed in “packages to install” by retrieving them from repositories hosted on an Artifactory. I imported the Artifactory certificate into the keystore of my Dataiku VM…
-
How to get information about jobs' CPU and ressource usage?
Hello, i need to make a repport about all the jobs and all the scenarios running in dataiku and i was wondering if there is a command that will allow me to have informations about the ressources used by every one of them. And if not is there another way to gather these informations?
-
Outer Join
I am trying to convert a HIVE SQL that includes a left outer join. I don't see that in a JOIN recipe. How do I add it? Operating system used: Windows
-
Image Rebuild Error
Hi, Our instance are running with DSS version as 13.4.0. I tried rebuilding the image to remove lower versions of python like 2.7,3.6,3.7,3.8 using the below command. Unfortunately its failing at below stage. ./bin/dssadmin build-base-image --type container-exec --without-py27 --without-py37 --without-py38 --with-py39…
-
How to Automate Clustering with Anomaly Detection for Each Partition in Dataiku?
Hello Dataiku Community, I’m working on a project where I’ve partitioned my dataset by category and year. For example, my partitions look like this: Category A | 2021 Category A | 2022 Category A | 2023 Category A | 2024 Category B | 2021 Category B | 2022 Category B | 2023 Category B | 2024 Category C | 2021 Category C |…
-
How to Handle Missing Data for Seasonal Analysis in Dataiku?
Hi Dataiku Community, I’m working on a dataset containing daily commodity prices over multiple years (2021-2024). However, there are significant gaps in the data, which are affecting my ability to analyze trends. Here are the details: Daily Data The dataset records daily prices for various commodities, but many days are…
-
Use Case : sync data
example table a most updated data. data b has not been updated. but examples there are data inconsistencies. suppose table a has 25 data. now table b only has 15 data: 1. table A 1 - 10 suppose it has the same id as in table b. but table b data has not been updated even though the id is the same. 2. Table A 11 - 25 the…
-
Identify Product ICC using Attribute Values
I am new to Dataiku, how can i identify/predict ICC of a product using attribute values based on raw data using Dataiku AI/ML Capabilities