-
SQL Compilation for Date-Partitioned Datasets for recipes that require all Partitions
Hi, this is my first time working with partitions so please let me know if I'm making a mistake. I have partitioned various datasets by a date column (set partition timeframe to DAY) and have set things up so that my scheduled scenarios for initial recipes are only doing transformations on the latest few days worth of data…
-
regex that will remove everything between begin: and end:
Hi I've got dataset with regex patterns in one column and python code that uses it to make replacement. I need regex pattern that will remove everything between "begin:" and "end:" Tried to use "begin:[\s\S]*?end: " but it doesn't work. in between I can have more then 3000 chars including special ones. Thank you Ela…
-
Is there a way to get the current Scenario id?
I'm buiilding a macro step for an scenario in which i need to obtain the last successful run of that scenario. I've been able to get the las successful time, but I'm struggling to limit this to only the current scenario. any suggestions on how to tackle this?
-
Please, use Marimo Notebook as replacement Jupyter Notebook
This is not a question, but more like opinion and/or feature request. Marimo Notebook definitely would be a perfect match for DSS rather than dataiku, you don't need to convert python recipe to a jupyter notebook, back and forth when editing it, the notebook is really git friendly since it is literally a python script…
-
Not seeing partitions list in the explore dataset tab
Does anyone know why partitions that are being extracted still don't appear in the partition list under the Explore tab of the dataset? Operating system used: windows
-
Error when renewing license on Dataiku Govern
Hi, I've encounter an error while trying to renew license in Dataiku Govern. My current license is expiring in 30th July 2025. What do I need to do to resolve this? Preferably another way without upgrading the DSS. Thanks. Sean Operating system used: Windows
-
How to get information about jobs' CPU and ressource usage?
Hello, i need to make a repport about all the jobs and all the scenarios running in dataiku and i was wondering if there is a command that will allow me to have informations about the ressources used by every one of them. And if not is there another way to gather these informations?
-
DataIKU not recognizing locally installed python
I am trying to create a new Python Code Env in my locally installed instance of Dataiku. The only option is Python 3.7. All the other options (Python 3.8, 3.9, 3.10, 3.11 and 3.12) are listed as "Not available" in the dropdown. I have Python 3.10 and Python 3.12 locally installed, but somehow Dataiku is not recognizing it.…
-
Impossible to use Custom Python UI for an application
Hello, I'm doing an application with the application designer. I'm trying to retrieve values from a dataset and put it inside a SELECT json list. To do that, I want to use the custom Python UI with the do() method, but i'm unable to save it, and therefore to launch it. In fact, i'm unable to save the app if I write…
-
how to get all runs of scenarios an their informations?
For my internship i have a project where i have to create a dataset with all the scenarios and their runs, but when i try and import them the informations such as the last run, the progress percentage, number of runs type of triggers and stuff like that remains empty or are marked with a 0 . I hve the id and the name of…
-
Need help with efficient BigQuery to Snowflake data transfer in Dataiku
Hi there! I'm setting up a pipeline to move data from BigQuery to Snowflake and really need some advice from folks who've done this before. Right now I'm using the standard DSS engine but it's painfully slow for larger datasets - takes forever to process even moderate amounts of data. I'm not sure what route is actually…
-
how can I use the LLM api hosted in dataiku?
we have some models running on dataiku. But currently I can only find the python client. Even I get the rest apis, the response from these apis are so different from the officials api, like sonnet. This is not good if I want to integrate the api to other tools. Like, I would like to initegrate the API to vscode plugin or…
-
how to display a running.. while running a scenario by using webapp
how to display a running.. while running a scenario by using webapp and also how to refresh that webapp page after scenario run?
-
ModuleNotFoundError When Accessing Knowledge Bank via REST API SDK
Hi everyone, I'm running into an issue while trying to access a Dataiku Knowledge Bank from an external Python script using the Dataiku REST API client. When I execute my script, I get the following error:ModuleNotFoundError: No module named 'dataiku.core.vector_stores.lifecycle' Here are my environment details: Dataiku…
-
Trouble creating custom Python code environments
I am trying to create a new Python Code Env in my locally installed instance of Dataiku. The only non-conda based option is Python 3.7. All the other options (Python 3.5, 3.6, 3.8, 3.9, 3.10, 3.11) are listed as "Not available" in the dropdown. I have Python 3.10.11 locally installed, but somehow Dataiku is not recognizing…
-
How to prevent users from accessing filesystem files?
I am installing Dataiku on a baremetal ubuntu 22 OS. However I noticed that users can access the root filesystem quite easily by managed folders or jupyter notebooks for example. Is there a way to completely prevent users from accessing any filesystem related files? If so what is the most efficient way to do so? Operating…
-
How to create a code studio template programmatically
I have an airgapped environment for which I prepare my Dataiku VM beforehand by a cloud-init script. This means all setup is done automatically via python or dsscli/dssadmin cli commands. This VM already contains the container image for container execution and visual recipes so custom python jobs, jupyter notebooks, visual…
-
SSL Error in containerized visual recipes despite no SSL used
I have an error when running visual recipe jobs in containerized execution that I don't have when executing locally in DSS. This job has S3 input and output in http, the DSS instance is in HTTP. The "Test" button in Settings > Containerized Execution works properly. Here is the error: Failed to list files, caused by:…
-
how to add calculated filters in Charts ?
I want to embed calculated filters (if x >1 set display color to blue, if x > 1.15 set color to yellow etc…) in a chart. Let's say Bar chart for example. I know how to add static filters but is there a way to add filter based on a calculation. here in this chart , each color is a parameter. I want to change parameter color…
-
Problems to run Macro to download the dataiku tensorflow resnet model
Hello, Dataiku community I am trying to follow this tutorial but I have problems to download the dataiku tensorflow resnet model using macro. I select the option "Download pre-trained model" in "deeplearning image cpu" section and I get the error: Oops: an unexpected error occurred Cannot run program…
-
clickhouse support
Hello, is it planned to add clikhouse as a supported database, if not, is it possible to add it ?
-
Enhance Excel output for "Export to folder"
I would like to request an enhancement to the "Export to Folder" recipe when exporting datasets to Excel format. Specifically, it would be extremely helpful if the export could support: Freezing the header row Adding auto filters to the header These features are commonly used in Excel for better data readability and…
-
Ctrl + Enter to run a recipe
It would be great to be able to use the shortcut key combination Ctrl + Enter to run a recipe while in the recipe editor screen. This keyboard shortcut would be consistent with what you can do in both Jupyter Notebooks and in SQL Notebooks. I realize that there is a current keyboard shortcut for running a recipe (@ run)…
-
how can write def do(payload, config, plugin_config, inputs): code ?
to interactive with the UI of macros using "type": "SELECT" in "type": "OBJECT_LIST"? { "name": "test", "label": "test.", "type": "OBJECT_LIST", "itemLabel": "test", "subParams": [ { "name": "obj", "label": "Object", "type": "SELECT", "forceReloadOnValueChange": true, "getChoicesFromPython": true } ] }
-
bloomberg Data Analytics in Dku
Hello, I've Excel sheet having connectors to bloomberg, this typically uses Bloomberg formulas (like BDP, BDH, etc.) to pull data into Excel. Do you know if there is any chance that this is work with dataiku without/with using Excel? Anyone has worked on it? thanks
-
Significant Improvements
I work as a data scientist manager at Pfizer. I have some feedback for DSS. 1. Does not return to the exact location in the Flow - I have a dataset in zoneA and it is shared to another zoneB. - I explore the dataset in zoneB - When I come back to the Flow, the flow focuses on the dataset in zoneA instead. 2. Cannot compare…
-
Disappearing training log
I am training a deep learning model on a Dataiku Notebook but the training has disappeared and I don't why. The output of my training grow up as if the log was "write in white" Operating system used: Windows Operating system used: Windows
-
Code Studio - Not authorized: Not owner of the Code Studio
My friend has created a Code Studio. We are under the same Dataiku project. Are there any settings that needs to be configured in order for me to gain access to the Code Studio?
-
Using dataiku DSS, How to get back to Designer for starting a New Project? Thanks.
Pursuing the Developer Track on the free version of Studio. Mapping my progress along side navigationg the DSS is a bit challenging. I am copying the Lesson onto Note and using another screen to run the exercises. These are quite intuitive for the most part, but with a few hiccups like getting back to the Hope Screen or…
-
Dataiku to Greenplum: Performance Lag on Large Data Loads & Batch Read Control
Hello, During a Proof of Concept (PoC), we're experiencing performance degradation when loading 20 million rows of data with 500 columns into GPDB (Greenplum Database). We've observed in the Dataiku logs that it continuously reads data in batches of 2000 rows. We're looking for a setting to adjust this batch size. We've…
-
Running Hugging Face models in Dataiku
Hi, I am trying to run some hugging face models in Dataiku DSS. The task requires to use an on-premise server where I have some useful GPUs to run these hugging face models. The server has an NVIDIA H100 Tensor Core GPU and I want to know: If Dataiku DSS can be installed in that server, if there are no problems with…
-
How to show total in a stacked bar chart and order by total
Hi Dataiku experts, I have a chart to show actual spend and remains to spend. would like to show total value of each stacked bar and order the bars by total value. I have a workaround solution but visual is not perfect- I added a third dataset Lifetime Total and uncolor the bar, but the result is there is too much white…
-
Free Edition Registration Error
I have installed DSS on linux Centos 7 this morning. DSS was started without error. When access the DSS URL the first time I was asked to select license. At page 2- Select your license, I selected Free Edition and entered the registration info. When clicked "Next" button, I got this error: Network error An attempt to…
-
About Shapley calculations
Hi, we're facing big problems with Shapley calculation with a customer. I've found some useful documentation at https://doc.dataiku.com/dss/latest/machine-learning/supervised/explanations.html and a few messages in the community (https://community.dataiku.com/t5/Using-Dataiku/SHAP-Shapley-values-in-Dataiku/m-p/22241,…
-
Run python Recipe with Scenario
Hi, I have a Python recipe that takes two datasets as inputs and provides a dataset as output, now I want to run this recipe with a scenario and run it every day at a specific time. How can I run this recipe? Thanks
-
Neo4J Connect through SSO
Hi! Is there a way to connect to Neo4J using SSO? We are currently investigating the corresponding plugin but the sign-in is only available through a username-password pair. Thanks in advance for any help!
-
Question about the install path of Dataiku
Due to the space in my username(Zhao Guanghao)of my laptop, I can not run dataiku correctly. Meanwhile, the filename under the 'User' file in C disk can't be modified, once you rename it directly, the system can not identify the account and it will collapse. Besides, Dataiku can only be installed in the path 'C…
-
dataiku library messagesender Email CC
Hi All, I am trying to use the MessageSender to send some emails in custom recipes. I was hoping if there is a way to pass the CC email list in the send function Please do let me know if there any suggestions or alternatives from dataiku.core.message_sender import MessageSender s = MessageSender(channel_id='SMTP',…
-
Saving Vector Store as KB
I was wondering if there was any way of saving a FAISS vector store I create in a python notebook as a knowledge bank I can use later on? I created a vector store (see code below) which has summaries as the embedded objects, and the parent documents as the retrieved documents. I did this based on LangChain's…
-
Using date in DataIKU
Hi, Despite going through documentation multiple times, I still don't really understand how dates work in DSS. I'm importing dataset from a connection. Without turning on any of the options in Date & Time handling, this is how data looks like: It says that the data type is string, while in the database itself it is, in…
-
SOLVED. Cannot replicate GLM predictions
SOLVED. It was the offset - needed to take the natural log of it before calibrating. Hello, I built a model using the GLM Classification plugin. The AUC is ~0.8 so it's fitting my data well but when I implement the GLM formula manually into Tableau the predictions are far too low despite having the correct shape. The model…
-
how to turn on chart zoom in&out feature in dashboard
Hi community, When I plot chart, there is a nice feature of zoom in&out by date (in the bottom of the chart), as shown below. However, when I publish the chart into dashboard, it seems that this feature is dropped by dataiku (see below chart). Is there a way to turn on such timeline/date zoom in&out feature in dashboard…
-
How to extract rows flagged by a custom Python rule in the Data Quality tab ?
Hi everyone, I'm working with Dataiku DSS version 13.5, and I'm using the Data Quality tab on datasets to define validation rules. When I use standard rules (e.g., missing values, uniqueness, etc.), I can easily export the rows in error. However, when I define a custom Python rule, I can see the column status marked as…
-
Load Balancer IP Address
I'm working with my DKU platform (running in Azure) to configure my kubernetes cluster with the goal of ensuring my load balancer node runs with the same IP address every time - e.g. 10.1.1.100. My AKS plugin configuration has a node pool named API-LoadBalancer. It has one node and does not autoscale. The admin panel's…
-
how to define helper in python code in Dataiku project
In Dataiku project I've got python code - but I need also to declare additional python code that will be than used like library from HelperLibrary.library1.codev1 import testprocessing Is it possible? Operating system used: Windows Operating system used: Windows
-
Design Node Launch Failure
I have set up a Dataiku on AWS and configured it to automatically start/stop the Fleet Manager and design nodes using AWS functionality. Yesterday, both the EC2 instances and DSS started successfully. However, today, while the EC2 instances started successfully, the DSS failed to start. For reference, the version of…
-
Timeseries forecasting with GPU / cuda 11
Hello, I am now trying to train a model with timeseries forecast by using GPU. OS: Ubuntu 22.04 Installed with apt-get on OS: libcudnn9-cuda-11 cuda-toolkit-11-8 libnccl2 I then created a new python env : when i use that environment in the model, I can see at first that it's fine since it shows me my GPU card : but when I…
-
Error while importing project to DataIKU: java.lang.IllegalStateException
Hi All, I am trying to import a project to DataIKU but getting below error - "Oops: an unexpected error occurred java.lang.IllegalStateException: Expected BEGIN_OBJECT but was BEGIN_ARRAY at path $, caused by: IllegalStateException: Expected BEGIN_OBJECT but was BEGIN_ARRAY at path $ Please see our options for getting help…
-
Sync Recipe from Redshift to Oracle RDS
I am trying to replicate a table in Redshift to a table in Oracle RDS using a sync recipe. I am getting the correct number of records created in the target Oracle RDS table, but all of the fields are empty (null). Operating system used: Linux Operating system used: Linux
-
Sharepoint output mismatch
I have 3 sharepoint connections. The datasets writing onto the dataiku sharepoint folders, write correctly- but they do not write the files correctly on the sharepoint site. I have 3 separate files writing to the same place in sharepoint (3 different export to recipes and 3 different dataiku sharepoint folders with the…
-
Can I use python to read Excel files without openpyxl
I have no permission to install openpyxl in my organization, is there a way to read Excel files in python?
-
How to Run Daily-Parameterized Flows in Parallel Without Global Variable Conflicts in Dataiku?
Hello, This is a question about executing the same Flow in parallel. I have a Flow that is designed to process data on a daily basis, and several recipes within the Flow refer to a project-level global variable called Job_Date. Each daily run takes about 30 minutes, making it a long-running Flow. Now, I have a requirement…
-
Isochrones limitations
Hello, I tried to use Isochrones (geo-router plugin) to scout for potential customers around our sites, but it looks isochrones are limited. I ideally woudl have like to get isochrones of 3 hours aorund our sites, but already if i set 1h, I don't get any result. For isochrones of 45 mins, they get calculated within 15…
-
Calling APIs and licensing requirements
Hi all, We are building an API that could be called by many people running local apps and processes. I understand API keys are the way to go. The issue I'm having is that my security team wants separate keys for every user and process - and a process for managing those keys. Another problem is there is no guarantee that a…
-
S3 Connection Problems
Hello, I am having problems connecting to an S3 bucket. I get the following error: Access Denied (Service: Amazon S3; Status Code: 403; Error Code: AccessDenied; Request ID: ###############; S3 Extended Request ID: ###############################################) I have tried applying policy at both the IAM User and bucket…
-
Bug - CTRL+Z Erases Text on Dataiku
I accidentally pressed ctrl + Z after opening a Wiki article and it caused all the text to be removed. I'm on DSS 8.0.2. This also happens within SQL queries and the Library Editor. Is this intended behavior? Here are the steps to reproduce with the Wiki: 1. Create Wiki article with some text and save 2. Navigate away from…
-
Dataiku Outbound IP Address Control - Proxy or Otherwise?
My team works with a data supplier that requires a white-listing process for IP addresses. When we attempt to use Dataiku to fetch the data our IP address associated with the call occasionally changes. We have access to a proxy IP address system, and we have made requests through the proxy to the data supplier when we run…
-
Add option to support non-pandas dataframes (e.g. polars) in Python recipes
Hi, There are many pandas alternatives. One that is new and very fast is polars. Polars is built on Rust so it is memory safe and runs in parallel by design. I use polars in one of my recipes but have to convert it to pandas to write the dataset. thx
-
Dynamic rename of column headers
Hi I'm migrating workflows from alteryx to dataiku. I need to migrate a dynamic rename tool. What it does is take the values from the left dataset and assign it headers from the right dataset. Is there any way to do this in dataiku without using python? Operating system used: Windows
-
ModuleNotFoundError: No module named 'dataiku.langchain'
Hi, I was recently trying to install the dataiku api locally for some testing purposes but was met with some errors. Installation seemed to be working fine but the moment I tried to run this code:`from dataiku.langchain.llm import DKUChatModel` , it seemed to crash and be unable to find the dataiku.langchain module. What…
-
Use of global variables
hello i would like to optimise my current data flows and data traitement! we have to day a filtrer which is applied late in of the recipes 1/69 of specif zone in my project how can i use the date variables to remaine with only the current month and only the previous one?
-
File move from one folder to other in network
Each day, I want to check if a file exists in a network folder that ends with today's date in its filename. If the file is found, I want to move it to another network folder at a scheduled time. This kind of task is possible in Alteryx using a combination of directory tools and conditional logic. I'm looking to implement a…
-
Customize excel sheet name in the output
hello how do i Customize excel sheet name in the output using multisheet EXCEL reciepe ? i want a no code solution
-
Multiple Issuers in JWT authorization
Hello, Is it possible to define multiple Issuers in JWT authorization method for API ? Comma separated values does not work. Sincerely Fabrice Operating system used: Redhat Linux
-
Make Dataiku Managed Datasets Less Opinionated (aka stop dropping my tables)
After 11.4.0 (or earlier as we upgraded from 11.0.3), Dataiku not defaults to dropping and re-creating by default when using Dataset python APIs if for some reason the dataset schema and underlying table do not match. It will do this silently and pass jobs, where later we find out that we've lost our history in the base…
-
Renaming a dataset using Python API
Dear Community, I am trying to rename a dataset from a project using the python API using the rename method from the dataikuapi.dss.dataset.DSSDataset class (https://developer.dataiku.com/latest/api-reference/python/datasets.html#dataikuapi.dss.dataset.DSSDataset.rename) but I get an AttributeError: 'DSSDataset' object has…
-
Dataiku DSS start failed : Download failed: unable to get local issuer certificate
Dataiku DSS start failed : Download failed: unable to get local issuer certificate Dataiku is not starting in the Macos 12.3.1 , the installation file is downloaded again from the Dataiku website tried reinstall multiple times 2022-06-21 11:01:40.749 +0400] INFO: Getting latest Dataiku DSS info from…
-
Unable to install dataiku in mac oss
I keep getting the below error; i have tried the cacert.pem and tried with optons like export NODE_TLS_REJECT_UNAUTHORIZED=0 but no luck. is there any way i can download offline and install ? pls help. thanks. Detailed error message below. ================================== [2025-05-11 10:21:56.403 -0500] INFO: Starting…
-
My dataset gets wiped (seemingly) periodically
Hello, I've been working with Dataiku for a few months, learning all about it. I created a flow to forecast upcoming values, from historical data. The method I used to import this historical data into Dataiku was: I created a Google BigQuery dataset → I put the SQL query in it → Loaded all the data → Synced it into another…
-
How can I send some information to a microsoft teams channel?
I have prepared the work flow in teams and i have the url from the webhook (using this tutorial ) Now i will like to send some information from a dataset like text to a channel in microsoft teams . How can i do it ? Operating system used: windows 11
-
How to enable Story AI in Dataiku?
I'm trying to use the new Dataiku Stories feature introduced in version 13.4.0, specifically the Story AI capability. According to the official documentation below, the activation path should be: From the waffle menu of the Design node, navigate to Administration > Settings. In the left panel, scroll down to Other and…
-
Alteryx to DataIKU migration Accelerator
Hi, Has anyone used Alteryx to DataIKU Converter tool ( provided to DataIKU service partners only). I think tool is still evolving. Kindly share if there are any feedback pointers about this tool. Thanks, Operating system used: Windows
-
Can't get license
Hi, I have to take this ML cert as it is a requirement from my company, but I have tried refreshing it and tried disconnecting and connecting my network but there is still error as below, I hope I can get clarification for this. Operating system used: Windows
-
can not register to DSS , after first time of installation
I am facing this error: Network Error: An attempt to communicate with DSS failed. Please check your network connectivity. Operating system used: RHEL-8.9 Operating system used: RHEL-8.9
-
send attachment from dataiku to teams channel.
Looking for options to send attachment from dataiku to teams channel. Anyone setup this before?
-
Display updated dashboards on dataiku govern
Hi I would like to display dashboards on a govern custom page. I implemented custom charts using matplotlib on a python recipe in the design node. I managed to send these custom charts in the “insights” folder of the project. I can access these charts using a custom HTML page in the govern node. However, these charts are…
-
Using other python version
hello i want to use other python version my environment is on-premise and OS is Rocky 9.5 so default python version 3.9 Since python3.11 and python3.12 can be installed from the Rocky 9.5 dnf repo, I installed them with dnf install and restarted dss. Then go to code-env and select python3.11 or python3.12 and click the…
-
Parse Date
one column has the date format like "2022-11-04 07:44:32.634000+00:00". select "Parse date…" in prepare recipe and show "Aww ...Smart Date could not find any format matching your data. Please try a custom format". Open Formula editor panel, input "concat(datePart(START_DATE,"year"),"_",datePart(START_DATE,"month")), and…
-
Automatically Updating Release Date Filter in Charts
Hi Team, I have a workflow that automatically extracts data from the database, generates Vertical Bar Charts, and sends an email to the team. The chart setup includes: X-axis: Products Y-axis: Power Consumption (Watt) Filter: Release Date (Single Value Selection) Issue: The charts are successfully generated, but the…
-
Visibility condition for Sections within Application Designer
In Application Designer, I have below configuration. Section1 - Edit Project Variable tile (Dropdown with values as A or B) Section2 - Multiple tiles within this Section3 - Multiple tiles within this If Section1 Project Variable value is A, then only Section2 should be displayed. If Section1 Project Variable value is B,…
-
How can I define the messaging channel?
-
Application as a recipe(using application designer)
Hi, I am converting my Alteryx workflow to dataiku. I have used macro in Alteryx because there are set of operations which were repeating. So, I used macro for that. now I am using dataiku. Want to know how to use application as a recipe to avoid replicating same steps multiple times. is there a way to achieve this in…
-
Categorical columns get distorted after using recipes like join, pivot, or group
Categorical columns get distorted after using recipes like join, pivot, or group, even with clean data and unique IDs. Anyone else facing this? Operating system used: Windows
-
Issue Sending Dashboard Pages via "Send Email Recipe"
Hi Team, I am trying to send dashboard pages via email using the "Send Email Recipe," similar to how I send CSV and Excel files. However, while CSV and Excel attachments are working fine, the dashboard pages are not being sent through email. Could you please help me troubleshoot this issue? Any insights or solutions would…
-
Modify project variables through a hook in advanced govern node
Hi everyone, I am currently designing a custom blueprint with the blueprint designer on Dataiku Govern. I am configuring a hook to communicate with a project in the design node and I would like the hook to access and modify the global variables of the design node project (based on variables I have stored in the artifact…
-
Data lineage on a whole dataset
Hello, is it possible to use data lineage on a whole dataset rather than column by column ? Thanks in advance !
-
is there any recipe that will parse different types of datetime formats
for example: 1. 07/31/2025 8:00am 2. 31/07/2025 16:30
-
Dark Mode
Every developer needs a dark mode A dark theme for the flow, datasets, and recipe configs would go a long way toward making Dataiku fit into workflows that involve many other dark mode tools. Dataiku is definitely very bright when swapping from other tools which operate in dark mode. Extensions like Dark Reader do a pretty…
-
Advanced container settings for R code environments
Like the Python code environments can have dockerfiles defined to be applied when building code env we need the same for R. We find our self modifying the base DSS image to accommodate some features needed in an R code environment.
-
Longer Connection text box on New Snowflake dataset page as needed
Request for the text box for Connection on New Snowflake dataset page to get longer to fit the full connection text if the connection text is longer than the current text box length. Our organization has a standard prefix for connections based on division/team/project, so I have multiple connections with the same prefix…
-
data analysis
I am in the data collection and analysis phase of my project. I am not sure which data tools and analyses are recommended for my data. Can anyone have the expertise to provide hisher feedback? If yes, i will provide further details
-
How to set up Exporting PDF dashborad?
I have installed and used the functions in the document below on on-prem linux. reference : https://doc.dataiku.com/dss/latest/installation/custom/graphics-export.html However, I am curious about how to set up the node created through AWS in fleetmanager. Should I connect directly to the design node (ec2) with ssh and…
-
Dataiku CRU(Compute Resource usage) wiki
Hi I have a question The wiki of dataiku CRU has zones and datasets for llm usage, but the actual CRU does not. Is this gone? Operating system used: rocky Operating system used: rocky
-
How to Analyze Entire Datasets in Dataiku Instead of Samples
I’m working in Dataiku to create a usable dataset by combining internal data with public data. Currently, each dataset is loaded with only a sample of about 10,000 rows, and I used Visual Recipes to build the final dataset. However, it seems that the final output was also generated based only on the sample data. How can I…
-
Where to track manual changes in an editable dataset?
I created an editable dataset and selected the option "Keep track of manual changes". However, I'm not sure where I can actually review those changes. In the "History" tab, I only see some commits related to changes in the dataset, but not all. For example, when I add a row or edit a cell value, it doesn't always create a…
-
Column count dropping in dataiku
I have one dataset where I have 11 columns, then in the preparation recipe one extra column is created and then immediately in the next dataset the column count drops to 7. How come is this possible? The first dataset The preparation recipe The third dataset
-
Support Panel Webapps in DataIku
Panel is a powerful #python #mlapp framework build on top of Bokeh. A high-level app and dashboarding solution for Python — Panel 0.12.4 documentation (holoviz.org) Panel can be supported similarly to Bokeh. The Panel server is build on top of Bokeh, therefore it should be very easy to support. For example `bokeh serve` is…
-
Error while computing large number
I've got an error while computing a large number (ie somtehing like 10^18). The formula "MJ*1000000" retruns negative value in column "J" for some rows and the correct value for others… Does someone as an explanation and a solution ? Thanks very much in advance
-
Removing Incorrectly Registered Automation/API Node Infrastructure in Dataiku
Hi, I've mistakenly registered some automation node and API node infrastructure with incorrect information in our Dataiku design node. I've carefully reviewed the Web UI but haven't been able to find a menu option to remove these. Could someone please advise on how to remove this incorrectly registered infrastructure? Any…
-
where to find confusion matrix, ROC, Accuracy and other in standalone evaluation recipe results
Operating system used: windows