Questions & Discussions

nav[aria-label="Primary Navigation"] { padding: 0; & ul { list-style: none; width: 100%; display: flex; flex-direction: row; justify-content: start; align-items: start; gap: 30px; padding: 0; & li { margin: 0; } & ul li { list-style: none; } } }

Using Dataiku
Ask your questions about how to get the most out of your Dataiku usage and connect with others just like you.
Setup & Configuration
Find answers and ask questions about how to get the most out of your Dataiku setup and configuration.
General
If your question doesn't fit in any of the other categories, feel free to share here!
Plugins & Extending Dataiku
Ask questions about extending Dataiku and share plugins with others.

Integrating a Digital Membership Card System with Dataiku for Member Analytics
Hello everyone, I’m exploring a use case where we connect a digital membership card system with Dataiku to improve member analytics and engagement for a large user base (50k+ members). We currently manage membership data through a platform like MembershipAnywhere, which issues Apple Wallet / Google Wallet-based digital…
Network error when trying to register for free edition of DSS
Dataiku version used: 14.0.3 Dataiku version used: 14.0.3
not resolve address for MIDP-SFS-003: Unknown error.
Error in Python process: At line 236: <class 'Exception'>: Check credentials and troubleshooting docs - Mount Failed: mount error: could not resolve address for MIDP-SFS-003: Unknown error. Lat week, it was working, can you please assist me on this?
Git error while running a python reciepe
Oops: an unexpected error occurred Error in Python process: At line 28: <class 'dataikuapi.utils.DataikuException'>: com.dataiku.dip.transactions.git.TransactionGitException: Failed to commit to Git a previously scheduled commit, caused by: CodedIOException: Unable to commit transaction, caused by: JGitInternalException:…
How to collect user-provided info in an application
I'm looking for tips/advice on how best to set up some user data collection as part of an application, and how it will interact with my flow/partitioning scheme. Here's the basic description of what is going on: I've built a notebook and accompanying recipe that performs a Monte Carlo simulation. I'm hoping to create an…
Bonjour, je souhaite une aide sur la configuration de dataiku pour un usage
J'ai un dataset qui pointe sur un répertoire ou je peu avoir un ou plusieurs fichiers. J'ai crée un receip "Prepare" et "join". Comment puis je faire pour forcer dataiku à faire un traitement séparé par fichier ? Merci pour votre aide Dataiku version used: 11.4
Data Type
Hi Guys, I just use embedding dataset recipe in Dataiku. My dataset containing two columns: a code column and a description column. The code values contain leading zeros (for example, 012345). After creating embeddings and performing search in knowledge bank Dataiku, the retrieved results display the code without the…
Issue on chart computation memory limit when building Pivot Table
I do not have much data, is about 4558 entries only. Why I have exceeded the memory limit ? How to resolve it ? Thanks. Dataiku version used: 14
Structured JSON output by LLM via Snowflake Cortex
Hi, my company recently added access to LLMs via Snowflake Cortex through Dataiku's LLM Mesh feature. When trying to generate structured JSON output (either in the UI via Prompt Studio or via the Python API according to the official documentation), I run into the following error: The Snowflake Cortex (SQL) completion API…
Dataiku Public Webapp (Version 12.5.2)
Hi Team, I have created a Webapp, but I wish to use it outside Dataiku and all the users of the WebApp do not have access to Dataiku. So, in Documentation I read about Public WebApp in Dataiku, but I am unable to follow the steps mentioned after clicking on Administration. Steps mentioned: . In order to make a webapp…
Prompt recipe - controlling request rate?
Is it possible to control the rate at which a prompt recipe issues requests? I ran my first one yesterday and very quickly hit the TPM rate cap on our OpenAI model deployment.
Bug: Uploaded plugin cannot be uninstalled
Dataiku version used: 14.6.0
Configuring dev servers on a project- or user-level
Hi, my team at work is using Dataiku 13.5.5 and our typical setting is that we are collaborating on projects with external partners that have more limited access to data connections and other Dataiku features compared to us. Just today we were informed by our external partner that they are not able to run test queries to…
jobId from scenario execution
Hi. I created a scenario that executes a python recipe and builds a dataset. I tried to exact jobId within the recipe using get_custom_variables.get("jobId"). However, the approach is unreliable, as the code returns a value and times and sometimes it doesn't. The scenario is run using a time-based trigger. Could someone…
PythonノートブックからカスタムLightGBM（5-Fold）モデルをSaved Modelに登録する際のエラーについて
Dataikuコミュニティの皆様、こんにちは。現在、Jupyter Notebook（Pythonコードレシピ）で作成したカスタムのLightGBMモデルを、Dataikuの「Saved Model（保存済みモデル）」としてフローに登録し、ScoreレシピやEvaluateレシピと連携させようとしています。予測時（推論時）のカテゴリ型のマッピングのズレ防止、および5つのFoldのアンサンブル（平均）予測を内包させるため、MLflowの mlflow.pyfunc.PythonModel を使用して以下のようなカスタムラッパークラスを実装しました。 1. 参考にしたドキュメント公式チュートリアルの「Step 3: ML Deploy…
Dataiku GCP installation question(s)
Hello, newbie question regarding Dataiku GCP installation. Reference architecture states: https://doc.dataiku.com/dss/latest/cloud/gcp/reference-architectures/gke-gcs.html and https://github.com/sudsk/dataiku/blob/main/README.md Setup an AlmaLinux 8 GCE machine and make sure that: - you select the right Service Account -…
Timezone trigger using Python API deployment
Hi, I'm using the Dataiku API to deploy parameters in my scenario. My question is about the timezone in the scenario trigger. In my custom python, I specify, timezone : "Europe/Paris" and the dataiku front took this parameter like we saw on the second screenshot. I want to launch my scenario at 1:00 in the timezone…
Add a dataset from a data collection in Python
Hi, Is it possible in Dataiku to import a dataset from a Data Collection into a project, using the Python API ? (I mean from a recipe in the target project, not in the source project). I am looking for a way to do this via a Python recipe or through the Dataiku Python API client, without having to do it manually through…
Unexpected prediction behavior in API Designer Test Queries with missing feature values
Hello, I am currently testing an API endpoint created with Dataiku API Designer and noticed some prediction behaviors that I do not fully understand. Environment Dataiku DSS API Designer LightGBM prediction model Approximately 48 input features Observation I expected the API to return an error when required feature values…
Deep Learning Feature Importance
I'm using v13.3.2, and notice Feature Importance is not available for DL models … it that available in v14 ? Dataiku version used: 13.3.2
URGENT
I want to use embed documents recipe, but always got an error "Got structured extraction result: failed - Local kernel failed to start, caused by: IOException: Failure while waiting for process to connect, caused by: IOException: Kernel process return code is 1 (processing time: 1598ms)" Dataiku version used: 14.2
Hi, How can I get the Community Trial with my "Gmail Account"
Tips for working with SQL Temporal tables?
I've searched around a bit, and I have found no information about Dataiku handling MSSQL temporal tables. Am I using the wrong search terms? Is this an area where I can only do this manually (temporal tables store a history with to datetime columns that bracket when the row was effective) MSSQL has some special query…
Need to drop cascade tables before re-creating and writing data
I have a few projects that are linked. Project 1: Creates tables Project 2: Creates Views and queries results for visualizations In project 1, i'd like to add something like this to the "Pre-write" statements on the tables: DROP TABLE IF EXISTS ${schema_name}.${table_name} CASCADE; Can anyone help with a way to do this? Or…
Is Dataiku Stories available in the 14-day trial?
Hi Dataiku Community Team, I am currently using the 14-day trial version of Dataiku and would like to test the Stories feature. I noticed in the documentation that Dataiku Stories is not available in all licenses, so I would like to ask whether this feature can be used in the 14-day trial. I was not sure where I should…
I want to install Playwright in Dataiku
He Hello Dataiku community, I'm trying to use Playwright in a Python code environment on Dataiku DSS. I added playwright to the "Packages to install" section of my environment, and it installs correctly. However, when I run my code, I get the following error: rustCopierModifierBrowserType.launch: Executable doesn't exist…
How do I merge branches?
I have created a duplicate project using version control in DSS. I have created the changes that I would like to make and would now like to merge this branch to the master branch. The instructions under Merging Branches aren't clear enough for me to accomplish this. Do I need to activate a "remote" and work outside of DSS…
Free for testing LLM API
Hi, do you know any of the LLM providers that proposes a free API with free prompts through Dataiku LLM mesh (on limited basis of course) so using one of the LLM recipes of Dataiku the reason why i ask this : i am teaching to students data science using Dataiku , i would like to have my students practicing Gen AI through…
SQL Script Failing - stored procedure call from redshift
ERROR: COMMIT cannot be invoked from a procedure that is executing in an atomic context. Hint: Try calling the procedure as a top-level call i.e. not from within an explicit transaction block. Or, if this procedure (or one of its ancestors in the call chain) was created with SET config It seems as though Dataiku is…
Changing datasets from filesystem_folders issue, won't register new dataset.
I am not hooked into an SFTP or database for Dataiku, I upload files manually right now. When I remove a dataset, and add a dataset, it does not register until I go to the output dataset —> settings —> and then click Test & Get Schema. I tried to set a trigger to run when the dataset in that folder gets changed, and it…
Client's credentials revoked
Hi, I get an error while running my scenario, do you know how to fix this? thanks! Oops: an unexpected error occurred Error in Python process: At line 7: <class 'RuntimeError'>: kinit failed, stderr - kinit: Client's credentials have been revoked while getting initial credentials
Dataiku Mail/Teams notification integration
Please provide tutorial or steps to integrate Dataiku Mail/ Teams notifications for both scenario- based use and code implementation
Quick feedback for SQL notebook new features!
Hi dataiku users, My name is Han, the technical product manager who worked on SQL notebook in DSS. We have shipped some major improvements of SQL notebook for release 14.3 and 14.4, including autocomplate, charts and SQL assitant. If you have tried the new features, we'd love to get your quick feedback! Just a couple of…
Full outer join
Is there a way to do a full outer join between to datasets stored in the DSS memory (so basically made by recipes or analyses) ?
Webapp Dash
Titre : Dash webapp – erreur "NoLayoutException: layout = None" au démarrage Bonjour à tous, Je rencontre un problème lors de l’exécution d’une webapp Dash dans Dataiku DSS. Mon application contient bien un app.layout défini, cependant au lancement j’obtiens l’erreur suivante : dash.exceptions.NoLayoutException : The…
Splunk Connection with dataiku
I have a requirement to read data from splunk via an automated process from dataiku. cansomeone help me in that?
Airtable Connector Error 403: Invalid permissions with Personal Access Token
Hey, I'm trying to connect Dataiku to Airtable using the Airtable plugin, but I keep getting this error: "Test failed: Failure while reading data from dataset : <class 'Exception'> : Invalid permissions, or the requested model was not found. Check that both your user and your token have the required permissions, and that…
Copy-paste tables from DSS
Hi dataiku users, My name is Han, a technical product manager in Dataiku. We have shipped a small but hopefully very useful feature that allows you to copy-paste the tables from DSS to your excel, or google sheet etc. Here is how it works for Explore view of the table It also works for SQL notebook for example: Try it out…
After upgrade to Dataiku 14.5.0 error with header Content-MD5
Hello, I have as object storage a Dell ECS solution, when I upgraded dataiku to 14.5.0 version I have an error on a python notebook: [2026/04/24-13:34:55.483] [qtp304148291-53] [DEBUG] [dku.jobs] - Command /tintercom/datasets/wait-write-session processed in 310ms [2026/04/24-13:34:55.480] [qtp304148291-54] [ERROR]…
Does we can put a dataset/recipe ofline of the build. there is any status ?
Hiii, well i'm on a big project rn and i got data going from folder (with a bunch of csv). So this data go to différent flow zone and go frome dataset to recipes again an agai before staking in a new flow zone. So i want that when i build all some part in any flow zone dont be build like cutting the flow from the dataset.…
Triggering Dataiku scenario from AWS Lambda (Node.js) to EC2‑hosted DSS-best practice&auth approach
Hi Dataiku Community, We are working on integrating Dataiku DSS (hosted on EC2) with AWS services and would appreciate guidance/documentation reference on the recommended approach and authentication practices. From what we know: currently Dataiku DSS is deployed on an EC2 instance (private network) DSS is in a different…
Any channel or community out of dataiku official
hi, i'm new to dataiku and in origin im from network and telecom, so i wanna upgrade my skills in data but i don't really know from where i should start. Like i just pass Three certification of dataiku but i feel like idk how to do really good things… And so i wanna know a lot about LLM an Ai management with dataiku if any…
Custom Trigger
Hi, I want to create custom trigger but it not executed even I have written simple print('Hello') but it is not appear in log. Could you please help.
Dataiku Outbound IP Address Control - Proxy or Otherwise?
My team works with a data supplier that requires a white-listing process for IP addresses. When we attempt to use Dataiku to fetch the data our IP address associated with the call occasionally changes. We have access to a proxy IP address system, and we have made requests through the proxy to the data supplier when we run…
XGBoost error
Our solution: Dear Abhinav, The issue you're encountering seems to stem from the missing `libomp` library, which is required by XGBoost on macOS. Let's walk through a possible resolution: Steps to resolve the missing `libomp` issue: 1. Install OpenMP (libomp): On macOS, you need to install `libomp`, which provides the…
Sample Data For Training
Hi everyone, I’m currently working on a machine learning project in Dataiku with around 1 million rows of training data. When I try to train using the full dataset, the process crashes due to out-of-memory issues. I noticed that Dataiku provides several sampling options (e.g., using the first 10,000 rows, using an…
Handling double query execution and compute costs with Databricks SQL
Hi everyone, Our team relies heavily on Dataiku integrated with Databricks SQL, and we’ve recently been investigating our compute costs. We noticed a specific behavior with standard SQL Query Recipes that we'd love to get the community's thoughts on. The Observation When building a new dataset using a SQL Query Recipe…
Dataiku Plugins Support Coverage
Hello I can see different level of support coverage for plugins.I can understand what Supported / 2 Tiers Support mean. But some plugins have not information about coverage (none of the above, but neither "not supported" ..) Do I have to consider they are "supported" or "not supported" by default when no label is set ?…
"Filtering" issue for the dashboard, with <Fixed Date> that will reset or change to ALL
I have set the date <fixed date>, with start date of 01-01-2026 Please see my issue below. I changed filter for "country" and "event type", and then the start date of <fixed date> changed to 01-01-2020 automatically. Bug ? I have an annoying issue with filter for <fixed date>, where the "Date" will reset or switch back to…
API Node: Creating a Python endpoint that executes arbitrary SQL passed in the request
Hi everyone, I’m working with Dataiku API Node (Python endpoint) and I’m exploring an API pattern where the endpoint can execute a SQL query that is passed dynamically in the API call. Use case The goal is to: Create a Python API endpoint Accept a SQL query as part of the request payload, for example: { "query": "SELECT…
Data Quality Plugin: multi-select dropdown of column names
Hi, I have some custom Data Quality checks added to Dataiku as part of a plugin. I want to have an input variable as a multi-select dropdown list of the columns in the schema. If I use "type": "DATASET_COLUMN", I get a dropdown list but no multi-select. If I use "type": "DATASET_COLUMNS", I get the ability to add mutliple…
Stuck at starting
My Dataiku is stuck in 'Starting Dataiku DSS...' for hours when I first start Dataiku, but it was working fine just yesterday. It's running on Windows, haven't had problems so far. Operating system used: Windows
Impossible to launch Dataiku DSS (Free edition)
Hello, As you can see on the attached screen, I can't launch Dataiku DSS (Free Edition). I installed it with 'admin rights'. I launch it with 'admin rights' too. I tried to desactivate 'Windows defender', it was still KO. Thank you for your help. Dataiku version used: 2026 april Dataiku version used: 2026 april
Exporting Dataiku projects other than bundles
Hi, I need to share the project structure (data transformation, recipe codes, etc.) with a third party that does not have access to Dataiku platform. Other than bundles, is there any way to export the entire project in .py or jupyter notebook format?
create new project
How can I make the stacked bar chart in Mix Chart
Looking for solutions to make "planned" and "unplanned" in the stacked bar.
How to adjust the point of line chart exactly same as the point of bar chart (assumed at center) ?
In this Mix Chart, the point of line chart lies on the top left the bar chart. It looks like the bar chart is higher than the line chart by eye ball, when we normally assume the point of the bar chart is at the center of itself. This makes interpretation not accurate, and is misleading.
Error compute kernel failure new Statistics Feature
Hello, I updated our Dataiku Plateform from 5.X to the 8. version. So when we try to use the new Statistics feature on a Dataset, I get the following error: Unexpected EDA kernel failure com.dataiku.dip.exceptions.CodedException: EDA docker kernel failed to start, caused by: CodedException: EDA docker kernel failed to…
Stack-recipe picking up all partitions when "Origin column" is enabled?
Hi, I encountered what seems like a bug after upgrading our DSS-instance. With certain visual Stack-recipes we've seen them picking up ALL data in ALL tables, regardless of partition restrictions set in the Input/Output settings. After testing out a bit we found it to be linked to the Origin column option. Enabled it…
Sharepoint List download is not downloading the ID (key) field
Is this a known limitation in Dataiku that a List does not download include the Item ID field? When i look at the Schema as well it does not show the ID field.
Zip Codes 5 and 9 digits
I have a dataset with a mixture of US zip codes (both 5 and 9 digits long) and some zip codes from Canada, the UK, and other countries. Is there a formula or another way that Dataiku will add in the hyphen for 9 digit US zip codes without impacting the other zip codes?
Privatelink接続におけるDataiku Cloud側の接続元AWSアカウント
LaunchPadから設定できるPrivateLinkのExtensionについて、SnowflakeとAWSのもので接続元となるDataiku Cloud側のAWSアカウントIDが異なるように見えます。 AWS private endpoint： Add the following ARN as an allowed principal to your endpoint service:(ARNの値に含まれるアカウントID) AWS Snowflake endpoint： Please enable AWS PrivateLink for our Your Snowflake account ID account from the…
Difference between global Variable and local variable
I want to clarify the difference between Global and Local Variables. I told them the definition from here So, then what's the point of having Local Variables if in Production I'm just going to use global Variables only. Isn't Local Variable redundant to have?
Using OpenAI web_search via LLM connection
Hi, I’m using a Dataiku LLM connection to Azure OpenAI and I would like to use OpenAI’s built‑in web_search capability (the same one you get when calling the Responses API with tools=[{"type": "web_search"}]). I created a code agent but unable to use the langchain framework to invoke web search with openAI…
How to check the last run time of a recipe with python api?
import re import dataiku import urllib3 import datetime urllib3.disable_warnings(urllib3.exceptions.InsecureRequestWarning) def get_project(client, project_key): return client.get_project(project_key) def get_datasets(client, project_key): project = get_project(client, project_key) return project.list_datasets() def…
SQL autocomplete in Dataiku's Notebook
Hi, I'm unable to get SQL autocomplete to work in Dataiku's SQL Notebook when connected to BigQuery.
Call Agent Hub or LLM Mesh
hello, how can i call agent hub/LLM mesh from outside Dataiku through POST-Man app or python code. Is there any way as API designer service in order to call it. I've read about Rest API but don't understand how to configure it over agent hub and how it's going to be accessible for the external systems Thanks, Dataiku…
How to delete a post that I created?
How do I delete a post that I created?
LIKE join
How can I perform pattern matching similar to the SQL join with a LIKE operator in Dataiku as below: SELECT Data1 .* Data2.NewName FROM Data1 INNER JOIN Data2 ON Data1.Cust_Name LIKE CONCAT('%', Data2.Contains, '%')
Enabling parquet format in Dataiku DSS
Hi Currently when we write into Dataiku file system we only csv and avro format. How can I enable parque format in Dataiku DSS running on linux platform on EC2 instance. I need steps for that. Also we don't have any HDFS connection setup as well. Regards, Ankur.
Dataiku Architect
Job Summary: We are seeking an experienced Dataiku Platform Architect with strong expertise in managing and supporting the Dataiku ecosystem across multiple environments. The ideal candidate will have hands-on experience in setting up and architecting various Dataiku nodes (Design, Automate, Fleet, Deployer, Govern, API)…
Hiring for Dataiku Architect
Experience: 10 to 20 years Location: Bengaluru, Gurgaon, Pune, Remote About Us: AceNet Consulting is a fast-growing global business and technology consulting firm leveraging a consultative approach, deep domain expertise and strong technology capabilities across business transformation, IT strategy & architecture, digital…
コンテナごとのリソース利用状況の取得方法について
Dataiku Cloudにおけるリソース管理方法を検討しています。以下のContainer execution configurationsで作成できるconfigurations単位でのリソース利用状況を取得・ダッシュボードで可視化する方法はありますでしょうか。以下のページの記載によるとKubernetesのpod単位でしか利用ログを取得できなさそうでした。 Reference | Managing elastic AI compute capacity - Dataiku Knowledge Base
CloudStack環境でPySpark実行時に追加ライブラリ (JAR) を読み込ませたい
お世話になります。 AWS上にDataiku環境を構築しており、EKS上にクラスタも配置しております。この環境でNotebookおよびPySparkレシピにおいてPySparkを使ってEKSクラスタで処理を実行するようなコードを書こうとしています。このコード内でSparkに追加ライブラリ (JARファイル) を読ませた状態でSparkSessionを起動したいのですが、クラスタに対してJARファイルをどうやって配置するべきなのか分からず困っております。クラスタへJARファイルを配置する良い方法について、ご存じの方がおりましたらお知恵を拝借できますでしょうか。よろしくお願いいたします。 Dataiku version used:…
Out of Memory Error vs jek.xmx size
Hi all, We have some DSS users, who get the error message: "java.lang.OutOfMemoryError: GC overhead limit exceeded" when trying to join 2 files with about 80k records and about 5k & 100 columns. We traced this problem and think this might be solved by increasing the jek.xmx setting to 3 GB. We are not completely sure what…
Soumitra Dutta Oxford: How do I import data into Dataiku?
Hi Everyone, I'm Soumitra Dutta – Entrepreneur & Photographer Based in Oxford. I’m new to Dataiku and want to get my dataset in. How do you usually import data? Any advice would be great! Regards Soumitra Dutta Oxford
Dataiku scenario variable
Hi everyone, I’m trying to set up a scenario with reporting. In my Python script, I declare a scenario variable like this: scenario = Scenario() scenario.set_scenario_variables(variables={ "tables_summary": summary }) Then, in my report, I attempt to send an email using ${tables_summary}. The summary variable is a…
Practice Demo
Hi, I completed core designer and available task with cards merchant and tx data. for being a core designer do we have any other practice demo available with sample data and steps in order to be more confident Regards, Prabhat Dataiku version used: 14 Dataiku version used: 14 Dataiku version used: 14
Problem with Dataiku Launcher
Hi, the Dataiku Launcher is not working. The dataiku design homepage is not starting up. I have tried the localhost also. I tried uninstalling and reinstalling the launcher setup but it still doesn't work. When i tried to avail the free trial, it is not accepting @gmail ids. Please help and kindly tell if there's any other…
Agent hub webapp
i'm trying to create agent hub webapp as mentioned in the course steps, there is an issue appears for me and don't know how to solve it. "Backend died before startup complete, caused by: APIErrorException: No module named 'pydantic'" i've download it in the default environment and configure the project code environment to…
Agent Hub with external systems
hello, i want to know how to use agent hub from external system, there's any way as API designer service in order to call it. I've read about Rest API but don't understand how to configure it over agent hub and how it's going to be accessible for the external systems Thanks, Dataiku version used: Trial version on cloud
ダッシュボードのメモリ設定について
お世話になっております。 Dashboard上のオブジェクトのメモリ制限についてご教示ください。現在Dashboard上にピボットを配置し、複数行列の計算を実施しているのですが、Memory超過のメッセージが出力される状態となっております。メッセージ上は200MByte程度のLimitationがあるように表記がされているのですが、こちらの制限を設定することは可能でしょうか？コンピュートの制御は可能であるDocumentは確認したのですが、Dashboardのオブジェクトに対して設定可否があればご教示いただきたく存じます。 Dataiku version used: 14.3.0
Please, use Marimo Notebook as replacement Jupyter Notebook
This is not a question, but more like opinion and/or feature request. Marimo Notebook definitely would be a perfect match for DSS rather than dataiku, you don't need to convert python recipe to a jupyter notebook, back and forth when editing it, the notebook is really git friendly since it is literally a python script…
Using Marimo with Dataiku
Jupyter Notebook is a widely used tool. It is well known, and many people have their habits in using it. Dataiku provides Jupyter Notebook as a standard for testing code-related stuff, editing Code Recipes, and many other actions. The Jupyter Notebook integration within Dataiku is a cornerstone (for coders) while creating…
How to fix "No Schema has been specified in the dataset settings. Schema is mandato" error.
Hi all, I am trying to join a table in BigQuery with another dataset in Dataiku and getting this error, "No Schema has been specified in the dataset settings. Schema is mandatory for BigQuery and corresponds to the BigQuery's dataset ID." Can someone explain how to fix this? Thanks in advance. Dataiku version used:…
How to work with trino engine
Hello everyone I'm using a trino connections and my tables are quite large and i want to know if if use full import or use sql query ? what is the recommanded method
How to loop on multiple files contained in a folder ?
i have multiple xlsx files contained in a folder . How can i loop on them one by one to concatenate them in a single dataframe ? Dataiku version used: 13.1 Dataiku version used: 13.1
How to retrieve the Evaluated Model name from a Model Evaluation Store via the Python API
I'm trying to pull the name of the evaluated model for a model evaluation store - e.g. the field "Evaluated model" under the category "Model" from this type of URL: https://dataikuprod.corp.{company}.org/projects/{projectkey}/modelevaluationstores/{meskey}/evaluations/{specificmekey}/report/tabular-summary I've been trying…
Run sql recipe with partition dataset
Hello, I need to write a SQL recipe that build a dataset on a specific partition set as a project variable. In my flow i have dozens of recipes that worked this way, i enter the partition ${execution_date} (close to the button run) and it works. For SQL recipe when i give the argument 2022-01-20 the query succeed but when…
SQL connection pooling feature
Hello, I am using a Dash web app for an internal tool to read/write data to a Snowflake connection. This web app may have up to 20-30 concurrent users, and I am noticing that a fresh connection is opened everytime a query to Snowflake is made. This creates intense connection overhead, leading to slower response times in…
Exclusion of Weekend Date in the chart
Hi all, Actually, my dataset only has weekday's date (Mon-Fri). How to exclude weekend date in the chart ?
Migration Azure to AWS Dataiku DSS echo system
iam looking for Best practices for migration of Azure Dataiku DSS echo system ( designer + real time / api + autoJobs) + Azure Kubernetes+ PostgreSQL to AWS Dataiku DSS echo system.
Governing Deployments of Bundles and API deployments
Hi I am using the Dataiku Govern node and I have tried going through the AI Governance Specialist Learning Path. I understand that we are able to govern a deployment of a bundle, however I have not been able to find any information regarding the governing of a API deployment. In addition, looking at the deployed API, there…
Application Designer
Is there anybody on this list who has used the Application Designer and used variables? If yes, would it be possible from to have a Zoom meeting so I can discuss a few issues in 15-30min rather than wait 1-2 weeks? I read most of the documentation, watched video and still cannot find answers on the practical use of…
Setup custom model as a Dataiku Plugin in Visual Analysis
Hi team, I am trying to add a custom XGBoost model to Visual Analysis. I tried to add custom code first , but that does not give me the flexibility for hyperparameter search on that algorithm. To make use of it, I created a custom Predictive model Dataiku Plugin and I could see it appearing in the Visual Analysis Models…
dataiku free version activation in china due to network connection to my.dataiku.com
i can not activate the dataiku free version in china due to network connection to my.dataiku.com， how can i solve it
How do I purchase a license?
I'm teaching a marketing analytics course and I would very much like to use Dataiku to replace DataRobot for a portion of the course. I've been trying about two weeks to purchase a license, filled out the form online, looked for other ways of contacting Dataiku, etc. I have been unsuccessful so far. I would be happy to get…
I want to get the chart to be displayed in my visual agent
I have created a visual agent where there are three tools used Dataset lookup, SQL Query and inline python, is there any way that agent can create simple charts like, line, bar, histogram,pie at the moment it generate textual response only. That would be nice if any one can share a sample project or any documentation for…
Using external Faker library to generate synthetic data
Hi everyone, Super beginner question.. Do you know what line of code to add in a Jupiter Notebook in Python to use https://faker.readthedocs.io/en/master/ ? i tried the following #pip install Faker #from faker install Faker #fake = Faker () #fake.name() Yet i think i am missing an important step should i upload the…
Select multiple column
Hello Is it possible to select multiple columns in order to remove them ? , i tried the remove/keep recipe which allows to select column one by one but if i have 100 to select and if they are contiguous a selection of the first and last would be easy like in Excel (or providing first col name, last col name to select all…

Users like you make us a community!

Create an account to contribute great content, engage with others, and show your appreciation.

Our Top Users

hadir 18

OlgaO 14