-
WARNING: Retrying (Retry(total=1, connect=None, read=None, redirect=None, status=None)) after connec
In my Dataiku UI, I created two Python environments under Code Envs: One for Python 3.12 One for Python 3.9 Each environment installs the Python libraries listed in “packages to install” by retrieving them from repositories hosted on an Artifactory. I imported the Artifactory certificate into the keystore of my Dataiku VM…
-
VISUAL RECIPE
Hi, Is there a way to exclude the column used to Split the dataset in SPLIT recipe without using prepare/precomputed column
-
한국 신한은행 고객 문의 사항
데이터 이쿠 설치 버젼 : 14.0.2 OS : RHEL 8.10 1. VS Code Studio Dataiku 에서 제공하는 기능 중에 visual code studio 를 webapp 처럼 deploy 해서 사용 하는 기능을 docker 환경에서도 사용이 가능한지요 13.5.2 , 13.5.4 , 14.0.2 이렇게 세개 버전에 대해서 설치 하고 확인 했으나 kubernetes pod 을 띄우려고 하는 로그가 나오고 있어서 docker 환경에서는 실행이 안되는것 같습니다. 버전과 연관성이 있는지와 kubernetes cluster 가 없으면 사용이 불가능 한지에 대해서…
-
Please, use Marimo Notebook as replacement Jupyter Notebook
This is not a question, but more like opinion and/or feature request. Marimo Notebook definitely would be a perfect match for DSS rather than dataiku, you don't need to convert python recipe to a jupyter notebook, back and forth when editing it, the notebook is really git friendly since it is literally a python script…
-
How to retrieve the test dataset used in the trained model With python?
Hello everyone, I am working on Dataiku, primarily using their API. I have trained my model and would like to retrieve the dataset that was used for testing via the API methods. Despite trying several methods, including get_train_info(), I am unable to obtain the test dataset. I don't want to export it; I just want to…
-
What is a window function in dataiku
-
Guidance Needed: Building a SQL Chatbot in Dataiku
Dear Dataiku Experts, I’m working on a Dataiku project where I want to enable natural language interaction with a production SQL database — essentially, a chatbot I call the “Production Analyst.” The idea is that a user can type queries in natural language, and the chatbot will interpret them to perform production…
-
Issue with get_connection in Python Endpoint for Snowflake – API Node
Hi all, I'm currently working on a Python endpoint in Dataiku API Node, where I need to dynamically generate and execute SQL queries against a Snowflake connection. These SQL queries are generated via OpenAI, so I cannot use a predefined SQL endpoint (as those require fixed queries). I followed the official Dataiku…
-
Advanced LLM Mesh add-on available on Free version?
I couldn't find the answer. I have a community version installed in my server. When I try to use the Agents tools, I get this Usage of Agents requires the Advanced LLM Mesh add-on. Please get in touch with your Dataiku Account Executive or Customer Success Manager to learn more about the Advanced LLM Mesh Is this only…
-
Custom Agent Dynamic SELECT Param
Hello Dataiku Team, I am creating a custom agent, and I am trying to add a python script for the SELECT parameter, but it is not working. Here is what I have in my agent.json: Here is how my tree looks like: And the compute_llm_choices.py is like this: Please, could you spot any reason why this is not working from the…
-
Change Auto-Typing to an off or on option with default “Off”
Would like to have the Auto-Typing setup as an option that can be turned off and on with the default being “Off”. This feature is changing my unit serial numbers (230836735F) to a Float (2.30836735E8) which causes me to lose records when joining on the unit serial numbers field in a following step. This will cause my…
-
Unknown error - Code 403 on a subset of recipies.
Hello, We've been experiencing issues on different recipes/notebooks, few sql recipes show Unknown error - Code 403 and a webapp does the same as well, even though scenarios run perfectly fine whenever we edit a recipe and try to validate we get the same error and it happens for some recipes not all of them. Operating…
-
How to get the list of items of default zone using python?
This is my code and 'Data Pool' is my default zone and it has more hen 10 tables but it is not fetching the list of items for default zone. however it is working for other zones but not for this default zone. Operating system used: Windows Operating system used: Windows
-
Is it appropriate to display a "public" web app from Dataiku on a retail website?
I have never really exposed Dataiku web apps in “Public” mode, i.e., open from the instance. Dataiku seems to make it clear that this is not very secure and that the service could be compromised if someone finds its URL. However, I need to expose a web app quickly on my website, so I'm wondering what the risks are of…
-
Paste list in interim table filter
I would like to be able to copy a list of data from excel and paste it in the interim table filter when using the "Is any of the strings" option instead of having to enter them one at a time. Helps in troubleshooting workflows when you are looking for multiple records.
-
Ability to resize section within a view
Can we get the ability to resize the sections within any of the views in DSS? I like having the info in the different section but a lot of times I wish I could shrink one down a little to get better view of another section within the same view. Attached are a couple of views but would be nice on all.
-
Marimo Notebooks Integration in DSS
I'd like to propose the integration of Marimo notebooks alongside the existing Jupyter notebooks in DSS. Marimo is an innovative notebook environment that addresses several limitations of traditional Jupyter notebooks while maintaining compatibility. Here are some key advantages of Marimo notebooks: Code quality : Marimo…
-
Is there a way to change bar width for a chart?
I feel like I'm going crazy trying to find something that seems like it should be a simple formatting option. I'm making a dashboard that has a lot of graphs like the one pictured, with lots of stacked bars. For some reason, these seem to be as skinny as possible and I can not for the life of me figure out how to change…
-
Make Bar Chart widths adjustable
Right now, theres no good way to adjust bar chart width. I think that this should be a formatting option for charts, since its really had to make bar charts look nice when there's a lot of data
-
error com.dataiku.dip.datasets.fs.FilesystemDatasetTestHandler are in unnamed module of loader 'app'
I get an error when trying to get dataset https://www2.census.gov/geo/tiger/TIGER2019/COUNTY/tl_2019_us_county.zip or ftp://ftp2.census.gov/geo/tiger/TIGER2019/COUNTY/tl_2019_us_county.zip from New HTTP dataset or or New FTP dataset respectively An error occurred class com.dataiku.dip.datasets.fs.HTTPDatasetTestHandler…
-
Prepare Recipe : Format date with custom format - Multiple columns
As of DSS v. 13.5, The processor "Format date with custom format" in the Prepare recipe does not allow to apply the same format to multiple columns at at time. It would be very usefull to perform this step accros multiple columns, just like in the Parse date processor : Best regards,
-
Add Charts to dataikuapi
I'm working with Agentic workflows and stuff in Dataiku 14. One of the ideas I'm toying with is using Dataiku's API module to automatically make very basic Prepare recipes, for example. The next thing I planned on doing was looking at a dataset and creating charts in a similar way, but there's no way currently (that I know…
-
Show API deployment comments on "Last updates" tab
When you deploy an API or API version from the API Designer to the Local Deployer, you can enter a comment for this version. I would love to see this comment field represented on the new "Last updates" tab on the deployer for a service in order to have a quick overview on why each update to the service was made or what…
-
Data download
Is there a way to download updated dataset if queried via “Explore” option and having filter applied on through “Sample Settings”., usually it downloads entire dataset as is and not the one generated after applying filter.
-
Incomplete and possibly incorrect documentation for cgroups, can't persist the settings at reboot
Hi, I'm following the Using cgroups for resource control guide, and the Making the settings persistent at reboot is missing instructions for legacy systemv-based init scripts. Our DSS version is now 14.0.2, but we originally installed something pre-v13. Our nodes don't have the /etc/dataiku/ directory, but have…
-
Using large context for a Gen AI prompt
Hi, I'm trying to create a prompt to ask questions to a LLM and get an answer based on 5,000 reviews for a product. I know there are ways to classify or perform sentiment analysis, but what I want to do is to ask an LLM a question about the whole bunch of reviews. I tried using RAG, but it is my understanding the this…
-
what is the instance Authorization level for getChoicesFromPython or shared Managed Folders?
I have a python recipe that is simply pulling from a managed folder from a different project and loading the files as choices into the plugin UI. Come to find out that other Data Scientist permission level users do not have access to select something from the dropdown. that being said I've looked into permissions, I am…
-
Can I use the R API to trigger a dataset build/job?
I want to create an R Shiny application to query data. The data exists in dataiku where a pipeline runs nightly. I would like to add a button to my app that would rebuild my data on an ad-hoc basis. Is this possible? In python it looks like this works. Is there an R equivalent? import dataiku project =…
-
Use pickle to create API service
i have python code recipe that already create the pickle model and save the pickle to filesytem_folder and already in scenario for weekly pickle update. then i want to create api that use this pickle to classify. i use Custom prediction (Python) for Endpoint type and set the Working folder (optional) to the…
-
Beginner Help: Deploying an API Service with Pickle Model from Jupyter Notebook in Dataiku
Hello Dear Community, I am a complete beginner in Dataiku and have created a Jupyter Notebook as a mini test model. I used Pickle to save the model and vectorizer into a managed folder named "Models". My goal is to make this model available as an API service, but I’m struggling with the process and would greatly appreciate…
-
Missing +Add In button
I'm a new user in the Excel to Dataiku Quick Start Tutorial. I've downloaded the Excel workbook. I'm on the Flow screen and the picture and screen look exactly the same with one exception - the + Add In blue button in the upper right side of the screen is missing. What do I need to do to have the button added to the screen…
-
How to Integrate an Exported Next.js App with Dataiku (v14.0.2)?
Hello everyone, I’m a junior developer and I’m having trouble integrating Next.js with Dataiku. I tried exporting my Next.js application to HTML using the output: export method, but I’m still encountering issues. I’m currently using Dataiku version 14.0.2. Here’s an example of the error I’m seeing: Could anyone advise if…
-
Permission to edit Scenarios only
We have use cases where we do not want someone to have edit or admin permissions to a project, but we would like them to be able to enable/disable scenarios entirely or the steps within them. This is in addition to being able to run the scenario of course. We are thinking about the case where there is a failure or need to…
-
Is it possible to have multiple source of provisioning user ?
Hi Dataiku Community ! Let me explain my context: Today and for a undetermined period, we have two different App to provision our DSS group user (LDAP and AzureAD precisely). However, we do not have same information on these two sources … My question: Is it possible to set up the Two different source to provision a single…
-
Error "Code env not usable"
Hi everyone, A project I created is currently shared with other users in my department as an application, allowing them to instantiate the project. They can run it, but they are not able to make any changes to the flow or the recipies within. After setting a new code env in my project, when a colleague of mine tries to…
-
Does DSS have a recipe for imbalanced sample? Like SMOTE?
-
Use Project Variables in Applications (HTML tile)
Hello, I am building an application, and part of the process is for the user to visit a particular notebook (to look at plots and choose parameters). I am trying to build a simple "url link" message in the Application UI, but I can't find how to use variables in it. * I chose the "Variable Display" tile * Then in HTML, I…
-
Dataiku Outbound IP Address Control - Proxy or Otherwise?
My team works with a data supplier that requires a white-listing process for IP addresses. When we attempt to use Dataiku to fetch the data our IP address associated with the call occasionally changes. We have access to a proxy IP address system, and we have made requests through the proxy to the data supplier when we run…
-
How to send an email from a plugin scenario step using the Python API (Messaging Channels)?
Hi everyone, I’m building a custom scenario step plugin in DSS 12.2. The step collects scenario information from multiple projects, generates an HTML report, and saves it as a project variable. Now, I would like the same plugin step to send this HTML report as an email using an existing Messaging Channel (configured in DSS…
-
Is there a way to get the current Scenario id?
I'm buiilding a macro step for an scenario in which i need to obtain the last successful run of that scenario. I've been able to get the las successful time, but I'm struggling to limit this to only the current scenario. any suggestions on how to tackle this?
-
where to find confusion matrix, ROC, Accuracy and other in standalone evaluation recipe results
Operating system used: windows
-
How can I access Dataiku Inbox using Dataiku Python API
I want to access the Dataiku Inbox to see who raised access requests using Dataiku Python API. Can you help me how to do that? I want to approve requests in an automated way using Python.
-
Display updated dashboards on dataiku govern
Hi I would like to display dashboards on a govern custom page. I implemented custom charts using matplotlib on a python recipe in the design node. I managed to send these custom charts in the “insights” folder of the project. I can access these charts using a custom HTML page in the govern node. However, these charts are…
-
how to display a running.. while running a scenario by using webapp
how to display a running.. while running a scenario by using webapp and also how to refresh that webapp page after scenario run?
-
SQL Compilation for Date-Partitioned Datasets for recipes that require all Partitions
Hi, this is my first time working with partitions so please let me know if I'm making a mistake. I have partitioned various datasets by a date column (set partition timeframe to DAY) and have set things up so that my scheduled scenarios for initial recipes are only doing transformations on the latest few days worth of data…
-
Course Checkpoint: Time Series Preparation
Hi, can you pls review the answers for Course Checkpoint: Time Series Preparation. I think I am getting 100% right but it still fails me.
-
Failed to create Tutorial - v13.4
Hello! I am trying to create tutorial: Core Designer → Create Your First Project. However, I keep receiving error: I also tried to download the tutorial from and import it, however I am receiving this error: I also tried creating tutorial via Chrome and Edge, both did not work. Could you please help me? Thanks!
-
DB上にテーブルが乱立するのを避けたい
DSSでレシピを実行すると必ずデータセットの指定が必要になり、レシピ数分のデータセットが作られることになります。 最終的な処理結果として不要な中間テーブルが残存する形になり、複雑なフローでレシピ数が増えてくるとDBのストレージを圧迫してしまいます。 中間テーブルをビュー化する、中間テーブルを作成せずに連続してレシピを実行するなどの解決方法などはありますでしょうか。
-
ERROR message : Conflicting scenario when updating project bundle
Hi, I am using the local deployer to deploy project on automation node. The first deployment goes well, on the other hand when I try to do an update and deploy a second version i get this error message : Automation node http://xxxxxxxx:xxxx returned an error when trying to activate bundle v1_prod on project xxxxxxx:…
-
Using SQLExecutor2 inside shared library
Hi, I would like to execute some raw sql queries like insert the rows directly into the oracle database. Based on the various community discussions, I chose to use SQLExecutor2. My code is as below: from dataiku import SQLExecutor2 import dataiku def test(): # get the needed data to prepare the query # for example, load…
-
Unable to create Python code environment after upgrading to DSS 14
Setup: AWS EC2 (Dataiku Cloud Stack) DSS version: 14.0.0 → upgraded again today to 14.0.2 Issue: When I try to create a new Python environment via “Administration ▸ Code envs ▸ New Python env,” the following error appears: Environment creation failed Command failed: '/opt/dataiku-dss-14.0.2/scripts/_create-virtualenv.sh'…
-
Significant Improvements
I work as a data scientist manager at Pfizer. I have some feedback for DSS. 1. Does not return to the exact location in the Flow - I have a dataset in zoneA and it is shared to another zoneB. - I explore the dataset in zoneB - When I come back to the Flow, the flow focuses on the dataset in zoneA instead. 2. Cannot compare…
-
how to get all runs of scenarios an their informations?
For my internship i have a project where i have to create a dataset with all the scenarios and their runs, but when i try and import them the informations such as the last run, the progress percentage, number of runs type of triggers and stuff like that remains empty or are marked with a 0 . I hve the id and the name of…
-
"schema cannot be empty when catalog is present"
Hello, I am new to dataiku. I was trying to play around my data but cannot complete simple data preparation receiving this message: schema cannot be empty when catalog is present. Even though I tried to perform different type of data preparation, it sent me this message without telling me exact error I had. Any idea or…
-
Allow nested flow zones
Hi, I use flow zones a lot and appreciate the value. Why not extend the capability and allow nested flow zones, i.e. a flow zone within a flow zone? thx
-
Not seeing partitions list in the explore dataset tab
Does anyone know why partitions that are being extracted still don't appear in the partition list under the Explore tab of the dataset? Operating system used: windows
-
Created an account NO 🚫 NEW PROJECT PAGE located
Note: I selected Dataiku version 10 👇 below in order to submit this question. I have no idea what version of Dataiku. I logged in created an account that is all. Create account Username Password Top Left: 🐦, Dataiki, profile Top Center: My name is large black font with option to edit Bottom half: Dataiku Academy Dataiku…
-
WINDOW RECIPE - SUM
Hello, I'm working with the window recipe. I have one simple dataset and i want compute sum and avg for each group. In the window definition I set up partitioning by variable "type", and in the window aggregations I choose sum and avg for variable "value". I expect dataiku computed sum and avg for each "type" group (i.e if…
-
How to get help
how to gt help online
-
como puedo obtener una version pagada
hola mi version de prueba de 14 dias fue renovada pero vence mañana. como puedo obtener la version pagada; ya que necesito seguir usando esta herramienta Operating system used: Windows 11
-
SSL Error in containerized visual recipes despite no SSL used
I have an error when running visual recipe jobs in containerized execution that I don't have when executing locally in DSS. This job has S3 input and output in http, the DSS instance is in HTTP. The "Test" button in Settings > Containerized Execution works properly. Here is the error: Failed to list files, caused by:…
-
regex that will remove everything between begin: and end:
Hi I've got dataset with regex patterns in one column and python code that uses it to make replacement. I need regex pattern that will remove everything between "begin:" and "end:" Tried to use "begin:[\s\S]*?end: " but it doesn't work. in between I can have more then 3000 chars including special ones. Thank you Ela…
-
Error when renewing license on Dataiku Govern
Hi, I've encounter an error while trying to renew license in Dataiku Govern. My current license is expiring in 30th July 2025. What do I need to do to resolve this? Preferably another way without upgrading the DSS. Thanks. Sean Operating system used: Windows
-
How to get information about jobs' CPU and ressource usage?
Hello, i need to make a repport about all the jobs and all the scenarios running in dataiku and i was wondering if there is a command that will allow me to have informations about the ressources used by every one of them. And if not is there another way to gather these informations?
-
DataIKU not recognizing locally installed python
I am trying to create a new Python Code Env in my locally installed instance of Dataiku. The only option is Python 3.7. All the other options (Python 3.8, 3.9, 3.10, 3.11 and 3.12) are listed as "Not available" in the dropdown. I have Python 3.10 and Python 3.12 locally installed, but somehow Dataiku is not recognizing it.…
-
Impossible to use Custom Python UI for an application
Hello, I'm doing an application with the application designer. I'm trying to retrieve values from a dataset and put it inside a SELECT json list. To do that, I want to use the custom Python UI with the do() method, but i'm unable to save it, and therefore to launch it. In fact, i'm unable to save the app if I write…
-
Need help with efficient BigQuery to Snowflake data transfer in Dataiku
Hi there! I'm setting up a pipeline to move data from BigQuery to Snowflake and really need some advice from folks who've done this before. Right now I'm using the standard DSS engine but it's painfully slow for larger datasets - takes forever to process even moderate amounts of data. I'm not sure what route is actually…
-
how can I use the LLM api hosted in dataiku?
we have some models running on dataiku. But currently I can only find the python client. Even I get the rest apis, the response from these apis are so different from the officials api, like sonnet. This is not good if I want to integrate the api to other tools. Like, I would like to initegrate the API to vscode plugin or…
-
ModuleNotFoundError When Accessing Knowledge Bank via REST API SDK
Hi everyone, I'm running into an issue while trying to access a Dataiku Knowledge Bank from an external Python script using the Dataiku REST API client. When I execute my script, I get the following error:ModuleNotFoundError: No module named 'dataiku.core.vector_stores.lifecycle' Here are my environment details: Dataiku…
-
Trouble creating custom Python code environments
I am trying to create a new Python Code Env in my locally installed instance of Dataiku. The only non-conda based option is Python 3.7. All the other options (Python 3.5, 3.6, 3.8, 3.9, 3.10, 3.11) are listed as "Not available" in the dropdown. I have Python 3.10.11 locally installed, but somehow Dataiku is not recognizing…
-
How to prevent users from accessing filesystem files?
I am installing Dataiku on a baremetal ubuntu 22 OS. However I noticed that users can access the root filesystem quite easily by managed folders or jupyter notebooks for example. Is there a way to completely prevent users from accessing any filesystem related files? If so what is the most efficient way to do so? Operating…
-
How to create a code studio template programmatically
I have an airgapped environment for which I prepare my Dataiku VM beforehand by a cloud-init script. This means all setup is done automatically via python or dsscli/dssadmin cli commands. This VM already contains the container image for container execution and visual recipes so custom python jobs, jupyter notebooks, visual…
-
how to add calculated filters in Charts ?
I want to embed calculated filters (if x >1 set display color to blue, if x > 1.15 set color to yellow etc…) in a chart. Let's say Bar chart for example. I know how to add static filters but is there a way to add filter based on a calculation. here in this chart , each color is a parameter. I want to change parameter color…
-
Problems to run Macro to download the dataiku tensorflow resnet model
Hello, Dataiku community I am trying to follow this tutorial but I have problems to download the dataiku tensorflow resnet model using macro. I select the option "Download pre-trained model" in "deeplearning image cpu" section and I get the error: Oops: an unexpected error occurred Cannot run program…
-
clickhouse support
Hello, is it planned to add clikhouse as a supported database, if not, is it possible to add it ?
-
Enhance Excel output for "Export to folder"
I would like to request an enhancement to the "Export to Folder" recipe when exporting datasets to Excel format. Specifically, it would be extremely helpful if the export could support: Freezing the header row Adding auto filters to the header These features are commonly used in Excel for better data readability and…
-
Ctrl + Enter to run a recipe
It would be great to be able to use the shortcut key combination Ctrl + Enter to run a recipe while in the recipe editor screen. This keyboard shortcut would be consistent with what you can do in both Jupyter Notebooks and in SQL Notebooks. I realize that there is a current keyboard shortcut for running a recipe (@ run)…
-
how can write def do(payload, config, plugin_config, inputs): code ?
to interactive with the UI of macros using "type": "SELECT" in "type": "OBJECT_LIST"? { "name": "test", "label": "test.", "type": "OBJECT_LIST", "itemLabel": "test", "subParams": [ { "name": "obj", "label": "Object", "type": "SELECT", "forceReloadOnValueChange": true, "getChoicesFromPython": true } ] }
-
bloomberg Data Analytics in Dku
Hello, I've Excel sheet having connectors to bloomberg, this typically uses Bloomberg formulas (like BDP, BDH, etc.) to pull data into Excel. Do you know if there is any chance that this is work with dataiku without/with using Excel? Anyone has worked on it? thanks
-
Disappearing training log
I am training a deep learning model on a Dataiku Notebook but the training has disappeared and I don't why. The output of my training grow up as if the log was "write in white" Operating system used: Windows Operating system used: Windows
-
Code Studio - Not authorized: Not owner of the Code Studio
My friend has created a Code Studio. We are under the same Dataiku project. Are there any settings that needs to be configured in order for me to gain access to the Code Studio?
-
Using dataiku DSS, How to get back to Designer for starting a New Project? Thanks.
Pursuing the Developer Track on the free version of Studio. Mapping my progress along side navigationg the DSS is a bit challenging. I am copying the Lesson onto Note and using another screen to run the exercises. These are quite intuitive for the most part, but with a few hiccups like getting back to the Hope Screen or…
-
Dataiku to Greenplum: Performance Lag on Large Data Loads & Batch Read Control
Hello, During a Proof of Concept (PoC), we're experiencing performance degradation when loading 20 million rows of data with 500 columns into GPDB (Greenplum Database). We've observed in the Dataiku logs that it continuously reads data in batches of 2000 rows. We're looking for a setting to adjust this batch size. We've…
-
Running Hugging Face models in Dataiku
Hi, I am trying to run some hugging face models in Dataiku DSS. The task requires to use an on-premise server where I have some useful GPUs to run these hugging face models. The server has an NVIDIA H100 Tensor Core GPU and I want to know: If Dataiku DSS can be installed in that server, if there are no problems with…
-
How to show total in a stacked bar chart and order by total
Hi Dataiku experts, I have a chart to show actual spend and remains to spend. would like to show total value of each stacked bar and order the bars by total value. I have a workaround solution but visual is not perfect- I added a third dataset Lifetime Total and uncolor the bar, but the result is there is too much white…
-
Free Edition Registration Error
I have installed DSS on linux Centos 7 this morning. DSS was started without error. When access the DSS URL the first time I was asked to select license. At page 2- Select your license, I selected Free Edition and entered the registration info. When clicked "Next" button, I got this error: Network error An attempt to…
-
About Shapley calculations
Hi, we're facing big problems with Shapley calculation with a customer. I've found some useful documentation at https://doc.dataiku.com/dss/latest/machine-learning/supervised/explanations.html and a few messages in the community (https://community.dataiku.com/t5/Using-Dataiku/SHAP-Shapley-values-in-Dataiku/m-p/22241,…
-
Run python Recipe with Scenario
Hi, I have a Python recipe that takes two datasets as inputs and provides a dataset as output, now I want to run this recipe with a scenario and run it every day at a specific time. How can I run this recipe? Thanks
-
Neo4J Connect through SSO
Hi! Is there a way to connect to Neo4J using SSO? We are currently investigating the corresponding plugin but the sign-in is only available through a username-password pair. Thanks in advance for any help!
-
Question about the install path of Dataiku
Due to the space in my username(Zhao Guanghao)of my laptop, I can not run dataiku correctly. Meanwhile, the filename under the 'User' file in C disk can't be modified, once you rename it directly, the system can not identify the account and it will collapse. Besides, Dataiku can only be installed in the path 'C…
-
dataiku library messagesender Email CC
Hi All, I am trying to use the MessageSender to send some emails in custom recipes. I was hoping if there is a way to pass the CC email list in the send function Please do let me know if there any suggestions or alternatives from dataiku.core.message_sender import MessageSender s = MessageSender(channel_id='SMTP',…
-
Saving Vector Store as KB
I was wondering if there was any way of saving a FAISS vector store I create in a python notebook as a knowledge bank I can use later on? I created a vector store (see code below) which has summaries as the embedded objects, and the parent documents as the retrieved documents. I did this based on LangChain's…
-
Using date in DataIKU
Hi, Despite going through documentation multiple times, I still don't really understand how dates work in DSS. I'm importing dataset from a connection. Without turning on any of the options in Date & Time handling, this is how data looks like: It says that the data type is string, while in the database itself it is, in…
-
SOLVED. Cannot replicate GLM predictions
SOLVED. It was the offset - needed to take the natural log of it before calibrating. Hello, I built a model using the GLM Classification plugin. The AUC is ~0.8 so it's fitting my data well but when I implement the GLM formula manually into Tableau the predictions are far too low despite having the correct shape. The model…
-
how to turn on chart zoom in&out feature in dashboard
Hi community, When I plot chart, there is a nice feature of zoom in&out by date (in the bottom of the chart), as shown below. However, when I publish the chart into dashboard, it seems that this feature is dropped by dataiku (see below chart). Is there a way to turn on such timeline/date zoom in&out feature in dashboard…
-
How to extract rows flagged by a custom Python rule in the Data Quality tab ?
Hi everyone, I'm working with Dataiku DSS version 13.5, and I'm using the Data Quality tab on datasets to define validation rules. When I use standard rules (e.g., missing values, uniqueness, etc.), I can easily export the rows in error. However, when I define a custom Python rule, I can see the column status marked as…
-
Load Balancer IP Address
I'm working with my DKU platform (running in Azure) to configure my kubernetes cluster with the goal of ensuring my load balancer node runs with the same IP address every time - e.g. 10.1.1.100. My AKS plugin configuration has a node pool named API-LoadBalancer. It has one node and does not autoscale. The admin panel's…
-
how to define helper in python code in Dataiku project
In Dataiku project I've got python code - but I need also to declare additional python code that will be than used like library from HelperLibrary.library1.codev1 import testprocessing Is it possible? Operating system used: Windows Operating system used: Windows
-
Design Node Launch Failure
I have set up a Dataiku on AWS and configured it to automatically start/stop the Fleet Manager and design nodes using AWS functionality. Yesterday, both the EC2 instances and DSS started successfully. However, today, while the EC2 instances started successfully, the DSS failed to start. For reference, the version of…
-
Timeseries forecasting with GPU / cuda 11
Hello, I am now trying to train a model with timeseries forecast by using GPU. OS: Ubuntu 22.04 Installed with apt-get on OS: libcudnn9-cuda-11 cuda-toolkit-11-8 libnccl2 I then created a new python env : when i use that environment in the model, I can see at first that it's fine since it shows me my GPU card : but when I…