-
Change to Google Drive: Will this impact the Dataiku Google Drive Plugins?
All, I just received a notice from Google that they are change the way that Google Drive by API works. (See below for the note.). I'm wondering if this change is going to impact the way that any of the Google Plugins work. And if we think thatit will make a difference. What type of changes are folks considering to make. It…
-
Free for testing LLM API
Hi, do you know any of the LLM providers that proposes a free API with free prompts through Dataiku LLM mesh (on limited basis of course) so using one of the LLM recipes of Dataiku the reason why i ask this : i am teaching to students data science using Dataiku , i would like to have my students practicing Gen AI through…
-
Does anyone know how to add a RecordID column in dataiku. I want to add serial number to each row.
-
Group by with empty value and with Null value
Hello everyone ! I have a dataset with empty values in one of the columns (col1) and I use a group by recipe on an other column (col2) without empty values with col1_distinct as aggregation. I get a volume of 21, 199 and 1608 for the 3 col2 fields. But I wanted to add a condition on col1 with a prepare recipe with a…
-
Allow user email configuration in profile for on-demand email alerts when recipes complete
Hi, I know Dataiku has scenarios and I use them daily. But when I am debugging flow recipes it would be great if I could configure Dataiku how to send me an email in my profile. Then in a recipe I could just switch a radio button 'EMAIL ON COMPLETION' to ON and walk away. When I am developing in dataiku, many of my flows…
-
ModuleNotFoundError: No module named 'modellightgbm'
I'm training a LightGBM classification model on Dataiku using a Python 3.6 environment with the following packages on dataiku 13.3.1 version : scikit-learn>=0.20,<0.21 scipy>=1.2,<1.3 xgboost==1.5.2 statsmodels>=0.10,<0.11 jinja2>=2.10,<2.11 flask>=1.0,<1.1 cloudpickle>=1.3,<1.6 matplotlib==3.3.4 numpy==1.19.5…
-
Dataiku Free Edition - Installation issues
Hi, I have a Windows laptop and I am trying to install the Dataiku free edition on it. When I run the Dataiku Launcher Setup it freezes at the 'Loading' stage with the following message box displayed: Can you assist please? Thank you Victor Operating system used: Windows 11 Home 64 bit Operating system used: Windows 11…
-
Governing old projects : easing the archive or deletion
Hi, we have hundreds of dataiku projects on our instances, many of them are just old versions or failed experiments and are wondering if someone already worked on this. Today we are listing projects through the Python API, exposing to end users a list of projet keys, associated with the owner and lastModifiedOn timestamp…
-
Extract tables from PDF
Hello community, to perform RAG, I want to extract tables from PDFs. I would like to do this using Dataiku plugins, but the quality is not what I expect. Do you know of other methods to do this? Thanks !
-
Dynamic Column
I have an excel input file. Col A to Col T till row T26, I have data where Col T have latest month data. Col V to Col AO have second set of data till AO50. Now, its dynamic data, every month, a new column is added. For example, for next month, Col u will have Dec month's data. same with second set of data, I will be…
-
how can I make a Django app in Dataiku ?
I'm looking for information regarding coding a Django application in Dataiku. Any information on how to achieve this ? thanks
-
Is the "Admin" privilege necessary to create branches in a project?
Hello, I am an administrator on our DSS servers and can create branches in project without issue. However it seems that unless a non-admin user created the project they cannot create branches (or switch branches) in a project unless they are made "Admin" under the "Security" tab, either by giving the permission directly or…
-
Terraform deployment
Hello, I'm wondering if anyone can share terraform code for dataIku infra. Thanks I'm thinking to use terrafy to convert the Azure ARM template to terraform but I thought to ask the community incase someone has done it before.
-
Custom trigger to run itself?
Hi everyone, I want to execute scenario again if its fail, so it can try 3 times, sometimes kubernetes fails or, spark fails could be fixed after run again so, i dont miss time range between fail and fix manually. This is the code created with LLM, it used python for that. I changed project name to variable "project name",…
-
How to use "Execute Python unit test" scenario step
A new scenario step was added in a recent DSS version which is to execute a Python unit test. I'd like to start using this. However, the documentation is pretty brief: "this step executes one or more Python pytest tests from a project’s Libraries folder using a Pytest selector". Anyone have more details on or an example of…
-
Dataiku cloudでSparkを使う方法について
GCP環境で立ち上げたDataiku cloudでsparkを使おうとするとエラーが出てしまいます。 ドキュメントにはDataiku cloudではデフォルトでSparkが使えると書いてあるのですがPysparkレシピを実行すると画像のようなエラーが出ます。何か設定をする必要があるのでしょうか。 Operating system used: Windows Operating system used: Windows
-
Custom trigger does not executes python code
Hi, I am quite new in Dataiku and I am interested why the following code does not work as expected. Namely, I am trying to define custom trigger that will check if the folder is empty. Both from dataiku.scenario import Trigger t=Trigger() folder = dataiku.Folder("folder_id") files = folder.list_paths_in_partition() if…
-
Add Venn diagram and UpSet plot to Charts
I'm encountering some use cases where I want to easily visualize the number of records belonging to one or several groups and their overlap where group membership is spread over multiple 1/0 columns. Would be super handy to have Venn diagrams in the Charts or, sometimes even better, UpSet plots.
-
I want to adapt OpenVPN's functionality to the API
Do you have a request to extend OpenVPN functionality to APIs, rather than just DBs and storage with connectors? We would like to use OpenVPN to connect via API from an application operating in a closed NW environment, but the current functionality does not allow us to connect. If you have the same request, we would be…
-
schema propagation problem
In the dataset explore, I can define 'description' using 'edit column schema'. And I can propagate the schema to the following 'flow' using 'schema propagation'. However, sometimes it is not inherited (if there is a 'prepare recipe' in the middle) I want to know how to inherit it normally.
-
DataScienceStudio.app not updating to DSS Release 13.4.0
Hi, usually the DataScienceStudio.app detects new DSS releases and asks the user to update. But this time, I am getting any update notifications even a newer version of Dataiku is available. My current version: 13.3.0 Latest version: 13.4.0 Operating system used: MacOs 15
-
How to use Notebook
Hi Team, I have signed in with my gmail account and not able to use environment. Can someone help me how to start using this enviornment or notebooks to work a sample hacthaton
-
vector database
How to use vector database in dataiku for premises LLMS Operating system used: Operating System: Red Hat Enterprise Linux 8.5 (Ootpa) CPE OS Name: cpe:/o:redhat:enterprise linux:8::baseos Kernel: Linux 4.18.0-348.20.1.el8 5.x86 64 Architecture: x86-64
-
Google Workspace as SAML SSO provider for DSS?
Is there anyone out there using a Google Workspace Domain to set up a single sign-on environment for Dataiku DSS? Operating system used: Linux
-
How does the evaluation store threshold actually work?
In the documentation for the evaluation store, when doing a two-class (binary) classification, there is a slider for the threshold used. The documentation for this threshold reads in part: When doing binary classification, most models don’t output a single binary answer, but instead a continuous “score of being positive”.…
-
Dash Long Callbacks Not Working
Hi all, I'm struggling to get long callbacks to work in Dataiku. When I initalize the app = dash.Dash() instance, the application does not run at all. When I remove it, the application runs but the callback does not work at all. Currently, it only works with a regular callback but I need it to work with a long callback.…
-
Scenario steps documented in Project Documentation
I see that the Scenarios are not documented in the auto created project documentation. This feature will greatly help to document how our automations are orchestrated
-
Dashboard Improvements on Reference Lines
Looking for 2 Dashboard enhancements on the Reference Lines tuning Reference Line Value Currently, if "Constant" is chosen as the source, a manual value must be entered. It would be beneficial to allow the use of a global variable in the value field. Ability add an aggregation of a different dataset column Have a dynamic…
-
can not register to DSS , after first time of installation
I am facing this error: Network Error: An attempt to communicate with DSS failed. Please check your network connectivity. Operating system used: RHEL-8.9 Operating system used: RHEL-8.9
-
How to set up a random forest regression?
-
Select Columns Outside of Join Recipe
I would like to be able to select the columns of data outside of a join recipe. A couple of examples: 1 - Usage of "unmatched rows". The column selection occurs after the join does not apply to data that isn't joined. In this case I am using both sets of data so need the option to select columns from both sets. 2 - Removal…
-
Option to display short descriptions on flow
Hi All, Forgive me if this has been discussed before, or if it is a polarizing topic as far as visual design goals. In evaluating Dataiku against other products, and ultimately deciding on Dataiku due to its many strengths, one thing my team lamented was that it was not possible to display descriptions of flow elements on…
-
Dataiku users from Romania
Are there any other community members from Romania? PS: > also started this thread as a log for tracing personal progress. >Day one-13.01.2025 - Installed application x Data Preparation Quick Start 8 of 8 lessons completed (100%)
-
Ommiting quotes around scenario string variables in a Freemarker email template
Hi, I have a scheduled project scenario that sends an email on some condition. The scenario contains a step that sets scenario variables based on values in a dataset. Here's that step: import dataiku import dataiku.scenario # Read the dataset df = dataiku.Dataset("node-disk-usage").get_dataframe() use_percentage_threshold…
-
About deployer infrastructure setting
I would like to know how to set K8s-related settings in the infrastructure of dataiku deployer and make the ingress controller (NGINX) option visible in the service exposition. Of course, I know that the ingress controller has been set up in K8s and needs to be done. However, I would like to know how to make the Ingress…
-
Is the person I'm talking to on what's app really from dataiku
Im not convinced
-
SQL Step to copy table
Hi, was trying to run insert statement as step in scenario , it's loading only 10 records can you please guide me what could be the issue?
-
How to check for consecutive monthly buys
i have a dataset that has purchasing history for many items for the past 10 years, i want to pull out only the items that have been purchased every month for the last 10 years. how do i go about this? Operating system used: macos
-
How do I use training and prediction dataset together in Dataiku
Hi I'm using Dataiku version 13.1. I need to do text prediction using BERT for that I have training dataset. After training I need to score it using prediction dataset. I'm doing BERT using Python code recipe. Can you suggest me the steps to score the prediction dataset?
-
Allow nested flow zones
Hi, I use flow zones a lot and appreciate the value. Why not extend the capability and allow nested flow zones, i.e. a flow zone within a flow zone? thx
-
Cannot Create Published API Service - Already have API node
Hi, I'm trying to publish my model in API node, i'm currently running in Development side, my dev side already consist of Designer node, Gov Node, and API node, i kept getting error "not authorized - Cannot Create Published API Service" How to address this issue? or anything to do with my User License? Thanks Operating…
-
ProcessDiedException - can't access the Flow
Hi, I was migrating a project from a localhost Dataiku to a shared local network instance and after importing the project, I can't access the flow screen. All I get is "HTTP code: 500, type: com.dataiku.dip.exceptions.ProcessDiedException" error message. Is there a way to debug this and find the underlaying issue ? I found…
-
Error While Accessing visual flow
"detailedMessage": "Cannot run program \"/data/dataiku/bin/jek\": error\u003d0, Failed to exec spawn helper: pid: 2472572, exit value: 1, caused by: IOException: error\u003d0, Failed to exec spawn helper: pid: 2472572, exit value: 1" I receive this error massage while tried to delete rows in visual recipe and affecting all…
-
Why does Dataiku allow two web-apps with the same name?
I was surprised to find Dataiku allows two web-apps with the same name to exist. Why? The expected behavior would be to ask the user if they want to overwrite a published web-app when name collision occurs. thx Operating system used: WIndows 10
-
Integrating Dataiku with Denodo
Has anyone created an integration with Denodo? If so, did you use a JDBC connection? How did you manage user permissions? Thanks! Operating system used: Red Hat
-
disabling Update output schemas in Python Scenario
Hi how do I disable the option that updates output schema in a Python scenario? Can't find the option in the API reference. Would be equivalent of circled below. Thanks Operating system used: Windows Operating system used: Windows
-
API Service: Python Prediction Endpoint vs Python Function
Hi, I have a general question regarding the difference between the python prediction and python function endpoints in the API Service in regards to serving a custom python model. From my understanding, the only advantage that the python prediction endpoint has over the python function endpoint is the ability to…
-
Merge/Group rows based on metrics
I am trying to merge/Group rows based on metric range. Present format Desired format
-
How do I set the logging level for Python recipes?
import logging logging.basicConfig(level=logging.ERROR) I have already tried the code above, but my recipe Python output still shows logs DEBUG and INFO. It means it is very hard for me to find the output from my tqdm progress bar. I am using DSS 13.3.3 Operating system used: Linux Operating system used: Linux
-
Ability to package environment/local variables with an API service
It would be very helpful if Dataiku allowed for packaging variables (either environment or local variables) with the capability to remap local variables as part of the deployment. Ideally there would also be an option to encrypt a variable. We have several API services that connect to other systems and require environment…
-
snowflake connection is working fine from Dataiku , however fetching the table preview throws error
Failed to read data from table Failed to read data from table, caused by: SnowflakeSQLException: JDBC driver internal error: exception creating result java.lang.NoClassDefFoundError: Could not initialize class net.snowflake.client.jdbc.internal.apache.arrow.memory.RootAllocator at…
-
How do I send an email to the user based on a condition on count of records in a dataset?
hi, After reading the documentation, I came cross the "Compute metrics" step in Scenarios, but how do i retrieve the count of the dataset using ${stepOutput_the_metrics}? And then if the count is more than 0, I want to trigger an email to the user. I am on DSS version 13 Any help is appreciated. Thanks Operating system…
-
"Training failed" Error in Machine Learning Quick Start Project
I just installed Dataiku 13.3.3 (Free Edition) on a MacBook Pro (MacOS 15.3) using Chrome as a browser. I am going through the Machine Learning Quick Start project following instructions in Dataiku Academy. I am getting a "Training Failed" error when training the model with the error message below. I suspect the issue has…
-
How to use LLM Mesh work with LiteLLM
Hi, I'm working on an Agentic Gen AI project using the crewai package, which uses LiteLLM as the engine to connect to various Gen AI models. I would like to use Dataiku LLM Mesh, but it seems that it's not compatible with the LiteLLM. I tried to use the DKULLM and DKUChatLLM, but both of them are not working. I'm on…
-
Have a dataiku templating engine based on Python mako or jinja
Hi, Python based templating engines like jinja and mako allow users to 'print' text in various formats, using conditional logic statements like if-else and for loops. I think dataiku should offer an off the shelf Python based templating engine that would allow users to upload their template(s) and pass a `context dict` to…
-
How do Preserve chienese text format during CSV to dataiku load?
I'm using Dataiku version 13.1. I have a text dataset with around 2400 rows, mostly it's in english but around 100 rows contains chines character. My data is in csv format. I need to perform GenAI task on my dataset & load back to it to CSV Chinese characters are converting to english characters while loading data from CSV…
-
Data not writing to dataset in Python recipe
I created a Python recipe that I get data from an API. I save the response to a panda dataframe, but when I use the DataSet.write_from_dataframe to write the dataframe result my output dataset. The rows is written, but the row is empty when I check the dnb_duns_detail_json dataset. Operating system used: MacOS
-
Best Match for Fuzzy Matching
Hi, I am new to DataIku and trying to find areas of overlap in 2 datasets using fuzzy matching. Is there a way to get a numerical ranking for how close matches are, so I can identify the highest matches and remove duplicate suggestions if needed? Thanks,
-
Error in python process: <class 'ValueError'>: Numeric feature score_2 is empty
Trying to run a Recommendation system, I received this error not allowing to run the job. Received this error message, checked the database and all fields are with information. [01:16:18] [INFO] [dku.utils] - *************** Recipe code failed **************[01:16:18] [INFO] [dku.utils] - Begin Python stack[01:16:18]…
-
Issue with Python script appending data in Dataiku project
Hello, I have an issue with my Dataiku project. I wrote a Python script that appends new data from the input dataset to the output dataset. I think the problem may be related to recursion in Dataiku. Could you please suggest a solution? Thank you in advance!
-
Issues with the Google Drive plugin: file not appearing in the Managed Folder in Dataiku
Hello everyone! I’m using the Google Drive plugin in Dataiku to read and write files directly from my Google Drive account. I managed to set it up, and it works correctly when creating datasets from Drive. However, I’m having trouble creating a Managed Folder in Dataiku that points to a file in Drive. Here’s what I do:…
-
Import from PowerBI to Dataiku
Hi community! I have a strange question. Is there any way to import data from a poweri workspace into Dataiku? I can do the other way around, export a dataset into a workspace but was wondering if there was an obvious plugin I am missing. Many thanks :) *Also, is there any limitation? i.e. Gen2 Dataflow only, semantic…
-
How write the rmarkdown report created in dataiku to a managed folder?
-
How to dynamically name rmarkdown reports in scenarios?
I'm working on a project that requires me to send an R Markdown report to a Box folder via email daily. However, the emailed file has the same name each day, leading to overwrites and versioning issues (e.g., v1, v2, v3). I'd like to automate this process by adding the current date to the filename of the emailed report.…
-
How to append dataframe in existing output dataset
Hello experts, In dataiku v12.3.0, I was trying to append dataframe using write_dataframe() in existing dataset (with same schema). But it always overwrites with last dataframe even though the dataset spec is configured like: dataset.spec_item["appendMode"] = True The dataset is classified as output so it doesn't let me…
-
C compiler error while running python module 'pymc'
Hi, I am using pymc module to develop a MMM model in Dataiku. While working on it, I am experiencing this somewhat well-known error on C compiler but couldn't find any solution on Dataiku yet. Code where the error occured: import warnings import arviz as az import matplotlib.pyplot as plt import numpy as np import pandas…
-
Changing the default font size for all new charts
I often send screenshots of the charts to my stakeholders over Teams chat to get quick confirmations/alignment on the intermediary results. A feedback I heard often is that they are having a hard time reading the legend, axis titles, values on axis etc. I don't want to manually change the font size for all these different…
-
How can I use websockets in a Webapp?
I would like to make a custom Webapp where the Python backend talks to the frontend and vice versa. I see that the Dataiku Answers webapp uses websockets and I would like to do the same. My current attempts using Flask-SocketIO did not work unfortunately, as it seems to use Werkzeug under the hood and I cannot start the…
-
How to load a mlflow model in a script to make predictions
I trained and deployed a model using mlflow in Dataiku. I want to make predictions on a test dataset using this deployed model. However, I don't want to use the "predict" visual recipe. Instead, I want to load the model in a script and make predictions. But I am not able to do it. Operating system used: Linux Operating…
-
Run Scenario after multiple scenarios have completed
Hi - I am trying to create a scenario that will auto-trigger once other time-based scenarios (in other projects) have completed. I think this is possible for 1 scenario using "Trigger after scenario" which automatically checks the status of a scenario at the frequency you set but I can't figure out how to do this using…
-
How to Sync Current Date from Greenplum to Oracle with Only the Date Part?
I have added a current_date column to my table in Greenplum using a Prepare recipe (with now() in Formula language). I want to sync this column to an Oracle database, but I need to keep only the date part of the value. For example, I want to convert a value like 2025-01-25T21:50:28.102Z into 2025-01-25 and store it as a…
-
Split / Join formula
Hi, Is it possible in a split (and only a split recipe not an sql one) with formula to check a condition by join. For example, to check that one of the child lines a of a parent b contains a certain value. If so, put all the corresponding lines a ->b* (of which at least one b satisfies the condition) in the split. Best…
-
Folding multiple columns by MULTIPLE patterns
Hello, I have a large number of columns that all follow the same pattern, with 3 parts that can change: Path#_Field_Year So either the Path #, Field, or Year could vary. For example, I could have: 1_Car_2030 1_Car_2031 1_Car_2032 2_Car_2030 2_Car_2031 2_Car_2032 Or I could have: 2_Bus_2040 2_Bus_2041 2_Bus_2042 5_Bus_2030…
-
I want Better Dataset Visualization in Dataiku Dashboard
I need to leverage Dataiku Dashboard to visualize Dataset with a better look. this is what I want (set different colors for value criteria / bold font / dataframe multindex support /etc): this is visualization of dataset currently (quite plain): I am code designer and can write python, btw, is there any solution? Operating…
-
Combining data sources from Blob Storage
Hello, I have a pretty simple problem, but somehow I'm not able to solve it. I'm using Dataiku to do ETL stuff and I need to pull the data from Blob Storage. Usually I just specify a path this is the start of my flow. But in this case, I need to create the first source dataset from multiple blob "folders" that are in a…
-
Beginner Help: Deploying an API Service with Pickle Model from Jupyter Notebook in Dataiku
Hello Dear Community, I am a complete beginner in Dataiku and have created a Jupyter Notebook as a mini test model. I used Pickle to save the model and vectorizer into a managed folder named "Models". My goal is to make this model available as an API service, but I’m struggling with the process and would greatly appreciate…
-
Je n'arrive pas à faire fonctionner cette formule pour calculer la différence entre deux dates
voici la formule: if(diff(DateDeSortie_MAJ2,val("DateEntréeDIR2S_MAJ"))>=0,"",DateDeSortie_MAJ2) merci pour votre aide
-
update DSS license - improve the documentation
Dataiku's documentation says there are 3 ways to update a DSS license but then only lists 2: """ Copy your new license file into DATA_DIR/config/license.json (needs restart of DSS) Log into Dataiku DSS, click the “Administration” gear, then click “Enter license” and enter the contents of the license file. """ I suppose the…
-
Container configuration mapping in bundle deployments
This request is to add mapping options for container configuration in bundle deployments. This would allow for repointing in the event that the container configurations are not named the same in the design node vs automation nodes.
-
Per-user credentials in LLM connections
This request is to add support for per-user credentials in LLM connections. We use OpenAI and set up API keys per project so that we can track spend and budgets at the project level. Currently we have to set up a separate OpenAI connection for each project but ideally we would be able to pass the API key in either through…
-
Selenium issues with Chrome.exe and Chromedriver.exe
I get this error Chrome binary found at: C:\\Program Files\\Google\\Chrome\\Application\\chrome.exeExecute permission for C:\\Program Files\\Google\\Chrome\\Application\\chrome.exe is granted. Failed to launch Chrome binary at C:\\Program Files\\Google\\Chrome\\Application\\chrome.exe: /bin/sh: C:\Program…
-
Bug - CTRL+Z Erases Text on Dataiku
I accidentally pressed ctrl + Z after opening a Wiki article and it caused all the text to be removed. I'm on DSS 8.0.2. This also happens within SQL queries and the Library Editor. Is this intended behavior? Here are the steps to reproduce with the Wiki: 1. Create Wiki article with some text and save 2. Navigate away from…
-
How to chart time series longer than 20 data points?
I'm creating a dashboard for reporting abnormal data series, looking at the past 90 day sales. I want to create the line chart of the past 90 day sales to monitor spikes and abnormalities. But the line chart only shows about 20 data points and lump all rest into "other" categories. Is this the default setting of chart…
-
How to get Sigmoid Platt scaling parameters used in Dataiku visual analysis Probability Calibration?
Hi Team, In the current MLOps pipeline which we are using for Model Development, we often have the need of re-calibrating the propensity so that we can have correct scaling. For this we are current using Platt Scaling which can be switched on or off from the portal as can be seen here - However , for this feature we…
-
Spark installation in standalone mode in DSS
Hi, Our Dataiku is not connected to Hadoop or Kubernetes cluster. Can I still install spark in local DSS server?
-
How do I get created and modified dates fro an artifact in Dataiku Govern API
I can see this information in the timeline in the UI, but doesn't seem to be in the data returned by: ar = client.get_artifact('ar.26421') print(ar.get_definition().get_raw()) Operating system used: Windows
-
Threshold optimization for what if? fuction in AutoMLprediction
I used "what if?" function in AutoMLprediction to do simulation, and I would like to change the threshold DSS automatically used. In the userguide, there is below description. But I could not find any button/icon to change a threshold. Please let me know how to change it. Prediction settings — Dataiku DSS 13 documentation…
-
Collaborative use of code studio template
We have recently starting working with code studio templates in order to create streamlit webapps on Dataiku. We quickly realized that while each (non admin) user can create their proper code studio instance from a specific code studio template without any issues, it is not possible for multiple (non admin) users to have…
-
Using large context for a Gen AI prompt
Hi, I'm trying to create a prompt to ask questions to a LLM and get an answer based on 5,000 reviews for a product. I know there are ways to classify or perform sentiment analysis, but what I want to do is to ask an LLM a question about the whole bunch of reviews. I tried using RAG, but it is my understanding the this…
-
Trigger on scenarios
Hello, I want to build a dataset using an append mode at the end of each month, triggered by a scenario. For this, I’ve created a custom SQL trigger that checks if the essential tables are updated by returning 0 or 1. However, this SQL code takes around 15 minutes to execute. To optimize this process, I’d like the trigger…
-
Performance issue using 'Group by' and 'Join' with CSV file
Hi, I have migrated from SAS to Dataiku, but having some performance issue. is there a way to solve the performance issue, using Dataiku for data 'group by' and 'join' recipes with CSV file?
-
Collating evaluation stores.
Hi I have 6 different models and for those models I have 6 different evaluation stores, is there a way to collate all these 6 evaluation stores and create a single collated evaluation store? Operating system used: Windows Operating system used: Windows
-
I need to find string in the text and truncate text untill this string
Operating system used: windows Operating system used: windows
-
dataiku API designer support upload files?
https://community.dataiku.com/discussion/12910/how-to-upload-a-file-through-api-designer is same question like this one, in 2021, You are correct that API services will only accept the query parameters as JSON objects, so uploading a file using multipart/form-data is not possible at the moment. I not sure now is 2025 any…
-
How to install TEXT PREPARATION PLUGIN
Dear Community Friends, Can anybody please help me in solving problem " How to install Text Preparation Plugin step by step ?" including installing dependency modules ,Profibus compiler, Regex Module in Python environment. I am using Windows10 with Dataiku 13 Free Edition . I am repeatedly failed in installing TP. when…
-
While Using Dataiku
Dear Community Friends, Can anybody please help me in solving problem " How to install Text Preparation Plugin step by step ?" including installing dependency modules ,Profibus compiler, Regex Module in Python environment. I am using Windows10 with Dataiku 13 Free Edition . I am repeatedly failed in installing TP. when…
-
Initial install - Connecting this DSS instance to a AKS cluster
Failed to start cluster : <class 'azure.core.exceptions.HttpResponseError'> : (AuthorizationFailed) The client 'xxxx-xxx-xxx-xx' with object id 'xxxx-xxx-xxx-xx' does not have authorization to perform action 'Microsoft.ContainerService/managedClusters/read' over scope…
-
How to Visualize Anomalies in Dataiku Similar to This Chart?
Hi everyone, I'm trying to create a chart in Dataiku to visualize anomalies in my dataset, similar to the example below: The chart highlights three categories: Good Anomaly (blue dots) Expected values (gray dots) Bad Anomaly (red dots) I have the following columns in my dataset: Date/Time (for the x-axis) Sales (for the…
-
delete
n/a
-
List all connection calls
Hi guys, Is there any way to list all calls that a dataiku connections. For exemple, i have a connection pg-teste-op and i need to know many times it was used, that is, how many datasets use this connection and the highest possble level of detail, such as wich job used the connection.