-
UPSERT Sample or statement
Hi, I'm new in Dataiku, I hv 1 new project to perform in dataiku which need to do upsert for existing table or dataset. May I know how can I perform the insert/update statment in SQL? Especially on input value from dataset? Let say I have abc dataset with 10 column data, but need to insert/update only 5 from 10 column? How…
-
Dash and Standard Webapp
Hi community, I am currently using 13.4V of Dataiku where I don’t have access to the API services. About my Agent Code: I have created a multi-agent system in Python where each agent performs a specific task sequentially: Agent 1 – Searches for relevant content based on user input. Agent 2 – Extracts the required context…
-
Ollama on DSS
Pardon my ignorance on the subject but is there a way to connect to a model like llama2 or mistral running on Ollama locally or another k8s cluster to DSS Operating system used: Suse 15
-
How to split and fold while escaping the comma in strings
I have the same task as the person in the dataiku discussion linked below. I'm using Sarina's solution at the bottom of that page. However, since some of the strings in my arrays have ", " (comma space) as part of the string, I can't use ", " as a separator, otherwise the array will split in the middle of a string. I…
-
Limit the decimal places
I want to round these numbers to 4-5 decimal places. I am using Round formula to achieve this. But, this is not reflecting in the next steps. Can anyone help me with this?
-
refresh partitions in dss via API
Hi, we have added by a python api a new dataset into the project and pointing it to an existing location in HDFS where partition folders are stored. (This location is managed by another DSS instance). This kind of "import" of read only dataset works, but I did not find a way how to "refresh" the list of partitions, i.e.…
-
How to programmatically refresh input dataset partitions with Snowflake?
Hi, I’m working with a Snowflake-partitioned dataset that serves as an input in my project flow. I’d like to automate the refresh of the partition listing, which is normally done manually using the "REFRESH PARTITIONS" button in the Metrics tab. We previously managed to do this with S3 using the…
-
How to shrink an over-extended disk by Fleet manager
Hello, We manage a Fleet Manager and Design Node deployed using "Cloud Stacks for AWS". As a result of generating a large number of container images such as code envs and code studio templates, the disk size of Design (EBS gp2) became enormous. By deleting old DSS versions and unused images, it was able to free up a…
-
we have count function to check in Alteryx. Can somebody let me know the count function in dataiku
we have count function to check in Alteryx. Can somebody let me know the count function in dataiku Operating system used: 2023.2.1 Operating system used: 2023.2.1
-
Check column name values in sharepoint file using dataiku
I want to retrieve sharepoint file from the library one by one and check if column "Project Name" is same, if not, then email user to check the file. I have used the plugin and files in folder option to retrieve the files. Have used Group By to check the count. But how do I set email scenario to check count and project…
-
data truncated
I have a prepare recipe with a dataset as output I store in one field a text. I don't understand why in the previsualisation of the prepare recipe I have the full text but in the output dataset the text seems truncated. Any ideas ?
-
Using Dataiku
Hi all, I'm working on a Python recipe to automate file validation in Dataiku using managed folders. My goal is to: Scan a "validation" folder for Excel or CSV files. Check that they contain the exact column headers that I defined. Route them to either an "inprogress" or "rejected" folder based on the result. I’m using…
-
Dashboard doesn't refresh in time before it's exported through a scenario
Hi - Please see below my goal, methodology, and the challenge I am facing. Thanks in advance for your help. GOAL: I want to get the export of my dashboard every Monday morning with the most up-to-date data. METHODOLOGY: I set up a scenario that is time triggered to run on Monday 8AM. First step of the scenario is to build…
-
Dark Mode
Every developer needs a dark mode A dark theme for the flow, datasets, and recipe configs would go a long way toward making Dataiku fit into workflows that involve many other dark mode tools. Dataiku is definitely very bright when swapping from other tools which operate in dark mode. Extensions like Dark Reader do a pretty…
-
The recipe execution is taking long time due to handling a large volume of data in dataiku
We are experiencing long execution times for a recipe in Dataiku due to handing large datasets, while we have implemented partitioning using a filter on a specific column, it still takes 1.5-2 hours to partitioning 30M records. Is there a more efficient way to handle and process this data quickly and effectively because…
-
Ask for resolution about "java.util.Properties org.eclipse.jetty.util.log.Log.getProperties()"
Hello. I am following the academy curriculum. In the machine learning section, the curriculum is not proceeding due to the following error message: 'java.util.Properties org.eclipse.jetty.util.log.Log.getProperties()' I would appreciate it if you could provide a solution." Operating system used: Machine Learning Quick Start
-
Weird behavior in Datetime
I am reading an excel file (.xlsx) which contains a column with Date and time value stored in custom (mmm d, yyyy h:mm:ss AM/PM). While uploading file to Dataiku is automatically reduces one second in the value. It is not done for each row but for on a random basis. E.g. Input Value - Jan 31,2025 10:00:00 AM Dataiku read…
-
Bug when exporting dataiku dashboard
Hi Dataiku specialist team, When I export a dataiku dashboard (into a PDF) that contains embedded web content or some webapp that contains maps, that particular tile is either ignored or it show as broken link. I already configured the tile characteristic to "delay image capture by some seconds" but that didn't fix the…
-
Using Modern Angular for Custom UI seetings in Dataiku Plugins?
I'm developing custom forms for Dataiku plugins and noticed the documentation shows examples using AngularJS (Angular 1.x) with controllers and $scope () Since AngularJS has reached end-of-life, I'm wondering if it's possible to use modern Angular (2+) with component-based architecture for these custom plugin interfaces…
-
Unable to Create Evaluation Store in Time Series Forecasting (DSS 12.5.2)
Hello, I am currently using Dataiku DSS version 12.5.2 and working with the Time Series Forecasting feature. After completing the model training, I attempted to run an evaluation, but I noticed that the Evaluation Store is not being created. I’ve confirmed that when using the AutoML Prediction functionality under Visual…
-
Check if sharepoint excel file column has unique value then store data else email user.
I am trying to develop a mechanism where we need to retrieve sharepoint excel files from the library one by one , once the user uploads and check if duplicate value is present in one column(SKU) , if yes then email user "Duplicate values found in SKU , please check." else store the dataset. Thanks.
-
How to convert the number "35.048,00-" to read as "-35,048.00"
I want the sales amounts to be read as: -35,048.00 2925.00 4000.00 etc
-
I am trying to pass a parameter from a dash web app to a scenario to use in a flow.
This is what I have tried trigger_fire = scenario.run() scenario.set_scenario_variables(run_id=run_trigger.id, File = "your_value") scenario_run = trigger_fire.wait_for_scenario_run() but I am getting this error 'DSSScenario' object has no attribute 'set_scenario_variables'
-
Azure AI Search Configuration
I have a problem with using azure ai search run in knowledge bank. When setting the connection there is no menu to grant permission access. Is there anyone who had the same problem?
-
Update and Insert into existing oracle table
Hi All I have two task to do. 1. I wanna update into existing table in oracle, is there any option to do that ? 2. Then I wanna insert into existing table in oracle, I saw the append instead of overwrite option but how to give the existing table in output ? Please guide/Suggest the way Thanks in advance
-
Provide ability to export Insights to images in Scenario Steps and the Python API
Currently only Dashboards can be exported to images in Scenario Steps (Export Dashboard step). While there is an export option in the GUI to export Insights to images this is not possible to do via Scenario Steps nor the Python API. So please add support for this. And also extend the Python API to allow Dashboard exports…
-
Per-user folders設定が機能しない。 Per-user folders setting is not working.
以下のドキュメントによると、ユーザがプロジェクトを作成する際に、ユーザIDに基づいたサブフォルダを作成するとあります。 Project folders - defaults and access — Dataiku DSS 13 documentation 私の環境で実施したところ、プロジェクトは作成されましたがサブフォルダが作成されません。 必要な権限を教示いただけないでしょうか。 ・実施ユーザの権限 Create projectsあり Write in root project folderなし ・実施場所 Project/Groupe_Folder 配下 According to the following…
-
how to add in flow
drop na values in all columns in dataiku Operating system used: windows
-
Update Records In Dataiku
Hi Folks, I'm trying to find a way in Dataiku to update specific records in a table based off of specified criteria. Some context: I process data on a daily basis that needs to be transformed to a workable state for a multitude of different reports. There are user-generated date values inside of the report that are invalid…
-
Upsert of daily data in Dataiku DSS
I work on data fed daily and I try to find a way in Dataiku to do upsert: do at the same time update records and insert new records, what are the different ways to do this please?
-
Setting up Stages in Snowflake to work with Dataiku
In Dataiku DSS when working with Snowflake there is an option to use a stage. This apparently speeds up performance by increasing the number of different types of processes one can do inside Snowflake without having to ship data back to the DSS server for processing. Are folks using this feature? What has your experience…
-
List managed folders from project
Currently, the only way to view which managed folders are associated with a project is to check the flow. However, on large projects, the flow is too large to load. (On my project of just 7,000 datasets, the flow crashes the browser tab). Datasets and recipes can be listed in the datasets and recipes pages, but managed…
-
Avoid data trim
Hello, When importing a csv file, data are trimed : raw data mustn't be truncated or trimed during import process. I've attched a simple exmaple of csv file used for testing and illutrating. Anyone having a solution to avoid this ? Operating system used: Windows
-
How to display value lables in chart which type is Line
like title , How to display value lables in chart (chart type is Line) Operating system used: windows 10
-
Time Series training hyperparameter optimization
How do I read this chart. I used seasonal trend with range setting for the seasonality. Do this mean when the seasonality setting is 30, the MASE is the smallest. So, the next iteration of training I should set the seasonality as 30? Operating system used: Windows Operating system used: Windows Operating system used:…
-
Do we have any documentations on how to implement autoencoder for clustering within Dataiku?
Can you provide some examples to implement autoencoders for detecting patterns in data without any labels. Operating system used: v12.6.3
-
Guidance Needed: Building a SQL Chatbot in Dataiku
Dear Dataiku Experts, I’m working on a Dataiku project where I want to enable natural language interaction with a production SQL database — essentially, a chatbot I call the “Production Analyst.” The idea is that a user can type queries in natural language, and the chatbot will interpret them to perform production…
-
Custom recipe: Selected values not save for values from API
Hi, In plugin I cannot save selected values for fields in recipe.json, when values comes from API. Below user description: “these values don’t persist once you leave the recipe screen and return. As an example, you can see after we add values for multiple other values, and then you can see in the upper right I saved. Then…
-
Custom recipe: Default value not work for values from API
In plugin I cannot see default value for field in recipe.json, when value comes from API. In recipe.json I have field: { "name": "field", "label": "Field", "type": "SELECT", "getChoicesFromPython": true, "mandatory": true, "defaultValue": "Default", }, In above code default value is set to "Default", but it is not set in…
-
Comments in Formula
User Story: As a creator of formulas in Dataiku, I would like to be able to add comments in formulas, this would allow me to leave information in formulas about why formulas are configured the way that they are, increasing trust and communications, and it would allow the ability to "comment out" chunks of code while…
-
Way to determine Alteryx workflow inventory list
Is there any automated way to discover the count of Alteryx workflow on users system to get the entire inventory to migrate to Dataiku. There are set of 50 users, connecting one to one is not a feasible option.
-
how to plot a chart by row
I have a sample dataset with 5 columns shown below: In dataiku how can I plot by each row, to get result like below chart in excel? Right now I can only plot something like below chart, which is NOT what I want ( I need 1M, 3M, 6M and 1Y to appear on x-axis): Could someone help? Many thanks!
-
[Application Designer]How to display custom variables according to a dynamic value
I'm trying an application where I have some custom variables to edit. One of them is a dropdown menu that can be either "local" or "remote". According to this value I want different fields in custom variables to be displayed. For example for "local" we will only have fields "user" "path" and "fileType". For "remote" there…
-
Github Copilot on Code Studio
Hi everyone ! I tried my best to find an answer to this and I'm sorry if it's a duplicated question. Is there some kind of workaround to get Github Copilot working on the code studios or a way to link local VS Code to the code-server ?
-
How to call AI agent with python code?
Hello, everybody. I have already built a visual agent. I want to build a Dash webapp "calling" that visual agent. I saw this tutorial, but here tools are in a utils.py file. But I have created my tools as plugins (like mentioned here). I used these tools creating a visual agent. Is it possible to call my visual agent in a…
-
Dataset to Microsoft teams
I have been sending automated datasets to emails in scenarios using reporters to mail . How do I send automated datasets to Microsoft teams using reporters to Microsoft teams. Thanks Operating system used: windows
-
Using data from a dataset in a Send Message scenario step. (Scenario Variables?)
I've built a project that determines a set of upcoming worker hires and sends an email (with an Excel attachment) to a group that begins working their access needs. I want to make the body of the email dynamic by adding a summary of the count of workers and the range of start dates. I have that data in a separate, single…
-
How to execute a recipe after an empty dataset ?
Is there any possible way of checking readyness of a dataset? I have a dataset that might be empty after a Hive query, it shouldn't be a problem but since it is (I cannot use it in a left join...) I decided to build another dataset that would contain either the result if it exists or a dummy line if it does not. All this…
-
can dataiku prompt engineering use claude sonnet 3.7 model now?
that will be nice
-
Can the hyper-parameter change for each new training model with each new dataset
Dear dataikuler thanks for reading my question. Hi, so my problem is when i re-train my model with different dataset (like my first dataset is from 12/10 /2024 to 12/10/2025 and my second dataset is 30/11/2024 to 30/11/2025) and then i deploy the second model i check the hyperparameter of each version and i see all of them…
-
Scenario questions&ideas
I want to modify the scenarios we use in production. As a developer, I don’t have access to the automation (prod) node, so let me explain the case: We have 8 different scenarios, each corresponding to a specific DOE type—for example, data ingestion, transportation calculation, dashboard output, etc. The first scenario is…
-
Append a pandas dataframe to an already existing Dataset within a plugin
I'm creating a custom plugin containing a recipe that evaluates a machine learning model and outputs a DSS Dataset with performance metrics (it is very similar to the in-built Evaluate recipe). However, each time I train the model, I would like to append the new performance record to the already-existing Dataset rather…
-
error : getting error while deployment in Automation node
I am trying to deploy my project in automation node in my dev1 or testing env. but I am getting this error : I am not using any API only is scenario in python code I am using this code client = dataiku.api_client() project_key = dataiku.default_project_key() project = client.get_project(project_key) An error occurred on…
-
Allow datasets to automatically reload schema when jobs run.
Currently, if columns in a dataset source are added or removed, jobs and scenarios that read from that dataset will fail until you reload the the schema from table. Even if everything downstream does not have dependencies on the column changes. We would like to see a setting to allow datasets to always reload schema when…
-
Run sql recipe with partition dataset
Hello, I need to write a SQL recipe that build a dataset on a specific partition set as a project variable. In my flow i have dozens of recipes that worked this way, i enter the partition ${execution_date} (close to the button run) and it works. For SQL recipe when i give the argument 2022-01-20 the query succeed but when…
-
jupyter-run data directory
Hello, some of my jupyter-run directories of single notebooks have a huge size. What exaclty is stored in these files. Is it an active session of a jupyter notebook? Some of these directories are very large for notebooks I didn't use for months. I also unloaded the session for these notebooks. I just found the description…
-
How to visualize project variables in dashboard?
I want to add a dynamic time format on a monthly dashboard, like Jan-25, Feb-25, which already stored in project variables. How can I choose proper tile as below and configure it in slide? Operating system used: Windows
-
How to Automatically Create an Up-to-Date Dataset from Data Quality Rules ?
Hi everyone, I'm working on a project where I've applied several Data Quality rules to a dataset (MAST_prepared). Using the Data Quality tab, I clicked "Create dataset from rules data", which generated a new dataset (MAST_prepared_rules). 👉 Issue: this dataset is static. It does not update automatically when I rebuild the…
-
How to execute a MS-SQL stored procedure in Dataiku
Not a question but an answer as I couln't find any relevant posts. I solved this problem using a SQLExecutor2 in a Python recipe: from dataiku import SQLExecutor2 executor = SQLExecutor2(connection="connection name") sql_str = """Execute sp_name 'param1','param2', 'param3'""" output_df = executor.query_to_df(sql_str,…
-
Custom script for Container execution to push the base images
I'm running into an error ''' Error while checking kubernetes logs, returned 1 Error from server (BadRequest): container "c" in pod "dataiku-exec-python-afmkmmh" is waiting to start: image can't be pulled ''' The main reason here is because of the access issue for pushing the image to the internal repository. I was…
-
Ability to choose input data set for copied and pasted subflows
I often have to copy a portion of a flow to use in a different section. Having the ability to define my input data would make things more efficient and eliminate some human error. In the use case I have, I want to copy the portion circled in red and paste it to where the green circle is, but I don't want it to branch off…
-
How to update data from database and run other processes using Scenario and pipeline with SQL DB con
Hi Having connection to Oracle database I need to run Dataiku pipeline periodically assuming that during the update new data from database will be uploaded (based on query that will be simply rerun) and result send from further manipulations. I want to use build in Connection option Operating system used: Windows
-
Read a file, outside the API folder, from a DSS API
Hello, We would like to implement a DSS API with a python function which reads some data files stored in a distant server (not the DSS API Node). This data file name will be an input parameter of the API. The server, where the data file is stored, is known. The data files are maintained by business users, this is why we…
-
Issue with Dropdown Not Showing Options in Dataiku Application Designer
Hi everyone, I'm working on a Dataiku Application Designer project where I need a dropdown to let users select between two dataset options: dataset1 and dataset2. I tried adding the following JSON in the Edit Parameters section: [ { "name": "dataset", "type": "SELECT", "mandatory": true, "canSelectForeign": false,…
-
How to create recipe using create_recipe function from Dataiku Python API?
Hello, I tried to use Dataiku Python API to create recipes given both .json and .shaker files. I load the .json fle and use it in recipe_proto argument of create_recipe function (https://doc.dataiku.com/dss/latest/python-api/projects.html). Similarly, I load the .shaker file and use it in creation_settings argument of…
-
Can't register free edition
I’m having trouble registering a free edition account for Dataiku 13. Every time I try, I’m redirected to my.dataiku.com, where I can’t proceed because I don’t have an existing account. It feels like a logical loop—I’m required to have an account in order to create one. Operating system used: Linux
-
How to setup Athena connection using s3 connection
Hello, i've already setup an s3 connection which is working perfectly. I'd like to use it to setup Athena connection. My s3 connection use sts-assume-role type. Iuse dataikucloudstacj (aws) Can you tell me why thing do not work? do I need specific policies in roles of s3 and intances profile? I've this error in screenshots.
-
Exporting to Windows Network Drive Folder Location
How can I configure an output tool similar to Alteryx where my data exports as an excel file to a windows PC network drive folder location? Right now, I'm only seeing ouput to SharePoint or folders within Dataiku as options.
-
Regarding, Dataiku Scenario, How to control the Scenario steps using variable?
Hi everyone, I have designed my dataset where it will always have single value either 'true' or 'false'. In my Dataiku scenario. I want to control the flow based on this output. If Dataset contains 'true' then , next step (building datasets in the project) should proceed and email notification need to be triggered if…
-
Divide a column by another one
Hi, I am supposed to get the GDP per capita. I have a column GDP and a column population. How I can divide GDP by population ?
-
Managed-datasets Metadata Synchronization Across Multiple DSS Instances
Use Case As an organization, we utilize three distinct DSS instances to manage our data analytic and ML workflows: * Self-Service and Data Products Consumption Instance: For end-users to consume data products, and work independently by having access to curated data. * Design and Development Instance: For designing and…
-
Send dataset to Teams message
Right now I use Reporter + Teams Workflows to send scenario finish reporter to teams chat. I see in scenario step, we can send message to mail including a dataset as HTML variables to email. It can also keep the color rule on the Dataiku table. I'm wondering if this is possible via Reporter? My problem is I can't send to…
-
Dataiku flow monitoring dashboard
Please help to create a real live visual dashboard in power bi where we can see if dataiku flows did run successfully or not. Basically project monitoring/tracking of jobs. I am new to dataiku so not sure how to export the job status data to power bi or SQL
-
External data catalog integration
Hi everyone, I'm looking for a way to integrate DataIku into a standalone Data Catalog tool. For example, DataHub. This stems from the fact that some initial data load and transformation happens inside the DWH through orchestration tool like Airflow and transformation tool like dbt. This creates initial datasets that are…
-
How to create a own custom recipe?
Hello Team, Hope you are doing good. I am using Dataiku version 13.4.2 and have written a Python script. I want to create a custom recipe (similar to built-in recipes like Sort, Prepare, or Embed) so that anyone can drag, drop, and use it easily. However, I do not have admin-level access to create a plugin. How can I…
-
Using Spark in DSS to process CSV files
Hi! I'm currently testing out the possibilities for leveraging Spark in our ETL pipelines. My usecase is that I have 90% of cases where I start with just raw text files in Azure Blob Storage (usually CSV's or TXT's). How can I plug in Spark to read and process those files? If I select the Spark engine or run a custom Spark…
-
Is there a way to add a calculated column in Dataiku?
I have a date column; some of the fields are empty in this column. Is there a way to "count" if there is a value in the field for that row and place into a new column? Something like below: Operating system used: Windows 11
-
ORDER BY in SQL recipe does nothing, yet it works in SQL notebook
My team uses Dataiku to write flows consisting mostly of SQL recipes with data stored on an Amazon Redshift server. I have noticed the odd behavior that ORDER BY clauses in SQL recipes do not seem to have any effect on the resulting Dataiku dataset. They do, however, still work inside SQL notebooks, which suggests that…
-
Copying a zone keeps the name of the zone managed folders
Hi, I am using currently dss version 13.4.0. When I copy a flow zone A, the names of the managed folders inside zone A are the same as the ones in the copy zone B, though their hash ids differ. The hash id I am talking about is obtained in python language with dataiku.Folder("ProjectKey.NameOfFolder").get_id() It is…
-
Can a project variable be used as a scenario variable?
Hi, I have a SQL query that when it returns no data, I need to be able to change the email content of my scenario reporter. I am currently setting a project variable and have a couple questions: 1. Can I use the project variable to conditional change which email reporter to use? 2. If no to #1, what is the best way…
-
Dataiku compare data between two columns
We have two datasets that I have uploaded from local and then I am joining the data based on a key and merged the columns. then when I am comparing two column values I used the function like: if(colA=ColB, "Match", "No Match") Even though some column values are exact same still it is showing as No Match. How to fix the…
-
Configuring Code env at project via Dataikuapi
Hi, I'm trying to configure code env programmatically via dataikuapi using the below mentioned commands and I have trouble setting it up. project_settings = Eng_project.get_settings() project_settings.set_code_env(code_env_name) project_settings.save() I'm getting this issue. AttributeError: 'DSSProjectSettings' object has…
-
How to pass the variables values from dash Webapp to the main project
My has local variables. I created a webApp using Dash for the project to render UI dropdown. Though I see the layout but the values selected does not reflect in the project. I still need to hard-code local variables. I have a python recipe in my dataiku main project working successfully with a local variable in json format…
-
Ability to zip files from one folder to another
A business user in my team is trying to upload daily pulls to an SFTP. These files are created by separate Snowflake queries, then merged using a Merge Folder recipe. The business user would like to be able to zip these files into a single folder before uploading to SFTP (3rd party requirement). Currently they are using a…
-
Error in running sync recipe
Hi , i am running a sync receipe from an MSSQL data set to a custom managed python data set. The sync recipe is having the option to append the data instead of overrwite, While running the receipe, getting the below error: Job failed: Unsupported APPEND mode for Custom python datasets An invalid argument has been…
-
DSS and Ubuntu compatibility
Hi, is there an overview of which DSS versions support which Ubuntu versions? We are currently working with DSS 12.3.1 and Ubuntu 20.04 but Ubuntu 20.04 is almost EOL so we want to upgrade to Ubuntu 22.04. Is that possible with our current DSS version? We are planning on upgrading to DSS 13 soon but want to take some…
-
Passed Core Designer Certificate exam but I have not received any email with link to certification
Hello, I have passed Core Designer Certificate exam on 29th March, but as of today but I have not received any email with link to certification not it is showing under by academy profile. Please see attached. How much time does it take to issue the certificate? Regards, Shiraz butt
-
Does Dataiku support multi-node GPU environment for LLM features (finetune, serving, etc.)?
Q1 Does Dataiku support multi-node GPU environment for LLM features (finetune, serving, etc.)? If you apply, how do you apply? I want to know if it is supported in a container execution environment and if there is a local GPU. Q2 Also, when I run local LLM as a container execution, the pod runs, responds, and disappears…
-
Error when trying to use Dataiku Python APIs in API endpoint (API designer)
I have created an API endpoint of type Python code. Inside, I want to run a scenario: the idea is that when an API request is received, a scenario run is triggered. The problem is that if I run this example code: import dataiku def api_py_function(): client = dataiku.api_client() project = client.list_project_keys() return…
-
Noeuds fermés
Je travaille avec Dataiku depuis un moment grâce au lien d'invitation de mon prof. Mon problème est que je ne peux pas créer de nouveau projet car les noeuds conception,automation et API sont arrêté
-
Issue when I import module
Hello everyone, First of all, I've already checked the "__init__.py" file in the directory and I added the python path in external- librairies.json. The fact is that the problem occurs even when I'm using a notebook provided by dataiku with one of the tutorial (so that must work ??? That why I suppose that I did sth wrong…
-
Decimal point restriction
How to restrict the decimal values upto 4 decimal places in all the double decimal type column in dataiku
-
[WEBAPP] How to run a scenario with a local variable
I have a python recipe in my scenario working with a local variable. I have succesfully executed my scenario within the webapp but : How can I change the local variable from the webapp's code (in the Python back-end) ?
-
Multiple Tabs Within a Project
Hi, My name is Yusuf Afolabi, I work for Caterpillar as a data scientist. I use Flow Zone a lot and it has been very helpful. Recently, I have been seen situations where navigating to a specific Flow Zone becomes problematic. Think of having like 10 different Flow Zone(s) in a project: you would have to scroll back and…
-
How to chart time series longer than 20 data points?
I'm creating a dashboard for reporting abnormal data series, looking at the past 90 day sales. I want to create the line chart of the past 90 day sales to monitor spikes and abnormalities. But the line chart only shows about 20 data points and lump all rest into "other" categories. Is this the default setting of chart…
-
Can't get license
Hi, I have to take this ML cert as it is a requirement from my company, but I have tried refreshing it and tried disconnecting and connecting my network but there is still error as below, I hope I can get clarification for this. Operating system used: Windows
-
Disk storage usage info in logs?
I am new to Dataiku, but loving the Resource Usage Monitor project that enables easy viz of compute and processing resources. However, I cannot find how to integrate storage usage into this project from the logs. I found this associated post from a few years ago which directed to use another solution other than Dataiku. Is…
-
Programmatically configuring Code env within Python Recipe for executing a scenario
Is there a way within Dataiku where we could Programmatically configuring Code env within Python Recipe for executing a scenario? This documentation doesnt' describe that functionality. Operating system used: Windows Operating system used: Windows
-
How Do I filter the content of a managed folder and output to another managed folder?
This is a question I've asked myself and solved with a little Python code so I thought I'd share. I had a folder with several subfolders, each containing a JPEG for each page of the original PDF (for context this folder is the output of the Greyscale recipe from our Text Extraction plugin). I really only want to parse…
-
How to set project variables inside process running on Thread?
Hi there, I am trying to set some project variable using set_variable() method of Dataiku API. But it somehow only runs once. Below is my DASH webapp code snippet for reference - import threading import dataiku import time def start_execution_publish_to_pbi(config_id): try: project = dataiku.Project() variables =…