-
Using Charts in Dataiku for Visualization
Hello! I am facing following issues while visualizing data on Charts tab in Dataiku - 1. Can we display Comma Separated Values (like 1,234,567) instead of 1.2M or 1234k in pivot table? 2. Can we edit Y- Axis Title of a Bar Chart? 3. There is the option for colouring the table cells in the pivot chart, but it considers all…
-
SHAP (Shapley values) in Dataiku
I am still new to Dataiku and am learning about its capabilities and am wondering if there is a way to replicate the global interpretability of the SHAP algorithm in Dataiku. I am familiar with SHAP in Python and have used it to show the positive and negative relationships of predictors with the target variable across all…
-
Monthly Partitioning changes partition column value
I am trying to setup monthly partitioning on a date column in my snowflake database. I have the source table and output dataset set as monthly partitioning. In the middle I have a prepare recipe where I use the time range to get a month (screenshot below), the output of the posting_date field changes from an actual date,…
-
Flagging Duplicate Rows
Scenario: I have a dataset named "XYZ". It has 5 columns: ID, A, B, C, D. I have uploaded this dataset to dataiku. Now i want to create a new column named "DUPLICATE_FLAG" that will have value of 1 if a duplicate record exists or 0 if does not exist. Rows will be compared and values will be added if everything except ID…
-
How to download a folder dataiku with python code ?
I want to download few folders from dataiku filesystem dataset but in python code. Can i get a hint or something that i can download these folders ?
-
RAM and CPU
When Dataiku execute processes in memory, what resources are used? : 1.-RAM on the server 2.- or CPU resources Is there a way to know these metrics?
-
Error while using get_predictor method
I am facing an error while using the get_predictor. import dataikufrom dataiku import pandasutils as pduimport pandas as pd # Example: load a DSS dataset as a Pandas dataframemodel = dataiku.Model("model_id") model.get_info()model.get_predictor() I get this error.…
-
Prediction on a saved model from python API endpoint
I am trying to predict a value from a python APi endpoint as per the below code. I have a saved model in DSS after training on Dataiku and need to predict by sending single row of data to the API. Is this the right way? client = dataikuapi.DSSClient(host, apiKey)client._session.verify = Falseproject =…
-
'Edit in notebook' in python recipe on partitioned dataset
For an application we have a high priority on transparency and understandability of the code. For that reason we use the 'Edit in notebook' function on our python recipes so that new team members and others can easily skim through the code as if a notebook and study the outputs of each step. However, on a partitioned…
-
Scenarios to Run Code Recipe that appends to data set.
How do I set up a Scenario to produce cumulative results over many runs of a Python Code Recipie? I have a very simple Flow with a Python Recipe and 1 SQL based dataset. I'm tracking the results over time in the WalkupResults Table which is stored in PostgreSQL. I want long term cumulative results in this table so for the…