-
RAM and CPU
When Dataiku execute processes in memory, what resources are used? : 1.-RAM on the server 2.- or CPU resources Is there a way to know these metrics?
-
Error while using get_predictor method
I am facing an error while using the get_predictor. import dataikufrom dataiku import pandasutils as pduimport pandas as pd # Example: load a DSS dataset as a Pandas dataframemodel = dataiku.Model("model_id") model.get_info()model.get_predictor() I get this error.…
-
Prediction on a saved model from python API endpoint
I am trying to predict a value from a python APi endpoint as per the below code. I have a saved model in DSS after training on Dataiku and need to predict by sending single row of data to the API. Is this the right way? client = dataikuapi.DSSClient(host, apiKey)client._session.verify = Falseproject =…
-
'Edit in notebook' in python recipe on partitioned dataset
For an application we have a high priority on transparency and understandability of the code. For that reason we use the 'Edit in notebook' function on our python recipes so that new team members and others can easily skim through the code as if a notebook and study the outputs of each step. However, on a partitioned…
-
Scenarios to Run Code Recipe that appends to data set.
How do I set up a Scenario to produce cumulative results over many runs of a Python Code Recipie? I have a very simple Flow with a Python Recipe and 1 SQL based dataset. I'm tracking the results over time in the WalkupResults Table which is stored in PostgreSQL. I want long term cumulative results in this table so for the…
-
Dataiku Scenario
I need to run a scenario within the same scenario(recursion or self call). It is basically running the scenario within a loop until a condition is failed or satisfied. Please help.
-
Metrics and Checks
Hey, hope y'all are doing great I have a quick question regarding using metrics and checks. I have a table and I want to do basic checks such as duplication of primary keys, duplication of records, missing values in a particular column etc. I used metrics to compute the count all of these along with checks and have them…
-
Data Quality Checks in a flow
Hi everyone, I have a quick question: I am importing datasets from snowflake, in a proper flow i.e., source -> warehouse -> data mart. 1) can either create them again in dataiku (double work) OR is there a way that this whole staging process goes in a proper sequential manner within the dataiku? 2) In this whole flow i…
-
Retrieve data based on tree structure inside a column
Hi, I'm trying to create a new column, "Parent", based on the 2 columns X and Y. It's kind of a basic tree structure in Y. 2 is the daughter of 1 and so on, as in the example below. In the "Parent", we use the Y column, but with the X value. I really don't know how to do that, I tried to do it with a formula/windows recipe…
-
Show only 1 Labor value per project.
Hi Team Trying to achieve the result by joining and then applying the window function to get Labor only in 1 record for project instead of all records. Attached is the data and the current o/p vs desired output Please help me in getting this answered to derive the solution Operating system used: Browser