-
Record unmatched rows from two datasets (without code recipe)
I would like to compare Dataset 1 and Dataset 2 (see the sample below), and record all the differences in Dataset 3. I have been exploring the functions in the Join recipe but no luck so far. Note that in Dataset 1 the ID field (join key) is unsorted, I could manually sort it in the prepare recipe but I would like to…
-
Visual recipe switch?
In visual recipe is there an existing function that acts as a switch to turn on/off of a recipe? For example I can create table 2 from table 1 using a prepare recipe by changing the values in column B based on a condition in column C, and table 2 will be used in a downstream process for validation. Now I would like to…
-
Does Schema propagation work on existing columns in a prepare recipe
Hi All, I had a question on whether schema propagation works in the prepare recipe ? For if there is a column in the input dataset which has a datatype of BIGINIT then during the first run the output dataset will also have the datatype as BIGINT But during a future run if the datatype of the column in the input dataset…
-
Connect Dataiku to Postgres
Hello, Dataiku Team I made a connection between a Postgres RDS and Dataiku. It worked ok. I decided to make a backup in a new virtual machine (ubuntu) in a EC2 of the RDS. I restored it on the installed Postgres in this virtual machine. Before restoring, I created a new user and bd to restore it. I made the new connection…
-
Not In Function when matching values of 2 datasets
Hello, im a very new user of DataIku, but experienced in writing SQL. I have a use case where i need to match the IDs of Table A to the IDs of Table B, and exclude the records in Table A which are matching with Table B. In SQL, i would usually just do a Where Table A.ID NOT IN Table B.ID, but in DataIku, i dont see any…
-
Reference global variables in python recipe
In Global Variables section, I created a numeric global variable -> "global_var": 1 I would like to reference this variable in a python recipe: import dataikufrom dataiku import pandasutils as pduproject = dataiku.api_client().get_project("my_project")projectVars = project.get_variables()global_var =…
-
invalid argument when using IF statement on global variable
Background: In prepare recipe I created an "if, then, else statement", inside this statement, one of the "then" condition is to set a field to a formula, where this formula itself is another "if statement" that references to a global variable. Here is the syntax: Field_A = (formula): if(${global_var} == 1, 'Y','Nxxxxxxx'),…
-
Easy solution to allow viewers to access to data
Hi everybody, My team is starting to develop a webapp in order to give non-technical users with viewer access to be independant on their needs of data extract. However, during the development of the webapp, we found it having many limitations like the development requires a lot of time and effort, it consumes a lot of…
-
Deployment: Free Version
Hello All, I was trying to run the MLops tutorial but I am on the free version. So, the workflow for deployment isn't the same. I wanted to first know if it is possible to deploy using the free version. If so, what can we deploy and what are the limitations? Is there a resource that explains the process? I am currently…
-
Python code environment library installation error (AxiosError: Network Error)
Hello! :] I have a code environment with a bunch of python libraries installed. I'm using this for a data science project. I'm now trying to run a survival analysis and survival model, with the exploration and model prototype happening in a python notebook. I have tried to install python libraries such as pysurvival,…