I'm trying to calculate the month-over-month percentage change in Dataiku using the following formula ((sales/ sales_lag) - 1) * 100 Where: * sales is the numerator column * sales_lag is the denominator column containing the lagged (previous month's) values However, the sales_lag column can have null or zero values, which…
I have string value called hours, formatted as "00" and a value called minutes formatted as "07". Im trying to use the concat function to create an HH:mm value = 00:07 but for some reason when using concat, it gets rid of trailing 0s and results in 0:7. is there a way around this? if not is there any other way I can create…
I think having to create a dataset for every intermediary step in Dataiku is not very efficient, especially from a data storage standpoint. I think it's causing a lot of redundant data to be stored in the process of creating a workflow. Is there any way of combining and executing multiple recipes together or not storing…
Iam trying to create a an ape end point for data lookup .when i click deplyement policy as "reference"Iam getting test quire results correctly but when i try to acess api through postman showing error "failed to load JDBC driver.Now when i click deplyement policy as "bundled" it showing the error at test query as "Dev…
Hello, I am using Dataiku 12.5.2 and currently running Spark 2.4. Dataiku is installed on a server named A, while Spark is installed on a server named B, configured as a standalone Spark installation without Hadoop. Both server A and server B are capable of TCP communication and allow SSH access. How can I use the Spark on…
data set full of jobs and each job has a start date, so I want to create a formula that says filter out my data where my start date hasn't happened yet / less than the current date. The issue I'm having is when I create my now() function it gives me the current date and time, I only need the current date in a parsed date…
It's common to realize there is a better naming for a dataset, or a name for a dataset to require revision after further adjustments made elsewhere in the workflow. In such an instant, I renamed the output dataset (which is a table in SQL Database) in Dataiku. The initial name was "backend" and I updated it to…
I have a column called minutes and that can range from 0 - 60, what I'm trying to do is add a leading 0 to the single digit values, ex 9 = 09, so that no matter what my value is always 2 digits long, is that possible? and if so, how would I accomplish that? Operating system used: windows
How do I access the dataiku dsscli for running a cli command on a conda environment ?
I was trying to start a job in order to build the dataset, whose recipe parent is a very simple streaming python recipie. It doesn't throw any error but doesnt build the dataset as well But same works fine with a normal python recipe and I'm able to build a dataset. is there any way i can achieve the same for a streaming…
Create an account to contribute great content, engage with others, and show your appreciation.