Hi! I'm currently testing out the possibilities for leveraging Spark in our ETL pipelines. My usecase is that I have 90% of cases where I start with just raw text files in Azure Blob Storage (usually …
I have a date column; some of the fields are empty in this column. Is there a way to "count" if there is a value in the field for that row and place into a new column? Something like below: Operating …
Hi, I am using currently dss version 13.4.0. When I copy a flow zone A, the names of the managed folders inside zone A are the same as the ones in the copy zone B, though their hash ids differ. The ha…
Hi, I have a SQL query that when it returns no data, I need to be able to change the email content of my scenario reporter. I am currently setting a project variable and have a couple questions: 1. Ca…
My has local variables. I created a webApp using Dash for the project to render UI dropdown. Though I see the layout but the values selected does not reflect in the project. I still need to hard-code …
I have created an API endpoint of type Python code. Inside, I want to run a scenario: the idea is that when an API request is received, a scenario run is triggered. The problem is that if I run this e…
I'm creating a dashboard for reporting abnormal data series, looking at the past 90 day sales. I want to create the line chart of the past 90 day sales to monitor spikes and abnormalities. But the lin…
I have a problem when using a python recipe that uses the kubernetes engine when running. I checked the log step by step, the python process should have been completed, but for some reason the kuberne…
SELECT DISTINCT N1.COLUMN1 as "Column 1", max(N1.COLUMN2) as "Column 2" FROM DB GROUP BY N1.COLUMN1 ORDER BY N1.COLUMN1 Hello, I'm trying to reproduce an sql script with Dataiku recipes, and in that s…
I’ve developed a custom Python API endpoint for regression and successfully predicted outcomes for individual records. However, when I attempt to process a batch of records, I encounter the following …