-
How to update data from database and run other processes using Scenario and pipeline with SQL DB con
Hi Having connection to Oracle database I need to run Dataiku pipeline periodically assuming that during the update new data from database will be uploaded (based on query that will be simply rerun) and result send from further manipulations. I want to use build in Connection option Operating system used: Windows
-
Read a file, outside the API folder, from a DSS API
Hello, We would like to implement a DSS API with a python function which reads some data files stored in a distant server (not the DSS API Node). This data file name will be an input parameter of the API. The server, where the data file is stored, is known. The data files are maintained by business users, this is why we…
-
Issue with Dropdown Not Showing Options in Dataiku Application Designer
Hi everyone, I'm working on a Dataiku Application Designer project where I need a dropdown to let users select between two dataset options: dataset1 and dataset2. I tried adding the following JSON in the Edit Parameters section: [ { "name": "dataset", "type": "SELECT", "mandatory": true, "canSelectForeign": false,…
-
How to create recipe using create_recipe function from Dataiku Python API?
Hello, I tried to use Dataiku Python API to create recipes given both .json and .shaker files. I load the .json fle and use it in recipe_proto argument of create_recipe function (https://doc.dataiku.com/dss/latest/python-api/projects.html). Similarly, I load the .shaker file and use it in creation_settings argument of…
-
Exporting to Windows Network Drive Folder Location
How can I configure an output tool similar to Alteryx where my data exports as an excel file to a windows PC network drive folder location? Right now, I'm only seeing ouput to SharePoint or folders within Dataiku as options.
-
Divide a column by another one
Hi, I am supposed to get the GDP per capita. I have a column GDP and a column population. How I can divide GDP by population ?
-
Send dataset to Teams message
Right now I use Reporter + Teams Workflows to send scenario finish reporter to teams chat. I see in scenario step, we can send message to mail including a dataset as HTML variables to email. It can also keep the color rule on the Dataiku table. I'm wondering if this is possible via Reporter? My problem is I can't send to…
-
Using Spark in DSS to process CSV files
Hi! I'm currently testing out the possibilities for leveraging Spark in our ETL pipelines. My usecase is that I have 90% of cases where I start with just raw text files in Azure Blob Storage (usually CSV's or TXT's). How can I plug in Spark to read and process those files? If I select the Spark engine or run a custom Spark…
-
Is there a way to add a calculated column in Dataiku?
I have a date column; some of the fields are empty in this column. Is there a way to "count" if there is a value in the field for that row and place into a new column? Something like below: Operating system used: Windows 11
-
ORDER BY in SQL recipe does nothing, yet it works in SQL notebook
My team uses Dataiku to write flows consisting mostly of SQL recipes with data stored on an Amazon Redshift server. I have noticed the odd behavior that ORDER BY clauses in SQL recipes do not seem to have any effect on the resulting Dataiku dataset. They do, however, still work inside SQL notebooks, which suggests that…