-
How to run/build a flow zone using dataiku python api
Let say that I have created a flow zone with id 'xytg' and I want to trigger / run/ build this flow zone using python api. How can I do that? I tried to look into the python api but could not find it.
-
Export predicted data through code
I need to export predicted data from visual analysis using python. I am using performance on train set as a reference for my monitoring therefore I need to automate downloading the predicted dataset to keep it up-to-date in case I retrain my model Operating system used: Windows
-
The Python process failed (exit code: 2)
Hi, I'm new to this tool and was following tutorials from Quick Start courses. When I was trying build the flow, I got the following error: [18:46:22] [INFO] [dku.flow.activity] - Run thread failed for activity…
-
Delete a partition
Hi, can anyone please help me on deleting a partition, i have created a dataset having many partitions and unfortunately one of the partition is loaded incorrectly and i would like to delete only partition and maintain the remaining same. please help on this, as i am unable to get any options to delete this.
-
Export data to password-protected xls
Hi! Does DSS have an option for setting a password on a newly generated xls-file when exporting data ? A general password that is, needed for opening that file. If a pythonscript is the way to handle this, what would be the preferred library for that ? Thanks in advance for any thoughts on this! Jurre
-
Export Partitoned Dataset
I'm trying to use a python recipe to export a partitioned dataset, when I partition by a specific column(date column(LDD for example)), that column is removed from the dataset. How would I export the partitioned dataset into monthly files based on LDD which was partitioned?
-
Update excell sheet via python script
Hi folks, I want to update specific cells in an Excel sheet using openpyxl.load_workbook in Python. When I run the code, I don’t encounter any errors, but nothing gets updated in the file. How can I solve this problem? Thanks in advance.
-
Repeated Random Splitting and Bootstrapping with XGBoost
I have a dataset that I want to random split into train and test sets with an 80/20 ratio. I aim to repeat this random splitting and bootstrapping of the training data 1,000 times. For each iteration, I'll train an XGBoost model and then export the SHAP values, Gini index for each feature, F1 score, and ROC AUC for the…
-
using pre deployment hook create a pv and pvc?
"I am trying to add a Persistent Volume Claim (PVC) to a Kubernetes deployment using a pre-deployment hook in Dataiku. Could you provide any documentation or steps outlining how to add a volume to a deployment through a pre-deployment hook?" Operating system used: windows Operating system used: windows
-
Looking to replicate a SUM(COUNTIF) formula in Dataiku
I am working on a scorecard in Dataiku and I would like to calculate the percentage of completion in a set number of columns. Basically, I would like to replicate this formula in excel: =SUM(COUNTIF(ColumnX:ColumnXX,"*")/Total Number of Columns) and am having issues. The columns are a mix of strings, integers, and text,…