-
Delete a partition
Hi, can anyone please help me on deleting a partition, i have created a dataset having many partitions and unfortunately one of the partition is loaded incorrectly and i would like to delete only partition and maintain the remaining same. please help on this, as i am unable to get any options to delete this.
-
Export data to password-protected xls
Hi! Does DSS have an option for setting a password on a newly generated xls-file when exporting data ? A general password that is, needed for opening that file. If a pythonscript is the way to handle this, what would be the preferred library for that ? Thanks in advance for any thoughts on this! Jurre
-
Export Partitoned Dataset
I'm trying to use a python recipe to export a partitioned dataset, when I partition by a specific column(date column(LDD for example)), that column is removed from the dataset. How would I export the partitioned dataset into monthly files based on LDD which was partitioned?
-
Update excell sheet via python script
Hi folks, I want to update specific cells in an Excel sheet using openpyxl.load_workbook in Python. When I run the code, I don’t encounter any errors, but nothing gets updated in the file. How can I solve this problem? Thanks in advance.
-
Repeated Random Splitting and Bootstrapping with XGBoost
I have a dataset that I want to random split into train and test sets with an 80/20 ratio. I aim to repeat this random splitting and bootstrapping of the training data 1,000 times. For each iteration, I'll train an XGBoost model and then export the SHAP values, Gini index for each feature, F1 score, and ROC AUC for the…
-
Looking to replicate a SUM(COUNTIF) formula in Dataiku
I am working on a scorecard in Dataiku and I would like to calculate the percentage of completion in a set number of columns. Basically, I would like to replicate this formula in excel: =SUM(COUNTIF(ColumnX:ColumnXX,"*")/Total Number of Columns) and am having issues. The columns are a mix of strings, integers, and text,…
-
Time Series values differ from original data
Given an Original Data set, then do Time Series Lab, and deploy a model. Then, I train the model > predict > score based on Original Dataset, getting Forecast Data. Forecast data contains date, values, forecast, percentile columns. I assume values to be the original data. However, when I compare it to my original data it…
-
An invalid argument has been encountered : Unknown DSS variable: dip.projectKey
Hi, I am attempting to use the saved model object to predict a dataset using the score recipe. However, I am encountering the following error message: "Invalid argument - An invalid argument has been encountered: Unknown DSS variable: dip.projectKey." Can you help me resolve this issue? Operating system used: Windows
-
Get Dataiku Projects with Tag "Foo"
Do we have a mechanism to retrieve list of projects within our Dataiku instance that has certain specific tags? Thanks!
-
How can I export graphs and outputs from a Dataiku notebook into Google Sheets?