Plugins & Extending Dataiku
-
Predictions made using Dataiku Snowpark API Yield Single Class for Multi-Class Classification model
Hello community, I am facing an issue when using Dataiku's Snowpark API to make predictions from a multi-class classification model. The model is trained with Dataiku and a Snowflake table via Visual ML. The predictions only return one class, while the same table loaded as a pandas DataFrame returns correct predictions…
-
Problem with np.bool dependency in "dataiku" package in python
Hi, I'm having a problem importing the "dataiku" package in python in the following environments since update to 11.2 python 3.6 python 3.7 python 3.9 # Example: load a DSS dataset as a Pandas dataframemydataset = dataiku.Dataset("PWHRn6UM") import dataiku…
-
Hierarchical Models
For my current situation I am looking to set up hierarchical models to describe the data I have where it is a standard situation of having multiple observations per customer across multiple days where one can easily see that the behavior of each customer is more similar to their own than others and therefore wanting to…
-
Importing code from a plugin into python notebook
I'd like to import python functions from a (self-made) plugin into a python notebook. I've found this part in the User's Guide, but it's lacking examples and I can't get it to work: https://doc.dataiku.com/dss/latest/plugins/reference/other.html If I have a plugin with plugin id "xyz" and the following folder structure xyz…
-
Can't get the predictor from a clustering model handler using the python API
I've been having success manipulating saved models using the python public API for both prediction and classification models to get more information from them and use the predictors in recipes. But I'm having trouble doing the same with clustering models trained in VisualML and deployed to the flow. I usually do import…
-
Multi-worksheet xlsx Exporter [SOLUTION]
Hey, At the current moment, there is no way of me to export multi-worksheet Excel file to a folder without installing a plugin, so decided to create one. Very simple script and can be found here: https://github.com/akmalsoliev/dataiku_multi_sheet Coming across some message boards noticed that the only purposed option was…
-
Is there a limit to a directory structure that .list_paths_in_partition() can traverse
All, I've been working on a project with a fairly deep data structure stored in a remotely mounted DSS "managed folder". In one of the directories, I've got ~140,000 files. In another, directory I've got ~170,000 files. In a smaller directory python code like this seems to correctly pull all of my file paths. # -*- coding:…
-
Creating Plugins to extend Dataiku DSS
Hi, I have been using Dataiku DSS for some time and love it's functionalities although at times I would like to add something to it. Has anyone created some sort of a cheat sheet of different things we can do to extend DSS, one of them would be to write our own plugins. Any resources (blogs, Dataiku docs, etc...) to do…
-
Creating a project with python in project folder
Hi community, just a small question. If I would like to create a new project from python, I can use the create_project() function from the dssclient.py. The definition is def create_project(self, project_key, name, owner, description=None, settings=None, project_folder_id=None) However, I am not sure how I can set the…