-
Limiting per project jobs for all projects
Hi, Is it possible to limit the number of jobs to say 5 for each project on a DSS instance? The documentation indicates I can do it for specific projects but doesn't make it clear if it's possible for all projects. I'd like to set the global limit to 20 and limit every project so no project exceeds 5 jobs. It's too…
-
Empty Hive table when processing with pyspark
Hello, I have a problem with a Hive table : - when trying to process the table using Pyspark (ex : df.count() ) i get 0 rows which means an empty DataFrame. - then when trying to investigate, using a Hive query (SELECT COUNT(*) FROM TABLE) i get all the data in that table. Does anyone have a solution to that or knows why…
-
Limiting the number of concurrent activities for all users
From https://doc.dataiku.com/dss/latest/flow/limits.html I see that I can setup a limit for a given user To define a limit for a given user, the key must follow the pattern “user/XXXX” (where XXXX is the username of a user). I was looking for a way to establish a general limit for al users, saying each individual user has…
-
Dataiku python library install
hello Dataikers I have a question. 1. Is it possible to add python library to dataiku in closed network on-premises? If possible, how should I perform it? 2. Is there a limited library when installing dataiaku Python library through pip Or is there a library that cannot be installed? (For example, in Anaconda, library…
-
SQL intermediate dataset isolation
So I'm trying to admin a dataiku platform. Until recently for security reasons we weren't cleared to use SQL (Postgres) datasets, so I've only recently started to experiment with those. Now I've noticed that by default all the intermediate tables appear for everyone sharing the connexion. There are quite a few problems…
-
Disable selection of backend in recipe compute configuration
How to set containerized environment as the only option available for the user? We wanted to disable the selection of backend from the options so that none of the jobs submitted will go to server backed. Operating system used: Linux RHEL
-
Is it Possible to Create Custom Notification Email for Govern Node For Requesting Sign Off
Hello fellow Dataiku users, Hope all is well! We are using the Dataiku Govern Node and have enabled notification emails for Sign Off Requests for example for bundles. We wanted to know if anyone has tried creating a custom notification or if this is possible? For example we would like the email to include additional…
-
Change recipe default engine
Hi all, I'm wondering if there's a way to change the recipe default engine from DSS to Spark as I'm using S3 datasets. I know it can be done for a specific recipe by clicking the icon below the Run button, but I want to change it for the whole project. Thanks in advance!
-
Setting up Event Server
I have an enterprise Design Node instance controlled through a Fleet Manager. I have followed these instructions: https://doc.dataiku.com/dss/latest/operations/audit-trail/eventserver.html I have also seen several of the other posts here: https://community.dataiku.com/t5/Setup-Configuration/DSS-event-server-setup/m-p/22948…
-
Training for fleet manager for AWS.
All, Is there any training on using Fleet manager on AWS to setup Dataiku DSS. Particularly around best practices on setting up Kubernetes. Over the coming months I hope to be setting up a research environment of DSS, that will be used to support researchers and automate workflows, and support some model building.…