-
How to find un-used shared-in datasets of a project with python API ?
Thanks for your time at the beginning. I have a project and I want to know which datasets are shared-in from other projects (black icons) with Python API. However, those shared-in datasets could be seperated into 2 group: ① unused: just showing in in the zone for checking ② used: being used through recipe for analyzing I…
-
Meet "Connection xxx not found" when exporting a project
Thanks for your time at the beginning. I am currently exporting a project, only check the 4 default options. However, it failed with a warning "An invalid argument has been encountered : Connection 'SF_VAW_PROD_ATP_MED' does not exist"Then I tried to use Python API to find this connection, but failed again: import dataiku…
-
Get a list of code environments that use a specific python package and the version specified?
Hi, We have an internal python package that gets used by dozens of code environments and we need to know which code environments those are and what version is present. Is there a way to do this with DSS? thx in advance Operating system used: Windows 10
-
Como hacer Naive Bayes en Dataiku
-
ModuleNotFoundError When Accessing Knowledge Bank via REST API SDK
Hi everyone, I'm running into an issue while trying to access a Dataiku Knowledge Bank from an external Python script using the Dataiku REST API client. When I execute my script, I get the following error:ModuleNotFoundError: No module named 'dataiku.core.vector_stores.lifecycle' Here are my environment details: Dataiku…
-
Renaming a dataset using Python API
Dear Community, I am trying to rename a dataset from a project using the python API using the rename method from the dataikuapi.dss.dataset.DSSDataset class (https://developer.dataiku.com/latest/api-reference/python/datasets.html#dataikuapi.dss.dataset.DSSDataset.rename) but I get an AttributeError: 'DSSDataset' object has…
-
Exception: Unable to fetch schema for PROJECT.dataset: b'Ticket not given or unrecognized
Hi there, I encounter the sudden issue of not being able to load datasets into a Jupyter Notebook. Changing environment/Kernel doesn't help. System reboot doesn't help. Force reloading doesn't help neither. Nothing was changed in the code. Flow still runs, so it runs as a receipt but not when trying to work in the…
-
The recipe execution is taking long time due to handling a large volume of data in dataiku
We are experiencing long execution times for a recipe in Dataiku due to handing large datasets, while we have implemented partitioning using a filter on a specific column, it still takes 1.5-2 hours to partitioning 30M records. Is there a more efficient way to handle and process this data quickly and effectively because…
-
How to execute a recipe after an empty dataset ?
Is there any possible way of checking readyness of a dataset? I have a dataset that might be empty after a Hive query, it shouldn't be a problem but since it is (I cannot use it in a left join...) I decided to build another dataset that would contain either the result if it exists or a dummy line if it does not. All this…
-
Can a project variable be used as a scenario variable?
Hi, I have a SQL query that when it returns no data, I need to be able to change the email content of my scenario reporter. I am currently setting a project variable and have a couple questions: 1. Can I use the project variable to conditional change which email reporter to use? 2. If no to #1, what is the best way…