-
Get a list of code environments that use a specific python package and the version specified?
Hi, We have an internal python package that gets used by dozens of code environments and we need to know which code environments those are and what version is present. Is there a way to do this with DSS? thx in advance Operating system used: Windows 10
-
Como hacer Naive Bayes en Dataiku
-
ModuleNotFoundError When Accessing Knowledge Bank via REST API SDK
Hi everyone, I'm running into an issue while trying to access a Dataiku Knowledge Bank from an external Python script using the Dataiku REST API client. When I execute my script, I get the following error:ModuleNotFoundError: No module named 'dataiku.core.vector_stores.lifecycle' Here are my environment details: Dataiku…
-
Renaming a dataset using Python API
Dear Community, I am trying to rename a dataset from a project using the python API using the rename method from the dataikuapi.dss.dataset.DSSDataset class (https://developer.dataiku.com/latest/api-reference/python/datasets.html#dataikuapi.dss.dataset.DSSDataset.rename) but I get an AttributeError: 'DSSDataset' object has…
-
Exception: Unable to fetch schema for PROJECT.dataset: b'Ticket not given or unrecognized
Hi there, I encounter the sudden issue of not being able to load datasets into a Jupyter Notebook. Changing environment/Kernel doesn't help. System reboot doesn't help. Force reloading doesn't help neither. Nothing was changed in the code. Flow still runs, so it runs as a receipt but not when trying to work in the…
-
The recipe execution is taking long time due to handling a large volume of data in dataiku
We are experiencing long execution times for a recipe in Dataiku due to handing large datasets, while we have implemented partitioning using a filter on a specific column, it still takes 1.5-2 hours to partitioning 30M records. Is there a more efficient way to handle and process this data quickly and effectively because…
-
How to execute a recipe after an empty dataset ?
Is there any possible way of checking readyness of a dataset? I have a dataset that might be empty after a Hive query, it shouldn't be a problem but since it is (I cannot use it in a left join...) I decided to build another dataset that would contain either the result if it exists or a dummy line if it does not. All this…
-
Can a project variable be used as a scenario variable?
Hi, I have a SQL query that when it returns no data, I need to be able to change the email content of my scenario reporter. I am currently setting a project variable and have a couple questions: 1. Can I use the project variable to conditional change which email reporter to use? 2. If no to #1, what is the best way…
-
Problem using python recipe
I have a problem when using a python recipe that uses the kubernetes engine when running. I checked the log step by step, the python process should have been completed, but for some reason the kubernetes engine does not show a sign of completion and looks like an unfinished loop. Please help with this error.
-
How to run integration tests on flows with Python recipes
I've recently started to use the "Run integration test" scenario step for testing. It's definitely some work to create the test reference datasets but it once set up it's great to be able to run this test after later code changes to confirm the process works as expected. Our flows typically mostly use SQL script recipes.…