HI all, i would like to create a dataset based on data quality rules where it failed. I would like to list (create table) all of the failed rows to able to send it to the team what needs to be changed. Failed or not is not enough I need to be able to collect every line where the rules are failed. Did not find and option.…
Hi - Please see below my goal, methodology, and the challenge I am facing. Thanks in advance for your help. GOAL: I have a flow that takes a project variable while running. The output is then visualized on the dashboard. I would like to try different variable values and export the resulting dashboards. I have 10 different…
We want to test database connections in all our instances since our internal security policy is to change database passwords yearly and this invariably leads to some user connections being missed and this invariably leads to some user database connections being missed, the password expiring, flows failing, data not being…
Hello, I’m trying to use the sample.py after unzipping the archive of a model I extracted. The model is a light gbm with a feature selection step. The version of the dss is 12.6.5 However the python script crash after the dummifier step with the error : Indexed_matrix.py Line 35 in _ remap _ key Remapped_key = (key[0],…
hello, everyone I'd like to ask about the cause of the result from executing the following code: client = dataiku.api_client() prj_key = dataiku.Project().project_key project = client.get_project(prj_key) scena = project.get_scenario(scenario_id) The result is: { … 'progress ': { … , 'fekport' : nan, …} … } In other…
Hello We would like to install a package A (ultralytics in our case) but without its dependency B (opencv-python) or to be able to install this package A and then remove one of its dependencies B. The reason behind is that ultralytics doesn't work properly with opencv-python, we need to remove it and install…
Hi. I want to extract data from 7 AM to 10 AM, regardless of the date, in Datetime, but I don't know how to do it. Thank you.
Hello, In my project, I am trying to do two different things with labelings tasks. 1- Automatically create a labeling class according with a registry dataset with the classes that should exists. 2- Get the URL of the labeling class though a Python API to send to slack webhook. This is a issue since I need to send the…
I have a connection to a postgres database. In this database there are two schemes. Is there a way to specify which schema the datasets should be written to? Operating system used: Ubuntu Operating system used: Ubuntu
I am facing a strange issue where even after a dataset is getting created, the code is saying that dataset doesn't exists. Following is the function that I am using which has 3 steps - 1. Creating a table in the database 2. Creating a dataset in DSS and connecting it to the table created in 1 3. Writing a pandas dataframe…
Create an account to contribute great content, engage with others, and show your appreciation.