Using Dataiku

How to get the value before a specific delimiter
If I have a string value formatted as firstname_middlename_lastname_initials_age_gender and I only need everything before age, is there a formula I can use to give me everything before the fourth instance of my delimiter "_"? I know I can probably use the split function to split them all apart and then concat together the…
How to pivot from column to rows
I have data that has a column with the days of the week, source and season. I'm trying to split my days column to rows. I have picture of my current input and expected output. I tried to use the split function but that split the days into different columns not rows. is there a way to get my desired output using dataiku?…
How to filter dates less than current day
I have a dataset that has a list of dates and days of the week, is there a way I can filter my dataset for dates that are less than or equal to the current date? I tried the formula date = now() but I don't think that's working, it just filters all my data out. Operating system used: windows
Suggestions on deploying an End Point
I have a dataset with content and URLs, I have created embeddings for this content and saved in a new dataset, so it now has the embedded content and the URLs, I want to create an API which can send content and I embed this content and perform a similarity search on the embedded dataset and give the top 3 matched URLs, I…
Defining a variable before scenario starts
Hello everyone! This past week, I've been working on a project where I've implemented a scenario reliant on database modifications. However, the issue is encountered when you need to redefine the same variable every time the scenario runs. This process involves loading a huge dictionary and AI models, consuming 2-3 minutes…
Conditionnal formating - Dashboards
Hello, I would like to add some colurs into published dashboards of my project. For instance, if the number is equal to 0, the cell should be highlighted in green... Does someone know how to do that ? Thank you !
Importing code from Git in global shared code
I noticed that it is now possible to sync per-project libraries with a git remote, however I did not see the feature for the global shared code folder. Is it a bug or you wouldn't recommend to sync the global shared code with a remote?
How to unzip files in a remote Managed Folder (as Azure File Share) using python?
I have zip files in a remote Managed Folder (as Azure File Share) and I need to unzip these files using Dataiku (Python) in this same folder to proceed with the flow. However, I was not able to do it following the examples showed in the API documentation (Managed folders — Dataiku DSS 12 documentation) Is it possible to do…
Torch backend for visual deep learning
Hi, As of now, "visual deep learning" interface implies that the model must use keras, and is forced to a tensorflow backend. Keras can use other backends, such as torch (About Keras 3) ; following this article I ran into various issues, leading me to believe that it is not supported by dataiku at the time of writing. Is…
Flag bank holiday
Hello Dataiku community, do you know if the bank holiday flag will work for the coming years (2, 3 years)?

Trending Discussions

Python API for fleet manager
Hi all, I am trying to connect to the FMClientAWS using the documentation here : But I am not sure what to use as key_id and key_secret. I could not find anything related to this in the FM UI. Can anyone help me with how to get these two values for my FM? Thanks! Operating system used: linux Operating system used: linux
SSH error connecting to ftp1.str.com, caused by: JSchAlgoNegoFailException: Algorithm negotiation fa
Getting the below error when trying to ping of the SFTP folders in the dataiku instance: SSH error connecting to ftp1.str.com, caused by: JSchAlgoNegoFailException: Algorithm negotiation fail: algorithmName="server_host_key"…
Embedding a link to download a dataset in scenario email report?
Hi, Our workflows take a user-defined data payload and create N datasets. Is it possible to add a hyperlink with the dataset name as the text and the link being the URL to download the dataset as a CSV file? thx Operating system used: Windows 10

Leaderboard

Turribeach 3582

tgb417 2477

Ignacio_Toledo 1079