-
Dataiku Online - How to know resource consumption (CPU, memory, most consuming jobs, ..)?
Hello!!! Context: * We are currently using Dataiku Online DSS, in the cloud. * My role is: Space Admin. * We are targeting moving to a EC2 machine by next year due to our license ending * Our license is Ikigai (we are a non-profit org) We would like to know how to measure resource consumption stuff like: * Most demanding…
-
When working with a Partitioned dataset is there a way to determine which partition the record is in
I'm working with several partitioned datasets. I've run into a problem that data in one of the partitions is partially corrupt. (Lots of extra spaces added to a field.) Going forward, I can put steps into my recipes to correct this before the data is put into the partitioned data set. However, I would like to clear and…
-
Read CSVs from S3 folder, process, write processed CSVs to S3 folder
How should I: 1. Read CSVs from an S3 folder 2. Process these CSVs with custom python code 3. Write these processed CSVs to an S3 folder. A different folder to the input I guess. Thanks in advance
-
Drop Data
When I delete a dataset, Dataiku always asks whether I wish to delete the data. I understand that if this is dataset built from data I uploaded that the question makes sense. But if I am deleting a dataset produced by a recipe, what is the meaning of the "drop data" option? I do not understand. What data is being deleted.…
-
Export Documentation
In the documentation, I read the instructions to create a flow document: * You can generate a document from a project with the following steps: * Go to the flow of the project * Click the Flow Actions button on the bottom-right corner * Select Export documentation * On the modal dialog, select the default template or…
-
M1 libraries
Dataiku does not run natively on the M1 Mac. But I am wondering if my library code can use libraries compiled for the M1 chip while the Dataiku framework continues to run on the Intel Chip through Rosetta. Thanks. Operating system used: Mac Ventur
-
How to use DSS formula functions inside Reporters txt message
Hi, I can't seem to be able to use any of the dss functions inside the reporters text editor. example: ParseJson(${variable}) or ArrayLen(${variable)) - I get the text printed out as is. I've the template type option set to DSS.
-
Create a writable Dataset
I have created a custom algorithm, and in its class constructor, I would like to write a data frame to an empty Dataset created by me in the Flow window. I tried to create a Dataset via the Dataset Menu, by choosing `Editable`. That creates a read-only Dataset. My question is how to create a writable Dataset. Is it…
-
Custom top-N model
I am having issues with my custom model. From what I read, models should derive from `scikit-learn` and I should make sure I have a `classes_` attribute in my model. However, my model is a top-N recommender, and I am using `rankfm`, a factorization machine model available in Python/C on github. I have included all the…
-
How to produce summary of results from R coding in Dataiku?
Hello, I perform propensity score matching and try to produce a summary of results. How can I see these results inside Dataiku? In other words, can I request to produce a dashboard from my "summary(match.itzs)" code line? Here is my code: library(dataiku) library(MatchIt) library(dplyr) library(optmatch) # Recipe inputs…