-
Multi-label classification
Hello community! Is it currently possible to perform multi-label classification using the in-built recipes, lab (or any other) features in DSS?
-
Add custom checks to dataset programatically
Hi, Is there a way to add custom checks in the same way as adding metrics to a dataset through the dataset api ? Thanks
-
Model deploy
For deploying a model from DSS, is it currently mandatory to go via the Lab feature? As in, do we have to train a model from inside the Lab?
-
Job failed: Error in python process: At line 16: Python process is running remotely
Dear Experts, currently using Dataiku online. I am trying to read the cleaned dataset , train it and get the model stored in the folder I am new in using Dataiku api, would like to take help here . Then use it in dataiku to test and create api. #…
-
Prepare recipe
Hello, just wanted to know if this data transformation is possible out of the box. text label text label abc ['A', 'B'] => abc ['A', 'B'] def C def ['C']
-
Implementing SCD2 (slowly changing dimension) in Dataiku
Hi, I want to implement SCD type 2 in Dataiku. Can it be possible to implement using Dataiku visual recipes. And what are the alternatives as well to implement the same. Regards, Ankur
-
How to get row count & dataset size using project.get_dataset() API?
Hi Team, We have a requirement wherein we need to log the updated dataset size and row count for all datasets of different projects. We tried two approaches, both of which takes huge amounts of computing time. It would be really helpful if you can let us know if there is a better, optimized way to perform this given we…
-
How to apply a partition-trained model to all partitions
Hello, I am trying to train a partitioned model on all partitions of a dataset and then apply this model to make predictions on another partitioned dataset. Dataiku presents an option to train on 'all partition' but when I try to score, using the 'all available' partitions parameter, it fails (I get the error "Path does…
-
Split my dataset from key column auto-recognition
Hello, Bonjour, I have a dataset and I want to spilt it in 2 parts. Data from a key column (2000 rows) In this key column is a full sentence (exemple : "hello how r u", "how r you", "hello fine") And when a key word is detected ("exemple : hello") it'll split automatically in one new dataset and when it's not detected…
-
Parsing xml file
Hi, I want to read xml file from dataiku into csv structure as output. When I upload xml file by using type xml in format and preview. It showing 6 columns in schema. Where 3 columns is having nested array and in each content it was append with xml_ text. When I prepare recipe I used unfold the array option for the nested…