-
Montonic constraints in Xgboost with hyperparameter search
Hi there, I'm currently using Dataiku to train a model for a prediction task. I need to impose directionality constraints on certain columns, but the current version of xgboost in Dataiku doesn't support these kinds of constraints. I've learned that we can write custom Python models with monotonic constraints as detailed…
-
Recipe failed but notebook runs (same code environment)
My recipe failed to resolve a pip package I installed on code env. The same code runs well on Notebook area but failed on recipe using the same code env. I suspect the recipe uses old version of the code env but I have no way of forcing it to use the latest code env build. How? Operating system used: Windows Operating…
-
How to delete preparation processor
Hello, while following this link, https://knowledge.dataiku.com/latest/plugins/development/tutorial-first-plugin-recipe-processor.html#test-the-preparation-processor I would like to completely delete the Processors Library called Hide colors created during the Test the preparation processor process. Even if delete the json…
-
Cannot use a managed folder as an input of a python recipe created from python code using builder
Hi team, I currently encountered an urgent issue with our project automation pipeline. In a Python code recipe, I used builder to create a new python recipe, and have added a dataset as input, 3 new datasets as output. In order to save some model objects to a managed folder, I need to add an existing folder "Pickle_Files"…
-
Executing a .exe in dataiku dss
I have a folder of data which can be converted to a desired format using a .exe application. The input data and .exe isgiven by 3rd party. How can I do this in DSS? Operating system used: Cloud
-
Creating FTP Datasets
Hello Dataiku Community. I am connecting to an FTP folder to build a dataset. The data I got is the union of all the data in all the files in that directory. What I want is to process only the new files coming to that directory. I was looking at the advanced options but I couldn't find any documentation about the…
-
How do i create a categorisation model for a reviews dataset
Hi there - new to dataiku, Lets say i have an excel sheet of 2 columns where one has app reviews and the other has dates they were posted. Is there a video tutorial anywhere or example where i can create a model to categorise the app reviews into categories eg) ux/ui problem or customer service problem as well as include…
-
Output of Python Recipe Embeddings Not Writing With a Comma Delimiter
Hi, I am using a python recipe to get embeddings from my text features in Dataiku and everything worked out fine, but the embeddings did not come out with comma delimiters when I write the output to a dataframe. I needed that format to perform a similarity search on the dataframe. Does anyone knows a way around this?
-
Error with "Optical Character Recognition (OCR)"
Hello, When I try to use the recipe "Optical Character Recognition (OCR)" on a folder containing grayscaled pictures (obtained with "Greyscale" recipe), it fails. The error type is: "Error in Python process: At line 22: <class 'ImportError'>: libGL.so.1: cannot open shared object file: No such file or directory". Can you…
-
Prepare recipe: the preview is full of data but the results are empty
Hello Community, I am actually helping one of my team work on preparation of data with: - Prepare recipe (running in Partially in database) engine. (Due to multiple data connections) - with some simple script elements (parse data, extract data componants) the preview is as it is as you can see the data is present in the…