-
Logging in dataiku notebook / recipe ...
Hello Team, I am working on pyspark recipes. I use notebook to build the logic and change it back into recipe. The dataiku and spark operations ( e.g. df.count() ) emits a lot of log statements to the console and makes the notebook very difficult to use. Is there a way for me to supress logging from dataku and spark APIs?…
-
Defining a global variable in the base name of the output file for a dataset
Hello I am working on a flow that has a python recipe that sets global variables. In the output dataset of the recipe a couple of these variables are being used to set the path and filename of the dataset which is stored in Azure. From researching on how to define the filename it states to set the "Force single output…
-
Practical Use of Code Studios
I am starting this thread to learn about how others are using code studios (such as VSCode, JupyterLab, and Streamlit) and for what purposes. In our organization, we were initially excited about the feature introduced in DSS v11. However, our enthusiasm was quickly dampened by the fact that users cannot select the…
-
i am looking a strange error while accessing my dataflows
flows were working previously but now this error window limits me to use any of my projects
-
Usage of tags of github in version control
Hi, I've connected my project version control to a remote repository hosted on github. Usually when developping with github I use tags to check releases of my code to move it to production, to being aware of which version of code I've deployed. I would like to use it in Dataiku in the same way, creating a bundle from the…
-
Clarification on handling streams through the API?
In this video: At around the timestamp 3:23, the last line of code shows folder.upload_stream("name_of_file_in_folder", f) Which appears to be incorrect since "folder" is undefined. Should this have been handle.upload_stream(… instead? (Also, please add version 13 as a version option in the ask a question form?) Operating…
-
Reorder columns in a dataset
Hello, I would like to reorder the columns of my dataset without using a Prepare Recipe. Is it possible ? Thank you !
-
Questions on quick modeling prediction
I have questions on the quick modeling part of dataiku. Now I am completing an assignment, but I find that the column of data in my label data used to calculate the cost does not appear in the unlabeled data. This problem caused me to be unable to predict unlabeled data with the model I trained. I would like to ask how to…
-
K-Modes supported
Hi, I noticed that Dataiku supports K-means clustering but couldn't find support for k-modes. Am I missing some documentation. If not, are there any plans to support k-modes clustering? thx Operating system used: Windows 10
-
Overview of all running jobs
Hello Community, Is it possible to get an overview of all running jobs? Is it possible by checking the internal metrics dss_jobs and then filtering on state empty? This seems to provide an overview of all jobs running. However, for example, if the job is partitioned, is it also possible to see all the sub-jobs running?…