-
Export a pandas dataframe directly to a local file
Hello everybody, I am building a Jupyter Notebook in a Dataiku project, and I would like to know if it is possible to export a pandas DataFrame directly to my local computer. I saw discussions in the forum that explained how to export pandas Dataframes into a Dataiku Managed folder, but I would like to go one step further.…
-
np.bool versus bool
To the Dataiku developers, Dataiku appears to use `np.bool` or `numpy.bool` in its codebase. This has been deprecated since Python 2.4 or so. I strongly recommend replacing these by `bool` to remove many warning errors that pollute the detailed job diagnostics. If there is a way to turn these warnings off, could you please…
-
Use reporter in scenarios to send email notifications
Hi, I have made a setup of scenaios using python api to send emails based on a dataset to various recipients. My concern is that with tme the number of recipients will increase, so is there a possibility that the process may fail in between? The set up includes a base dataset, python script to set the recipient, subject…
-
Custom Checks using history of metric values
Hi, I would like to define a custom check using the evolution of a metric value comparing last run et previous one. Is it possibe to do this in custom python ? By default, function provided uses a dictionary containing only last run. Thanks in advance.
-
using tesseract to read pdf
Hi all, I am a python script that uses tesseract engine in order to extract text from scanned pdf files. I have already tried to use tesseract OCR plugin but the results aren't what I am looking for. The python script that I wrote in my laptop is working fine. However, When I am using the same code in dataiku server I got…
-
Flow Document Templates
Is there a library of templates for the Flow Document Generator ... the default generator is a good start, and I'm aware of the scripting available to customize - but I would like to avoid reinventing the wheel if I can Thanks, Dave
-
The automation node DSS license
What's the function of automation node DSS license?
-
How to partition a non partitioned dataset using python
I want to partition a non partitioned dataset . Re-dispatching is not working as i want to have distinct partition values from a column of non partitioned dataset .
-
How to Obtain and Utilize Dataiku Username to a SQL Query (Oracle)
I would like to add 2 columns to our most important SQL tables (Oracle) and use them as an "audit log" to track who made the change and when the change was made: AUDIT_USER_NAME AUDIT_TIMESTAMP In my SQL Query Recipe, I am using the following columns: SELECT user as audit_user_name ,sysdate as audit_timestamp…
-
dataiku version control vs remore version control
Hi Team, Please suggest which is recommended to maintain version control of project in dataiku, inbuilt version control vs remote version control(bitbucket, Git repo). and why? and also please share if comparable document you have already. thanks in advance.