-
multiply
Hi - I have a very simple request but I am unable to work it out. I have 2 columns on a dataset which I need to multiply one by the other? Example one column is in USD and then I have a rates column and I want to multiply one by the other to give a number?
-
Where Can i watch YouTube Trending Videos
There is one website where you can easily watch what's current trending YouTube videos. Explore the latest trending videos on YouTube in the United States. From gaming and music to sports, movies, news, and tech, discover the top 50 YouTube trending videos USA offers a wide range of content, making it a great source of…
-
pivot recipe
i want to create column in pivot recipe with specific sorting ,any idea how to do it example: col1 col2national failinternationalsuccessnationalsuccessinternationalfail pivot national_success,national_fail,international_success,international_fail so pivot table column col1 ,col2 sorted desc
-
Removing (209) specific words/strings/sp characters from string column
I am preparing a column containing 'Comments/narrative' (string) for a word cloud data set. I have a list of 209 specific words i'd like to remove from the column ('A', 'AN', 'THE', etc). I'd rather not use the 'find and replace' recipe, for obvious reasons. Can anyone recommend a more efficient solution? Thank you!
-
Export a pandas dataframe directly to a local file
Hello everybody, I am building a Jupyter Notebook in a Dataiku project, and I would like to know if it is possible to export a pandas DataFrame directly to my local computer. I saw discussions in the forum that explained how to export pandas Dataframes into a Dataiku Managed folder, but I would like to go one step further.…
-
np.bool versus bool
To the Dataiku developers, Dataiku appears to use `np.bool` or `numpy.bool` in its codebase. This has been deprecated since Python 2.4 or so. I strongly recommend replacing these by `bool` to remove many warning errors that pollute the detailed job diagnostics. If there is a way to turn these warnings off, could you please…
-
Use reporter in scenarios to send email notifications
Hi, I have made a setup of scenaios using python api to send emails based on a dataset to various recipients. My concern is that with tme the number of recipients will increase, so is there a possibility that the process may fail in between? The set up includes a base dataset, python script to set the recipient, subject…
-
Custom Checks using history of metric values
Hi, I would like to define a custom check using the evolution of a metric value comparing last run et previous one. Is it possibe to do this in custom python ? By default, function provided uses a dictionary containing only last run. Thanks in advance.
-
using tesseract to read pdf
Hi all, I am a python script that uses tesseract engine in order to extract text from scanned pdf files. I have already tried to use tesseract OCR plugin but the results aren't what I am looking for. The python script that I wrote in my laptop is working fine. However, When I am using the same code in dataiku server I got…
-
Flow Document Templates
Is there a library of templates for the Flow Document Generator ... the default generator is a good start, and I'm aware of the scripting available to customize - but I would like to avoid reinventing the wheel if I can Thanks, Dave