How to find Max date from previous month from a dataset using sql for scenario variable. required max date of previous month not last date of previous month. Operating system used: windows Operating system used: Windows11
Greetings! I have a data set consisting of various columns, one being 'US State' - All states are represented multiple times. I would like to compile a random sample consisting of 2 samples for each state. I've read up on the different sampling methods and don't see how they will fit my use case. I welcome the Community's…
Hi - I have a very simple request but I am unable to work it out. I have 2 columns on a dataset which I need to multiply one by the other? Example one column is in USD and then I have a rates column and I want to multiply one by the other to give a number?
There is one website where you can easily watch what's current trending YouTube videos. Explore the latest trending videos on YouTube in the United States. From gaming and music to sports, movies, news, and tech, discover the top 50 YouTube trending videos USA offers a wide range of content, making it a great source of…
i want to create column in pivot recipe with specific sorting ,any idea how to do it example: col1 col2national failinternationalsuccessnationalsuccessinternationalfail pivot national_success,national_fail,international_success,international_fail so pivot table column col1 ,col2 sorted desc
I am preparing a column containing 'Comments/narrative' (string) for a word cloud data set. I have a list of 209 specific words i'd like to remove from the column ('A', 'AN', 'THE', etc). I'd rather not use the 'find and replace' recipe, for obvious reasons. Can anyone recommend a more efficient solution? Thank you!
Hello everybody, I am building a Jupyter Notebook in a Dataiku project, and I would like to know if it is possible to export a pandas DataFrame directly to my local computer. I saw discussions in the forum that explained how to export pandas Dataframes into a Dataiku Managed folder, but I would like to go one step further.…
To the Dataiku developers, Dataiku appears to use `np.bool` or `numpy.bool` in its codebase. This has been deprecated since Python 2.4 or so. I strongly recommend replacing these by `bool` to remove many warning errors that pollute the detailed job diagnostics. If there is a way to turn these warnings off, could you please…
Hi, I have made a setup of scenaios using python api to send emails based on a dataset to various recipients. My concern is that with tme the number of recipients will increase, so is there a possibility that the process may fail in between? The set up includes a base dataset, python script to set the recipient, subject…
Hi, I would like to define a custom check using the evolution of a metric value comparing last run et previous one. Is it possibe to do this in custom python ? By default, function provided uses a dictionary containing only last run. Thanks in advance.
Create an account to contribute great content, engage with others, and show your appreciation.