-
Alert/Notification for Job failure
Hello, I want to send an alert/notification on failure of job in Dataiku. How can I implement it ? Thanks
-
Object not serializable error on an if then else statement run on spark engine
I get the following error when I run an if then else error on spark. This runs fine on the local engine. I have if then else statements that run just fine in other sections of my workflow in the DSS. How should I interpret this and how do I debug? Job failed: Task not serializable, caused by:…
-
How to reuse a group of blocks with different inputs
I have a complex process, involving several visual recipes and Python code blocks that I'd like to apply to several different datasets. I'd like to be able to apply this complex process to these different datasets obtaining several output datasets. "Convert into an application-as-recipe" seems the perfect solution but I…
-
How to extract text from doc files?
I have few .doc files in my managed folder, I want to extract the text from the files using python recipe. Please guide me how can I achieve this. Or is there any way to convert the .doc files into .docx file programmatically and then extracting the text from the converted file? Thank you in Advance. Operating system used:…
-
Python recipe Excel packages
Hello everyone, I have an issue while constructing excel with python recipe. I need to use precise packages (xlsxwriter, xlwings, openpyxl and so on) that work very well in Anaconda.But in Dataiku python recipe it's not the case. In my mind with python recipe I could do whatever python usually does. Ex below of a line that…
-
Convert scientific numbers
Hello, I would like to convert numbers in scientific fromat (1,5E7 for example) in a number format (1 500 000,00). Does someone know how to do that ? Thank you !
-
Dataset overwritten instead of error
When building datasets I have seen that, on changes to the schema given by the recipe, the dataset is fully overwritten, data and all. This means that, when a recipe suddenly does not return the correct schema, all previous data is lost… Previously we did get an error message if this was the case and we would not lose any…
-
Sorting dates in Pivot Tables
Hey everyone! I created one Pivot Table as an Insight to include it in my Dashboard. The Dataset has a date column. It is parsed correctly and the Insight has recognised the field as date. In the dataset, the table is ordered by the date field Descending. However, when I create the Insight with the Pivot Table, the only…
-
Possible to set importLibrariesFromProjects using Python
Hi, I need to be able to set importLibrariesFromProjects using Python, is this possible? thx Operating system used: Windows10
-
How to shift data in a column
I have a main question and as part of my solution to it I have a follow up question: Main question: I have a column in my time series data (let's call it status) that is populated with on/ off binary data. I need to find a way to create a column to count the days since last time status was on. so basically when the status…