Using Dataiku

Is there an automated method to verify the current status (enabled or disabled) of scenario steps
We are in the process of automating prechecks performed prior to initiating an end-to-end project. As part of this effort, we are exploring automated methods to confirm the current status (enabled or disabled) of scenario steps in Dataiku. If such a method exists, we would greatly appreciate your assistance in guiding us…
Sharepoint Files with Different Field Counts, but Same Field Names
Hello everyone, I am currently importing files from SharePoint that are sent to me via email on a weekly basis. Each file may have a different number of fields each time it is sent. I notices that Dataiku is not appending on the name each time and simply creating null values on the fields that arent visible on each…
Copying data from local managed folder to S3 managed folder
Hi, I have some model files in a managed folder stored on DSS, I want to copy them to a new folder in S3, is there a way to do it using the python recipe?
Advanced Designer Fuzzy Join Error
Hi community, Pretty new to Dataiku but not to analytics and data visualization, integration etc. But I'm having an issue in the course. I'm doing the tutorial for the Fuzzy Join and am working on our cloud instance (not on my local pc) and I went through and created the fuzzy match according to the instructions (Went back…
BatchUpdateException: Data truncation: Data too long for column 'content' at row 130
I have a MariaDB table where I try to write results of an RSS that I fetch through a Python recipe. I get the error of the title that the column "content" is too long. When I open the dataset and check the schema, it is like this: ``` { "name": "content", "type": "string", "originalType": "LONGTEXT", "maxLength": -1,…
Get list of projects that used the list of dataset
Hello, I need help to get the list of projects from lists of dataset that is available in our folder. for e.g. dataset 1 is used in Project A, Project B and Project C I am not sure if we can get the list from flow's documentation or should I use python get the list. Operating system used: Windows
Export a Dataiku project from the Dataiku online service/version into a local
Hi! I'm trying to export a Dataiku project from the Dataiku online service/version into a local instance. Export goes well but at import issues appear as below:
Change default sample Wiki article
I was wondering if Dataiku permit the use of custom sample articles for projects wikis or at least allow us to modify the default sample article.
When extracting date components from a date, year changes from double integer to double decimal
when extracting the date components from a date in a prepare recipe, it shows as being a double integer format but once i run the recipe, the ouput dataset has the year column as a double decimal format. is there any way to keep it as a double integer? i have tried a formula to format it and tried to round the date with no…
[Error] Error: [$injector:nomod] Module 'dataiku' is not available! You either misspelled the module
I see this error in Javascript console while running the app http://localhost:11200/ in Safari. It works on Chrome. The problem started just today, it was running well so far. [Error] Error: [$injector:nomod] Module 'dataiku' is not available! You either misspelled the module name or forgot to load it. If registering a…

Trending Discussions

Logging in dataiku notebook / recipe ...
Hello Team, I am working on pyspark recipes. I use notebook to build the logic and change it back into recipe. The dataiku and spark operations ( e.g. df.count() ) emits a lot of log statements to the console and makes the notebook very difficult to use. Is there a way for me to supress logging from dataku and spark APIs?…
Defining a global variable in the base name of the output file for a dataset
Hello I am working on a flow that has a python recipe that sets global variables. In the output dataset of the recipe a couple of these variables are being used to set the path and filename of the dataset which is stored in Azure. From researching on how to define the filename it states to set the "Force single output…
i am looking a strange error while accessing my dataflows
flows were working previously but now this error window limits me to use any of my projects

Leaderboard

Turribeach 3581

tgb417 2477

Ignacio_Toledo 1079