Using Dataiku

How to search/replace a column based on another table ?
Hello, I'm looking for a way to do a search/replace on a column based on a reference table that contains two columns: Refrence_Table (WrongValueToMatch, Replace) Table_Data (Col1, Col2, Col3, Col4) I want to do a search and replace on "Col1" of the Table_Data table, I'll have to search for the values of the…
Excel uploading is not working as expected
Strange things happens trying to upload simple excel file (.xlsx) to Dataiku from local computer. I will try to clearly explain how did I try to upload file and what went wrong: * I click "+Dataset" button and select "Upload your files". * Drag simple excel file with two columns. * In "Format/Preview" window I select "skip…
Creating chart with snowfalek data throws error
Hello, I'm triying to create a pivot table from snowflake data. Sometimes i receive the following error (on a random basis) Operating system used: Windows 10 Operating system used: Windows 10
New jupyter notebook error 403 Forbidden
I've been using dataiku and notebooks just fine before. My profile/user was recently closed since the (licence?) slot needed to be borrowed/transferred for a while. Then it was put back on with all the same authorization levels as all the other users. Now when I try to open any jupyter notebook on a project I get "403…
TensorFlow slices method using containerized execution
Hi Experts, I am using the tensorflow slices to batch process my images for CNN model. The snapshot of the code is as follows and it runs very well in a Jupyter notebook in dataiku using local execution. When I run the same code in dataiku containerized execution it gives the following errors, I have seen documentation on…
Python Package Import Error
I am unable to import from my Python module even though I have properly defined the module in the Dataiku Python Library. I even added a __init__.py to the folder to make sure Dataiku "saw" it as a module folder. It didn't work for me with standard user permissions, and it doesn't work for me even after being elevated to…
Update/replace dataset
Hello, I'm a beginner in Dataiku, I built few flows, did tranings. Now I'm stucked with a problem: I built a flow (prepare, filter, group etc) and now I've got some updated data. I would like to replace very first dataset with my new data. Number of columns is the same, some column names are changed. Is it possible? I know…
How to concat multiple columns by index?
I have 2 dataset with different column names but same length. Dataset 1 have column A,B, length is 10. Dataset 2 have column C,D, length is 10. I want to get Dataset 3 by concating Dataset1&2, Its column name are A,B,C,D and length is 10. I have tried join and stack, but I can't get what I expect. Operating system used:…
PROTECT SCHEMA OF POSTGRES TABLES
Hello, When I launch recpies, it appears that output datasets connected with a postgres tables erase all parameters of this table, including schema, constraints, foreign keys, etc. I would imagine a possibility to manage conflicts between the existing table and the dataiku output. Is there any possibility to protect these…
Failed to create the tutorial - Advanced Designer Exam
Dear Dataiku community, I had started my advanced designer exam with a mistake in the project and to be able to answer the question correctly, I deleted the project from Dataiku. An error message came up but when I opened the homepage, the project was gone. I then tried to add the project again from DSS Tutorials but I am…

Trending Discussions

Logging in dataiku notebook / recipe ...
Hello Team, I am working on pyspark recipes. I use notebook to build the logic and change it back into recipe. The dataiku and spark operations ( e.g. df.count() ) emits a lot of log statements to the console and makes the notebook very difficult to use. Is there a way for me to supress logging from dataku and spark APIs?…
Defining a global variable in the base name of the output file for a dataset
Hello I am working on a flow that has a python recipe that sets global variables. In the output dataset of the recipe a couple of these variables are being used to set the path and filename of the dataset which is stored in Azure. From researching on how to define the filename it states to set the "Force single output…
i am looking a strange error while accessing my dataflows
flows were working previously but now this error window limits me to use any of my projects

Leaderboard

Turribeach 3581

tgb417 2477

Ignacio_Toledo 1079