-
Fast appending a dataset into a historical identical schema
hello I am trying to append a dataset that is executed weekly into a historical dataset (identical schema) that contains years worth of data. Let's call first one as data1w and the historical one as datahist Dataiku will not accept two input datasets (data1w and datahist ) while output is datahist (ie. same as one of the…
-
ModuleNotFoundError: No module named 'dku_utils'
I am using python recipe for one of the projects from dataiku gallery (rfm segmentation). In that I am getting following error. If the dku_utils module has changed then please guide me. Thanks Operating system used: windows 10 Operating system used: windows 10 Operating system used: windows 10
-
How to add data to a existing dataset with python?
I have data set by name weather_data , i want to add data everyday to this dataset How can i do this with python?
-
Accelerating Data Science with Snowflake and Dataiku
Job failed: SQL compilation error: Stage 'PC_DATAIKU_DB.PUBLIC.DATAIKU_DEFAULT_STAGE' does not exist or not authorized. View job details Operating system used: Windows
-
Creating many jobs with a python recipe
Hi folks! So I have a scenario in which I have a block of python code which performs a task. I also have a list of items, and I want to run the same job for each item in the list, the exact same code, just with the item name injected in. I could do with with a python loop, but my list contains about 30 items, and if one…
-
Zip Codes 5 and 9 digits
I have a dataset with a mixture of US zip codes (both 5 and 9 digits long) and some zip codes from Canada, the UK, and other countries. Is there a formula or another way that Dataiku will add in the hyphen for 9 digit US zip codes without impacting the other zip codes?
-
About widget in dataiku
Hi Dataiku team, I'm exploring the package ipycytoscape package that can return the graph where i can drag and drop the nodes and edges on this graph dynamically. I try the following example https://ipycytoscape.readthedocs.io/en/latest/ from ipycytoscape import CytoscapeWidgetimport networkx as nxG =…
-
Dataiku App MULTISELECT: Select an option multiple times
Hi, I'm using an Dataiku App to apply filters on datasets. You need to know that the 1st filter will be applied to the first dataset, the 2nd filter to the second dataset, etc. THE PROBLEM : I can't select the same filter twice to apply them on both datasets What should I change on my Multiselect to do it (if it's…
-
ETL Datastage to Dataiku Migrations
Hello, We have a legacy ETL system which uses IBM DataStage jobs to perform the ETL. Is there a automated way to migrate these DataStage jobs to Dataiku flows? We can take export of DataStage jobs in JSON format but not sure if that can be leveraged to do the migration.
-
How to show Spark progress within Jupyter Notebook?
I'm used to working in Jupyter in standard AWS EC2 instances and via WSL. In both of these, PySpark displays progress while performing queries / transformations. Is there a way to get the behaviour in Dataiku's Jupyter implementation? As always, I have set "spark.ui.showConsoleProgress" to "true"; however, it does not…