Outer Join "Not available with this engine"
Hello everyone
I have two datasets that are imported from my PC, so they are not stored on any database. Currently, I am trying to outer join those two data sets; however, I am having this option disabled and showing an error saying: "Not available with this engine".
Is there anyone way to overcome this error, or at least use any alternative method and get same result?
Best regards,
Ahmed
Best Answer
-
Turribeach Dataiku DSS Core Designer, Neuron, Dataiku DSS Adv Designer, Registered, Neuron 2023 Posts: 1,987 Neuron
Few solutions for this:
- Install https://www.postgresql.org (it’s free) in your PC so you can store your datasets in a SQL database
- Use a Cloud database like GCP’s BigQuery or AWS Redshift
- Convert your recipe to Python code
Answers
-
I would like to implement the first method, but after storing my dataset in the database, can I replace it with the data already in the flow?
-
Turribeach Dataiku DSS Core Designer, Neuron, Dataiku DSS Adv Designer, Registered, Neuron 2023 Posts: 1,987 Neuron
You can but it’s a manual process. First you need to manually sync the datasets to the database using a Sync recipe. Then multi-select the datasets and select the Change Connection on the right pane to move them to the new SQL connection. The confirm the datasets have the data in SQL. Finally remove the Sync recipes and you will have your flow migrated to the new database.