Survey banner
Switching to Dataiku - a new area to help users who are transitioning from other tools and diving into Dataiku! CHECK IT OUT

Saving custom prepare processor

Level 1
Saving custom prepare processor

Hello everyone,

I began using Dataiku a few days ago. I have a lot of "address" data, and I tried to use the Geocoder pluging in order to convert them into usable coordinates and geopoints.

As this plugin induces many java errors on my installation (NullPointerException), I decided to create my own prepare-recipe Python processor which call the Google Geocoding API.

Thus, I have 2 questions : 

  1. Is there a possibility to create multiple columns with a Python processor ? I would like to add latitude, longitude, for example, but I cannot find how to do that.
  2. is there a way to save a Python processor in order to re-use it on other recipes or other projects ?

Thanks a log for your answers.

Operating system used: AlmaLinux (AWS)

0 Kudos
1 Reply

Hi @PeteGore ,

The python processor within a prepared recipe can only apply to one column. 

You can compute both and put them to the targeted column for the python processor and later split it to separate columns with another processor. 

You can perform this in a Python recipe instead. 

You can leverage project libraries if you need to reuse python code

Or package a python recipe as a plugging:


0 Kudos