Survey banner
Share your feedback on the Dataiku documentation with this 5 min survey. Thanks! TAKE THE SURVEY

Saving custom prepare processor

Level 1
Saving custom prepare processor

Hello everyone,

I began using Dataiku a few days ago. I have a lot of "address" data, and I tried to use the Geocoder pluging in order to convert them into usable coordinates and geopoints.

As this plugin induces many java errors on my installation (NullPointerException), I decided to create my own prepare-recipe Python processor which call the Google Geocoding API.

Thus, I have 2 questions : 

  1. Is there a possibility to create multiple columns with a Python processor ? I would like to add latitude, longitude, for example, but I cannot find how to do that.
  2. is there a way to save a Python processor in order to re-use it on other recipes or other projects ?

Thanks a log for your answers.

Operating system used: AlmaLinux (AWS)

0 Kudos
1 Reply

Hi @PeteGore ,

The python processor within a prepared recipe can only apply to one column. 

You can compute both and put them to the targeted column for the python processor and later split it to separate columns with another processor. 

You can perform this in a Python recipe instead. 

You can leverage project libraries if you need to reuse python code

Or package a python recipe as a plugging:


0 Kudos