Deploy non "Saved Model" for real time scoring

Options
RohitRanga
RohitRanga Registered Posts: 41 ✭✭✭✭
edited July 16 in General Discussion

Hello Dataiku community!

I have a multi-label text classifier trained using scikit learn and stored in a managed folder as a .pkl file. I came across this tutorial for how to create a python model prediction endpoint.

https://doc.dataiku.com/dss/latest/apinode/endpoint-python-prediction.html#classification

The input to the endpoint needs to be something like this:

{'text': 'I have my graduation ceremony and birthday next week.'}

And the output like this:

{'text': 'I have my graduation ceremony and birthday next week.',
 'predictions': [{'values': ['BIRTHDAY', 'GRADUATION']}]}

Any thoughts on how this can be done? I would compute embeddings for the text using my custom logic before it is fed into the model.

Setup Info
    Tags
      Help me…