Docs for "pandasutils"?

JustinMinnion
JustinMinnion Registered Posts: 2 ✭✭

Hello,

My apologies if this is a remedial question, but at the start of every Python recipe the boilerplate code includes an import of:

from dataiku import pandasutils as pdu

Is there documentation for pandasutils? Is it a package that can be used in Python recipes?

I've tried looking through the Dataiku Developer Guide, and I even tried seeing if the dataiku package had its source code published anywhere (I could only find dataiku-api-client-python ), but no luck.

It's actually been (in a funny way) challenging to look for these docs because so many unrelated code samples include this import line.

Any help is appreciated, thanks in advance!
-Justin

Operating system used: OS X

Best Answer

  • Turribeach
    Turribeach Dataiku DSS Core Designer, Neuron, Dataiku DSS Adv Designer, Registered, Neuron 2023 Posts: 2,024 Neuron
    Answer ✓

    The "dataiku" package is a bit of kludge if you ask me (not to be confused with the dataiku-api-client, see here for Dataiku Python APIs). First of all it isn't available on PyPi but you can install it from a tar bundle you can get from your DSS node. But even that bundle doesn't have all the modules that a full Dataiku installation will have as I noted on this thread. The dataiku module gets imported via sys.path.append(). It lives on the folder where you unpacked the Dataiku installer. In a MacOS installation that will be different since MacOS has a separate installer which leaves the module under:

    /Users/[your MacOS ID]/Library/DataScienceStudio/kits/dataiku-dss-[your current DSS version]-osx/python/dataiku

    You can find the pandasutils module under ./core/pandasutils.py

Answers

Setup Info
    Tags
      Help me…