Whats the meaning of the columns in the internat stats commits data?

Options
hisyam
hisyam Dataiku DSS Core Designer, Dataiku DSS ML Practitioner, Dataiku DSS Adv Designer, Registered Posts: 16

dataikustats.PNG

This dataset is pulled from internal stats commits which is produced by Dataiku. But I'm finding it hard to understand the last 5 columns it produced. Does anyone know what they mean or link me to reference that I can use?


Operating system used: Linux

Tagged:

Best Answer

  • Turribeach
    Turribeach Dataiku DSS Core Designer, Neuron, Dataiku DSS Adv Designer, Registered, Neuron 2023 Posts: 1,757 Neuron
    Answer ✓
    Options

    I think you have misunderstood this functionality. It's not meant to track data rows inserts, deletes, etc but metadata changes (definition changes done to your flow objects). Have a look at the Version Control menu in your project and you will be able to see the changes and review how they are persisted.

Answers

  • Turribeach
    Turribeach Dataiku DSS Core Designer, Neuron, Dataiku DSS Adv Designer, Registered, Neuron 2023 Posts: 1,757 Neuron
    Options

    The last 5 columns show you the changes the user has done in the commit. Most of the Dataiku objects in GUI are stored as XML or JSON files so these columns are indicating if new objects have been added or changed.

  • hisyam
    hisyam Dataiku DSS Core Designer, Dataiku DSS ML Practitioner, Dataiku DSS Adv Designer, Registered Posts: 16
    Options

    Some of the columns are pretty straightforward, for example

    added files: adding files into the project

    removed files: files that were removed from the project

    changed files: if any of the existing files in the project has been changed (might be wrong on this one)

    But what I don't understand is the other two, namely added lines and removed lines. I uploaded a dataset that only contains 4 rows of data and only changed the column names. However, if you noticed on the added lines column in the picture, you can see that the it says 343 added lines were made. This is the part that I have trouble on understanding. I don't fully understand why it said 343 lines were added whereas I've only changed the column names.

    Would be glad if anyone can help to clarify on this.

  • hisyam
    hisyam Dataiku DSS Core Designer, Dataiku DSS ML Practitioner, Dataiku DSS Adv Designer, Registered Posts: 16
    Options

    Thank you! The answer I was just looking for. I didn't know about the version control.

Setup Info
    Tags
      Help me…