Improve user experience for hierarchical data

Nate · April 2021

In Pandas, tables with hierarchical indexed are rendered gracefully, enabling users to quickly visualize their datasets in detail and perform mass transformations to data deep within hierarchies. In Dataiku, hierarchical data is currently visualized as a string. These strings can be hovered over to expand, but don't get formatted neatly and are often truncated.

Dataiku does already have some fairly powerful processors to manipulate hierarchical data, but lacks visualization capability to make these processors quickly useful for exploration and preparation.

I think it would improve Dataiku tremendously for the capability to visualize hierarchical data in some way to be incorporated. Pandas has a couple styles for this:

pands-multiindex-initial flyers TZli0

There are some other styles for displaying this type of data as well:

39775506-51a08c70-52bb-11e8-86a6-1ef4ba7f497a VaDbh 6So12 0049174e57d3cc2b3763ab4d8842eb41 Erovc 35484781-f5faabd0-0422-11e8-9144-b622394f96a1 nested-jquery-grids joins_4.max-1400x1400 79d187cbae1c1dc3fdff3b21290e03dd GeneratedReport

Even simple rendering of json objects on hover would be a big improvement, but it would be very cool if some type of deeper rendering enabling all of the advantages of Dataiku column metadata could be enabled, especially where all the fields in a json object or array happen to match a strict schema. Then it also becomes much easier to apply prepare recipe processors to hierarchical data without needing to flatten the schema. A common pattern in prepare recipes is to flatten a layer of hierarchical data, process it, then to reformat it as json so another json column's data can then be processed without creating a product join in the process. I think this could be less laborious if nested fields were recognized as part of a table's schema, allowing users to quickly apply processors multiple layers deep in objects while maintaining the original structure of a table.

Ashley · May 2021

Thanks for your idea, @natejgardner

Your idea meets the criteria for submission, we'll reach out should we require more information.

If you’re reading this and would love to see this capability in Dataiku DSS be sure to kudos the original post or leave a comment!

Take care,

Ashley

Improve user experience for hierarchical data

In the Backlog · Last Updated April 2021

Comments

Categories

Setup Info

Tags