Improve user experience for hierarchical data

In Pandas, tables with hierarchical indexed are rendered gracefully, enabling users to quickly visualize their datasets in detail and perform mass transformations to data deep within hierarchies. In Dataiku, hierarchical data is currently visualized as a string. These strings can be hovered over to expand, but don't get formatted neatly and are often truncated.

Dataiku does already have some fairly powerful processors to manipulate hierarchical data, but lacks visualization capability to make these processors quickly useful for exploration and preparation.

I think it would improve Dataiku tremendously for the capability to visualize hierarchical data in some way to be incorporated. Pandas has a couple styles for this:

 

pands-multiindex-initialflyersTZli0

 

There are some other styles for displaying this type of data as well:

39775506-51a08c70-52bb-11e8-86a6-1ef4ba7f497aVaDbh6So120049174e57d3cc2b3763ab4d8842eb41Erovc35484781-f5faabd0-0422-11e8-9144-b622394f96a1nested-jquery-gridsjoins_4.max-1400x140079d187cbae1c1dc3fdff3b21290e03ddGeneratedReport

 

Even simple rendering of json objects on hover would be a big improvement, but it would be very cool if some type of deeper rendering enabling all of the advantages of Dataiku column metadata could be enabled, especially where all the fields in a json object or array happen to match a strict schema. Then it also becomes much easier to apply prepare recipe processors to hierarchical data without needing to flatten the schema. A common pattern in prepare recipes is to flatten a layer of hierarchical data, process it, then to reformat it as json so another json column's data can then be processed without creating a product join in the process. I think this could be less laborious if nested fields were recognized as part of a table's schema, allowing users to quickly apply processors multiple layers deep in objects while maintaining the original structure of a table.

1 Comment
AshleyW
Dataiker

Thanks for your idea, @natejgardner 

Your idea meets the criteria for submission, we'll reach out should we require more information.

If you’re reading this and would love to see this capability in Dataiku DSS be sure to kudos the original post or leave a comment!

Take care,

Ashley

Status changed to: In the Backlog

Thanks for your idea, @natejgardner 

Your idea meets the criteria for submission, we'll reach out should we require more information.

If you’re reading this and would love to see this capability in Dataiku DSS be sure to kudos the original post or leave a comment!

Take care,

Ashley