Sign up to take part
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Added on June 23, 2022 10:43AM
Likes: 9
Replies: 3
Team members: Ryan Moore & Tony Olson
Country: United States
Organization: Snow Fox Data
Snow Fox Data is a premier data strategy, data science, and analytics solutions provider. Headquartered in Wisconsin and serving customers worldwide, we provide a vast landscape of knowledge that supports your success through data-driven decision-making. A passionate team of data architects, data scientists, data engineers, and data analysts, Snow Fox Data empowers you to make clearer decisions through clever data solutions.
Awards Categories:
At Snow Fox Data, we work with numerous customers who utilize Dataiku in their Data Science and Analytics practice. Many of these organizations and analytics groups have not yet invested in an enterprise data cataloging tool or data lineage tool, which are often cost-prohibitive.
As part of the productionalization process for these customers, we have often witnessed them creating "homegrown" data cataloging solutions that typically consist of a combination of spreadsheets, Dataiku, and their preferred visualization tool. Their “homegrown” data cataloging solutions are labor-intensive to maintain and do not integrate with their developers, who are hands-on with the Dataiku projects.
Additionally, our clients struggle with data lineage. They are creating numerous downstream datasets in Dataiku. We often experience them saying “where did that column come from?” Without upstream data lineage visibility, our clients lose trust in the data and ultimately the solution’s business outcomes.
Because of this cataloging and lineage challenge, Snow Fox Data has created a free Dataiku plugin called THREAD™. THREAD™ is a lightweight catalog and lineage tool that directly integrates with Dataiku and its datasets. This tool allows for a single location to document data connected to Dataiku and to consume the catalog's contents in a manner that is easy and efficient for business practices.
THREAD™ is implemented as a Dataiku web app plugin that has a very easy installation process and has the ability to securely scan an entire (or partial) Dataiku node to allow for lineage view and documentation. The indexes and metadata that are generated by THREAD™ are saved as Dataiku datasets in a project flow, making it very easy to export indexes and metadata for exposure in 3rd party visualization tools such as PowerBI or Tableau.
Use Case Stage: In Production
THREAD™ has already been deployed on 100s of projects at multiple joint Snow Fox Data and Dataiku clients. Here are some areas of business value THREAD™ users have obtained:
THREAD™ is built on top of Dataiku! All the value THREAD™ creates is an extension of and possible because of Dataiku.
Dataiku’s flexible and extensible platform allows the community to contribute and share solutions across organizations and industries easily. The ability to write custom plugins and integrate with the Python API provide the capability to achieve exceptional business value through custom integrations.
The native security integration removes governance concerns on building application solutions on top of Dataiku and thus increases the speed of the innovation process.
Amazing work, congratulations! By the way, is it possible to install the plugin somehow?
Great work! Are external users able to download this plug-in?
Hi @Ignacio_Toledo
@harsh9127
- the Dataiku team is reviewing the plugin for approval in the store right now. Send me a DM if you'd like to get "early access", I can get you the plugin directly.