Sign up to take part
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Hi
we use Dataiku as our primary tool to create data sources for Tableau and any other analytical stuff. I just joined the team and I am finding it difficult to understand the Bigquery target table names. They have everything named as stacked, joined, prepared and the final table sometimes has weird names too.
I would like to have a standard format for namings. I know dataiku doesn't have any rules but can anyone post any tips on how you guys name your datasets in your org ?
need tips, suggestions any ideas.
Hi there and welcome to the Dataiku community!
This is just my opinion.. but it is generally good practice to use a dataset naming convention that is both simple in nature but also prescriptive enough for others so that they can quickly get a good idea of what the data or table might contain without spending too much time trying to figure this out. In my experience, names like table_name_stacked, table_name_joined, table_name_prepared, and table_name_final can be fairly common as it tells you what operations or recipe was used previously to produce this dataset/table along with where it belongs in the Flow (table_name_final likely being the final output of the data transformation, cleansing, preparation, wrangling, etc. that was done and should thus be used as the input for your BI tool/layer).
However, this is merely one perspective and can indeed differ quite greatly based on the individual's or organization's preferences and needs/requirements.
Best,
Andrew