Submit your innovative use case or inspiring success story to the 2023 Dataiku Frontrunner Awards! LET'S GO

Data validation compared to previous data

SaschaS
Level 2
Data validation compared to previous data

Hello,

is there a way to check and validate data?
I have webshop traffic data in my spreadsheet on a daily basis. These are divided into our different channels like SEA, Price Search Engines, SEO and so on.
I'm looking for a way to check if there are major discrepancies in new data compared to previous ones.
In this way I would like to check whether we have tracking and/or transmission errors and then send a message to the responsible employee if necessary.
Maybe someone has a good idea how I can implement this.

Thanks and Greetings
Sascha

0 Kudos
1 Reply
kathyqingyuxu

Hey @SaschaS ,

 

As a start you could try looking into the following:

1. Creating a scenario with specific data checks:Concept: Metrics & Checks — Dataiku Knowledge Base, and then send a email or use a webhook to send an alert (Concept: Scenarios — Dataiku Knowledge Base). In your scenario as a first step you could run the checks and only if the checks are approved continue to run the remainder of the flow or else send a message to users warning on a data change.

2. You could also take a look at the Dataiku Data Drift functionalities: Input Data Drift — Dataiku DSS 11.0 documentation

 

Hope this helps!

 

Best,

Kathy

0 Kudos