Stéphane Raguideau, GEM IS Digital & Data Accelerator
Yasine Meridja, GEM IS Digital & Data Accelerator
Organization: ENGIE - Global Energy Management & Sales
Global Energy Management & Sales (GEMS) is one of ENGIE’s Global Entity. At the heart of the energy value chain, we optimize the Group’s assets portfolio including electricity, renewable technologies, natural gas, environmental products and bulk commodities such as biomass. We also develop our own external commercial franchise worldwide and rely on four main expertise to offer tailor-made, innovative and competitive solutions. We provide services in energy supply & global commodities, energy transition services, risk management & market access, and asset management. With a staff of 1,400, offices in 15 countries including 8 main spots, GEMS has an extended geographical coverage in Europe, the US and Asia-Pacific.
Data science is at the core of our activities at ENGIE - Global Energy Management & Sales. Users across departments manage various sources of data, including:
This data is leveraged by the business for many purposes, including:
But access to the data was limited due to its sheer volume, security considerations, and tooling segmentation. In addition, coding skills were required for accessing it, which excluded many users who did not have a technical background.
Users needed to manually retrieve the data through a variety of applications, which caused several issues:
We implemented Dataiku in 2018. As with every new tool, Dataiku requires specific onboarding to maximize its benefits. At GEMS, our users have different profiles and backgrounds, hence they are not all familiar with data manipulation and analysis.
It is therefore important to provide them with training opportunities, regardless of their division (trading, risk, back office, finance, IT, etc.). This includes:
This training path is set to two months, after which users are given autonomy to access the data, manipulate it for their day-to-day needs, and most importantly, are able to explore new areas to gain more insights from their data, which is a key pillar of data democratization within GEMS.
We then empower these users across business teams and monitor their (and our) success through two key elements:
An important key to success is to transmit the desire to share knowledge, to collaborate and to facilitate user learning. To make this possible, we started a Dataiku Community with different communication channels:
Yammer, where we post articles around new Dataiku version, internal trainings, Quiz about Dataiku usage, Dataiku white paper and even the movie Data Science Pioneers: Conquering the Next Frontier. Every post has a relaxed tone and humoristic GIF and encourages readers to add comments.
A dedicated community Dataiku instance where are stored Internal trainings, plugins how-to's, and Dataiku project to pass Dataiku certification exam. All projects contain various descriptions, very detailed Wiki with many screenshots and comments and are structured with Flow Zone and Tags.
Becoming a data scientist or a data engineer doesn’t happen overnight though, hence we’ve developed a tool to manage our Dataiku instances, monitor all projects created, and ensure they’re following established governance and best practices - including data connections, scenarios, data sharing, partitioning, plugin types, etc.) All users are therefore able to produce insights safely!
Dataiku Watcher has been designed to easily manage all our Dataiku Instances and monitor:
For each of our Dataiku Instance installed on our servers, we are able to monitor:
Each project is analyzed (datasets, recipes, metadata, connections, plugins usages, automation, interaction with other projects & team, etc.) and a score is computed over 3 different lines: Collaboration/Design/Automation.
The score reflects the fact that the project matches the internal guidance and best practices promoted internally.
Thanks to this score, we can detect potential problems in advance and work with the teams by providing them with tailor-made training.
How Dataiku is used across the organization? How many hours do our users spend on the platform and how many times do they log in per month?
All this questions, and more, are essentials to ensure that Dataiku is still used, fit our needs, and is still relevant in our data tools framework. Dataiku Watcher provide all the required metrics on a daily basis.
Dataiku Watcher provides tools to ensure every license purchased is used. Inactive Data Scientist are detected (no activity over the last 60 days) allowing us to save on licensing costs by reallocating unused licenses (if any) instead of purchasing new ones.
Dataiku enables us to drive positive impact through the following features:
Dataiku enables business teams to scale their impact and save time across a broad range of operations:
“We save 1-2 hours a day when producing our liquidity risk indicators and we now have the possibility to focus our efforts on enriching our liquidity analysis instead”
Treasury and Liquidity Risk Officer
“We've automatized and accelerated our month-end closing process from 5 days to 15 minutes”
FinOps Project Manager
“The number of mispriced derivatives has decreased by approximately 66%, which saves us about 3 hours per day”
Lead Weather Derivatives Structurer
“Dataiku reduces the time for back testing our models from 1 week to 10 minutes. Additionally, the platform helps us understand each step of the analysis”
Weather Derivatives Structurer