Sign up to take part
Registered users can ask their own questions, contribute to discussions, and be part of the Community!
Added on August 1, 2023 3:43PM
Likes: 1
Replies: 0
Team members:
Do Van Nhan, Data Scientist, with:
Country: Vietnam
Organization: FPT Software
A subsidiary of FPT Corporation – the leading ICT Group in Asia, FPT Software is a global technology and IT services provider with headquarter in Vietnam. Its decades of experiences in the global market have seen FPT Software empowering digital transformation for businesses worldwide, from various industries: Healthcare, BFSI, Manufacturing and Automotive, Communications, Media and Services, Aerospace and Aviation, Logistics and Transportation, Utilities and Energy, Consumer Packaged Goods, and Public Sector.
Awards Categories:
Last year, 2023, was a turbulent year with numerous occurrences such as the Ukraine War, the Energy Crisis, and Food Security. As a result, this period was a global impediment to economic development and practically all enterprises. We, FPT Software, are no exception.
Luckily, we have signed contracts with several significant organizations during this challenging period, many of them are airlines, not only in Europe but also in Japan and China. The difficulties they are interested in are becoming more diversified, ranging from projecting the number of passengers passing through security gates to assessing the distribution of passenger groups based on behavior, optimizing the problem of fuel consumption and making arrangements - staffing, and so on.
On the client side, they want to know how predictive models function, how algorithms work, how to evaluate them, and how to show a dashboard with alerts (alarms and alarms) to an external chat channel. That is why we selected Dataiku as a solution to these issues. "Dataiku is not a platform, it is a solution"
As we mentioned before, we are confronted with a comprehensive challenge:
1. Maintaining databases on both Redshift and Snowflake
2. Solving problems with different algorithms (MLOps)
Namely, time-series forecasting (for predicting the number of customers passing through security gates every 15 minutes), customer segmentation (segmenting customer behavior) and algorithm optimization (for fuel minimization and working time optimization without sacrificing flight quality/consumer experience).
3. Create dashboards to perform analytics on predicted customers through the gate, fuel consumption, and distribution of customer groups.
Using Dataiku's Dashboard is perfectly suitable in this case, we can flexibly apply both Jupyter Notebook, Scenarios and Webapp right in one dashboard to visualize the entire working flow for each individual problem.
4. Send warning alert messages if the prediction model is significantly deviated.
We used webhook and content from "Automating the Model Lifecycle" to implement and solve this problem. All of these features from Dataiku is really helpful to us to solve these problems.
Business Area Enhanced: Accounting/Finance
Use Case Stage: In Production
Communicating and connecting with people, especially in marketing, is all about understanding their needs, behaviors, and expectations. We are asked to answers questions such as:
To answer these, companies should segment their users by shared similarities in order to establish, nurture, and maintain strong relationships, etc. How to analyze and solve these questions, that is why we use Dataiku to find the best solution.
These problems have helped aviation partners to optimize the number of employees, working hours at security gates, flight diagrams and as well as optimize fuel consumption without affecting service quality or customer experience. A user who does not have much experience as a data scientist can still use Dataiku to experiment with models without having to worry about whether he has a lack of knowledge or not.
Dataiku has an ML Diagnostics function, are designed to identify and help troubleshoot potential problems and suggest possible improvements at different stages of training and building machine learning models. Thus problems like Dataset Sanity Checks, Leakage Detection, Overfitting Detection, Training Reproducibility, etc are no longer difficult for a low-code or even no-code user. With these features, it is easy for them to receive prompts from the platform, and then review the model (in consultation with other users) before issuing the complete reports.
The amount of staff that the customers used (after we completely handed them over) to this process was greatly reduced compared to before they came to Dataiku. The workflow for calculating these models is done automatically, effectively reducing inadvertent human-caused calculation errors over a long period of time. The accuracy of the prediction models is greatly improved without violating any other phenomena such as Overfiting, Data leakage, etc, but also reduces the number of operators, optimizes time and customer experience row. That's what we've been hoping for.
Value Type:
Value Range: Hundreds of thousands of $