Modular Ingestion Framework for Data Hub import and management

Realization of a centralized datahub and a modular framework for managing and modelling data

Background

Technology Reply supports the customer in all phases to realize centralized Data Lake, from design to implementation, during the realization of all needed services and during the data preparation useful for its queries. The goal is to manage in a performant way, inside a single Data Platform, all data belonging to different Banks, in order to guarantee the same functionalities to all customers.

In particular, the centralized datalake, named Data Hub, contains data received from different sources. In this way, each user can utilize a unique environment for data consulting and querying.

Solution

Data Hub realization, including the framework described, allows to manage in a more intuitive way processes and creation of models needed to users analysis. Technology Reply supports the customer in all the phases, from Platform analysis, Process identification and implementation.

Advantages

The platform can be implemented through Cloud Native Services giving the following advantages:

Best management in terms of reliability and services availability
Optimal dimension infrastructure, pay per use

The Framework modularity gives us the opportunity to easily adjust it with new add-ons depending on our customers needs. Possible extensions are:

Data quality
Encrypted information and GDPR
Flow integration

Metadata structures, used to guide processes, can be queried in order to create easily Data Lineage: In particular, it is possible to obtain mode and transformation applied during the creation of different models.