Technology Reply dealt with the study, the design and the implementation of the data acquisition and integration components of the connected vehicle within the Corporate Data Lake: a distributed, cloud-based data platform, which processes approximately 1.5 billion records daily with more than 5,000 data integration processes taken from multiple sources.
The acquisition of the connected vehicle data, starting from the heterogeneous source systems, was carried out through a market-leading ETL tool and furthermore the realization of data integration and structures was designed in order to promote front-end solutions oriented to interactive reports and self-service analysis.
Four distinct design streams have been organized for the design, design and implementation of the solution:
- Data Ingestion & Architecture, responsible for defining the reference architecture, setup and coordination of data integration activities
- Data Modeling & Use Case definition, responsible for designing business scenarios and feeding rules for indicators of interest
- Data Governance, responsible for Data Quality areas, Metadata Management and the definition of a common Business Glossary
- Data Privacy, responsible for the definition and implementation of Security and Privacy logics such as Data Retention, anonymization of sensitive data