The client is a dynamic and forward-thinking mid-tier bank in the US, committed to delivering exceptional financial solutions and customer service. With a focus on innovation and agility, this bank is driving growth and success in a highly competitive industry.
THE CHALLENGE / REQUIREMENT
The client faced significant challenges related to their data ecosystem, including a high total cost of ownership and exponential data growth in their data warehouse. With data doubling every two years at a rate of 100+ TB, controlling and managing the data growth in their SQL server-based data warehouse was extremely challenging.
To compound these challenges, the client also experienced performance issues in the consumption layer due to the data volume and increased consumption/data usage patterns. The data sharing process was difficult, and the client needed a simplified development process to manage their complex data ecosystem effectively.
We began by migrating the existing data warehouse on SQL server to Snowflake on AWS, leveraging our expertise in cloud migration to streamline workflows and enhance performance. We retrofitted the ETL/consumption layer to consume data from the modernized data store, ensuring that the client's data ecosystem was up-to-date and efficient.
We took a phased approach to ensure success and minimize disruption to the clients' business operations and successfully completed a migration project in three phases over a 2-year roadmap for the client.
Technologies used : Snowflake | SQL Server | AWS Cloud| Informatica | Terraform | Tableau
Multiple tools were used to bring automation and accuracy through this process :
Polyjuice ETL retrofitting tool has led to more than 60% reduction in conversion-built effort.
Custom tools developed for SQL translation and data migration, resulted in more than 35% reduction in effort. This helped to streamline workflows, reduce complexity, and allocate resources to areas that drive business growth.
Complete regression test automation framework has led to more than a 40% reduction in effort, ensuring that the clients' data ecosystems are efficient, up-to-date, and reliable.