Data Warehouse Infrastructure Migration And Creating Data Mart For The World’s Largest Job Marketplace

March 7, 2023 by
The Result

Reduced cost of data storage and support due to the migrated corporate data warehouse infrastructure that simplified data management.

Increased performance of data querying and data availability for analytics due to execution of the most critical workloads on top of Snowflake’s multi-cluster shared data architecture.

The created (from scratch) data mart for the marketing department.

The Challenge

Mellivora data engineers were approached by a client with the request to identify valuable business logic and to separate it from the outdated part of code.

The request included zero stopping time for moving the data and data quality checks supporting the entire data migration process.

The Solution

Mellivora experts executed data migration from the PrestoDB data warehouse to Snowflake.

Some ETL workflows from Apache Airflow were moved to RunDeck or to Snowflake Data Integration directly.

The outdated integration with Oracle Responsys was replaced by modern technology stack NeuStar + Tealium + Braze.

Moreover, with a view of supporting different needs for the marketing team, a unified professional dictionary was developed with pre-aggregated values and expressions. 

Technology Stack

  • Snowflake
  • Apache Airflow
  • Rundeck by PagerDuty
  • Tealium, Braze, Neustar
  • Python, bash, shell, SQL