Infrastructure
We are implementing a tightly integrated Data Lake to facilitate our activities.
Our Tech Stack
Data Lake
We have build an integrated Data Lake for Advanced Analytics. All our ETL processes are orchestrated via Databricks Jobs based on various sources.
- Databricks (Spark)
- Azure Data Lake Storage Gen2
- MongoDB
- PostgreSQL
- R (Tidyverse) for Analytics and Advanced Analytics
- Python for Machine Learning
Web Services (Frontend and Backend Development)
- Kubernetes
- Docker
- Flask/FastAPI
- Angular
- Datawrapper