Azure Data Analytics Class Project - Data Engineering project on Azure using the medaillon architecture
This project is an example of a Azure Data Pipeline involving multiples data source and azure services such as Azure Synapse, Azure Data Factory, and Azure Key Vault.
It uses Los Angeles crime data and weather data from the OpenWeather API
- Weather Data: https://openweathermap.org/ (API)
- LA crime data: https://data.lacity.org/Public-Safety/Crime-Data-from-2020-to-Present/2nrs-mtv8/about_data (.CSV)
In the openweather bronze module use /config/dt to set your parameters and API Key.
Raw data gets loaded into the bronze layer. After that, gets transformed and moved to silver and finally joined in gold using Azure Synapse (PySpark). Final visualisation can be made in a analytics tool like PowerBI
Secrets are handled with Azure Key Vault
This is a class project. I worked together with 3 of my classmateds
