Weather Forecasting with LSTM-based neural network

This project was created as part of the Deep Machine Learning course at Óbuda University. Its goal is to implement a deep learning model that, based on meteorological data from several European capitals, can predict the daily maximum temperature.

Topic selection and objective

The aim of the project is to create a forecasting pipeline using an LSTM-based model with the help of an existing open meteorological dataset. The task includes:

querying and processing raw data,
preparing time-based features based on data from multiple cities,
creating training and validation sets in the form of time-windowed sequences,
training and optimizing an LSTM model,
evaluating and visualizing the model.

Data source

The data was queried via the public API of Meteostat.
The data has daily resolution and includes the following features:
- daily average temperature (tavg)
- minimum temperature (tmin)
- maximum temperature (tmax)
- precipitation (prcp)
- wind speed (wspd)

Data was only used if it was fully available for the given city and feature throughout the entire examined time interval (2000–2024).

Model and pipeline

The implementation is based on an LSTM (Long Short-Term Memory) neural network.
The input data consists of multiple features from several European capitals, in the form of time-windowed sequences.
The target variable is the prediction of the tmax value for a selected city.

Model parameters

hidden_size: 16–128 (recommended: 32)
batch_size: 64–4096 (recommended: 1024)
learning_rate: 0.0001–0.003 (recommended: 0.001)
num_layers: 1 or 2 (recommended: 1)
window_size: 7 days (not tested with other values)

Early stopping and saving of training curves were used during training.

Evaluation

Models were compared based on several metrics:

Mean Absolute Error (MAE)
Mean Squared Error (MSE)
training time (in seconds)

The evaluation results are available in .csv format in the training_log.csv file, and the training curves for each epoch are also logged.

Project structure

oe_deep-ml/
├── logs/ # Log files for training curves and metrics
├── models/ # Trained models (.pt)
├── plots/ # Plots of training curves
├── training_log.csv # Hyperparameter tuning results
├── worldcities.csv # City coordinates (SimpleMaps)
├── european_capitals_weather.csv # Merged, processed data
├── weather_data_train.ipynb # Data collection and preparation & model training and logging
├── weather_inference.ipynb # Model loading and inference
├── results_analysis.ipynb # Evaluation of hyperparameter tuning results

Key takeaways

Optimization of batch_size and learning_rate has a significant impact on performance.
Too large hidden layers led to overfitting and poor generalization.
Standardizing the tmax target variable caused biased predictions, so the final model was trained on the original scale.
Training curves and early stopping helped avoid overfitting.

Result

Train Loss: 4.497498
Mean Absolute Error (MAE): 2.054539
Mean Squared Error (MSE): 6.802113
Training time (in seconds): 30.13 s

Requirements

Python 3.10+
numpy
pandas
torch
matplotlib
scikit-learn
meteostat
geopy

License

MIT License — freely usable for educational and research purposes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Weather Forecasting with LSTM-based neural network

Topic selection and objective

Data source

Model and pipeline

Model parameters

Evaluation

Project structure

Key takeaways

Result

Requirements

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
logs		logs
models		models
plots		plots
.gitignore		.gitignore
README.md		README.md
Result.png		Result.png
results_analysis.ipynb		results_analysis.ipynb
training_log.csv		training_log.csv
weather_data_train.ipynb		weather_data_train.ipynb
weather_inference.ipynb		weather_inference.ipynb
worldcities.csv		worldcities.csv

Folders and files

Latest commit

History

Repository files navigation

Weather Forecasting with LSTM-based neural network

Topic selection and objective

Data source

Model and pipeline

Model parameters

Evaluation

Project structure

Key takeaways

Result

Requirements

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages