Skip to content
View AlejandroBeldaFernandez's full-sized avatar

Block or report AlejandroBeldaFernandez

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Alejandro Belda Fernández — Portfolio

Portfolio en Español


About me

Computer Science graduate from the University of Murcia with a Master's degree in Artificial Intelligence. I spent one year working as a researcher (PDI) handling complex biological datasets, managing Linux server infrastructure, and collaborating in academic research environments. This gave me a strong foundation in data quality, methodological rigour, and working with noisy real-world data. I am now transitioning into industry roles in data science and data analysis.

I enjoy the full pipeline: understanding the business problem, cleaning and transforming messy data, exploring it with the right questions, building models that actually work in context, and communicating results clearly. I care about honest evaluation, documented decisions, and conclusions that non-technical stakeholders can act on.

Languages: Spanish (native) · English (B2)

Contact: alejandrobeldafernandez@gmail.com

LinkedIn

Skills

Machine Learning

Python scikit-learn Pandas Matplotlib Seaborn

MLOps & Deployment

Docker FastAPI Git MLflow Linux Prefect

Databases & Tools

SQL PyPI BI Tools Spreadsheets


Open source

calm-data-generator

Python library for synthetic data generation, developed during my time as a researcher. Published on PyPI and maintained independently. Designed to generate realistic tabular datasets for development, testing, and prototyping. PyPI · GitHub

GitHub


Portfolio projects


01 — Madrid Traffic Accident Injury Prediction

Type: Classification

Stack: Python · pandas · scikit-learn · CatBoost · Optuna · SHAP

Dataset: Open data from the Madrid City Council (2019–2023)

Predicts whether a traffic accident in Madrid will result in at least one injured person, using information available at the time the accident is reported. The project covers person-to-accident aggregation, feature engineering, three models with Optuna tuning, SHAP explainability, and actionable recommendations for emergency resource allocation.

Best model (CatBoost): ROC AUC 0.873 · Balanced Accuracy 0.801 · Macro F1 0.73

What does this mean in practice? Out of every 10 accidents the model predicts as involving injuries, roughly 8 to 9 are correct. It also correctly identifies most accidents where no one is hurt, which avoids unnecessary resource dispatch. The remaining errors are mostly false alarms rather than missed injuries, which in an emergency context is the safer type of mistake.

GitHub


More projects coming soon.

Repositories in progress — publishing progressively throughout 2026.

Popular repositories Loading

  1. Calm-Data-Generator Calm-Data-Generator Public

    CALM-Data-Generator is a comprehensive Python library for synthetic data generation with advanced features

    Python 4 1

  2. madrid-traffic-accidents madrid-traffic-accidents Public

    Supervised machine learning project to predict whether a traffic accident in Madrid results in any injury, using open data from the Madrid City Council (2019–2023).

    Jupyter Notebook

  3. AlejandroBeldaFernandez AlejandroBeldaFernandez Public