Aaron Spring logoAaron Spring

Hi, I'm Aaron Spring

Freelance ML engineer, AI engineer and geospatial data engineer based in Hamburg. I build:

  • Production ML systems
  • LLM-powered applications
  • Analysis-ready climate datasets
Download CV ↗

Aaron Spring

Get in touch

Interested in working together? Reach out and I'll get back to you.

GitHubLinkedIn
Value Proposition

ML Engineer

  • Identify use cases for automated decision-making
  • Curate training datasets
  • Train models for automated decision-making
  • Deploy and monitor models

AI Engineer

  • Identify GenAI use cases from #weNeedToDoAI
  • Turn business ideas into verifiable prototypes, fast
  • AI evaluations: human annotation design, error analysis and automated verification
  • Consult on how to integrate AI tooling into your team

Geospatial Data Engineer

  • Identify datasets for cloud-based workflows in climate risk and energy domains
  • Build analysis-ready, cloud-optimised (ARCO) climate datasets with inherent safety, consistency and reproducibility
  • Orchestrate scheduled geospatial cloud pipelines
  • Optimise storage, chunking, throughput and cost
Recent Projects
Climate array data in the cloud

ARCO ERA5

Business Challenge

Access the best climate reanalysis based on the most recent available data for maps and time-series use-cases.

Solution

Built an analysis-ready cloud-optimised (ARCO) ERA5 dataset on Earthmover Arraylake with dual chunking optimized for maps and time-series applications, leveraging Icechunk for ACID transactions and an attached SLA covering data freshness and availability.

Climate
ARCO
Zarr
Icechunk
AWS S3
Python
AI Recruiting Agent funnel

AI Recruiting Agent

Business Challenge

How to speed up recruiter candidate review load at scale?

Solution

Recruiting agent narrowing 20M profiles to 1,000 candidates via embeddings retrieval, down to ~200 prescreened candidates with a fine-tuned LLM, returning ~10 qualified & interested applicants via personalised outreach per job posting.

Job Candidate Matching
Embeddings
Retrieval
qdrant
Deep Learning
LLM
Fine-tuning
pytorch
Tracing
langfuse
Data Curation
metaflow
Personalisation
Python
Scala
Ad user personalization

Ad Personalisation

Business Challenge

Maximize revenue per impression by selecting the ad most relevant for the user.

Solution

Improved CTR and revenue-per-impression by 2% by upgrading from logistic regression to deep neural network enabling ad-user personalisation.

Ad Tech
Personalisation
ML
scikit-learn
mlflow
Deep Learning
tensorflow
Data Curation
Python
Scala
Climate data forecasting with AI

S2S Forecast Challenge

Business Challenge

Run a Kaggle-style challenge for sub-seasonal climate forecasts with low entry barriers

Solution

Curated training datasets and a verification pipeline on git & notebook-based renkulab.io for clean-data ML forecast submissions with standardised metrics.

Climate
ARCO
Zarr
ML
scikit-learn
Deep Learning
tensorflow
Data Curation
Python
© Copyright 2026 by Aaron Spring.