Jonas Vos

Jonas
Vos.

Based in The Netherlands

Transforming raw data into deployable intelligence. I build systems spanning machine learning, data engineering, and full-stack development.

scroll

Crafting intelligence
from raw data.

Jonas Vos

I'm Jonas, a third-year Data Science & AI student at Breda University of Applied Sciences. I'm passionate about building intelligent systems that solve real-world problems, from computer vision pipelines to full-stack AI platforms.

Across 10+ projects I've worked with deep learning, NLP, reinforcement learning, computer vision, and full-stack development. I thrive at the intersection of research and engineering, turning raw data into deployable, impactful solutions.

When I'm not training models or wrangling data, you'll find me producing electronic music, diving into the latest AI research papers, or cycling through the North Brabant countryside. I thrive in teams that value curiosity and aren't afraid to experiment.

10+
Projects
3
Years DS&AI
5
Teams Led

Data Science & ML

Statistical modeling, deep learning, NLP, and computer vision. From BERTje to U-Net to reinforcement learning agents.

Data Engineering

ETL pipelines, REST APIs, Docker deployments, and cloud infrastructure on Azure. Building systems that scale.

AI Applications

Full-stack AI platforms: from LLM-powered recruitment tools to smart campus analytics. End-to-end from prototype to production.

Tech stack.

Python3 yrs · 9 projs
SQL2 yrs · 4 projs
JavaScript1 yr · 2 projs
HTML / CSS1 yr · 2 projs
LaTeX2 yrs · 3 projs
TensorFlow / Keras2 yrs · 5 projs
PyTorch1 yr · 2 projs
Scikit-learn3 yrs · 6 projs
Transformers (BERT)1 yr · 2 projs
Reinforcement Learning1 yr · 1 proj
Computer Vision2 yrs · 3 projs
NLP2 yrs · 2 projs
FastAPI2 yrs · 4 projs
Docker1 yr · 3 projs
MongoDB1 yr · 1 proj
Git / GitHub Actions3 yrs · 9 projs
ETL Pipelines1 yr · 2 projs
REST APIs2 yrs · 4 projs
Flask1 yr · 1 proj
SQLite1 yr · 1 proj
Azure1 yr · 2 projs
MLflow / WandB1 yr · 2 projs
Streamlit2 yrs · 3 projs
Power BI1 yr · 1 proj

Selected work.

Currently Building · Feb – June 2026

Smart Campus Analytics

Predictive analytics system forecasting campus occupancy at BUas. Time series ML models integrating data from cameras, WiFi, and scheduling systems. REST API with real-time endpoints feeding dashboards and automated staff planning.

Data EngineerTime SeriesFastAPIDockerETLExplainable AI
01
Feb – June 2026

PASCO: Smart Campus Analytics

Predictive occupancy forecasting system for BUas campus operations. Ingests data from entrance cameras, WiFi, TimeEdit room bookings, KNMI weather, and NS train disruptions into a 112-column hourly feature table. Trains a Temporal Fusion Transformer alongside LightGBM, XGBoost, and CatBoost models to forecast building occupancy up to 7 days ahead, feeding FastAPI endpoints and Power BI dashboards used by catering, facilities, and cleaning teams.

112
Features
4
Models
<15%
Target MAPE
PythonTFTLightGBMXGBoostFastAPIAirflowDockerPower BIETLTime Series
02
Feb 2026 – Present

Vibe Splitter

Web app that automatically clusters a Spotify library into vibe-based playlists using Last.fm genre tags, MiniLM semantic embeddings, and optional Spotify audio features. Runs hourly, routes new tracks via cosine similarity, and features a real-time SSE-powered dashboard.

Hourly
Sync
Auto
Routing
Docker
Deploy
PythonFlaskSpotify APIfastembedSQLiteDockerSSE
03
Year 3 · Full Semester

ObjectivEye

AI-powered recruitment platform for Dutch businesses. Uses GPT-4o for bias detection in job postings, CV parsing, ESCO-based skill matching, and EU AI Act compliance. Includes an ML Judge framework with 80 test cases for systematic prompt evaluation.

80
Test Cases
EU AI Act
Compliance
5 people
Team
FastAPIAzure OpenAIMongoDBMLflowTailwind CSS
04
Year 2 · Block D

Plant Root Analyzer

Deep learning image segmentation pipeline analyzing plant root images using a U-Net CNN. Features CLI, web interface, and REST API, deployed both on-premise and on Azure Container Apps with full CI/CD.

3
Interfaces
Azure
Deploy
Automated
Tests
TensorFlowFastAPIDockerAzurePytest
05
Year 2 · Block B

NPEC Computer Vision & Robotics

Computer vision + robotics pipeline for the Netherlands Plant Eco-phenotyping Centre. Root segmentation (F1: 0.85), root system architecture extraction, and reinforcement learning agents controlling a liquid handling robot.

0.85
F1 Score
4 people
Team
NPEC
Client
Computer VisionReinforcement LearningPID ControlWandB
06
Year 2 · Block C

Dutch Emotion Detection

NLP pipeline for Dutch language emotion classification. Transcribes video/audio via Whisper, classifies 7 emotions using fine-tuned BERTje/RoBERTa. Achieved 84.7% accuracy with SMOTE-balanced datasets.

84.7%
Accuracy
7
Emotions
RoBERTa
Model
BERTjeRoBERTaWhisperSMOTENLP
More Projects

Road Safety Predictor

2024

Machine learning system predicting road accident risk levels (low / medium / high) for Breda. Merges the ANWB Safe Driving Dataset with six years of Breda accident records, trains ensemble and neural-network models, and deploys via Streamlit.

Scikit-learnStreamlitSMOTENeural NetworksFigma

Biome Explorer App

2024

Deep learning image classifier that identifies 7 global biomes from photos. Built through 4 iterative model experiments culminating in a VGG16 transfer-learning model (~90% test accuracy), with GradCAM + LIME interpretability analysis and a user-tested Figma mobile app prototype.

TensorFlowVGG16GradCAMLIMEFigma

NAC Breda Player Analytics

2024

End-to-end data analysis and ML pipeline for NAC Breda's player recruitment. Cleaned and explored a dataset of 16,535 footballers across 140 features, segmented players by position, and built a Logistic Regression classifier to identify high-potential forward prospects.

PandasScikit-learnMatplotlibSeabornLogistic Regression

SDG Dashboard

2023

Interactive dashboard investigating UN SDG progress, focusing on SDG 14 (Life Below Water), examining how marine ecosystem protection influences fish stock populations.

PythonPower BILinear RegressionEDAData Viz

Chatbot Trust & Data Security Research

2024

Academic research paper investigating how personal data security influences customer trust in AI chatbot interactions. Combines qualitative thematic analysis with quantitative survey data, correlation and regression analysis.

Mixed MethodsThematic AnalysisStatisticsSurvey DesignLaTeX

The path so far.

Let's build something
extraordinary.

Whether you have a data challenge, a collaboration idea, or just want to connect, I'd love to hear from you.

or reach me directly at Jonasvos01@gmail.com