Articles - Alex Goldhoorn

Technical Articles

Technical writings on data science, AI evaluation, simulation, and logistics optimization.

When LLMs Meet Structured Data: The Evaluation Challenge

January 2026 • Technical Article • BibTeX

Building an evaluation framework for LLM agents at Meight. When extracting structured shipping data from documents, we learned that evaluation requires both strict metrics (for production readiness) and LLM-as-a-judge (for semantic correctness).

Read Article →

System 1 vs System 2: Testing LLMs with Riddles

December 2025 • Article + Interactive Challenge + Raw Outputs • BibTeX

An experimental evaluation of how 8 models (6 cloud, 2 local) perform on logic puzzles, revealing the gap between pattern matching and first-principles reasoning. Includes complete raw model responses.

🎯 Try Interactive Challenge → Read Full Analysis → 📝 Raw Outputs →

How to Simulate a Global Delivery Platform

February 2021 • Medium • BibTeX

Deep dive into building a large-scale discrete event simulation system for Glovo's global delivery network, covering architecture decisions, performance optimization, and real-world validation.

Read on Medium →

Contact: alex (at) goldhoorn.net