Available · May 2026

Benjamin
Holderbein

I build retrieval and LLM systems that actually ship. Currently at Asurion replacing a 3-year-old semantic search with a modular RAG pipeline serving 10,000+ daily queries, while finishing my M.S. in Data Science & AI at USF.

Benjamin Holderbein
01About

AI engineer with a research bent and a builder’s instinct.

I'm an AI Engineer based in San Francisco. My day-to-day is RAG plumbing: ingestion, chunking, embeddings, retrieval, evals — making sure the thing that comes out the other end is actually better than what was there before.

Right now I'm at Asurion, where I built a modular RAG system that replaces a three-year-old semantic search serving 10,000+ daily customer queries. Before that I shipped a React Native app at StudyStudio.ai, an NLP matching system at USF's Data Institute that's still in production, and time-series pipelines for residential energy research at Frontier Energy.

Outside of work I ran USF's 100+ member rock climbing club for two years. I like things that are precise, durable, and slightly understated.

02NowMay 2026
  • 01Building the next iteration of Asurion's retrieval stack — eval harness over BGE / Qwen3 / Nemotron / Gemini.
  • 02Wrapping up my M.S. in Data Science & AI at USF (June 2026).
  • 03Reading: Sebastian Raschka — Build a Large Language Model.
03Selected work3 projects
2025

Connected Vehicle Data Pipeline

End-to-end pipeline that synthesizes connected-vehicle transactions, lands them in Postgres, runs feature engineering and a fraud-detection model, and surfaces results in a real-time Streamlit dashboard.

PythonPostgreSQLscikit-learnStreamlit
2025

LLM From Scratch

Implementing a transformer-based LLM in PyTorch from first principles, following Sebastian Raschka's Build a Large Language Model. A working understanding of every line, not just the API.

PyTorchPython
2024

Cellular Microscopy Counting

U-Net CNN for image segmentation that automates cell counting for biomedical research. Mean error of 1.4 cells, beating the 3-cell target by 2x and replacing manual counting.

PyTorchU-NetCV
04Experience
2025 — Now

AI Engineer, Intern

Asurion · San Francisco
Built a modular RAG system replacing a 3-year-old semantic search serving 10,000+ daily customer queries. Architected an ingestion pipeline supporting four embedding models (BGE, Qwen3, Nemotron, Gemini) and a Redis vector store with automated preprocessing, chunking, and embedding workflows. Integrated DeepEval to benchmark embedders and retrieval strategies.
2025

Software Engineer, Intern

StudyStudio.ai · San Francisco
Developed and deployed a cross-platform mobile app using React Native, TypeScript, and Clerk for iOS and Android. Delivered a polished prototype mirroring core web functionality and ready for store deployment.
2024

AI Engineer, Intern

USF Data Institute · San Francisco
Designed and shipped an NLP algorithm that automates internship/student matching at USF — clustering qualifications and matching them against employer requirements with LLMs. Built a preprocessing pipeline that cut runtime and API cost by 40%. Still in production.
2022 — 2023

Data Scientist, Intern

Frontier Energy · Davis, CA
Analyzed 40+ data channels monitoring residential building performance across California pilot sites. Built an Azure Data Explorer dashboard and a Python pipeline that cleaned, validated, and resampled time-series data from 1-second to 5-minute intervals.
05ContactSay hello