About Me

I'm a Machine Learning Engineer and Data Science Manager at Capital One, where I lead Generative AI for the contact centers — LLM-powered, agentic tools that augment thousands of servicing agents live, mid-conversation. Over six years I've taken applied AI from prototype to production: real-time NLP, RAG, low-latency streaming inference, and the credit-risk ML that decisions loans at large scale.

What drives me is making a product measurably better for the people using it. Outside work: science fiction (Dune above all), strategy games, and time with my wife Amy and our dog Huck.

More about me

Selected Work

Flagship generative-AI systems I've taken from prototype to production at Capital One — starting with a public, clickable rebuild you can try yourself.

Live demo · you can try this one

Live Call Copilot

A public rebuild of the shape of the three systems below, running live in your browser. Talk into your mic — or play a sample call — and watch a two-voice transcript, self-drafting notes, RAG-retrieved procedure docs, and a sub-second frustration alert, all from a single WebSocket. Deepgram + OpenAI composed into a real-time product; no proprietary anything.

Next.js Deepgram OpenAI pgvector Railway

Try the live demo Read the write-up

Live Call Copilot demo — two-voice transcript, self-drafting notes, RAG procedure docs, and a frustration alert

Generative AI

Real-Time Call Summarization

An agentic GPT reasoning pipeline on Kafka that auto-drafts servicing-agent notes mid-call, with an LLM "agent-as-a-judge" review stage before anything reaches the agent.

GPT-4 Kafka Python AWS Lambda Read the write-up →

Real-timeMid-call notes

AgenticDraft + review

In pilotGoverned rollout

Retrieval · RAG

Live Procedure RAG

A retrieval-augmented generation system that runs vector-embedding retrieval over live call transcripts to surface the right procedure and training documents to agents in real time.

RAG pgvector Embeddings Python Read the write-up →

Real-timeMid-conversation

Vector searchProcedure docs

Real-Time NLP

Frustrated-Customer Detection

Capital One's first real-time AI alert system — streaming DistilBERT and LLM sentiment over live calls at scale, alerting managers to de-escalate in real time.

DistilBERT PyTorch Kafka LLMs Read the write-up →

Real-timeLive alerting

At scaleAcross the business

See all projects

Generative AI

Agentic LLM pipelines, RAG, and prompt/eval systems that augment thousands of users live, with measurable business outcomes.

Real-Time NLP

Sentiment, complaint, and intent detection over streaming call transcripts — with sub-second alerting for de-escalation.

LLM Evaluation

LLM-as-a-judge, offline & online evaluation, and observability for non-deterministic systems in production.

ML Systems at Scale

Low-latency, real-time streaming inference on Kafka, AWS Lambda, DynamoDB, and Snowflake.

Applied ML & Credit

Auto-loan underwriting models decisioning billions, plus reinforcement-learning credit-policy optimization.

Technical Leadership

Leading ML engineers and data scientists, setting technical objectives with VP- and EVP-level stakeholders.

GenAI & LLMs

Agentic Pipelines RAG & Vector Search LLM-as-a-Judge Live Demo

Claude Code OpenAI RAG pgvector LLM-as-Judge

ML & NLP

Real-Time Sentiment Complaint Detection Emotion Classification Anomaly Detection

PyTorch Hugging Face scikit-learn DistilBERT TensorFlow

Systems & Leadership

Real-time streaming Auto-Loan Underwriting RL Credit Policy Teaching

Kafka AWS Lambda DynamoDB Snowflake Splunk

Harrison Jansma

Machine Learning Engineer & Data Science Manager

Harrison Jansma

About Me

Selected Work

Live Call Copilot

Real-Time Call Summarization

Live Procedure RAG

Frustrated-Customer Detection

What I Work On

Generative AI

Real-Time NLP

LLM Evaluation

ML Systems at Scale

Applied ML & Credit

Technical Leadership

Skills & Tooling

GenAI & LLMs

ML & NLP

Systems & Leadership

Find Me Online

LinkedIn

GitHub

Medium