Tag: nlp
-
Deploying self-hosted LLMs in Prod

A practical walkthrough of deploying a self-hosted LLM on AWS EKS with dedicated GPU node groups, taints/tolerations for isolation, separate inference and app services, model-weight caching via volumes, and sizing guidance for VRAM/concurrency and warm GPU capacity.
-
Reference Architecture: An Agentic CLI Application

While writing the previous post in this series, I hit a very unglamorous problem: my laptop disk was full. Not “almost full”. Full enough that everything started to feel brittle. I did what I always do: open a couple of folders, run a few du commands, check caches, look for the usual suspects, delete stuff,…
-
Building and Securing Publicly Facing AI Agents

A CV is compressed text. It is easy to skim, easy to misread, and hard to validate. So instead of publishing my background as static pages, I built a public, live AI agent on top of my Notion database. The goal is simple: don’t just read my CV. Interact with it. Ask questions in your…
