April 2, 2026LLM Serving and the Bus That Never StopsIn-flight batching is the trick that keeps LLM serving from wasting GPU seats.machine-learningllminference
November 12, 2025Word Embedding is Magic!Word embedding is a magic trick that allows computers to understand language.machine-learningword-embedding