Christy Jacob

Apr 6, 2026

TurboQuant - Compressing KV Caches to 3 Bits

Apr 3, 2026

LoRA & Parameter-Efficient Fine-Tuning - Adapting Giants on a Budget

Apr 3, 2026

Mixture of Experts - Scaling Without the Compute Tax

Apr 3, 2026

Model Quantization - Squeezing Giants into Laptops

Apr 3, 2026

RLHF - Teaching Language Models to Follow Human Intent

Apr 3, 2026

Speculative Decoding - Making LLMs Think Faster

Mar 26, 2026

Flash Attention - Breaking the Memory Wall

Mar 26, 2026

KV Caching - Making Transformers Actually Fast

Mar 25, 2026

Attention Is All You Need - A Visual Story

Mar 24, 2026

Language Modeling & Recurrent Networks

710

530

350

Craft

0

-150

Mar 22, 2026

Craft

Mar 22, 2026

Regularization & Stability - Training Networks That Generalize

Mar 21, 2026

Optimizers & Training - Making Neural Networks Learn Faster

Mar 20, 2026

Deep Learning from First Principles

Jul 5, 2025

Blog covers powered by GPT-4o

Jul 4, 2025

PG 101 - Building Postgres Extensions

Apr 17, 2025

Blog

Recent Posts

TurboQuant - Compressing KV Caches to 3 Bits

LoRA & Parameter-Efficient Fine-Tuning - Adapting Giants on a Budget

Mixture of Experts - Scaling Without the Compute Tax

Model Quantization - Squeezing Giants into Laptops

RLHF - Teaching Language Models to Follow Human Intent

Speculative Decoding - Making LLMs Think Faster

Flash Attention - Breaking the Memory Wall

KV Caching - Making Transformers Actually Fast

Attention Is All You Need - A Visual Story

Language Modeling & Recurrent Networks

Craft

Craft

Regularization & Stability - Training Networks That Generalize

Optimizers & Training - Making Neural Networks Learn Faster

Deep Learning from First Principles

Blog covers powered by GPT-4o

PG 101 - Building Postgres Extensions

Hello World