LLMs, Vision & Systems

Notes from ongoing research, field experiments, and production learnings around agentic AI and computer vision.

SAM3Dec 21, 202512 min

MLX SAM3

A worklog to share some understanding of Meta's latest segmentation model SAM3 ported to mlx to run on Apple Silicon with a demonstrable application.

View code on GitHub
LLMsNov 18, 202512 min

KV Cache in Large Language Models

A practical guide to understanding and implementing KV Cache in large language models. Implementation is using mlx framework, a NumPy-like array framework optimized for Apple Silicon's Unified Memory Architecture.

View code on GitHub