Gemma3.c: Gemma 3 Pure Inference in C
Summary
The Gemma3 project presents a pure C11 CPU inference engine for the Gemma 3 4B IT model, demonstrating that LLM inference can run without Python, PyTorch, or GPUs. The repository covers build steps, model download, memory usage, and a CLI/library API, with portability across Linux/macOS and Windows via WSL/MinGW.