Writing an optimizing tensor compiler from scratch
Summary
Mykhailo Moroz profiles TensorFrost, a static optimizing tensor compiler that blends NumPy-like operations with shader-style control flow. The post covers architecture (kernel fusion, IR, autodiff), Python frontend, host backends, and multiple examples (fluid simulation, path tracer, N-body), plus performance observations and a roadmap of future work.