Show HN: Sweep, Open-weights 1.5B model for next-edit autocomplete

January 21, 2026 at 23:22

Quality: 8/10 Relevance: 9/10

Summary

Sweep Next-Edit 1.5B is a compact on-device autocomplete model quantized to GGUF (8-bit) with a 1.5B parameter size, claiming sub-500ms inference on a laptop and outperformance of larger models on next-edit tasks. It uses Qwen2.5-Coder as the base and provides clear usage steps, benchmarks, and an ecosystem around deployment (JetBrains plugin, Apache 2.0 license). The post showcases practical implications for local inference and lightweight code-editing workloads.

Read Original Article