Show HN: Sweep, Open-weights 1.5B model for next-edit autocomplete
Summary
Sweep Next-Edit 1.5B is a compact on-device autocomplete model quantized to GGUF (8-bit) with a 1.5B parameter size, claiming sub-500ms inference on a laptop and outperformance of larger models on next-edit tasks. It uses Qwen2.5-Coder as the base and provides clear usage steps, benchmarks, and an ecosystem around deployment (JetBrains plugin, Apache 2.0 license). The post showcases practical implications for local inference and lightweight code-editing workloads.