🤖 TurboQuant on a MacBook: The KV Cache Killer You've Been Ignoring KV cache on a 70B model at 32k tokens? That's 40GB+ in FP16, dooming your MacBook. TurboQuant compresses it ruthlessly—without touching model quality. 4 min read 4 weeks ago