KVarN: Variance-Based KV-Cache Quantization Reduces Error Accumulation3. June 2026AI Models, Claude CodeKVarN reduces error accumulation when quantizing KV-caches to 2-bit precision through improved token-scale normalization and achieves state-of-the-art results on MATH500, AIME24, and HumanEval. Share on: