InfoKV: Entropy-Based KV-Cache Compression for Long Reasoning Sequences26. June 2026AI Models, Claude CodeInfoKV combines attention scores with uncertainty signals for KV-cache compression, outperforming pure attention-based methods on long reasoning tasks by measurable margins. Share on: