Tag #training-infra 1 post tagged training-infra. ← All topics deep-dive KV Cache Compression Is Now an Alignment Problem A new preprint argues that compressing KV cache during RL rollouts silently biases the policy you ship. For teams treating RLHF as a defensive control, the off-policy bug matters more than the throughput win. May 11, 2026