GRAIL: Enhanced Reinforcement Learning for Mathematical Reasoning in LLMs4. June 2026AI Models, Claude AI, Claude CodeGRAIL uses gradient activation saliency to train relevant reasoning steps more strongly than irrelevant tokens, achieving 3.60% accuracy improvement without separate process-level supervision. Share on: