Sumi: Uniform-Diffusion Language Model with 7 Billion Parameters Trained from Scratch18. June 2026AI ModelsSumi is the first openly available Uniform-Diffusion language model trained from scratch at the 7-billion-parameter scale and addresses a research gap between established autoregressive and masked diffusion approaches. Share on:
ZPPO: Teacher Models as Prompts Instead of Gradients17. June 2026AI Models, Claude AIZPPO integrates teacher models as prompt components instead of gradients, improving generalization in knowledge transfer to smaller models. Share on:
ICALens: Interpretability Method for Language Models Without Training Additional Autoencoders11. June 2026AI Models, Claude AIICA-based analysis enables rapid exploration of interpretable directions in language models without expensive training of additional autoencoders. Share on: