ARM: Autoregressive Model for Unified Image and Text Processing10. June 2026AI ModelsARM combines discrete visual tokens with a 7-billion-parameter model to solve image and text tasks uniformly as token predictions. Share on: