Skip to content

Granite 4.1 LLMs: How They Are Built

Granite 4.1 are compact language models from IBM with 3B, 30B and 83B parameters, trained on 15 trillion tokens with a 512K context window. The 8B Instruct model outperforms the larger predecessor model through optimized dense architecture and advanced fine-tuning and reinforcement learning techniques.

Share on:

Building Blocks for Foundation Model Training and Inference on AWS

Foundation model development today scales across three channels: pre-training, post-training, and test-time compute, with AWS showing how its infrastructure—accelerators, networking, and storage—works with open-source tools like PyTorch, Kubernetes, and Prometheus to enable efficient training and inference.

Share on: