Gemma 4 family with three variants (31B dense, 26B-A4B MoE, E2B compact) is available as a fully managed service on Amazon Bedrock, with native reasoning, function calling, and multimodal support.
NVIDIA and Microsoft combine specialized hardware (RTX Spark, DGX Station for Windows), secure runtimes (OpenShell), and open-source models (Nemotron, Cosmos) into an end-to-end stack for agentic AI deployment from local Windows devices to Azure Cloud.