NVIDIA Nemotron 3 Ultra is now available on Ollama’s cloud. It’s a 550 billion parameter (55B active) open model from NVIDIA built for long-running, agentic workflows with fast and affordable performance across hundreds of tool calls.
Download Ollama, then run Nemotron 3 Ultra with your tool of choice.
Claude Code
ollama launch claude --model nemotron-3-ultra:cloud
Hermes Agent
ollama launch hermes --model nemotron-3-ultra:cloud
OpenClaw
ollama launch openclaw --model nemotron-3-ultra:cloud
General chat
ollama run nemotron-3-ultra:cloud
See more integrations.
Nemotron 3 Ultra leads on accuracy across agent productivity, instruction following, and long-context tasks, while delivering leading throughput—saving up to 30% on costs compared to other leading open models.
Figure 1: Nemotron 3 Ultra leads among open models on agentic benchmarks for agent productivity, coding, and instruction following.
Figure 2: Nemotron 3 Ultra is in the most attractive quadrant with leading accuracy and leading throughput among open models.
Figure 3: Nemotron 3 Ultra saves up to 30% in costs and leads on the cost efficiency frontier.