Welcome! Type "help" for available commands.

$

Welcome! Type "help" for available commands.

$

~/bookmarks

William's Bookmark Library

/**/

NVIDIA Nemotron-3-Super: How To Run Guide | Unsloth Documentation

unsloth.aiSaved March 17, 20265 min

Technical Documentation

Summary

This guide explains how to run and fine-tune NVIDIA Nemotron-3-Super-120B-A12B locally using Unsloth. It details hardware requirements, inference settings, and token handling for this hybrid MoE model.

Highlights

Supports local inference on devices with 64GB RAM/VRAM and fine-tuning via Unsloth.
Optimized for multi-agent AI with a 1M-token context window and high throughput.
Requires specific inference parameters like temperature 1.0 for general chat.
Uses special tokens for reasoning and requires max_position_embeddings adjustment due to NoPE.

auto-generated

Preview of NVIDIA Nemotron-3-Super: How To Run Guide | Unsloth Documentation

via Unsloth Documentation

Context

Audience

AI Developers

DomainMachine Learning

Formattechnical guide

Accessfree online

Topics

NVIDIA Nemotron-3-Super MoE Models Local Inference Model Fine-Tuning Llama.cpp Guide

Visit Site All Bookmarks

NVIDIA Nemotron-3-Super: How To Run Guide | Unsloth Documentation

unsloth.aiSaved March 17, 20265 min

Technical Documentation

Summary

This guide explains how to run and fine-tune NVIDIA Nemotron-3-Super-120B-A12B locally using Unsloth. It details hardware requirements, inference settings, and token handling for this hybrid MoE model.

Highlights

Supports local inference on devices with 64GB RAM/VRAM and fine-tuning via Unsloth.
Optimized for multi-agent AI with a 1M-token context window and high throughput.
Requires specific inference parameters like temperature 1.0 for general chat.
Uses special tokens for reasoning and requires max_position_embeddings adjustment due to NoPE.

auto-generated

via Unsloth Documentation

Context

Audience

AI Developers

DomainMachine Learning

Formattechnical guide

Accessfree online

Topics

NVIDIA Nemotron-3-Super MoE Models Local Inference Model Fine-Tuning Llama.cpp Guide

Visit Site All Bookmarks

~/bookmarks

NVIDIA Nemotron-3-Super: How To Run Guide | Unsloth Documentation

Summary

Highlights

Context

Topics

Related

NVIDIA Nemotron-3-Super: How To Run Guide | Unsloth Documentation

Summary

Highlights

Context

Topics

Related

~/bookmarks

NVIDIA Nemotron-3-Super: How To Run Guide | Unsloth Documentation

Summary

Highlights

Context

Topics

Related

Discover Similar Content

NVIDIA Nemotron-3-Super: How To Run Guide | Unsloth Documentation

Summary

Highlights

Context

Topics

Related