NVIDIA: Nemotron-4 340B Instruct

nvidia/nemotron-4-340b-instruct

Nemotron-4-340B-Instruct is an English-language chat model optimized for synthetic data generation. This large language model (LLM) is a fine-tuned version of Nemotron-4-340B-Base, designed for single and multi-turn chat use-cases with a 4,096 token context length.

The base model was pre-trained on 9 trillion tokens from diverse English texts, 50+ natural languages, and 40+ coding languages. The instruct model underwent additional alignment steps:

Supervised Fine-tuning (SFT)
Direct Preference Optimization (DPO)
Reward-aware Preference Optimization (RPO)

The alignment process used approximately 20K human-annotated samples, while 98% of the data for fine-tuning was synthetically generated. Detailed information about the synthetic data generation pipeline is available in the technical report(opens in new tab).

Model weights

Modalities

Context

Low

4K

Released

Jun 23, 2024

Knowledge Cutoff

Jun 2023

NVIDIA: Nemotron-4 340B Instruct