NVIDIA Fine-Tunes Llama3.1 Model to Beat GPT-4o and Claude 3.5 Sonnet with Only 70 Billion Parameters

17.10.2024 в 11:21,
Hard news

NVIDIA has officially released its Llama-3.1-Nemotron-70B-Instruct model. Based on META's Llama 3.1 70B, the Nemotron model is a large language model customized by NVIDIA in order to improve the helpf

ulness of LLM-generated responses. NVIDIA uses fine-tuning structured data to steer the model and allow it to generate more helpful responses. With only 70 billion parameters, the model is punching fa ...

Автор: AleksandarK
Источник: https://www.techpowerup.com/327796/nvidia-fine-tunes-llama3-1-model-to-beat-gpt-4o-and-claude-3-5-sonnet-with-only-70-billion-parameters
×