In this Quick Start tutorial, we’ll be training Qwen3 14B Instruct to play 2048, a simple game that requires forward planning and basic math skills.Documentation Index
Fetch the complete documentation index at: https://openpipe-art-austin-megatron-models.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Reading time: 15 minTraining time: 2 hoursTotal cost: Free!
Step 1: Provision W&B API key
ART is an open source library and works across infra and observability providers. To keep things simple in this tutorial, we’ll exclusively use Weights & Biases services, which means we’ll only need to provision one API key. We’ll use these services:- W&B Training - autoscale GPUs for inference and training
- W&B Models - record metrics like reward
- W&B Weave - record your model’s traces as it generates completions
- W&B Artifacts - store and manage your model’s checkpoints