One of the supported base models: Qwen3.5 (0.8B/2B/4B/9B/35B-A3B/122B-A10B/397B-A17B), Llama 1B/3B Instruct, NVIDIA Nemotron 30B/120B, OpenAI gpt-oss 20B/120B.
environment
string
Environments Hub slug for the RL environment driving the run.
config
object
Run configuration generated by `prime train init`.