Agent Skill : LLM Training

Train and fine-tune language models using TRL (Transformer Reinforcement Learning) on Hugging Face Jobs infrastructure.

Covers SFT, DPO, GRPO and reward modeling training methods, plus GGUF conversion for local deployment. Includes guidance on the TRL Jobs package, UV scripts with PEP 723 format, dataset preparation and validation, hardware selection, cost estimation, Trackio monitoring, Hub authentication, and model persistence.

Skill Name: model-trainer

Version: 1.0.0

License: Apache-2.0

Main Documentation: skills/model-trainer/SKILL.md

Core Documentation

Training Scripts

Production-ready training scripts with Trackio integration, LoRA/PEFT, and Hub saving:

Reference Guides

Detailed guides for specific aspects of training:

Plugin Configuration