Supervised Fine-Tuning — Loss Function, Packing, Memory, and LoRA