brand image

Ali

  • Publications
  • About
  • Blog
  • Recent
  • Getting the Hang of Instruction Tuning

built with Hugo and poison
© 2025 . All rights reserved.

Getting the Hang of Instruction Tuning

November 25, 2024
  • language models
  • tutorial
A hands-on programming tutorial of instruction tuning: I take a base Gemma 2B model and fine-tune it on the Alpaca dataset on a small GPU; this enables the model to follow user instructions.