Skip to content

Llama-recipes v.0.0.3 Release Notes

Compare
Choose a tag to compare
@mreso mreso released this 23 Jul 17:43
· 363 commits to main since this release
2845306

Llama 3.1 Integration

This release accompanies the release of Llama 3.1 which included new versions of the Llama 8B and 70B models as well as the new 405B version. To get started with the new models you can find information in the official documentation or the on the HuggingFace hub. Further details can also be found in the model card and the Llama 3.1 paper. For this release we updated the documentation and made sure all components work with the new models.

New Features

We also added new features like FSDP + QLoRA fine-tuning and H2O algorithm for long context inference.

Additional Examples

Besides, we added new examples to get you up and running quickly with the Llama models

  • Add Groq/Llama3 recipes (cookbook and command line examples) by @dloman118 in #553
  • [WIP] Peft Finetuning Quickstart Notebook by @mreso in #558
  • 4 notebooks ported from 4 DLAI agent short courses using Llama 3 by @jeffxtang in #560
  • [lamini] Add lamini text2sql memory tuning tutorial by @powerjohnnyli in #573
  • colab links fixed for dlai agents notebooks by @jeffxtang in #593
  • Port of DLAI LlamaIndex Agent short course lessons 2-4 to use Llama 3 by @jeffxtang in #594

Codebase Refactor

We also refactored our codebase to improve discoverability of our documentation and recipes

Misc improvements and bugfixes

New Contributors

Full Changelog: v0.0.2...v0.0.3

We would like to thank all who contributed to this release and are looking forward to future contributions!