Skip to content

v0.1.4

Latest
Compare
Choose a tag to compare
@pculliton pculliton released this 25 Mar 18:04

What's Changed

  • Refactor Gemma ctor and improve pool NUMA support by @copybara-service in #520
  • Fix the prompt wrapping of gemma3-1b by @ufownl in #523
  • Add note on attention length and SFP by @copybara-service in #521
  • Add support for a secondary EOS token by @copybara-service in #525
  • Update app argument documentation by @copybara-service in #526
  • Set the secondary EOS for Gemma2 by @ufownl in #527

Full Changelog: v0.1.3...v0.1.4