What's Changed
- Refactor Gemma ctor and improve pool NUMA support by @copybara-service in #520
- Fix the prompt wrapping of gemma3-1b by @ufownl in #523
- Add note on attention length and SFP by @copybara-service in #521
- Add support for a secondary EOS token by @copybara-service in #525
- Update app argument documentation by @copybara-service in #526
- Set the secondary EOS for Gemma2 by @ufownl in #527
Full Changelog: v0.1.3...v0.1.4