Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

Appearance settings

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

ngxson / llama.cpp Public

forked from ggml-org/llama.cpp

Notifications You must be signed in to change notification settings
Fork 5
Star 12

Code
Pull requests 11
Actions
Projects
Security
Insights

Additional navigation options

Code
Pull requests
Actions
Projects
Security
Insights

Releases: ngxson/llama.cpp

Releases Tags

Releases · ngxson/llama.cpp

b4034

05 Nov 13:27

github-actions

b4034

b8deef0

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

b4034

llama : add <|tool_call|> formatting to Granite template (#10177)

Branch: GraniteToolCallTemplate

Signed-off-by: Gabe Goodhart <[email protected]>

Assets 22

Uh oh!

There was an error while loading. Please reload this page.

All reactions

b4033

04 Nov 23:42

github-actions

b4033

a9e8a9a

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

b4033

ggml : fix arch check in bf16_to_fp32 (#10164)

Assets 22

Uh oh!

There was an error while loading. Please reload this page.

All reactions

b4027

04 Nov 13:38

github-actions

b4027

ea02c75

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

b4027

cuda : clear error after changing peer access (#10153)

Assets 22

Uh oh!

There was an error while loading. Please reload this page.

All reactions

b4024

04 Nov 12:54

github-actions

b4024

329ed91

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

b4024

CANN: adjust backend registry refactor. (#10158)

remove buffer->iface.get_name that used in cann as it was removed in backend registry refactor PR.

Assets 22

Uh oh!

There was an error while loading. Please reload this page.

All reactions

b4023

04 Nov 10:33

github-actions

b4023

ce027ad

This commit was signed with the committer’s verified signature.

ggerganov Georgi Gerganov

GPG key ID: BF970631944C16B7

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

b4023

sync : ggml

Assets 22

Uh oh!

There was an error while loading. Please reload this page.

All reactions

b4020

03 Nov 20:35

github-actions

b4020

9f40989

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

b4020

ggml : move CPU backend to a separate file (#10144)

Assets 22

Uh oh!

There was an error while loading. Please reload this page.

All reactions

b4019

03 Nov 14:32

github-actions

b4019

08828a6

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

b4019

metal : minor fixup in FA kernel (#10143)

* metal : minor fixup in FA kernel

ggml-ci

* metal : use the unrolled loop variable

* metal : remove unused var

Assets 22

Uh oh!

There was an error while loading. Please reload this page.

All reactions

b4016

02 Nov 18:34

github-actions

b4016

42cadc7

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

b4016

server : fix slot selection by lru (#10126)

* server : fix slot selection by lru, migrate lcs to `size_t`

* minor debug log fix

Assets 22

Uh oh!

There was an error while loading. Please reload this page.

All reactions

b4014

02 Nov 14:36

github-actions

b4014

1926d6e

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

b4014

llama : adjust default context size + print warnings (#10136)

* llama : adjust default context size + print warnings

ggml-ci

* ggml-ci : add missing gpu-layers + adjust context sizes

Assets 22

Uh oh!

There was an error while loading. Please reload this page.

All reactions

b4013

02 Nov 13:24

github-actions

b4013

b634f8a

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

b4013

simple-chat : only add bos on first prompt (#10129)

Assets 22

Uh oh!

There was an error while loading. Please reload this page.

All reactions

Previous 1 2 … 96 97 98 99 100 Next

Previous Next

Footer

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.