-
Notifications
You must be signed in to change notification settings - Fork 12.2k
Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration #10133
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
33 commits
Select commit
Hold shift + click to select a range
f66c75a
rwkv6: rename to wkv6
zhiyuan1i b4254c5
rwkv6: support avx2 avx512 armv8 armv9
zhiyuan1i e198f7b
rwkv6: update cuda file name
zhiyuan1i 3f75f12
rwkv6: rename params
zhiyuan1i 2fc42b6
wkv on sycl
zhiyuan1i bee1cec
sycl: add some ops
zhiyuan1i 1c58096
sycl: Enhance OP support judgment
zhiyuan1i 042c3e0
Merge branch 'ggerganov:master' into master
zhiyuan1i 811aa87
wkv6: drop armv9 and tranfer to GGML style
zhiyuan1i 4d26631
flake.lock: Update (#10146)
ggerganov b189630
metal : minor fixup in FA kernel (#10143)
ggerganov 89812b1
ggml : move CPU backend to a separate file (#10144)
slaren 8050d02
metal : fix minor string leaks (ggml/1004)
pminev eb5711c
cmake : make it possible linking ggml as external lib (ggml/1003)
ykhrustalev 153251f
sync : ggml
ggerganov 5f79214
Merge branch 'ggerganov:master' into master
zhiyuan1i 61c665b
fix: update changes to upstream
zhiyuan1i 9ea34a7
fix: add defualt
zhiyuan1i 8c7b4ec
Update ggml/src/ggml-sycl/outprod.cpp
zhiyuan1i bb0685f
Update ggml/src/ggml-sycl/wkv6.cpp
zhiyuan1i 81cb301
update the function to use appropriate types
zhiyuan1i a878502
fix define error
zhiyuan1i b816024
Update ggml/src/ggml-cpu.c
zhiyuan1i 72e4432
add appropriate asserts
zhiyuan1i 35a1a2d
move element-wise functions outside
zhiyuan1i 6a1e977
Update ggml/src/ggml-sycl/concat.cpp
zhiyuan1i a749ba7
put the declaration outside the loop
zhiyuan1i 4693b46
rewrite to be more inline with the common pattern for distributing th…
zhiyuan1i 4574795
use recommended way GGML_TENSOR_LOCALS
zhiyuan1i acb1b9d
Merge branch 'ggerganov:master' into master
zhiyuan1i e264c35
remove some codes
zhiyuan1i 623db3b
update lint
zhiyuan1i 98e070c
Merge branch 'ggerganov:master' into master
zhiyuan1i File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file was deleted.
Oops, something went wrong.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
#include "common.cuh" | ||
|
||
#define CUDA_WKV_BLOCK_SIZE 64 | ||
|
||
void ggml_cuda_op_rwkv_wkv6(ggml_backend_cuda_context & ctx, ggml_tensor * dst); |
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.