-
Notifications
You must be signed in to change notification settings - Fork 598
Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
add ckpt and restore with feature evict metaheader
cla signed
#4342
opened Jun 13, 2025 by
lalala-2
Loading…
Fixing reading from EmbeddingRocksDB connection
cla signed
fb-exported
#4341
opened Jun 13, 2025 by
Raahul46
Loading…
Making create_rocksdb_hard_link_snapshot function a no_op
cla signed
fb-exported
#4340
opened Jun 13, 2025 by
Raahul46
Loading…
Implement a stat library for fbgemm embedding
cla signed
fb-exported
#4339
opened Jun 13, 2025 by
Kaiweitu
Loading…
[fbgemm_gpu] Upgrade benchmark workflows
cla signed
module: rocm
#4337
opened Jun 12, 2025 by
q10
Loading…
Add initial version of TuningCache and scripts for heuristic + kernel (#4289)
cla signed
fb-exported
#4336
opened Jun 12, 2025 by
cthi
Loading…
Adding a separate utils file for KVTensorMetaData (#4298)
cla signed
fb-exported
#4335
opened Jun 12, 2025 by
Raahul46
Loading…
Fix int_nbit inference int8 nobag kernel meta function
cla signed
fb-exported
#4333
opened Jun 12, 2025 by
spcyppt
Loading…
Tune FP8 grouped GEMM for Llama4 shapes
cla signed
fb-exported
#4326
opened Jun 11, 2025 by
jiawenliu64
Loading…
fix output dtype issue in merge_pooled_embeddings when input tensors are all empty
cla signed
fb-exported
#4325
opened Jun 11, 2025 by
842974287
Loading…
NVFP4 quantization emulation kernels as reference
cla signed
fb-exported
#4324
opened Jun 11, 2025 by
summerdengfb
Loading…
Use local counter for TBE boundary check warinings to improve performance
cla signed
fb-exported
#4316
opened Jun 10, 2025 by
yoyoyocmu
Loading…
Support prefetch pipeline in bounds_check_indices
cla signed
fb-exported
#4312
opened Jun 9, 2025 by
sryap
Loading…
Improve heuristic for Cutlass FP8 Grouped GEMM
cla signed
fb-exported
#4309
opened Jun 9, 2025 by
cthi
Loading…
Support tuning cache for Cutlass FP8 Grouped GEMM
cla signed
fb-exported
#4308
opened Jun 9, 2025 by
cthi
Loading…
[fbgemm_gpu] TBE microbenchmark upgrades
cla signed
module: rocm
#4307
opened Jun 9, 2025 by
q10
Loading…
tbe cpu nobag dispatch and backward pass kernel impl
cla signed
fb-exported
#4303
opened Jun 9, 2025 by
yabalaban
Loading…
tbe cpu nobag dispatch and forward pass kernel impl
cla signed
fb-exported
#4302
opened Jun 9, 2025 by
yabalaban
Loading…
Support tuning cache for Cutlass FP8 GEMM
cla signed
fb-exported
#4301
opened Jun 9, 2025 by
cthi
Loading…
Add new kernels for Cutlass BF16 grouped GEMM for tuning cache
cla signed
fb-exported
#4300
opened Jun 9, 2025 by
cthi
Loading…
Support tuning cache for Cutlass BF16 grouped GEMM
cla signed
fb-exported
#4299
opened Jun 9, 2025 by
cthi
Loading…
put feature_evict definition in cpp file
cla signed
fb-exported
#4294
opened Jun 8, 2025 by
chenyuzhcy
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.