-
Notifications
You must be signed in to change notification settings - Fork 744
Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Make float16/bfloat16 distinct types
cla signed
#5736
opened May 5, 2026 by
cyyever
Contributor
Loading…
Switch bf16 quantize ops to at::kBFloat16
cla signed
#5735
opened May 5, 2026 by
cyyever
Contributor
Loading…
TLX IKBO FA benchmarking with latest commit hash + bug fix (#5734)
cla signed
fb-exported
meta-exported
#5734
opened May 5, 2026 by
liptds
Contributor
Loading…
make stats logging a dictionary
cla signed
fb-exported
meta-exported
#5733
opened May 5, 2026 by
xywang9334
Loading…
Add KVZCH inference read-time hit rate metrics via fb303 ODS counters (#5730)
cla signed
fb-exported
meta-exported
#5730
opened May 4, 2026 by
hy-NJU
Loading…
remove uneccesarry field for FixedBlockPool in inference (#5729)
cla signed
fb-exported
meta-exported
#5729
opened May 4, 2026 by
hy-NJU
Loading…
Add SVE-FP16 version of EmbeddingSpMDMNbit (#5728)
cla signed
fb-exported
meta-exported
#5728
opened May 4, 2026 by
ShuyangLiu
Loading…
Enable AMD tests for ZCH & Fix OSS
cla signed
fb-exported
meta-exported
#5727
opened May 4, 2026 by
Ali-Tehrani
Contributor
Loading…
Enable device-side assertions on ROCm
ciflow/rocm
cla signed
module: rocm
#5723
opened May 2, 2026 by
cyyever
Contributor
Loading…
Fix TalnetAudioClassifier_Tests in asan ubsan mode
cla signed
fb-exported
meta-exported
#5719
opened Apr 30, 2026 by
gbahimeta
Loading…
Optimize jagged_unique_indices_cuda (binary-search length + custom cub pipeline)
cla signed
fb-exported
meta-exported
#5718
opened Apr 30, 2026 by
AlbertDachiChen
Contributor
Loading…
fbcode/deeplearning/fbgemm/fbgemm_gpu/test/tbe/cache
cla signed
fb-exported
meta-exported
#5717
opened Apr 30, 2026 by
meta-codesync
Bot
Loading…
FBGEMM: Add JK gate to bypass batch_index_select_dim0 backward CTA path
cla signed
fb-exported
meta-exported
#5716
opened Apr 30, 2026 by
Zhihan-Lu
Loading…
Fix pyspark_deps.par crash (coredump f160mb8e201359l3) (#5713)
cla signed
fb-exported
meta-exported
#5713
opened Apr 29, 2026 by
excelle08
Contributor
Loading…
TBE backward hip_mixed_d warp kernel for ROCm (#5074)
ciflow/rocm
cla signed
fb-exported
meta-exported
module: rocm
#5712
opened Apr 29, 2026 by
spcyppt
Contributor
Loading…
TBE backward CTA kernel optimization for ROCm (#5711)
ciflow/rocm
cla signed
fb-exported
meta-exported
module: rocm
#5711
opened Apr 29, 2026 by
spcyppt
Contributor
Loading…
Add OSS benchmark scripts for TBE training benchmarks
cla signed
fb-exported
meta-exported
#5710
opened Apr 29, 2026 by
spcyppt
Contributor
Loading…
Fix backward cache flush on col_tile change in group_index_select
cla signed
fb-exported
meta-exported
module: rocm
#5707
opened Apr 28, 2026 by
q10
Contributor
Loading…
Fix sign-extension bug in fbgemm MX4 Python reference dequantize (#5706)
cla signed
fb-exported
meta-exported
#5706
opened Apr 28, 2026 by
purvisa-at-meta
Loading…
Fix sign-extension bug in fbgemm MX4 Triton dequantize kernel (#5705)
cla signed
fb-exported
meta-exported
#5705
opened Apr 28, 2026 by
purvisa-at-meta
Loading…
Add unit tests for FusedNBitRowwise fp32 intermediate precision and fp16→int4→bf16 roundtrip
cla signed
fb-exported
meta-exported
#5703
opened Apr 28, 2026 by
zhaozhul
Contributor
Loading…
Remove GPU sync stalls in _prefetch zero-row invalidation
cla signed
#5699
opened Apr 27, 2026 by
EddyLXJ
Contributor
Loading…
[ROCm] support warpSize 32 and 64 in the same build
ciflow/rocm
cla signed
module: rocm
#5696
opened Apr 25, 2026 by
jeffdaily
Loading…
Remove unnecessary if __name__ == "__main__": unittest.main() boilerplate in deeplearning/fbgemm/fbgemm_gpu/test (#5689)
cla signed
fb-exported
meta-exported
#5689
opened Apr 24, 2026 by
meta-codesync
Bot
Loading…
Add diagnostic output to debug OSS CI torch import failure
cla signed
fb-exported
meta-exported
#5686
opened Apr 23, 2026 by
gchalump
Contributor
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.