-
Notifications
You must be signed in to change notification settings - Fork 53
Pull requests: NVIDIA/Fuser
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix expected BAR instructions in AmpereModifiersSharedMemoryEpilogue
#3663
opened Jan 2, 2025 by
jacobhinkle
Loading…
Generalize IRBFSWithPermissiveDependence to BFSWithPermissiveDependence
#3662
opened Dec 31, 2024 by
naoyam
Loading…
EmbeddingOp
node with same functionality as F.embedding
#3649
opened Dec 26, 2024 by
Priya2698
Loading…
Split Hopper MMA by warp-tile before instruction tile
#3642
opened Dec 24, 2024 by
jacobhinkle
Loading…
Ring Allgather + GEMM Overlap HostIR Implementation
Multi-GPU
#3626
opened Dec 20, 2024 by
nsarka
Loading…
cacheInputs propagates allocation only for matmul schedulers.
#3621
opened Dec 19, 2024 by
wujingyue
Loading…
Support outer reduction scheduler with SOL autotuning
Autotune
Generate heuristics through machine learning models.
Lower distributed matmul to pipelined algorithm for fine-grained overlap
Multi-GPU
#3606
opened Dec 18, 2024 by
samnordmann
Loading…
2 tasks done
[wgmma] Insert commit_group and wait_group after mma_async
Matmuls
#3573
opened Dec 11, 2024 by
jacobhinkle
•
Draft
Previous Next
ProTip!
Adding no:label will show everything without a label.