-
Notifications
You must be signed in to change notification settings - Fork 11
Pull requests: AMD-AGI/TraceLens
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add deterministic marker identification evals
#656
opened May 21, 2026 by
Ahmedhasssan-aig
Contributor
Loading…
3 tasks done
fix(jaxopkeys): route TE-adjacent buffer-init kernels to TE
#655
opened May 20, 2026 by
gphuang
Contributor
Loading…
3 tasks done
Changes to TraceLens files for comparative analysis
#654
opened May 19, 2026 by
unagiboi
Collaborator
Loading…
feat(perfmodel): add varlen aiter FA + aten::_flash_attention_forward perf models (Wan 2.2 coverage)
#651
opened May 19, 2026 by
gphuang
Contributor
Loading…
4 tasks done
fix: ops_summary shows GPU kernel names for orphan HIP launches (#641)
#642
opened May 14, 2026 by
gabeweisz
Collaborator
Loading…
feat(perfmodel): add primus_turbo and aiter MXFP4 GEMM and quantize perf models
#638
opened May 11, 2026 by
gphuang
Contributor
Loading…
6 tasks done
Include atleast one prefill step in the split if available (Bug fix)
#636
opened May 8, 2026 by
devalshahamd
Contributor
Loading…
PerfModel: simplify torch op categorization registry
#630
opened May 6, 2026 by
ajassani
Collaborator
Loading…
Support fp8 quantization for KV cache in Inference perf models
#625
opened May 6, 2026 by
devalshahamd
Contributor
Loading…
Feat/add gpu op uids to report
enhancement
New feature or request
#624
opened May 5, 2026 by
unagiboi
Collaborator
Loading…
Add op category breakdown to perf report comparison (#331)
#622
opened May 5, 2026 by
ajassani
Collaborator
Loading…
2 of 4 tasks
Perfmodel for torch.compile (triton)
enhancement
New feature or request
#621
opened May 5, 2026 by
janmatai
Contributor
Loading…
Runtime improvement for pseudo op extension
#617
opened Apr 30, 2026 by
devalshahamd
Contributor
Loading…
fix(treeperf): preserve kernel time when perf model fails
#611
opened Apr 29, 2026 by
unagiboi
Collaborator
Loading…
EventReplay: extend beyond aten ops with auto-import and custom initializers
#607
opened Apr 28, 2026 by
ajassani
Collaborator
Loading…
2 tasks done
Adding Claude.md file for Tracelens
documentation
Improvements or additions to documentation
#606
opened Apr 28, 2026 by
janmatai
Contributor
Loading…
fix(trace2tree): attach "bleeding" host events to a containing ancestor
#603
opened Apr 23, 2026 by
unagiboi
Collaborator
Loading…
[Feat] Idle Time Analysis for the GPU Kernel Stream
#601
opened Apr 22, 2026 by
spandoesai
Collaborator
•
Draft
Fix/trace2tree: fix cross-rank GPU attribution and merged-trace hang
#577
opened Apr 1, 2026 by
brieflynn
Contributor
Loading…
456 add support for traces generated by jax 08 suppress warning
#572
opened Mar 28, 2026 by
devalshahamd
Contributor
Loading…
Add perf model coverage for DeepEP EP communication ops
perf_model
Add performance model for calculating TFLOPS/s and TB/s
#520
opened Mar 10, 2026 by
gphuang
Contributor
Loading…
get_df_xla_perf raises KeyError for FP8 and s8 dtypes in XLA kernel operands
#447
opened Dec 12, 2025 by
brieflynn
Contributor
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.