Skip to content

Pull requests: THUDM/slime

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add timeout configuration for on policy distillation HTTP session.
#1970 opened May 28, 2026 by qqwqqw689 Contributor Loading…
Fix PYTHONBUFFERED typo to PYTHONUNBUFFERED=1
#1967 opened May 27, 2026 by Chasing1020 Loading…
fix: drop incorrect critic GPU add to rollout_num_gpus in colocate mode
#1950 opened May 27, 2026 by aoshen02 Loading…
3 of 4 tasks
Optimize CP sequence KL communication for GSPO/OPSM
#1948 opened May 26, 2026 by zzdeae86 Loading…
Add Trackio rollout trace logging
#1935 opened May 21, 2026 by abidlabs Loading…
Feat/minimax m2.5 support
#1929 opened May 21, 2026 by xs1997zju Loading…
fix: avoid applying rollout temperature to critic values
#1928 opened May 21, 2026 by Baiyu-Su Loading…
feat: add SFT entropy logging and validation loss monitoring
#1925 opened May 19, 2026 by none0663 Contributor Loading…
fix: make OPSM reject whole off-policy sequences
#1917 opened May 18, 2026 by haoyang9804 Loading…
Support custom rollout-proxy TIS hooks in bypass mode
#1912 opened May 15, 2026 by sjtushenhai Loading…
[docs] fix reverse KL formula
#1911 opened May 14, 2026 by underspirit Loading…
fix: add eval-before-train to train_async.py (parity with train.py)
#1906 opened May 13, 2026 by Taosheng-ty Loading…
4 tasks done
feat: filter logits by loss_mask before log_probs/entropy computation
#1905 opened May 13, 2026 by Taosheng-ty Loading…
5 of 6 tasks
Filter zero-advantage samples in convert_samples_to_train_data
#1901 opened May 11, 2026 by nanjiangwill Collaborator Loading…
Add SwanLab tracking support
#1898 opened May 9, 2026 by asckaya Loading…
fix: add fallback for --save-hf when Megatron-Bridge lacks model support
#1881 opened Apr 30, 2026 by WangHong-yang Contributor Loading…
3 tasks done
feat(profile): safer torch.profiler defaults + per-grad-step capture
#1879 opened Apr 29, 2026 by leofan-lab Contributor Loading…
ProTip! Follow long discussions with comments:>50.