Skip to content

Pull requests: modelscope/ms-swift

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[bugfix] fix acc metrics
#7026 opened Dec 12, 2025 by Jintao-Huang Loading…
[bugfix] fix missing generate method for InternVL-2.5
#7019 opened Dec 12, 2025 by xwy-bit Loading…
1 of 4 tasks
collect npu profiling data
#6977 opened Dec 10, 2025 by OneMondy Loading…
1 of 4 tasks
[feat] Add Support Cut-Cross-Entropy (CCE)
#6971 opened Dec 9, 2025 by w1ida Loading…
[megatron] Update megatron shells
#6967 opened Dec 9, 2025 by Jintao-Huang Loading…
support deepspeed elastic
#6955 opened Dec 8, 2025 by meichangsu1 Loading…
2 of 4 tasks
[WIP] [v4] refactor model_type & template
#6944 opened Dec 8, 2025 by Jintao-Huang Loading…
add muon clip optimizer
#6662 opened Nov 19, 2025 by vx120 Loading…
1 task
Add conditional distillation support for GKD trainer
#6542 opened Nov 11, 2025 by woshixiaobai2019 Loading…
3 tasks
[WIP][Exp]Support ray dpo
#6395 opened Nov 1, 2025 by tastelikefeet Loading…
1 of 4 tasks
[megatron] update megatron_args default_val
#6252 opened Oct 22, 2025 by Jintao-Huang Loading…
feat: Enable for exporting unmerged HF Lora Adapter
#6225 opened Oct 20, 2025 by jason9693 Loading…
1 of 4 tasks
[WIP] refactor template
#6085 opened Oct 11, 2025 by Jintao-Huang Loading…
update docs
#5691 opened Sep 6, 2025 by Jintao-Huang Loading…
[model] update minicpmv-4.5 video processor stale
#5679 opened Sep 5, 2025 by hjh0119 Loading…
Bug fix: eval OOM due to deepcopy of torch model stale
#5607 opened Aug 29, 2025 by hellopahe Loading…
1 task done
[init]support gptq grpo in colocate mode stale
#5569 opened Aug 27, 2025 by ItGirls Loading…
1 of 4 tasks
Update dataset_info.json stale
#3723 opened Mar 31, 2025 by sandeep-sm Loading…
3 tasks
[WIP] support reasoning_content
#3159 opened Feb 18, 2025 by Jintao-Huang Loading…
loss_scale bug when meeting <image>
#3036 opened Feb 8, 2025 by mangoyuan Draft
1 of 4 tasks
ProTip! Follow long discussions with comments:>50.