Skip to content

DeepSpeed parity hacks#502

Closed
bigximik wants to merge 25 commits into
mainfrom
gspo
Closed

DeepSpeed parity hacks#502
bigximik wants to merge 25 commits into
mainfrom
gspo

gspo: drop the temperature field, use base logits_scale_factor

227f8bc
Select commit
Loading
Failed to load commit list.