Make GSPO loss length-proportional by jlamypoirier · Pull Request #544 · ServiceNow/Fast-LLM

jlamypoirier · 2026-06-17T19:12:47Z

Codex GPT-5 note:

Summary

make GSPO apply segment loss once per labeled token instead of once per document
keep per-document label counts for the geometric-mean ratio and advantage computation
update the GSPO reference test to match the length-proportional loss

Rationale

PipelineRL DeepSpeed GSPO uses length-proportional sequence weighting when group_normalization=false: each segment contributes in proportion to its labeled-token count. Fast-LLM was still using mask / num_labels_in_seq as both the geometric-mean normalizer and the loss weight, making each document contribute uniformly regardless of length. This keeps num_labels_in_seq for the ratio/advantage means but uses the label mask as the loss/gradient weight.

Test

FAST_LLM_TEST_RESULTS_PATH=/tmp/fast_llm_tests/gspo_length_proportional /Users/joel.lamy-poirier/Projects/Fast-LLM/venv/bin/python -m pytest -v -n 4 tests/layers/test_lm_losses.py -k gspo

jlamypoirier added 2 commits June 17, 2026 15:11

Make GSPO loss length-proportional

68aadad

Add GSPO uneven-document regression test

17d3ccb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make GSPO loss length-proportional#544

Make GSPO loss length-proportional#544
jlamypoirier wants to merge 2 commits into
mainfrom
jlp_gspo-length-proportional

jlamypoirier commented Jun 17, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jlamypoirier commented Jun 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Rationale

Test

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

jlamypoirier commented Jun 17, 2026 •

edited

Loading