feat: CP support for mamba layer #164

kmehant · 2025-11-24T07:05:44Z

In this PR, we add CP support for mamba by swapping the HF mamba module with cp implementation from.

Results

Legend

abbreviation	meaning
cp	context parallel degree
ep	expert parallel degree
dp	data parallel degree
gas	gradient accumulation steps
ebs	effective batch size
s	sequence length is

Ablations

Parity Experiments

model	experiment setting	loss	tps per gpu
ibm-granite/granite-4.0-h-tiny	cp8-ebs4-s8192-gas1	0.8059140625	973.6
ibm-granite/granite-4.0-h-tiny	cp8-ebs4-s8192-gas1-ep8	0.80224609375	2367.6
ibm-granite/granite-4.0-h-tiny	cp8-ebs4-s8192-gas2	0.8059765625	NA
ibm-granite/granite-4.0-h-tiny	cp4-dp2-ebs4-s8192-gas1	0.802953125	953.4
ibm-granite/granite-4.0-h-tiny	cp1-dp4-ep4-ebs4-s8192-gas1	0.7967056884765625	2576

Long Context (sequence length is 131072 (128k))

model	experiment setting	tps per gpu	GPU memory util ratio
ibm-granite/granite-4.0-h-tiny	cp8-ebs1-s131072-gas1-ep8	1462.8	0.5140136719
ibm-granite/granite-4.0-h-small	cp8-ebs1-s131072-gas1-ep8	682.7	0.9887207031

Training Resumption

settings used: mk-cp8-ebs4-s8192-gas1

Summary of external dependencies

fsdp2-nov https://github.com/kmehant/transformers.git (additional changes not in upstream)

Changes:

Preparing batches to be compatible for CP if CP is enabled.
Wrapping training loop with torch cp context
Preparing shift labels for correct loss calculation
Specific loss reduction when combination of cp and dp is used.
model saving fix

fsdp2-fix https://github.com/kmehant/accelerate.git (additional changes not in upstream)

Changes:

Mixed precision fix when using FSDP2

fsdp2-fix https://github.com/kmehant/accelerate.git (additional changes not in upstream)

Changes:

Mixed precision fix when using FSDP2

mamba-cp https://github.com/kmehant/fms-acceleration.git (will be main after merging #164)

Enables CP for mamba layers to go hand in hand with self attention CP

mamba-cp https://github.com/garrett361/mamba (Thanks to Garrett)

Mamba_ssm kernels must be installed from this fork and branch to leverage CP.

Summary of PR merged into HF repos to enable CP and FSDP2.

ashokponkumar · 2025-11-28T10:43:44Z

plugins/mamba-cp/src/fms_acceleration_mcp/.DS_Store

can we remove this?

Signed-off-by: Mehant Kammakomati <[email protected]>

kmehant changed the title ~~CP support for mamba layer~~ feat: CP support for mamba layer Nov 24, 2025

kmehant force-pushed the mamba-cp branch 2 times, most recently from 3c0f767 to 1990451 Compare November 27, 2025 17:40

kmehant mentioned this pull request Nov 28, 2025

feat: CP support for mamba layer foundation-model-stack/fms-hf-tuning#642

Open

kmehant requested a review from ashokponkumar November 28, 2025 10:42

ashokponkumar requested changes Nov 28, 2025

View reviewed changes

plugins/mamba-cp/src/fms_acceleration_mcp/.DS_Store Outdated

Copy link

Collaborator

ashokponkumar Nov 28, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can we remove this?

kmehant added 24 commits November 28, 2025 16:19

feat: add support for mamba cp

4c9e86b

Signed-off-by: Mehant Kammakomati <[email protected]>

feat: add support for mamba cp

f479ef5

Signed-off-by: Mehant Kammakomati <[email protected]>

feat: add support for mamba cp

b1e7e21

Signed-off-by: Mehant Kammakomati <[email protected]>

feat: add support for mamba cp

7185e25

Signed-off-by: Mehant Kammakomati <[email protected]>

feat: add support for mamba cp

6ea7165

Signed-off-by: Mehant Kammakomati <[email protected]>

feat: add support for mamba cp

0eb1bfd

Signed-off-by: Mehant Kammakomati <[email protected]>

feat: add support for mamba cp

6f2aaa4

Signed-off-by: Mehant Kammakomati <[email protected]>

feat: add support for mamba cp

a8832be

Signed-off-by: Mehant Kammakomati <[email protected]>

feat: add support for mamba cp

893f4cd

Signed-off-by: Mehant Kammakomati <[email protected]>

feat: add support for mamba cp

1e9433b

Signed-off-by: Mehant Kammakomati <[email protected]>

feat: add support for mamba cp

ec60dab

Signed-off-by: Mehant Kammakomati <[email protected]>

feat: add support for mamba cp

ffe2f17

Signed-off-by: Mehant Kammakomati <[email protected]>

feat: add support for mamba cp

a7365ea

Signed-off-by: Mehant Kammakomati <[email protected]>

feat: add support for mamba cp

e9fcfa7

Signed-off-by: Mehant Kammakomati <[email protected]>

feat: add support for mamba cp

3fc0a34

Signed-off-by: Mehant Kammakomati <[email protected]>

debug

66af3f6

Signed-off-by: Mehant Kammakomati <[email protected]>

debug

ffea60f

Signed-off-by: Mehant Kammakomati <[email protected]>

fix: remove print stmts

bfc90a9

Signed-off-by: Mehant Kammakomati <[email protected]>

docs: add docs

e4e470e

Signed-off-by: Mehant Kammakomati <[email protected]>

docs: add docs

ec66712

Signed-off-by: Mehant Kammakomati <[email protected]>

nit: lint and fmt

74202e1

Signed-off-by: Mehant Kammakomati <[email protected]>

fix: dp cp loss

7313944

Signed-off-by: Mehant Kammakomati <[email protected]>

feat: add unit test

e7779ee

Signed-off-by: Mehant Kammakomati <[email protected]>

feat: add unit test

d5c1543

Signed-off-by: Mehant Kammakomati <[email protected]>

kmehant added 3 commits November 28, 2025 16:19

feat: add unit test

2e6f243

Signed-off-by: Mehant Kammakomati <[email protected]>

feat: add unit test

164c9d4

Signed-off-by: Mehant Kammakomati <[email protected]>

nit: remove ds_store files

2d2012d

Signed-off-by: Mehant Kammakomati <[email protected]>

kmehant force-pushed the mamba-cp branch from eea1b84 to 2d2012d Compare November 28, 2025 10:49

ashokponkumar approved these changes Nov 28, 2025

View reviewed changes

kmehant merged commit d451073 into foundation-model-stack:main Nov 28, 2025
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: CP support for mamba layer #164

feat: CP support for mamba layer #164

Uh oh!

kmehant commented Nov 24, 2025 •

edited

Loading

Uh oh!

ashokponkumar Nov 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: CP support for mamba layer #164

feat: CP support for mamba layer #164

Uh oh!

Conversation

kmehant commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Results

Legend

Ablations

Parity Experiments

Long Context (sequence length is 131072 (128k))

Training Resumption

Summary of external dependencies

fsdp2-nov https://github.com/kmehant/transformers.git (additional changes not in upstream)

fsdp2-fix https://github.com/kmehant/accelerate.git (additional changes not in upstream)

fsdp2-fix https://github.com/kmehant/accelerate.git (additional changes not in upstream)

mamba-cp https://github.com/kmehant/fms-acceleration.git (will be main after merging #164)

mamba-cp https://github.com/garrett361/mamba (Thanks to Garrett)

Summary of PR merged into HF repos to enable CP and FSDP2.

Uh oh!

ashokponkumar Nov 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kmehant commented Nov 24, 2025 •

edited

Loading