[Feature] Enable non-timm encoders with explicit embed_dim by kitewatermelon · Pull Request #426 · galilai-group/stable-pretraining

kitewatermelon · 2026-05-20T15:36:35Z

Description

Fixes two failure modes that prevented custom (non-timm) backbones from being used with SSL methods:

1. Missing .embed_dim attribute
Many encoders (e.g. MONAI ViT) do not expose a .embed_dim attribute that timm models provide automatically. Passing such a module previously raised an AttributeError at construction time.

2. Hardcoded 2D dummy forward pass
Several CLS-token methods inferred embed_dim by running backbone(torch.zeros(1, 3, 224, 224)). This silently breaks any 3D encoder (e.g. volumetric MRI with shape (1, 1, 96, 96, 96)).

Fix — explicit embed_dim parameter across 13 CLS-token methods:

# timm string → embed_dim inferred automatically (no change for existing users)
model = SimCLR("vit_base_patch16_224", projector_dims=(2048, 256))

# Custom nn.Module → embed_dim provided explicitly
model = SimCLR(my_monai_vit, projector_dims=(2048, 256), embed_dim=768)
model = SimCLR(my_3d_vit,    projector_dims=(2048, 256), embed_dim=512)

Also — encoder_name API unification:
MaskedEncoder, IJEPA, MAE, and LeJEPA previously accepted the encoder argument as model_or_model_name. Renamed to encoder_name to match all other methods in the library.

Affected methods: BarlowTwins, BYOL, DINO, MoCov2, MoCov3, NNCLR, PIRL, SimCLR, SimSiam, SwAV, TiCO, VICReg, WMSE, MaskedEncoder, IJEPA, MAE, LeJEPA

Note: Although model_or_model_name may be a more descriptive name, encoder_name was chosen to match the existing convention already used across the majority of the codebase.

Together, encoder_name and embed_dim make the methods API more predictable and consistent for users bringing custom backbones.

Checklist

I have read the Contributing document.
The documentation is up-to-date with the changes I made (check build artifacts).
All tests passed, and additional code has been covered with new tests.
I have added the PR to the RELEASES.rst file.

- Rename encoder_name → model_or_model_name to reflect dual str/Module input (consistence with mae and ijepa) - Infer embed_dim automatically for timm str; require explicit embed_dim for custom encoder - Add embed_dim and model_or_model_name params to docstring

…methods - Add optional `embed_dim` parameter to all CLS-token SSL methods (SimCLR, BYOL, DINO, BarlowTwins, SwAV, MoCov2, MoCov3, SimSiam, WMSE, VICReg, TiCO, NNCLR, PIRL): inferred automatically from backbone.embed_dim for timm strings, or from the module attribute if present, with a clear error when neither is available — enables custom encoders without .embed_dim - Replace all BYOL-style dummy forward passes (torch.zeros(1,3,224,224)) with direct backbone.embed_dim access, removing 2D input shape hardcoding - Rename model_or_model_name → encoder_name in IJEPA, MAE, MaskedEncoder, and all related tests for consistency with the rest of the library

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

kitewatermelon added 4 commits May 15, 2026 18:45

Merge branch 'galilai-group:main' into main

232544a

[BugFix] model_or_model_name to encoder_name in LeJEPA

00d82e8

kitewatermelon requested a review from RandallBalestriero as a code owner May 20, 2026 15:36

kitewatermelon and others added 4 commits May 21, 2026 16:33

[Docs] Add embed_dim param docstring to all CLS-token SSL methods

9c142be

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

[BugFix] embed_dim type annotation int -> Optional[int] in LeJEPA

affe31a

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Merge branch 'galilai-group:main' into main

35bf935

Merge branch 'galilai-group:main' into main

b3fde26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] Enable non-timm encoders with explicit embed_dim#426

[Feature] Enable non-timm encoders with explicit embed_dim#426
kitewatermelon wants to merge 8 commits into
galilai-group:mainfrom
kitewatermelon:main

kitewatermelon commented May 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

kitewatermelon commented May 20, 2026

Description

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant