Cross Zamirski Model Training by MattsonCam · Pull Request #26 · WayScience/nuclear_speckles_analysis

MattsonCam · 2026-06-08T21:11:35Z

This pr includes the cross zamirski model and the structure needed for training. It also include per-batch logging and removes irrelevant code. This code may change in the future to allow for training on Alpine due to the cuda memory constraint. As a result batch size has been reduced.

wasserstein GAN GP models

Wrap the discriminator-step generator forward in torch.no_grad() and remove the now-redundant detach on the fake samples passed to the critic loss. This preserves the two-step WGAN-GP training behavior while avoiding construction of an unnecessary generator autograd graph during the critic update, reducing memory and compute overhead.

review-notebook-app · 2026-06-08T21:11:39Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

wli51

LGTM! Maybe the trainer should support differential stepping frequency between discriminator and generator?

wli51 · 2026-06-08T21:45:37Z

+                with torch.no_grad():
+                    fake_targets_for_discriminator = self.image_postprocessor(
+                        self.generator(inputs)
+                    )
+                discriminator_outputs = self.discriminator_loss(
+                    critic=self.discriminator,
+                    real_samples=targets,
+                    fake_samples=fake_targets_for_discriminator,
+                )
+                discriminator_loss, discriminator_components = self._detach_components(
+                    discriminator_outputs
+                )
+
+                self.discriminator_optimizer.zero_grad()
+                discriminator_loss.backward()
+                self.discriminator_optimizer.step()
+
+                generated_predictions = self.image_postprocessor(self.generator(inputs))
+                fake_classification_outputs = self.discriminator(generated_predictions)
+                generator_outputs = self.generator_loss(
+                    fake_classification_outputs=fake_classification_outputs,
+                    generated_predictions=generated_predictions,
+                    targets=targets,
+                    epoch=epoch,
+                    loss_mask=batch_data.get("loss_mask"),
+                )
+                generator_loss, generator_components = self._detach_components(
+                    generator_outputs
+                )
+
+                self.generator_optimizer.zero_grad()
+                generator_loss.backward()
+                self.generator_optimizer.step()


I don't remember the Cross-Zamirski trainer implementation that well, is equal number of update frequencies what they decided on. I believe in classical wGAN training the discriminator gets updated more frequently than the generator for stability.

I think you're right, I will update this. I know if degrades the loss contribution by normalize by the epoch

Cameron Mattson added 8 commits June 8, 2026 13:21

This branch is for the cross zamirski

54fdc40

wasserstein GAN GP models

Add unconditional WGAN-GP training stack

2d0814f

Remove obsolete L1 trainer path

8463f27

Reduce Optuna max batch size

550f48d

Updated to mention batch size purpose when splitting

53d29e5

Removed old amp references

18feba2

Changed batch size to accomodate memory constraint

8b43851

MattsonCam requested a review from wli51 June 8, 2026 21:14

gwaybio approved these changes Jun 8, 2026

View reviewed changes

wli51 approved these changes Jun 8, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cross Zamirski Model Training#26

Cross Zamirski Model Training#26
MattsonCam wants to merge 8 commits into
wgan_gp_cross_zamirskifrom
wgan_gp_cross_zamirski_review

MattsonCam commented Jun 8, 2026

Uh oh!

review-notebook-app Bot commented Jun 8, 2026

Uh oh!

wli51 left a comment

Uh oh!

wli51 Jun 8, 2026

Uh oh!

MattsonCam Jun 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

MattsonCam commented Jun 8, 2026

Uh oh!

review-notebook-app Bot commented Jun 8, 2026

Uh oh!

wli51 left a comment

Choose a reason for hiding this comment

Uh oh!

wli51 Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

MattsonCam Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants