Fix DiceFocalLoss to apply activation before removing background by luomi16 · Pull Request #8947 · Project-MONAI/MONAI

luomi16 · 2026-06-24T20:11:16Z

Summary

This PR fixes a bug in DiceFocalLoss where using include_background=False with softmax=True or sigmoid=True for binary segmentation would cause the activation function to be ignored.

Problem

When using include_background=False with softmax=True in DiceFocalLoss for binary segmentation, the Dice part would produce this warning and not apply softmax when calculating the Dice loss:

UserWarning: single channel prediction, `softmax=True` ignored.

This happened because the background channel was removed before the activation was applied, leaving only one channel for the Dice loss to process.

Solution

The fix applies the activation (softmax/sigmoid/other_act) before removing the background channel, ensuring that the activation is applied to all channels as intended.

Changes made:

Store sigmoid, softmax, and other_act as instance variables in DiceFocalLoss.__init__()
Apply activation in forward() before removing background
Disable activation in the internal DiceLoss instance to avoid double application

Testing

All existing tests pass
Verified that the fix produces correct results by comparing with manual computation
Verified that no warnings are produced when using include_background=False with softmax=True or sigmoid=True

Related Issue

Fixes: #5697

When using include_background=False with softmax=True or sigmoid=True in DiceFocalLoss for binary segmentation, the activation was being ignored because the background channel was removed before the activation was applied. This fix applies the activation (softmax/sigmoid/other_act) BEFORE removing the background channel, ensuring that the activation is applied to all channels as intended. The fix: 1. Stores sigmoid, softmax, and other_act as instance variables 2. Applies activation in forward() before removing background 3. Disables activation in the internal DiceLoss instance to avoid double application Fixes: Project-MONAI#5697

coderabbitai · 2026-06-24T20:11:38Z

📝 Walkthrough

Walkthrough

DiceFocalLoss now builds its internal DiceLoss with activation disabled, stores its own sigmoid/softmax/other_act settings, and applies that activation in forward before optionally removing the background channel. The one-hot target conversion and single-channel handling remain unchanged.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

🚥 Pre-merge checks | ✅ 3 | ❌ 2

❌ Failed checks (2 warnings)

Check name	Status	Explanation	Resolution
Linked Issues check	⚠️ Warning	The PR fixes DiceFocalLoss activation handling, not the macOS quick-py3 sampler failure in [`#5697`].	Update the PR to reproduce and fix the DistributedWeightedRandomSamplerTest failure, then confirm quick-py3 on macOS passes.
Out of Scope Changes check	⚠️ Warning	The DiceFocalLoss refactor is unrelated to the linked macOS sampler failure in [`#5697`].	Either align the code change with [`#5697`] or relink the PR to the actual DiceFocalLoss bug it addresses.

✅ Passed checks (3 passed)

Check name	Status	Explanation
Title check	✅ Passed	It clearly states the main change: applying activation before removing the background channel.
Description check	✅ Passed	It covers the bug, fix, testing, and linked issue, though it doesn't use the template's exact sections.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

⚔️ Resolve merge conflicts

Resolve merge conflict in branch fix/dice-focal-loss-include-background

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands.}

coderabbitai

Actionable comments posted: 2

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

monai/losses/dice.py (1)

852-872: 🎯 Functional Correctness | 🟠 Major | ⚡ Quick win

Keep logits for FocalLoss.

input is converted to probabilities, then passed to self.focal(...). That breaks the documented logits contract for FocalLoss.

Proposed fix

+        dice_input = input
+        focal_input = input
+
         # Apply activation before removing background to ensure softmax/sigmoid works correctly
         if self.sigmoid:
-            input = torch.sigmoid(input)
+            dice_input = torch.sigmoid(dice_input)
         elif self.softmax:
             if n_pred_ch == 1:
                 warnings.warn("single channel prediction, `softmax=True` ignored.")
             else:
-                input = torch.softmax(input, 1)
+                dice_input = torch.softmax(dice_input, 1)
         elif self.other_act is not None:
-            input = self.other_act(input)
+            dice_input = self.other_act(dice_input)
 
         if not self.include_background:
             if n_pred_ch == 1:
                 warnings.warn("single channel prediction, `include_background=False` ignored.")
             else:
                 # if skipping background, removing first channel
                 target = target[:, 1:]
-                input = input[:, 1:]
+                dice_input = dice_input[:, 1:]
+                focal_input = focal_input[:, 1:]
 
-        dice_loss = self.dice(input, target)
-        focal_loss = self.focal(input, target)
+        dice_loss = self.dice(dice_input, target)
+        focal_loss = self.focal(focal_input, target)

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@monai/losses/dice.py` around lines 852 - 872, Keep logits intact for the
focal path in the dice loss implementation: in the section that applies
activation and then calls self.dice(...) and self.focal(...), avoid reusing the
activated input for FocalLoss. Compute the dice input with the existing
sigmoid/softmax/other_act handling, but pass the original logits (or a separate
untouched tensor) into self.focal so the FocalLoss contract remains correct. Use
the dice() and focal() calls in dice.py to locate the change.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@monai/losses/dice.py`:
- Line 857: The warning emitted in the single-channel prediction branch of
`DiceLoss` needs an explicit stack level so the caller sees the correct source
location. Update the `warnings.warn` call in `dice.py` to pass `stacklevel=2`,
keeping the existing message intact and making sure the change is applied in the
`DiceLoss` logic where `softmax=True` is ignored.
- Around line 825-827: DiceLoss is no longer enforcing mutually exclusive
activation settings, so invalid configs like sigmoid=True and softmax=True can
slip through when the activation path is skipped. Restore the exclusivity
validation in DiceLoss initialization or setup logic by checking the sigmoid,
softmax, and other_act flags together and raising an error when more than one
activation is enabled; use the DiceLoss constructor/validation flow to locate
and fix this.

---

Outside diff comments:
In `@monai/losses/dice.py`:
- Around line 852-872: Keep logits intact for the focal path in the dice loss
implementation: in the section that applies activation and then calls
self.dice(...) and self.focal(...), avoid reusing the activated input for
FocalLoss. Compute the dice input with the existing sigmoid/softmax/other_act
handling, but pass the original logits (or a separate untouched tensor) into
self.focal so the FocalLoss contract remains correct. Use the dice() and focal()
calls in dice.py to locate the change.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 6c110aae-3952-4268-976c-23e1523294ce

📥 Commits

Reviewing files that changed from the base of the PR and between 557ffaa and 773f5fc.

📒 Files selected for processing (1)

monai/losses/dice.py

coderabbitai · 2026-06-24T20:16:15Z

+        self.sigmoid = sigmoid
+        self.softmax = softmax
+        self.other_act = other_act


🎯 Functional Correctness | 🟠 Major | ⚡ Quick win

Preserve activation exclusivity validation.

With DiceLoss activation disabled, configs like sigmoid=True, softmax=True are now silently accepted and resolved by if/elif.

Proposed fix

self.to_onehot_y = to_onehot_y self.include_background = include_background + if int(sigmoid) + int(softmax) + int(other_act is not None) > 1: + raise ValueError("Only one of sigmoid=True, softmax=True, or other_act may be specified.") self.sigmoid = sigmoid self.softmax = softmax self.other_act = other_act

As per path instructions, "Examine code for logical error or inconsistencies".

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

self.sigmoid = sigmoid

self.softmax = softmax

self.other_act = other_act

self.to_onehot_y = to_onehot_y

self.include_background = include_background

if int(sigmoid) + int(softmax) + int(other_act is not None) > 1:

raise ValueError("Only one of sigmoid=True, softmax=True, or other_act may be specified.")

self.sigmoid = sigmoid

self.softmax = softmax

self.other_act = other_act

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@monai/losses/dice.py` around lines 825 - 827, DiceLoss is no longer enforcing mutually exclusive activation settings, so invalid configs like sigmoid=True and softmax=True can slip through when the activation path is skipped. Restore the exclusivity validation in DiceLoss initialization or setup logic by checking the sigmoid, softmax, and other_act flags together and raising an error when more than one activation is enabled; use the DiceLoss constructor/validation flow to locate and fix this.

Source: Path instructions

coderabbitai · 2026-06-24T20:16:15Z

+            input = torch.sigmoid(input)
+        elif self.softmax:
+            if n_pred_ch == 1:
+                warnings.warn("single channel prediction, `softmax=True` ignored.")


📐 Maintainability & Code Quality | 🟡 Minor | ⚡ Quick win

Set stacklevel on the new warning.

Ruff flags this warnings.warn call; use stacklevel=2 so callers see their call site.

Proposed fix

- warnings.warn("single channel prediction, `softmax=True` ignored.") + warnings.warn("single channel prediction, `softmax=True` ignored.", stacklevel=2)

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

warnings.warn("single channel prediction, `softmax=True` ignored.")

warnings.warn("single channel prediction, `softmax=True` ignored.", stacklevel=2)

🧰 Tools

🪛 Ruff (0.15.18)

[warning] 857-857: No explicit stacklevel keyword argument found

Set stacklevel=2

(B028)

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@monai/losses/dice.py` at line 857, The warning emitted in the single-channel prediction branch of `DiceLoss` needs an explicit stack level so the caller sees the correct source location. Update the `warnings.warn` call in `dice.py` to pass `stacklevel=2`, keeping the existing message intact and making sure the change is applied in the `DiceLoss` logic where `softmax=True` is ignored.

Source: Linters/SAST tools

luomi16 requested review from KumoLiu, Nic-Ma and ericspod as code owners June 24, 2026 20:11

coderabbitai Bot reviewed Jun 24, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix DiceFocalLoss to apply activation before removing background#8947

Fix DiceFocalLoss to apply activation before removing background#8947
luomi16 wants to merge 1 commit into
Project-MONAI:devfrom
luomi16:fix/dice-focal-loss-include-background

luomi16 commented Jun 24, 2026

Uh oh!

coderabbitai Bot commented Jun 24, 2026 •

edited

Loading

Walkthrough

Estimated code review effort

❌ Failed checks (2 warnings)

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot Jun 24, 2026

Uh oh!

coderabbitai Bot Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	warnings.warn("single channel prediction, `softmax=True` ignored.")
	warnings.warn("single channel prediction, `softmax=True` ignored.", stacklevel=2)

Uh oh!

Conversation

luomi16 commented Jun 24, 2026

Summary

Problem

Solution

Changes made:

Testing

Related Issue

Uh oh!

coderabbitai Bot commented Jun 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Estimated code review effort

❌ Failed checks (2 warnings)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 24, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 24, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

coderabbitai Bot commented Jun 24, 2026 •

edited

Loading