Modify Dice, Jaccard and Tversky losses #8138

zifuwanggg · 2024-10-10T19:55:15Z

Description

The Dice, Jaccard and Tversky losses in monai.losses.dice and monai.losses.tversky are modified based on JDTLoss and segmentation_models.pytorch.

In the original versions, when squared_pred=False, the loss functions are incompatible with soft labels. For example, with a ground truth value of 0.5 for a single pixel, the Dice loss is minimized when the predicted value is 1, which is clearly erroneous. To address this, the intersection term is rewritten as $\frac{|x|_p^p + |y|_p^p - |x-y|_p^p}{2}$. When $p$ is 2 (squared_pred=True), this reformulation becomes the classical inner product: $\langle x,y \rangle$. When $p$ is 1 (squared_pred=False), the reformulation has been proven to retain equivalence with the original versions when the ground truth is binary (i.e. one-hot hard labels). Moreover, since the new versions are minimized if and only if the prediction is identical to the ground truth, even when the ground truth include fractional numbers, they resolves the issue with soft labels [1, 2].

In summary, there are three scenarios:

[Scenario 1] $x$ is nonnegative and $y$ is binary: The new versions are the same as the original versions.
[Scenario 2] Both $x$ and $y$ are nonnegative: The new versions differ from the original versions. The new versions are minimized if and only if $x=y$, while the original versions may not, making them incorrect.
[Scenario 3] Either $x$ or $y$ is negative: The new versions differ from the original versions. The new versions are minimized if and only if $x=y$, while the original versions may not, making them incorrect.

Due to these differences, particularly in Scenarios 2 and 3, some tests fail with the new versions:

The target is non-binary: test_multi_scale
The input is negative: test_dice_loss, test_tversky_loss, test_generalized_dice_loss, test_masked_loss, test_seg_loss_integration

The failures in test_multi_scale are expected since the original versions are incorrectly defined for non-binary targets. Furthermore, because Dice, Jaccard, and Tversky losses are fundamentally defined over probabilities—which should be nonnegative—the new versions should not be tested against negative input or target values.

Example

import torch
import torch.linalg as LA
import torch.nn.functional as F

torch.manual_seed(0)

b, c, h, w = 4, 3, 32, 32
dims = (0, 2, 3)

pred = torch.rand(b, c, h, w).softmax(dim=1)
soft_label = torch.rand(b, c, h, w).softmax(dim=1)
hard_label = torch.randint(low=0, high=c, size=(b, h, w))
one_hot_label = F.one_hot(hard_label, c).permute(0, 3, 1, 2).float()

def dice_old(x, y, ord, dims):
    cardinality = LA.vector_norm(x, ord=ord, dim=dims) ** ord + LA.vector_norm(y, ord=ord, dim=dims) ** ord
    intersection = torch.sum(x * y, dim=dims)
    return 2 * intersection / cardinality

def dice_new(x, y, ord, dims):
    cardinality = LA.vector_norm(x, ord=ord, dim=dims) ** ord + LA.vector_norm(y, ord=ord, dim=dims) ** ord
    difference = LA.vector_norm(x - y, ord=ord, dim=dims) ** ord
    intersection = (cardinality - difference) / 2
    return 2 * intersection / cardinality

print(dice_old(pred, one_hot_label, 1, dims), dice_new(pred, one_hot_label, 1, dims))
print(dice_old(pred, soft_label, 1, dims), dice_new(pred, soft_label, 1, dims))
print(dice_old(pred, pred, 1, dims), dice_new(pred, pred, 1, dims))

print(dice_old(pred, one_hot_label, 2, dims), dice_new(pred, one_hot_label, 2, dims))
print(dice_old(pred, soft_label, 2, dims), dice_new(pred, soft_label, 2, dims))
print(dice_old(pred, pred, 2, dims), dice_new(pred, pred, 2, dims))

# tensor([0.3345, 0.3310, 0.3317]) tensor([0.3345, 0.3310, 0.3317])
# tensor([0.3321, 0.3333, 0.3350]) tensor([0.8680, 0.8690, 0.8700])
# tensor([0.3487, 0.3502, 0.3544]) tensor([1., 1., 1.])

# tensor([0.4921, 0.4904, 0.4935]) tensor([0.4921, 0.4904, 0.4935])
# tensor([0.9489, 0.9499, 0.9503]) tensor([0.9489, 0.9499, 0.9503])
# tensor([1., 1., 1.]) tensor([1., 1., 1.])

References

[1] Dice Semimetric Losses: Optimizing the Dice Score with Soft Labels. Zifu Wang, Teodora Popordanoska, Jeroen Bertels, Robin Lemmens, Matthew B. Blaschko. MICCAI 2023.

[2] Jaccard Metric Losses: Optimizing the Jaccard Index with Soft Labels. Zifu Wang, Xuefei Ning, Matthew B. Blaschko. NeurIPS 2023.

Types of changes

Non-breaking change (fix or new feature that would not break existing functionality).
Breaking change (fix or new feature that would cause existing functionality to change).
New tests added to cover the changes.
Integration tests passed locally by running ./runtests.sh -f -u --net --coverage.
Quick tests passed locally by running ./runtests.sh --quick --unittests --disttests.
In-line docstrings updated.
Documentation updated, tested make html command in the docs/ folder.

Signed-off-by: Zifu Wang <[email protected]>

ericspod · 2024-10-12T21:52:31Z

Hi @zifuwanggg thanks for the contribution. I have an issue with this change in that the behaviour of the losses is very different now as seen in the CICD errors. I would instead suggest adding new loss functions in a "soft_losses.py" file or something like that instead of changing existing losses. Other uses may rely on existing behaviour, and in situations where non-binarises values are accidentally used due to incorrect postprocessing there is less feedback about the problem.

zifuwanggg · 2024-10-14T21:27:06Z

Hi @ericspod, thank you for reviewing my code. While adding new loss functions as separate .py files could be a workaround, my concern is that this approach would lead to a lot of duplicated code, as the core differences are only in 2-3 lines.

Would it make sense to add an attribute to the existing loss classes and create a new helper function, so that the default behavior remains unchanged? Something like the following.

class DiceLoss(_Loss):
    def __init__(
        ...
        binary_label: bool = True,
    ):
        ...
        self.binary_label = binary_label

    def forward(...):
        ...
        f = compute_score(self.binary_label)
        ...


class GeneralizedDiceLoss(_Loss):
    def __init__(
        ...
        binary_label: bool = True,
    ):
        ...
        self.binary_label = binary_label
    
    def forward(...):
        ...
        f = compute_score(self.binary_label)
        ...


def compute_score(binary_label):
    if binary_label == True:
        ...
    else:
        ...

ericspod · 2024-10-18T11:11:15Z

Hi @ericspod, thank you for reviewing my code. While adding new loss functions as separate .py files could be a workaround, my concern is that this approach would lead to a lot of duplicated code, as the core differences are only in 2-3 lines.

Hi @zifuwanggg I appreciate wanting to reduce duplicate code, we have too much of that in these loss functions as it stands so yes adding more isn't great. I think we can try to parameterise the loss functions in some way, either a function as you suggest or some other way, so long as the default behaviour is preserved. If you want to have a go at refactoring to do that we can return to it, I think in the future we do need to refactor all these loss functions to reduce duplication anyway.

zifuwanggg · 2024-10-21T09:12:37Z

Hi @ericspod, I've created losses/utils.py and put a helper function that is shared by both dice.py and tversky.py.

Unit tests pass, but mypy tests fail. This seems related to #8149 and #8161.

Signed-off-by: Zifu Wang <[email protected]>

zifuwanggg · 2024-10-22T11:28:58Z

Hi @ericspod, all CICD tests pass. @KumoLiu, thanks for the commit.

ericspod

I have a minor comment about the check in the helper function being expensive to compute, but otherwise we do also need tests for soft labels to ensure that formulation of the losses works. I do want to get others to review this as well to be doubly sure the changes are compatible. Thanks again.

monai/losses/utils.py

monai/losses/dice.py

for more information, see https://pre-commit.ci

zifuwanggg · 2024-11-30T21:14:25Z

Hi @ericspod, sorry for the late response.

I remove the costly check and modify the description of soft_label as you suggested. I also add some test cases. When input is the same as target, the loss value becomes zero when soft_label=True.

Hi @KumoLiu @csudre @Nic-Ma, could you please kindly review the changes?

KumoLiu

Thanks for the contribution! Overall looks good to me.
Jus leave one comment and please sign off to fix the DCO error.
https://github.com/Project-MONAI/MONAI/pull/8138/checks?check_run_id=33736878625

monai/losses/utils.py

I, Zifu Wang <[email protected]>, hereby add my Signed-off-by to this commit: 3f74183 I, Zifu Wang <[email protected]>, hereby add my Signed-off-by to this commit: a778e58 I, Zifu Wang <[email protected]>, hereby add my Signed-off-by to this commit: aeef0af I, Zifu Wang <[email protected]>, hereby add my Signed-off-by to this commit: 58c5396 Signed-off-by: Zifu Wang <[email protected]>

KumoLiu · 2024-12-02T15:02:40Z

/build

Modify Dice, Jaccard and Tversky losses

087cf74

Signed-off-by: Zifu Wang <[email protected]>

ericspod requested review from Nic-Ma, KumoLiu and csudre October 12, 2024 21:52

zifuwanggg added 2 commits October 20, 2024 19:45

Merge remote-tracking branch 'upstream/dev' into 8094-modify-dice-loss

cfd2d1e

Add helper function

3f74183

KumoLiu and others added 4 commits October 22, 2024 15:40

Merge branch 'dev' into 8094-modify-dice-loss

ea8a240

Merge branch 'Project-MONAI:dev' into 8094-modify-dice-loss

8c3a746

Fix mypy error

3e4f714

Signed-off-by: Zifu Wang <[email protected]>

Fix mypy error

f3ab679

Signed-off-by: Zifu Wang <[email protected]>

ericspod reviewed Oct 24, 2024

View reviewed changes

monai/losses/utils.py Outdated Show resolved Hide resolved

monai/losses/dice.py Outdated Show resolved Hide resolved

zifuwanggg and others added 5 commits November 30, 2024 20:31

Merge branch 'dev' into 8094-modify-dice-loss

60c7b36

Add test cases

a778e58

Modify args description and remove check

aeef0af

[pre-commit.ci] auto fixes from pre-commit.com hooks

2f77b1f

for more information, see https://pre-commit.ci

Fix code format

58c5396

KumoLiu reviewed Dec 2, 2024

View reviewed changes

monai/losses/utils.py Show resolved Hide resolved

zifuwanggg and others added 3 commits December 2, 2024 11:15

Merge branch 'dev' into 8094-modify-dice-loss

f07925e

Merge branch 'dev' into 8094-modify-dice-loss

a6c7cf3

KumoLiu approved these changes Dec 3, 2024

View reviewed changes

KumoLiu merged commit 9808ce2 into Project-MONAI:dev Dec 3, 2024
28 checks passed

zifuwanggg deleted the 8094-modify-dice-loss branch December 3, 2024 10:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modify Dice, Jaccard and Tversky losses #8138

Modify Dice, Jaccard and Tversky losses #8138

zifuwanggg commented Oct 10, 2024 •

edited

Loading

ericspod commented Oct 12, 2024

zifuwanggg commented Oct 14, 2024

ericspod commented Oct 18, 2024

zifuwanggg commented Oct 21, 2024 •

edited

Loading

zifuwanggg commented Oct 22, 2024

ericspod left a comment

zifuwanggg commented Nov 30, 2024 •

edited

Loading

KumoLiu left a comment

KumoLiu commented Dec 2, 2024

Modify Dice, Jaccard and Tversky losses #8138

Modify Dice, Jaccard and Tversky losses #8138

Conversation

zifuwanggg commented Oct 10, 2024 • edited Loading

Description

Example

References

Types of changes

ericspod commented Oct 12, 2024

zifuwanggg commented Oct 14, 2024

ericspod commented Oct 18, 2024

zifuwanggg commented Oct 21, 2024 • edited Loading

zifuwanggg commented Oct 22, 2024

ericspod left a comment

Choose a reason for hiding this comment

zifuwanggg commented Nov 30, 2024 • edited Loading

KumoLiu left a comment

Choose a reason for hiding this comment

KumoLiu commented Dec 2, 2024

zifuwanggg commented Oct 10, 2024 •

edited

Loading

zifuwanggg commented Oct 21, 2024 •

edited

Loading

zifuwanggg commented Nov 30, 2024 •

edited

Loading