Fix intensity normalizations #981

qin-yu · 2024-07-25T13:54:07Z

During the process of packaging the custom Cellpose model for bioimage.io, I identified two potential issues with the intensity normalization implementation in Cellpose. This PR proposes fixes for:

The current normalization with the lowhigh argument is not channel-wise. In this PR, lowhigh now either takes a 2-tuple (low, high) for all channels, or a nchan-tuple of that for each channel.
The current invert mechanism does not necessarily invert the channel to be segmented (--chan); instead, it inverts the last channel. Depending on the --chan2 value, this might not be the intended channel. In this PR, all channels are inverted if invert.

Original PR message on 25 Jul 2024

Original PR

Hi @carsen-stringer!

During the process of packaging the custom Cellpose model for bioimage.io, I identified two potential issues with the intensity normalization implementation in Cellpose. This PR proposes fixes for:

The current min-max normalization with the lowhigh argument is not channel-wise, and the argument itself might be misused.
The current invert mechanism does not necessarily invert the channel to be segmented (--chan); instead, it inverts the last channel. Depending on the --chan2 value, this might not be the intended channel.

Please review these changes as I might have misunderstood some parts.

Fix for Min-Max Normalization by `31bc5be`

For cellpose.transforms.normalize_img() (channel-wise normalization), the lowhigh option is provided. I assume this corresponds to min-max normalization where lowhigh is a tuple representing (new minimum, new maximum).

cellpose/cellpose/transforms.py

Lines 600 to 601 in 84344b0

    
                   lowhigh (tuple, optional): The lower and upper bounds for normalization. If provided, it should be a tuple 
        
                       of two values. Defaults to None.

If lowhigh represents (old minimum, old maximum), then normalization is not channel-wise if lowhigh has a length of 2:

cellpose/cellpose/transforms.py

Lines 624 to 626 in 84344b0

    
           if lowhigh is not None: 
        
               assert len(lowhigh) == 2 
        
               assert lowhigh[1] > lowhigh[0]

In either case, the implementation needs fixing:

cellpose/cellpose/transforms.py

Lines 640 to 643 in 84344b0

    
           if lowhigh is not None: 
        
               for c in range(nchan): 
        
                   img_norm[..., 
        
                            c] = (img_norm[..., c] - lowhigh[0]) / (lowhigh[1] - lowhigh[0])

The current implementation is essentially: img_norm = (img_norm - lowhigh[0]) / (lowhigh[1] - lowhigh[0])

This has been fixed by commit 31bc5be.

Fix for Invert by `91469d1`

For cellpose.transforms.normalize_img() (channel-wise normalization), there is an invert option for cases where "cells are dark instead of bright":

cellpose/cellpose/transforms.py

Lines 598 to 599 in 84344b0

    
                   invert (bool, optional): Whether to invert the image. Useful if cells are dark instead of bright. 
        
                       Defaults to False.

In Cellpose, the invert function is performed after reshape(), which moves the main channel to channel 0 and the auxiliary channel to channel 1. Therefore, the primary use case is to invert the first channel (channel 0) instead of the last one (current implementation):

cellpose/cellpose/transforms.py

Lines 666 to 667 in 84344b0

    
           if (tile_norm_blocksize > 0 or normalize) and invert: 
        
               img_norm[..., c] = -1 * img_norm[..., c] + 1

The variable c retains the value from a previous loop, being 1 if nchan is 2 or 2 if nchan is 3.

This has been fixed by commit 91469d1, but further improvements can be made to allow for a user-specified list of channels to be inverted.

1. Make it actually channel-wise; 2. Before it should take `(old_min, old_max)` of the input image (otherwise it doesn't make sense), but now it takes `(new_min, new_max)` of the expected normalized output. Before this fix, if `lowhigh` is provided, channel-wise linear normalization is performed by: ```python for c in ...: img_norm[...,c] = (img_norm[..., c] - lowhigh[0]) / (lowhigh[1] - lowhigh[0]) ``` which has a redundant `[..., c]` and is not "channel-wise", i.e. it is equivalent to ```python img_norm = (img_norm - lowhigh[0]) / (lowhigh[1] - lowhigh[0]) ``` This parameter, `lowhigh`, is never used in Cellpose itself, however.

Previously, only the last channel was inverted. Now, channel 0 is inverted, which is typically cells used for segmentation. This update addresses the common use case where the biological structure of interest is in the first channel. Future improvements could allow for user-specified channels.

carsen-stringer · 2024-09-07T14:19:15Z

thanks @qin-yu ! the lowhigh is meant to represent the values to set to 0 and 1 (e.g. pixel value 100->0 and 1100->1 corresponds to lowhigh = [100,1100]. I think the fix here would be to check if lowhigh is a list of lists (per channel) or a single list (for all channels). It's a good point that this use-case isn't covered currently.

for the inversion, thanks for catching that, I think it would make sense to apply for all channels?

codecov · 2024-09-09T11:34:03Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 53.70%. Comparing base (19b65ce) to head (4df2731).
Report is 14 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #981      +/-   ##
==========================================
+ Coverage   52.76%   53.70%   +0.93%     
==========================================
  Files          18       18              
  Lines        4126     4132       +6     
==========================================
+ Hits         2177     2219      +42     
+ Misses       1949     1913      -36

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

qin-yu · 2024-09-09T15:36:14Z

thanks @qin-yu ! the lowhigh is meant to represent the values to set to 0 and 1 (e.g. pixel value 100->0 and 1100->1 corresponds to lowhigh = [100,1100]. I think the fix here would be to check if lowhigh is a list of lists (per channel) or a single list (for all channels). It's a good point that this use-case isn't covered currently.

Thanks for explaining! That makes sense, working on it.

for the inversion, thanks for catching that, I think it would make sense to apply for all channels?

Yes you are right, inverting all channels by default if invert == True seems natural to me. But maybe this could also be either a bool or tuple(bool, ...)?

carsen-stringer · 2024-09-09T18:20:11Z

Thanks so much, regarding the invert function I think it's fine with all channels, I'm not sure how much it's being used with multichannel images

* `lowhigh` could be either a 2-tuple or a `nchan`-tuple of 2-tuple * `invert` inverts all channels Note that `lowhigh` shouldn't be combined with pre-smoothing or -sharpening.

* Tested the changes made in transform.py * Removed unused import and formatted whitespace

qin-yu · 2024-09-10T10:19:10Z

Thanks so much, regarding the invert function I think it's fine with all channels, I'm not sure how much it's being used with multichannel images

You are right. Let's keep it simple

I have finalized the PR, including two fixes along with the corresponding tests. Please take a look when you have a moment! @carsen-stringer

qin-yu · 2024-09-10T10:43:09Z

The failed check seems to be incidental and maybe unrelated to the PR.

qin-yu · 2024-09-10T12:30:07Z

All checks passed 🥳

qin-yu added 2 commits July 24, 2024 01:47

qin-yu mentioned this pull request Jul 25, 2024

[BUG] Invert after intensity normalisation #982

Closed

Merge branch 'MouseLand:main' into fix-intensity-normalization

a29f121

qin-yu added 3 commits September 10, 2024 03:16

refactor normalize_img(): channel-wise lowhigh, volume-wise invert

5eb40b7

* `lowhigh` could be either a 2-tuple or a `nchan`-tuple of 2-tuple * `invert` inverts all channels Note that `lowhigh` shouldn't be combined with pre-smoothing or -sharpening.

update tests for normalize_img() and format code

93b050e

* Tested the changes made in transform.py * Removed unused import and formatted whitespace

Add tests to cover exception handling in normalize_img()

4df2731

carsen-stringer merged commit 385891c into MouseLand:main Sep 10, 2024
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix intensity normalizations #981

Fix intensity normalizations #981

qin-yu commented Jul 25, 2024 •

edited

Loading

carsen-stringer commented Sep 7, 2024

codecov bot commented Sep 9, 2024 •

edited

Loading

qin-yu commented Sep 9, 2024

carsen-stringer commented Sep 9, 2024

qin-yu commented Sep 10, 2024

qin-yu commented Sep 10, 2024

qin-yu commented Sep 10, 2024

	lowhigh (tuple, optional): The lower and upper bounds for normalization. If provided, it should be a tuple
	of two values. Defaults to None.

	if lowhigh is not None:
	assert len(lowhigh) == 2
	assert lowhigh[1] > lowhigh[0]

	if lowhigh is not None:
	for c in range(nchan):
	img_norm[...,
	c] = (img_norm[..., c] - lowhigh[0]) / (lowhigh[1] - lowhigh[0])

	invert (bool, optional): Whether to invert the image. Useful if cells are dark instead of bright.
	Defaults to False.

	if (tile_norm_blocksize > 0 or normalize) and invert:
	img_norm[..., c] = -1 * img_norm[..., c] + 1

Fix intensity normalizations #981

Fix intensity normalizations #981

Conversation

qin-yu commented Jul 25, 2024 • edited Loading

Original PR

Fix for Min-Max Normalization by 31bc5be

Fix for Invert by 91469d1

carsen-stringer commented Sep 7, 2024

codecov bot commented Sep 9, 2024 • edited Loading

Codecov Report

qin-yu commented Sep 9, 2024

carsen-stringer commented Sep 9, 2024

qin-yu commented Sep 10, 2024

qin-yu commented Sep 10, 2024

qin-yu commented Sep 10, 2024

qin-yu commented Jul 25, 2024 •

edited

Loading

Fix for Min-Max Normalization by `31bc5be`

Fix for Invert by `91469d1`

codecov bot commented Sep 9, 2024 •

edited

Loading