Review docstring and possibly raise a userwarning for pearsonr based on preserve_dims=all #703

tennlee · 2024-09-27T01:27:45Z

Issue: pearsonr may be mathematically undefined when all dimensions are preserved, resulting in NaNs being returned. This isn't exactly wrong, but is perhaps not what a user would be expecting and could be made clearer

Solution: This should at least be documented, and possibly a user warning should also be raised in the code

See also: KGE uses pearsonr in its calculation so is also affected - it has this described in the docstring

nicholasloveday · 2024-10-14T02:22:42Z

I think a user warning is a good idea

nikeethr · 2025-03-14T11:57:18Z

NSE raises an error. Its not whether or not the computation is wrong - a generic tool like numpy can return NaNs when interacting with NaNs (i.e. when the input itself is invalid), but a score is not a generic tool - here the input argument itself is invalid for a score. (for this particular scenario, scores that need to have at least one dimension reduced, must have at least one dimension reduced - this is the case in NSE, and allowing for this computation - where this criteria (>=1 reduced dim) fails - makes the entire score meaningless and may obfuscate genuine computational errors).

NSE does raise a warning however for partially correct data - in the event that the parts that are correct can be used and the rest can be double checked.

In NSE in particular if I'm allowed to preserve all dimensions, I can't compute obs variance - it means I fundamentally don't understand how NSE works or something in my data pipeline is busted - this isn't a warning its a fundamental user input error. A warning is more acceptable if I did specify reduce dims, but I still had insufficient data to do the computation.

I think warnings should be used sparingly - e.g. for partially valid inputs like described earlier where the user may still be interested in proceeding with a computation. For data that is clearly or completely invalid, raising an error should be the default - they aren't road blocks, they just mean the result is incorrect or invalid and needs to be handled appropriately either by handling the exception itself or fixing the source of the error, because any further operation with this result without appropriate handling, is meaningless.

tennlee added documentation Improvements or additions to documentation enhancement New feature or request good first issue Good for newcomers labels Nov 22, 2024

nikeethr linked a pull request Mar 15, 2025 that will close this issue

(Requires design discussion) handling all NaNs - #703 #776

Draft

23 tasks

nikeethr linked a pull request Mar 24, 2025 that will close this issue

(Requires design discussion) handling all NaNs - #703 #776

Draft

23 tasks

nikeethr mentioned this issue May 15, 2025

Add a UserWarning (or maybe add an exception) for negative weights #829

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Review docstring and possibly raise a userwarning for pearsonr based on preserve_dims=all #703

Review docstring and possibly raise a userwarning for pearsonr based on preserve_dims=all #703

tennlee commented Sep 27, 2024

nicholasloveday commented Oct 14, 2024

Uh oh!

nikeethr commented Mar 14, 2025 •

edited

Loading

Uh oh!

Review docstring and possibly raise a userwarning for pearsonr based on preserve_dims=all #703

Review docstring and possibly raise a userwarning for pearsonr based on preserve_dims=all #703

Comments

tennlee commented Sep 27, 2024

nicholasloveday commented Oct 14, 2024

Uh oh!

nikeethr commented Mar 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nikeethr commented Mar 14, 2025 •

edited

Loading