Feat/test devices #74

Oisin-M · 2025-11-06T17:39:22Z

Close #73
Close #65

Description

Test devices as well as backends. Uses eku refactor/backend_for_testing_only branch.

test every available device for each backend
migrate testing code to new eku structure
bugfixed mismatched device errors
bugfixed some non array-api compliant code

Issues to note, but finally decided not to tackle here:

for mps, need to convert to float32 inputs (float64 not supported). Could consider having a default dtype per device+backend
issues with mps device giving odd results
issues with tolerances after changing to float32
issues with nan locations after changing to float32

Other points:
For now, I decided to use directly the namespaces and not the backends (I added array-api-compat versions of xp.isclose and xp.allclose, but we may want to reconsider that later if we want to have e.g. default dtype per device handling)

Contributor Declaration

By opening this pull request, I affirm the following:

All authors agree to the Contributor License Agreement.
The code follows the project's coding standards.
I have performed self-review and added comments where needed.
I have added or updated tests to verify that my changes are effective and functional.
I have run all existing tests and confirmed they pass.

Oisin-M · 2025-11-07T09:42:54Z

Already fixed many failures, but here's a log of the remaining failing tests...

Testing locally, therefore for the following combinations

numpy, torch-cpu, torch-mps

Test status:

extreme

tests/extreme/test_extreme.py::test_highlevel_efi, ::test_efi_core for torch mps - no error, just results are totally off. Not solveable via tolerance changes e.g.

E       AssertionError: assert tensor(False, device='mps:0')
E        +  where tensor(False, device='mps:0') = isclose(tensor(-8.1199, device='mps:0'), tensor(-0.1838, device='mps:0'))

solar

tests/solar/test_solar.py::test_cos_solar_zenith_angle_1, ::test_cos_solar_zenith_angle_integrated, ::test_toa_incident_solar_radiation for torch mps with error TypeError: unsupported operand type(s) for *: 'Tensor' and 'Tensor'
tests/solar/test_solar.py::test_cos_solar_zenith_angle_integrated, ::test_toa_incident_solar_radiation for torch mps - no error, just results are off. Check if tolerance issue

vertical

All passing!

score

tests/score/test_score.py::test_crps_meteo_missing, ::test_crps_meteo for torch mps - no error, just results are off. Check if tolerance issue

stats

tests/stats/test_stats.py::test_nanaverage for torch mps - RuntimeError: Expected all tensors to be on the same device, but found at least two devices, mps:0 and cpu!
tests/stats/test_stats.py::test_quantiles_core, ::test_quantiles_nans for torch mps - IndexError: Dimension specified as -1 but tensor has no dimensions
tests/stats/test_stats.py::test_quantiles_core for torch mps - no error, just results are off. Check if tolerance issue

thermo

tests/thermo/test_thermo.py::test_wet_bulb_potential_temperature, ::test_wet_bulb_temperature, ::test_temperature_on_moist_adiabat, ::test_saturation_ept, ::test_relative_humidity_from_specific_humidity, ::test_specific_humidity_from_relative_humidity, ::test_saturation_specific_humidity_slope_number, ::test_saturation_specific_humidity_slope_1, ::test_saturation_mixing_ratio_slope_numbers, ::test_saturation_mixing_ratio_slope_1, ::test_saturation_vapour_pressure_slope, ::test_saturation_specific_humidity, ::test_saturation_mixing_ratio, ::test_saturation_vapour_pressure_1, ::test_saturation_vapour_pressure_2 for torch mps - RuntimeError: Expected all tensors to be on the same device, but found at least two devices, mps:0 and cpu!
tests/thermo/test_thermo.py::test_wet_bulb_temperature for all arrays and devices - issue with nan locations not matching anymore after converting to float32. Not solveable via tolerance changes

Oisin-M · 2025-11-07T12:47:59Z

The mps issues seem unrelated to the code and might be an issue we can't fix here. Since torch-cpu is passing, I have decided for now to ignore the issues with mps with a view to revisiting later. Will test cuda-based backend/device combos instead

…and torch

… cuda

Oisin-M · 2025-11-07T14:10:21Z

Tests are passing for all the following combos*

numpy-cpu
cupy-cuda:0
torch-cpu
torch-cuda:0

*with some exceptions that we skip for now

torch.histogramdd not supported for torch-cuda, only for cpu. Therefore, ignoring torch-cuda for tests/wind/test_wind.py::test_windrose_1.
cp.percentile does not match torch or numpy nan axis semantics, so ignoring cupy for test_quantiles_nans

Oisin-M added 6 commits November 6, 2025 18:36

feat: refactor tests to test device and use new eku format

2fa888e

feat: bugfix mismatched device arrays

5dd88f8

fix: float32 casting to avoid issues with mps device

d1f2337

fix: update tolerances so tests pass

636e92e

fix: do not use negative indices and switch to array-api compat flip

e8e2c0a

fix: update tolerances for float32 tests to pass

b5055b0

Oisin-M added 5 commits November 7, 2025 10:47

fix: device mismatch in stats/nanaverage

c8d9015

fix: device mismatch issue in solar

65415f7

fix: bugfix device handling in thermo

defa84e

chore: prefer zeros_like to zeros

ced3acb

fix: revert to float64 and old tolerances

e437691

Oisin-M marked this pull request as ready for review November 7, 2025 12:48

Oisin-M added 4 commits November 7, 2025 14:21

fix: device mismatch in stats xp.take index

eb5e3f7

fix: fix cupy indexing mismatch

ed0d29a

fix: avoid cupy nan quantile test due to different behaviour than np …

89667cf

…and torch

fix: skip test since histogram functionality not implemented on torch…

420e70f

… cuda

Oisin-M requested a review from sandorkertesz November 7, 2025 14:11

Oisin-M added 2 commits November 7, 2025 14:31

feat: add nightly ci gpu hpc

3e28c59

fix: use python3 module

a743aad

Oisin-M mentioned this pull request Nov 7, 2025

Cpf fails with torch due to slicing with negative strides #73

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feat/test devices #74

Feat/test devices #74

Uh oh!

Oisin-M commented Nov 6, 2025 •

edited

Loading

Uh oh!

Oisin-M commented Nov 7, 2025 •

edited

Loading

Uh oh!

Oisin-M commented Nov 7, 2025

Uh oh!

Oisin-M commented Nov 7, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Feat/test devices #74

Are you sure you want to change the base?

Feat/test devices #74

Uh oh!

Conversation

Oisin-M commented Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Contributor Declaration

Uh oh!

Oisin-M commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Oisin-M commented Nov 7, 2025

Uh oh!

Oisin-M commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Oisin-M commented Nov 6, 2025 •

edited

Loading

Oisin-M commented Nov 7, 2025 •

edited

Loading

Oisin-M commented Nov 7, 2025 •

edited

Loading