[WIP] Updated land calibration pipeline #1210

ph-kev · 2025-07-07T19:42:57Z

This PR rewrites the land calibration pipeline to use the latest snowy land model, the new ObservationRecipe in ClimaCalibrate, and ClimaAnalysis for data preprocessing and transformations. With these additions, the pipeline should be

simpler to understand
offload the covariance matrix computation to ClimaCalibrate
less error prone when adding priors, modifying or adding new observations, modifying the calibration configuration, etc

However, the updated land calibration pipeline is brittle in multiple aspects and these issues affect the other calibration pipelines. In this issue, the issues specific to the land calibration pipeline is listed.

Overwriting parameters is painful. There is no easy way of overwriting parameters. See the example below of the current land calibration pipeline.

ClimaLand.jl/experiments/calibration/forward_model_land.jl

Lines 52 to 75 in ccc3115

    
           p_names = collect(keys(params)) 
        
           p_values = [params[name]["value"] for name in p_names] 
        
           params = (; zip(Symbol.(p_names), p_values)...) 
        
           (; 
        
               #        pc, 
        
               #        sc, 
        
               #        K_sat_plant, 
        
               #        a, 
        
               #        h_leaf, 
        
               #        α_snow, 
        
               #        α_soil_dry_scaler, 
        
               #        τ_leaf_scaler, 
        
               #        α_leaf_scaler, 
        
               #        α_soil_scaler, 
        
               α_0, 
        
               Δα, 
        
               k, 
        
               beta_snow, 
        
               x0_snow, 
        
               gamma_snow, 
        
               beta_0, 
        
               #        beta_min, 
        
               z0_snow, 
        
           ) = params

Configuring the calibration is not straightforward. To configure the calibration, all the settings are centralized to a single file, but making the user modify what those functions return does not seem ideal. Furthermore, it is not clear whether using functions is the best way to pass those information to the worker processes.
Adding a variable is difficult and not obvious. To add a new variable, you need to modify three different files, specify what the simulation and observational data are, how they should be preprocessed (e.g. units conversion, shifting dates), and add additional data transformation (e.g. seasonal averages). This process is error prone and the transformations can easily get out of sync.
Getting the landsea mask. To get a landsea mask, you need to go through ClimaCore and ClimaAnalysis. This is not intuitive and can easily lead to errors if the diagnostics change.
Mask-aware replace and flatten. In the calibration code, there is a hack to replace all the NaN values on land with the average non-nan value on land. Furthermore, flatten removes all NaN regardless of where it is. This means that any NaN on land and completely stop a calibration.

Update to packages

New release of ClimaAnalysis (needed for better masking functions)
New release of ClimaCalibrate (needed for updates to ObservationRecipes)
New release of EnsembleKalmanProcesses (needed for metadata for observations)

ph-kev added 3 commits July 3, 2025 16:36

Working land calibration pipeline

33991ce

Fix to land calibration

b36f741

Disable nan check [skip ci]

b6e3103

ph-kev marked this pull request as draft July 7, 2025 19:43

Working ClimaLand calibration [skip ci]

51a8e47

ph-kev force-pushed the kp/pipeline branch from f2b6bc7 to 51a8e47 Compare July 17, 2025 17:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] Updated land calibration pipeline #1210

[WIP] Updated land calibration pipeline #1210

Uh oh!

ph-kev commented Jul 7, 2025 •

edited

Loading

Uh oh!

Uh oh!

	p_names = collect(keys(params))
	p_values = [params[name]["value"] for name in p_names]
	params = (; zip(Symbol.(p_names), p_values)...)
	(;
	# pc,
	# sc,
	# K_sat_plant,
	# a,
	# h_leaf,
	# α_snow,
	# α_soil_dry_scaler,
	# τ_leaf_scaler,
	# α_leaf_scaler,
	# α_soil_scaler,
	α_0,
	Δα,
	k,
	beta_snow,
	x0_snow,
	gamma_snow,
	beta_0,
	# beta_min,
	z0_snow,
	) = params

[WIP] Updated land calibration pipeline #1210

Are you sure you want to change the base?

[WIP] Updated land calibration pipeline #1210

Uh oh!

Conversation

ph-kev commented Jul 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Update to packages

Uh oh!

Uh oh!

ph-kev commented Jul 7, 2025 •

edited

Loading