(dev-validation)= # Validation Tasks should typically include validation of variables and/or timers. This validation is a critical part of running suites and comparing them to baselines. ## Validating variables against a baseline The easiest type of validation you can add is against a baseline if one is provided during setup (see {ref}`dev-polaris-setup` or {ref}`dev-polaris-suite`). To do this, simply add a list of variables in the keyword argument `validate_vars` to the :py:meth:`polaris.Step.add_output_file()` method. As an example: ```python from polaris import Step class Init(Step): def __init__(self, task): super().__init__(task=task, name='init') self.add_output_file('initial_state.nc', validate_vars=['temperature', 'salinity', 'layerThickness']) ``` Here, we add `initial_state.nc` as an output of the `init` step, and indicated that the variables `temperature`, `salinity`, and `layerThickness` should be compared against a baseline, if one is provided, after the step as run. ## Validating variables In addition to baseline validation, it is often useful to compare files between steps of a run. This is done by adding later step to perform the validation. This validation step will use the function {py:func}`polaris.validate.compare_variables()` to compare variables in a file with a given relative path (`filename1`) with the same variables in another file (`filename2`). As a compact example of creating a validate step for a restart run: ```python from polaris import Step from polaris.validate import compare_variables class Validate(Step): def __init__(self, task): super().__init__(task=task, name='validate') self.add_input_file(filename='output_full_run.nc', target=f'../full_run/output.nc') self.add_input_file(filename='output_restart_run.nc', target=f'../restart_run/output.nc') def run(self): super().run() variables = ['temperature', 'salinity', 'layerThickness', 'normalVelocity'] all_pass = compare_variables(variables, filename1='output_full_run.nc', filename2='output_restart_run.nc', logger=self.logger) if not all_pass: raise ValueError('Validation failed comparing outputs between ' 'full_run and restart_run') ``` The 2 files `../full_run/output.nc` and `../restart_run/output.nc` are symlinked locally and compared to make sure the variables `temperature`, `salinity`, `layerThickness`, and `normalVelocity` are identical between the two. By default, the output is "quiet". If you set `quiet=False`, typical output will look like this: ```none Beginning variable comparisons for all time levels of field 'temperature'. Note any time levels reported are 0-based. Pass thresholds are: L1: 0.00000000000000e+00 L2: 0.00000000000000e+00 L_Infinity: 0.00000000000000e+00 0: l1: 0.00000000000000e+00 l2: 0.00000000000000e+00 linf: 0.00000000000000e+00 1: l1: 0.00000000000000e+00 l2: 0.00000000000000e+00 linf: 0.00000000000000e+00 2: l1: 0.00000000000000e+00 l2: 0.00000000000000e+00 linf: 0.00000000000000e+00 ** PASS Comparison of temperature between /home/xylar/data/mpas/test_nightly_latest/ocean/baroclinic_channel/10km/threads_test/1thread/output.nc and /home/xylar/data/mpas/test_nightly_latest/ocean/baroclinic_channel/10km/threads_test/2thread/output.nc Beginning variable comparisons for all time levels of field 'salinity'. Note any time levels reported are 0-based. Pass thresholds are: L1: 0.00000000000000e+00 L2: 0.00000000000000e+00 L_Infinity: 0.00000000000000e+00 0: l1: 0.00000000000000e+00 l2: 0.00000000000000e+00 linf: 0.00000000000000e+00 1: l1: 0.00000000000000e+00 l2: 0.00000000000000e+00 linf: 0.00000000000000e+00 2: l1: 0.00000000000000e+00 l2: 0.00000000000000e+00 linf: 0.00000000000000e+00 ** PASS Comparison of salinity between /home/xylar/data/mpas/test_nightly_latest/ocean/baroclinic_channel/10km/threads_test/1thread/output.nc and /home/xylar/data/mpas/test_nightly_latest/ocean/baroclinic_channel/10km/threads_test/2thread/output.nc Beginning variable comparisons for all time levels of field 'layerThickness'. Note any time levels reported are 0-based. Pass thresholds are: L1: 0.00000000000000e+00 L2: 0.00000000000000e+00 L_Infinity: 0.00000000000000e+00 0: l1: 0.00000000000000e+00 l2: 0.00000000000000e+00 linf: 0.00000000000000e+00 1: l1: 0.00000000000000e+00 l2: 0.00000000000000e+00 linf: 0.00000000000000e+00 2: l1: 0.00000000000000e+00 l2: 0.00000000000000e+00 linf: 0.00000000000000e+00 ** PASS Comparison of layerThickness between /home/xylar/data/mpas/test_nightly_latest/ocean/baroclinic_channel/10km/threads_test/1thread/output.nc and /home/xylar/data/mpas/test_nightly_latest/ocean/baroclinic_channel/10km/threads_test/2thread/output.nc Beginning variable comparisons for all time levels of field 'normalVelocity'. Note any time levels reported are 0-based. Pass thresholds are: L1: 0.00000000000000e+00 L2: 0.00000000000000e+00 L_Infinity: 0.00000000000000e+00 0: l1: 0.00000000000000e+00 l2: 0.00000000000000e+00 linf: 0.00000000000000e+00 1: l1: 0.00000000000000e+00 l2: 0.00000000000000e+00 linf: 0.00000000000000e+00 2: l1: 0.00000000000000e+00 l2: 0.00000000000000e+00 linf: 0.00000000000000e+00 ** PASS Comparison of normalVelocity between /home/xylar/data/mpas/test_nightly_latest/ocean/baroclinic_channel/10km/threads_test/1thread/output.nc and /home/xylar/data/mpas/test_nightly_latest/ocean/baroclinic_channel/10km/threads_test/2thread/output.nc ``` If `quiet=True` (the default), there is only an indication that the comparison passed for each variable: ```none temperature Time index: 0, 1, 2 PASS /home/xylar/data/mpas/test_20210616/further_validation/ocean/baroclinic_channel/10km/threads_test/1thread/output.nc /home/xylar/data/mpas/test_20210616/further_validation/ocean/baroclinic_channel/10km/threads_test/2thread/output.nc salinity Time index: 0, 1, 2 PASS /home/xylar/data/mpas/test_20210616/further_validation/ocean/baroclinic_channel/10km/threads_test/1thread/output.nc /home/xylar/data/mpas/test_20210616/further_validation/ocean/baroclinic_channel/10km/threads_test/2thread/output.nc layerThickness Time index: 0, 1, 2 PASS /home/xylar/data/mpas/test_20210616/further_validation/ocean/baroclinic_channel/10km/threads_test/1thread/output.nc /home/xylar/data/mpas/test_20210616/further_validation/ocean/baroclinic_channel/10km/threads_test/2thread/output.nc normalVelocity Time index: 0, 1, 2 PASS /home/xylar/data/mpas/test_20210616/further_validation/ocean/baroclinic_channel/10km/threads_test/1thread/output.nc /home/xylar/data/mpas/test_20210616/further_validation/ocean/baroclinic_channel/10km/threads_test/2thread/output.nc temperature Time index: 0, 1, 2 PASS /home/xylar/data/mpas/test_20210616/further_validation/ocean/baroclinic_channel/10km/threads_test/1thread/output.nc /home/xylar/data/mpas/test_20210616/baseline/ocean/baroclinic_channel/10km/threads_test/1thread/output.nc salinity Time index: 0, 1, 2 PASS /home/xylar/data/mpas/test_20210616/further_validation/ocean/baroclinic_channel/10km/threads_test/1thread/output.nc /home/xylar/data/mpas/test_20210616/baseline/ocean/baroclinic_channel/10km/threads_test/1thread/output.nc layerThickness Time index: 0, 1, 2 PASS /home/xylar/data/mpas/test_20210616/further_validation/ocean/baroclinic_channel/10km/threads_test/1thread/output.nc /home/xylar/data/mpas/test_20210616/baseline/ocean/baroclinic_channel/10km/threads_test/1thread/output.nc normalVelocity Time index: 0, 1, 2 PASS /home/xylar/data/mpas/test_20210616/further_validation/ocean/baroclinic_channel/10km/threads_test/1thread/output.nc /home/xylar/data/mpas/test_20210616/baseline/ocean/baroclinic_channel/10km/threads_test/1thread/output.nc temperature Time index: 0, 1, 2 PASS /home/xylar/data/mpas/test_20210616/further_validation/ocean/baroclinic_channel/10km/threads_test/2thread/output.nc /home/xylar/data/mpas/test_20210616/baseline/ocean/baroclinic_channel/10km/threads_test/2thread/output.nc salinity Time index: 0, 1, 2 PASS /home/xylar/data/mpas/test_20210616/further_validation/ocean/baroclinic_channel/10km/threads_test/2thread/output.nc /home/xylar/data/mpas/test_20210616/baseline/ocean/baroclinic_channel/10km/threads_test/2thread/output.nc layerThickness Time index: 0, 1, 2 PASS /home/xylar/data/mpas/test_20210616/further_validation/ocean/baroclinic_channel/10km/threads_test/2thread/output.nc /home/xylar/data/mpas/test_20210616/baseline/ocean/baroclinic_channel/10km/threads_test/2thread/output.nc normalVelocity Time index: 0, 1, 2 PASS /home/xylar/data/mpas/test_20210616/further_validation/ocean/baroclinic_channel/10km/threads_test/2thread/output.nc /home/xylar/data/mpas/test_20210616/baseline/ocean/baroclinic_channel/10km/threads_test/2thread/output.nc ``` ## Norms In circumstance where you would like to allow comparison to pass with non-zero differences between variables, you can supply keyword arguments `l1_norm`, `l2_norm` and/or `linf_norm` to give the desired maximum values for these norms, above which the comparison will fail, raising a `ValueError`. If you do want certain norms checked, you can pass their value as `None`. If you want different nonzero norm values for different variables, the easiest solution is to call {py:func}`polaris.validate.compare_variables()` separately for each variable and with different norm values specified. You will need to "and" together the results from calling {py:func}`polaris.validate.compare_variables()`. When you specify a nonzero norm, you may want polaris to print the norm values it is using for comparison when the results are printed. To do so, use the optional `quiet=False` argument. ## Datasets In some cases, a comparison cannot be made directly between the datasets loaded from the two files to be compared. Instead, the datasets require manipulation for some reason. Currently, this is the case for datasets from the Omega model, which need to have their variables renamed to the MPAS-Ocean names for use in Polaris. The `ds1` and `ds2` keyword arguments are used to supply datasets corresponding to `filename1` and `filename2`, respectively, in such circumstances.