Sometimes, data files need to be updated or replaced to address problems discovered after they were initially published. As of April 2019, all files in the NA-CORDEX archive should have two global attributes, "version" and "tracking_id" that can be used to differentiate whether two files with the same name have different contents. Major version numbers track changes in the primary data in the file. Minor version numbers track changes in metadata or ancillary data. The tracking_id is a unique identifier (UUID) assigned to each file before publication.
When the CRCM5 dataset was first published, there was only one set of simulations, from Katja Winger at UQAM. The files were published with the name of the RCM given as "CRCM5". Subsequently, Sébastien Biner at OURANOS ran a second and complementary set of simulations using CRCM5. However, the configuration of CRCM5 is not the same between the two sets of simulations: the lake fractions differ and one (CRCM5-OUR) is nudged. (See RCM Characteristics for details.) Therefore, we refer to the two models as CRCM5-UQAM and CRCM5-OUR, distinguishing between them by the modeling center. We have republished the files originally named CRCM5 as CRCM5-UQAM.
Some simulations had to be re-run because of problems that were detected only after the data had been post-processed and published.
In all cases, the original simulations were retracted and replaced with output from the re-runs.
If there is doubt regarding whether a file is from the original run or the re-run, check the global attribute "version" in the netCDF headers; re-runs are verison 2 or higher.
Some errors in the post-processing workflow were discovered only after the data had been published. In these cases, we corrected the errors and republished the data. The following problems affected the actual data in the files (not just metadata or ancillary data) and resulted in a version number of 2 or higher.