SEAS5: the new ECMWF seasonal forecast system

Johnson, Stephanie J.; Stockdale, Timothy N.; Ferranti, Laura; Balmaseda, Magdalena A.; Molteni, Franco; Magnusson, Linus; Tietsche, Steffen; Decremer, Damien; Weisheimer, Antje; Balsamo, Gianpaolo; Keeley, Sarah P. E.; Mogensen, Kristian; Zuo, Hao; Monge-Sanz, Beatriz M.

doi:https://doi.org/10.5194/gmd-12-1087-2019

Articles | Volume 12, issue 3

https://doi.org/10.5194/gmd-12-1087-2019

Articles | Volume 12, issue 3

Model description paper

22 Mar 2019

Model description paper |

| 22 Mar 2019

SEAS5: the new ECMWF seasonal forecast system

Stephanie J. Johnson, Timothy N. Stockdale, Laura Ferranti, Magdalena A. Balmaseda, Franco Molteni, Linus Magnusson, Steffen Tietsche, Damien Decremer, Antje Weisheimer, Gianpaolo Balsamo, Sarah P. E. Keeley, Kristian Mogensen, Hao Zuo, and Beatriz M. Monge-Sanz

Abstract

In this paper we describe SEAS5, ECMWF's fifth generation seasonal forecast system, which became operational in November 2017. Compared to its predecessor, System 4, SEAS5 is a substantially changed forecast system. It includes upgraded versions of the atmosphere and ocean models at higher resolutions, and adds a prognostic sea-ice model. Here, we describe the configuration of SEAS5 and summarise the most noticeable results from a set of diagnostics including biases, variability, teleconnections and forecast skill.

An important improvement in SEAS5 is the reduction of the equatorial Pacific cold tongue bias, which is accompanied by a more realistic El Niño amplitude and an improvement in El Niño prediction skill over the central-west Pacific. Improvements in 2 m temperature skill are also clear over the tropical Pacific. Sea-surface temperature (SST) biases in the northern extratropics change due to increased ocean resolution, especially in regions associated with western boundary currents. The increased ocean resolution exposes a new problem in the northwest Atlantic, where SEAS5 fails to capture decadal variability of the North Atlantic subpolar gyre, resulting in a degradation of DJF 2 m temperature prediction skill in this region. The prognostic sea-ice model improves seasonal predictions of sea-ice cover, although some regions and seasons suffer from biases introduced by employing a fully dynamical model rather than the simple, empirical scheme used in System 4. There are also improvements in 2 m temperature skill in the vicinity of the Arctic sea-ice edge. Cold temperature biases in the troposphere improve, but increase at the tropopause. Biases in the extratropical jets are larger than in System 4: extratropical jets are too strong, and displaced northwards in JJA. In summary, development and added complexity since System 4 has ensured that SEAS5 is a state-of-the-art seasonal forecast system which continues to display a particular strength in the El Niño Southern Oscillation (ENSO) prediction.

How to cite

How to cite.

Dates

Received: 12 Sep 2018 – Discussion started: 01 Oct 2018 – Revised: 18 Dec 2018 – Accepted: 04 Jan 2019 – Published: 22 Mar 2019

1 Introduction

The European Centre for Medium-Range Weather Forecasts (ECMWF) has been running real-time seasonal forecast systems since 1997. The seasonal system has been upgraded at approximately 5-year intervals during this time. SEAS5, ECMWF's fifth generation seasonal forecast system, became operational in November 2017, replacing its predecessor System 4 (hereafter SEAS4; Molteni et al., 2011) which had been operational since 2011.

SEAS4 was a state-of-the-art seasonal forecast system, which maintained competitive performance over the 6 years it was operational. One particular feature was high El Niño Southern Oscillation (ENSO) forecast skill (Molteni et al., 2011). It also displayed good performance in the prediction of the stratosphere and quasi-biennial oscillation (QBO; e.g. Scaife et al., 2014). As with many other seasonal forecast systems, mid-latitude skill remained limited, although some skill was demonstrated in predicting southern European summer temperatures (Molteni et al., 2011) and the sign of the Arctic Oscillation in Northern Hemisphere winter (Stockdale et al., 2015). Measures of overall skill in SEAS4 showed progress over previous systems (Molteni et al., 2011; Weisheimer and Palmer, 2014).

SEAS5 benefits from recent developments in its component models and initial condition generation. The Integrated Forecast System (IFS) atmosphere model has improved since SEAS4 was implemented, especially in the representation of tropical convection (e.g. Bechtold et al., 2014), and there has been a substantial increase in horizontal resolution. The ocean model has also been upgraded with improved physics, increased horizontal and vertical resolution, and a corresponding ocean and sea-ice reanalysis with up-to-date reprocessed observational datasets. SEAS4 lacked a prognostic sea-ice model, which is now considered an important ingredient for seasonal forecasting and has been included in SEAS5.

The benefits and challenges of a seamless forecasting system have been well documented in the literature (e.g. Brown et al., 2012). Development of a new seasonal forecast model at ECMWF has always used a recent version of the medium-range weather forecast model, with components added as needed to allow forecasting of longer timescales. Some of the components originally developed for the seasonal forecast system have later been adopted in the medium-range forecast model, most notably an initialised ocean model (Janssen et al., 2013). Consequently, the fundamental differences between the seasonal and medium-range forecast configurations have reduced over time. This trend has continued with the introduction of SEAS5. The starting point for SEAS5 was the forecast model configuration used in the ECMWF's extended range ensemble forecast, which is targeted at forecasting the time range of 10 to 46 days. A few changes that were demonstrated to improve seasonal forecast skill were made to create the final SEAS5 configuration. Some of these changes have already been adopted by subsequent versions of the medium-range forecast systems, and in other cases the convergence is planned for the future.

The purpose of this paper is to document SEAS5 and outline its strengths and weaknesses compared to its predecessor SEAS4. Given the very large number of metrics, scores, processes, geographical regions and modes of variability that we assess when introducing a new system, it is not feasible to document all of them or expect that every single aspect of forecast performance be improved. However, it is important to present metrics that summarise performance and illustrate any changes in the characteristics of the forecast system. In Sect. 2, we describe SEAS5 including the forecast and re-forecast production (Sect. 2.1), the atmosphere and ocean model configurations (Sect. 2.2) and initial conditions for atmosphere and ocean (Sect. 2.3). Section 3 discusses the scope of our assessment and the statistical methods used in this paper. Section 4 uses diagnostics to describe SEAS5's mean state climatology and the inter-annual variability of processes such as ENSO. Section 5 presents verification of the global performance of the system. We summarise the results in Sect. 6.

2 Description of SEAS5

2.1 Re-forecast and forecast production

The “long-range” forecast consists of a 51-member ensemble initialised every month on the first day of the month (see Sect. 2.3), and integrated for 7 months. On each 1 February, 1 May, 1 August and 1 November, 15 of the 51 ensemble members are extended a further 6 months for a total forecast length of 13 months. These “annual-range” forecasts were designed primarily to give an outlook for ENSO.

To verify the system and calibrate the forecast, SEAS5 uses a set of retrospective seasonal forecasts for past dates that can be compared to the historical record. This set of re-forecasts (also sometimes known as hindcasts) start on the first of every month for years 1981 to 2016 and have 25 ensemble members. This is a substantial increase on the SEAS4 operational re-forecast set, which included 15 members initialised from 1981 to 2010. On 1 February, 1 May, 1 August and 1 November, 15 of the 25 SEAS5 re-forecast ensemble members are extended a further 6 months to provide a re-forecast set for the annual-range forecasts. The entire re-forecast set is used to verify the forecast system (see Sect. 3), but only a subset of this re-forecast data, from years 1993 to 2016, is used in the calculation of forecast anomalies. Using this more recent period avoids the long-term trend of climate change from overly affecting the forecast products, and also coincides with the calibration period used in the Copernicus Climate Change Service's multi-system seasonal forecast. SEAS5 became operational at the beginning of November 2017. In addition to the re-forecast set, 51-member forecasts were computed for all start dates in 2017 to allow assessment of SEAS5 on any initialisation date from 1 January 1981 to the current date.

2.2 Model configuration

Table 1 summarises the configuration of SEAS5 and compares it to SEAS4. SEAS5 uses updated versions of the atmosphere and ocean models and adds a new interactive sea-ice model, and each of these components are described in detail below.

Table 1Table comparing the configuration of SEAS4 and SEAS5. Abbreviations are defined in the text.

Download Print Version | Download XLSX

2.2.1 Atmosphere model and forcing

SEAS5 uses ECMWF's IFS atmosphere model cycle 43r1. A brief description of the parameterisations in the IFS is provided here, and the most significant changes between IFS cycle 36r4 (SEAS4) and 43r1 (SEAS5) are highlighted.

The radiation code is based on the Rapid Radiation Transfer Model (RRTM; Mlawer et al., 1997; Iacono et al., 2008). Cloud–radiation interactions are taken into account using the McICA (Monte Carlo Independent Column Approximation) method (Morcrette et al., 2008). For computational efficiency, the radiation calculations are only called every 3 h, which gives a poor representation of the diurnal cycle. In cycle 43r1, this is mitigated by approximate updating at higher time frequency, reducing biases in stratospheric temperature and errors in the diurnal cycle of near-surface temperature (Hogan and Bozzo, 2015; Hogan and Hirahara, 2016).

The parameterisation of convection is based on the mass-flux approach (Tiedtke, 1989; Bechtold et al., 2008). The convective parameterisation evolves with each cycle, and in SEAS5 it has a modified Convective Available Potential Energy (CAPE) closure leading to an improved diurnal cycle of convection (Bechtold et al., 2014) and a revised formulation of detrainment and convective momentum transport improving the tropical flow. The cloud and large-scale precipitation scheme (Tiedtke, 1993; Forbes et al., 2011; Forbes and Tompkins, 2011) has an improved representation of mixed-phase clouds in cycle 43r1 (Forbes and Ahlgrimm, 2014). In addition, there were numerous other improvements to the parameterisation of microphysics, particularly for warm-rain processes (Ahlgrimm and Forbes, 2014), but also ice-phase processes and ice supersaturation. The combination of changes in the cloud and convection schemes between SEAS4 and SEAS5 substantially reduces biases in tropical temperature throughout the troposphere, as will be seen in Sect. 4.2.

The orographic gravity wave drag is parameterised following Lott and Miller (1997) and Beljaars et al. (2004), and the non-orographic gravity wave drag parameterisation is as described in Orr et al. (2010). The turbulent mixing scheme follows the eddy-diffusivity mass-flux (EDMF) framework, with a K-diffusion turbulence closure and a mass-flux component to represent the non-local eddy fluxes in unstable boundary layers (Köhler et al., 2011). In cycle 43r1, the degree of turbulent mixing in stable conditions has been reduced to improve the representation of low-level jets. This change combined with an increase in the orographic drag led to a significantly better representation of the large-scale circulation (Sandu et al., 2014). The representation of near-surface winds was also improved by a revision of the roughness length (Sandu et al., 2011).

The surface-exchange parameterisation is based on a tiled approach (HTESSEL; Viterbo and Beljaars, 1995; Van den Hurk et al., 2000; Balsamo et al., 2009; Dutra et al., 2010 a; Boussetta et al., 2013) representing different sub-grid surface types for vegetation, bare soil, snow and open water. The hydrology for soil infiltration and run-off is described by Balsamo et al. (2009) and the representation of surface snow is described in Dutra et al. (2010 a). For cycle 43r1, a representation of inland-water bodies that can carry significant thermal storage and anomalies in the forecasts has been introduced (Mironov et al., 2010; Dutra et al., 2010 b; Balsamo et al., 2012). In cycle 43r1, the skin temperature for ocean points takes account of the cool skin effect and a diurnal warm layer effect (Zeng and Beljaars, 2005).

SEAS5 was developed following a “seamless” approach, so the atmospheric component of SEAS5 is nearly identical to the IFS cycle 43r1 configuration used for the ENS extended forecast (IFS, 2016), which was operational for medium- and extended-range forecasting from 22 November 2016 to 11 July 2017. The atmospheric model uses a two-time-level semi-Lagrangian scheme, with spectral horizontal resolution of T319 and a 20 min time step. The model physical parameterisations are calculated in physical space on a reduced O320 Gaussian grid, which has a grid spacing of approximately 36 km. There are 91 levels in the vertical, with a model top in the mesosphere at 0.01 hPa or around 80 km. The ECMWF wave model is used at 0.5^∘ resolution (IFS, 2016, Part VII) with the same time step as the atmosphere. One change to the cycle 43r1 model settings was introduced for SEAS5. In SEAS5 the tropical amplitude of the non-orographic gravity wave drag was considerably reduced compared to the default settings in cycle 43r1 in order to improve the modelling of the QBO and the climate mean stratospheric winds. The impact of this change is described in Sect. 4.4.

Greenhouse gas radiative forcing consists of a zonally averaged seasonally varying climatology derived from the Monitoring Atmospheric Composition and Climate reanalysis (MACC reanalysis; Inness et al., 2013) which is scaled to capture the long-term trend in greenhouse gas emissions using CMIP5 historical greenhouse gases from 1981 to 2000 and CMIP5 RCP 3-PD from 2000 on as in ERA5. A new prognostic ozone scheme (Monge-Sanz et al., 2011) replaces the scheme used in SEAS4 and the default 43r1 configuration (Cariolle and Déqué, 1986; Cariolle and Teyssèdre, 2007); but as part of the seamless strategy used to develop SEAS5, prognostic ozone is not radiatively interactive as it was in SEAS4. Instead, the radiation scheme sees the same ozone climatology used in the cycle 43r1 ENS extended forecasts. Tropospheric sulfate aerosol follows the decadally varying CMIP5 climatology, rather than the time-invariant climatology that is default in cycle 43r1. Volcanic stratospheric sulfate aerosol is still treated by the method used for SEAS4; the initial load of volcanic aerosol is prescribed using GISS data (2012 update¹). The forecast is initialised using the GISS values from the month before the forecast starts, and then evolved in time with damped persistence (timescale 400 days). The vertical distribution follows a prescribed profile that is dependent on the depth of the stratosphere. The horizontal distribution is approximated by three numbers: the Northern Hemisphere, tropical and Southern Hemisphere amounts. SEAS5 cannot predict volcanic eruptions; but after a major eruption occurs, manual estimates of the volcanic aerosol, based in part on the Copernicus Atmosphere Monitoring Service (CAMS) SO₂ analyses, could be included in future real-time forecasts. The new prognostic ozone scheme is used to determine the tropopause height for application of volcanic aerosol.

2.2.2 Ocean and cryosphere models

SEAS5 uses the Nucleus for European Modelling of the Ocean model (NEMO, Madec and the NEMO team, 2016) version 3.4.1 developed by the NEMO European consortium, which is an upgrade from the NEMO v3.0 model used in SEAS4. It contains upgrades to aspects of ocean-surface wave interaction (Breivik et al., 2015) originally introduced at ECMWF, including estimating momentum flux from the dissipation term (accounting for the intensity of breaking waves), accounting for the energy flux from breaking waves in surface boundary conditions of the turbulent kinetic energy equation (Craig and Banner, 1994), and introducing the Coriolis–Stokes forcing term in the momentum equation.

The ocean model horizontal resolution increases from ORCA1^∘ in SEAS4 to ORCA0.25^∘ (developed by the DRAKKAR international research network) in SEAS5, which improves the representation of sharp fronts and ocean transports in SEAS5. The number of ocean vertical levels increases from 42 to 75, including an increase from 5 to 18 levels in the uppermost 50 m of the ocean. This reduces the depth of the surface layer of the ocean model from 10 to 1 m, which improves the representation of the diurnal cycle of SSTs. The ocean model time step is 20 min.

The Louvain-la-Neuve sea-ice model version 2 (LIM2; Fichefet and Maqueda, 1997), developed at the Belgian Université catholique de Louvain, is added in SEAS5. Introducing a prognostic sea-ice model allows the sea-ice cover to respond to changes in the atmosphere and ocean states, enabling SEAS5 to provide seasonal outlooks of sea-ice cover. At the same time, prognostic sea ice has the potential to improve forecasts of the atmospheric state and circulation by virtue of improved surface fluxes of heat, moisture and momentum. LIM2 is part of the NEMO modelling framework and uses the same tripolar ORCA0.25^∘ grid as the ocean, but has an hourly time step. It is a dynamic–thermodynamic model with a single thickness category. The model is used within SEAS5 to simulate the evolution of the fractional ice cover (sea-ice concentration), and only this variable is coupled to the atmosphere surface scheme. LIM2 simulates the conductive heat flux within the ice based on two vertical layers in the ice with varying thickness and a single snow layer on top of the ice, which determines the basal ice growth rate during winter. The surface heat flux at the sea-ice–atmosphere interface, however, is determined by an ice conductive heat flux computed by the atmosphere model. This leads to thermodynamic inconsistencies at the surface, resulting in an overestimation of the basal ice growth rate in winter, as seen in Sect. 4.3. The model also does not simulate the formation or evolution of melt ponds, which is important for summer surface energy balance. Ice velocities are computed by solving an appropriate momentum balance equation using a viscous-plastic rheology; sea-ice velocities are important because they give rise to the transport of sea-ice properties by advection.

2.2.3 Coupling

Some of the model components are tightly coupled: the land component, being on the same grid as the atmosphere model and requiring only vertical physics, has always been embedded within the atmosphere model; the ocean and sea-ice components are also tightly coupled to each other. A coupling interface then computes exchanges of information between three distinct modules that use three different horizontal grids: the atmosphere and land, the ocean and sea ice, and the wave model. The atmosphere and wave models exchange fluxes of heat, momentum, freshwater and turbulent kinetic energy with the ocean and sea ice, while the ocean and sea-ice models communicate SST, surface currents and sea-ice concentration to the atmosphere and wave models. There is no coupling between land and ocean.

The coupling interface in SEAS5 is implemented as a single executable, whereas SEAS4 used the OASIS3 coupler (Valcke, 2013). Details on the single executable coupling interface can be found in Mogensen et al. (2012 b). As in SEAS4 (Molteni et al., 2011), a Gaussian method is used for interpolation between the atmosphere and ocean models in both directions, primarily due to the complexity of the ORCA0.25^∘ grid. The Gaussian method automatically accounts for the different coast lines of the atmosphere and ocean models – values at land points are never used in the coupling since these can be physically very different to conditions over water. The atmosphere and ocean are coupled hourly to allow the diurnal cycle to be resolved.

2.3 Model initialisation

Table 2 summarises the main datasets used to initialise SEAS5 and compares them to those used in SEAS4. The model used to calculate SEAS5 forecasts and re-forecasts is identical, but forecasts must be initialised differently from re-forecasts in order to make use of near-real-time observational data. Forecasts and re-forecasts should be initialised and calculated as similarly as possible to ensure accurate bias correction. We describe the initialisation of both re-forecasts and forecasts here, including any adjustments made to improve consistency between re-forecast and forecast initialisation.

Table 2Table summarising the initialisation of SEAS4 and SEAS5. Abbreviations are defined in the text.

Download Print Version | Download XLSX

2.3.1 Atmosphere and land

In SEAS5 re-forecasts (prior to 1 January 2017) the atmosphere is initialised from ERA-Interim (Dee et al., 2011). ERA-Interim analysis is not available in time for SEAS5 forecast initialisation, so forecasts (1 January 2017 and later) are initialised from ECMWF operational analyses instead.

The inter-annual variability in ozone in ERA-Interim is affected by changes in satellite instruments over time, and does not represent the true inter-annual variability in ozone in the atmosphere (Dee et al., 2011). Consequently, the prognostic ozone scheme in both re-forecasts and forecasts is initialised with a seasonally varying climatology produced by the ozone model (Monge-Sanz et al., 2011) within an integration where an enhanced vertical resolution version of the IFS (cycle 42r1, L137) is nudged to ERA-Interim vorticity (12 h timescale) and tropopause temperature (5-day timescale, which is needed to control biases in lower stratosphere temperature).

Land-surface initial conditions for the re-forecasts are generated by the cycle 43r1 version of the HTESSEL scheme run in offline mode for the re-forecast period at the same resolution as SEAS5. In offline mode, HTESSEL is forced with ERA-Interim (precipitation, solar radiation, near-surface temperature, winds and humidity) following the method described in Balsamo et al. (2015).

The land surface in SEAS5 forecasts is initialised from ECMWF operational analysis, which includes a dedicated land data assimilation as described in de Rosnay et al. (2014). The SEAS5 land initial conditions are then interpolated from the HRES O1280 grid onto the O320 SEAS5 grid. This interpolation can result in locally large differences compared to initial conditions prepared directly at the lower resolution. Consequently, a limiter is used to prevent the real-time land-surface values taking inconsistent values relative to those used in the re-forecasts. The limits are defined as the maximum and minimum values observed at that point and calendar date for the 36-year re-forecast period, plus an additional margin specified as a global constant for each field. For more details please refer to the SEAS5 user guide².

2.3.2 Ocean

SEAS5 ocean and sea-ice initial conditions for forecasts and re-forecasts are provided by the new operational ocean analysis system, OCEAN5 (Zuo et al., 2019), which is made up of the historical ocean reanalysis (ORAS5) and the daily real-time ocean analysis (OCEAN5-RT). OCEAN5 uses the same ocean and sea-ice model as the coupled forecasts in SEAS5. OCEAN5 is conducted with NEMOVAR (Mogensen et al., 2012 a) in its 3D-Var FGAT (First-guess at appropriate time) configuration. Compared to its predecessor ORAS4 (Balmaseda et al., 2013), OCEAN5 has higher resolution, updated data assimilation and observational datasets, and provides sea-ice initial conditions.

ORAS5 is based on Ocean Reanalysis Pilot 5 (ORAP5; see Zuo et al., 2017 b; Tietsche et al., 2017), but using updated observational datasets. The ocean in situ temperature and salinity comes from the recent quality-controlled EN4 (Good et al., 2013), which has higher vertical resolution and better spatial coverage than the previous version EN3. The altimeter sea-level data have also been updated to the latest version (AVISO DT2014, Pujol et al., 2016) from CMEMS (Copernicus Marine Environmental Monitoring Services). The underlying SST analysis before 2008 comes from the HadISSTv2 dataset (Titchner and Rayner, 2014), which was the historical SST dataset most consistent with the Operational Sea Surface Temperature and Sea Ice Analysis (OSTIA) SST product used in operations at ECMWF. The sea-ice concentration comes from ERA-40 before 1985 and from an OSTIA (Donlon et al., 2012) reprocessed product between 1985 and 2008. From 2008 onwards both SST and sea-ice concentration are given by the OSTIA operational product, which is the same as used in the ECMWF operational atmospheric analysis. Further details of the OCEAN5 configuration and its sensitivities are discussed in Zuo et al. (2019).

2.4 Ensemble generation

2.4.1 Initial condition perturbations

Initial condition perturbations are applied to atmosphere and ocean initial conditions to represent uncertainty in the initial state and increase ensemble spread. Ensemble member 0 is initialised from unperturbed atmospheric initial conditions; in other members all upper air fields and a limited set of land fields (soil moisture, soil temperature, snow, sea-ice temperature and skin temperature) are perturbed. As in the operational ENS, perturbations from an ensemble of data assimilations (EDA) and perturbations constructed from the leading singular vectors are applied (IFS, 2016, Part V). EDA perturbations are only available for the later years in the re-forecast set; so to preserve consistency across the hindcast set and forecasts, the EDA perturbations from 2015 were applied to the initial conditions for all forecast and re-forecast years. The EDA perturbations are new in SEAS5, while singular vector perturbations were also used in SEAS4 with settings from IFS cycle 36r4.

OCEAN5 contains a 5-member ensemble analysis. The perturbation scheme used to generate this ensemble consists of two distinct elements: perturbations to the assimilated observations, both at the surface and at depth, and perturbations to the surface forcing fields. The forcing perturbations used to generate the ocean re-analyses are monthly realisations of SST errors, wind stress, solar radiation and fresh water flux sampling analysis error, as described in Zuo et al. (2017 a). While monthly perturbations are used to create the analysis ensemble, pentad perturbations of SST from HadISSTv2 are used to further augment the number of initial ocean conditions. First, each SEAS5 ensemble member is assigned one of the OCEAN5 ensemble members: OCEAN5 member 0 for SEAS5 member 0 counting up to OCEAN5 member 4 for SEAS5 member 4, and starting again at OCEAN5 member 0 for SEAS5 member 5. Then further perturbations, drawn from the HadISSTv2 pentad analysis error repository and unique to each ensemble member (Zuo et al., 2017 a, Section 4), are applied to the upper 22 levels of the ocean temperature, decreasing with depth. This perturbation is not applied to ensemble member 0. The pentad perturbations applied to the forecast initial conditions sample the fast analysis error, while the monthly perturbations applied to the ocean re-analyses sample errors with longer (1-month) decorrelation timescales. There are several differences between ocean initial condition perturbations in SEAS4 and SEAS5, the main differences are in the perturbation repository and the introduction of two temporal decorrelation scales; for details, see Zuo et al. (2017 a).

2.4.2 Stochastic model perturbations

In addition to perturbing the initial conditions, perturbations to the atmospheric model are applied to represent uncertainty from missing or unresolved sub-grid-scale processes (e.g. convection, clouds, radiation, turbulence) which have to be parameterised (Palmer, 2012). ECMWF has been using stochastic parameterisation schemes to explicitly account for these uncertainties in its forecasting systems from the medium-range to seasonal forecasts for many years (Buizza et al., 1999; Palmer et al., 2009) and the schemes that are used in SEAS5 are identical to those used in the shorter forecast ranges in cycle 43r1 (see Leutbecher et al., 2017). The stochastically perturbed physical tendency (SPPT) scheme introduces flow-dependent multiplicative noise to the total tendencies of the prognostic variables temperature, horizontal wind and humidity at model levels. The noise has a spatial and temporal correlation structure with three distinct scales representing small-scale fast perturbations, large-scale slow perturbations and an intermediate scale. A tapering in the boundary layer and the upper-most model levels effectively switches off the SPPT perturbations in these regions. The version of SPPT used here is based on a mass, energy and moisture conservation fix that was originally developed by the EC-Earth consortium (see Lang et al., 2016). The stochastic kinetic energy backscatter (SKEB) scheme aims at improving the upscale energy cascade from the sub-grid scales to the resolved scales (Shutts, 2005), but has been found to have a smaller overall impact in the ECMWF system (Weisheimer et al., 2014). Both of these schemes were also used in SEAS4, with settings from IFS cycle 36r4. For details of the schemes and performances, see Palmer et al. (2009), Lang et al. (2016), Leutbecher et al. (2017) and Weisheimer et al. (2014). Stochastic perturbations from both SPPT and SKEB are applied to all ensemble members; SEAS5 does not have a control forecast.

3 Assessment scope and evaluation methods

In order to compare the SEAS5 skill with the previous operational system (SEAS4), we could work with the largest common period for which the re-forecasts from SEAS4 and SEAS5 are available (namely 1981 to 2010). Since a key component of the seasonal forecast skill is the ability to forecast ENSO, it is important to consider a long verification period to include sufficient numbers of ENSO events. To allow a longer verification period we have included the operational forecasts for SEAS4 for 2011 to 2016, giving an overall comparison period of 1981 to 2016. This choice is not perfect since there are inconsistencies in the land-surface initialisation between the SEAS4 re-forecasts and SEAS4 real-time forecasts. Comparison of the SEAS5 and SEAS4 score differences for 1981 to 2010 and 1981 to 2016 (not discussed in this paper) shows no sign of this slight inconsistency affecting the results presented here. Consequently, the assessment in this paper is based on this 36-year re-forecast period unless otherwise mentioned (see Sect. 3.2), which is consistent with the SEAS5 verification available on the ECMWF website³.

SEAS5 has an increased operational re-forecast ensemble size compared to SEAS4; however, the real-time ensemble size is the same in both systems. Since we are interested in the comparative skill of the real-time forecast system, throughout this article we compare the two forecast systems using the same ensemble size. Since the implementation of SEAS4, extra ensemble members have been added to quarterly re-forecast dates (February, May, August, November), allowing us to compare the 25-member SEAS5 re-forecast set to 25 ensemble members from SEAS4. When only 15 SEAS4 ensemble members are available, we compare them to the first 15 members from SEAS5.

Our assessment is performed on monthly means. “Forecast lead time” is defined here to be the months elapsed since forecast initialisation but prior to the month being discussed, while “forecast month” includes the month being discussed, one more than forecast lead. For example, if a forecast is initialised on 1 January, February has 1-month forecast lead time and is month 2 of the forecast.“Verification month” is defined as the calendar month that the forecast is issued for. Unless otherwise mentioned, diagnostics are seasonal means at 1-month lead time (i.e. a DJF SST map is from a 1 November start date), which corresponds to months 2 to 4 of the forecast.

3.1 Evaluation and verification metrics

The seasonal forecast performance has been evaluated using a wide range of deterministic and probabilistic scores. For ENSO forecasts and other SST statistics we use deterministic metrics such as anomaly correlation and root mean square error. For the skill of atmospheric variables we also use probabilistic metrics such as the continuous ranked probability score and reliability diagrams.

3.1.1 Anomaly correlation

Anomaly correlations are calculated in accordance with established practice for scoring ENSO forecasts. First, bias-corrected anomalies for each forecast date and lead time in the re-forecast dataset are created using cross validation (i.e. the bias correction is calculated only from other re-forecast years, not the one being bias corrected). Anomalies for tropical ocean indices are calculated with respect to a standard 30-year reference climate period, which is 1981 to 2010. All other anomalies are calculated with reference to the full validation period of 1981 to 2016. The correlation is then calculated between the ensemble mean forecast and observed anomaly time series. The cross-validation procedure affects the correlation negatively, leading in theory to a small but systematic underestimate of expected future forecast skill.

3.1.2 Amplitude ratio

The ratio of the forecast anomaly amplitudes to observed amplitudes is calculated from the cross-validated bias-corrected individual ensemble member anomalies, computed with respect to 1981 to 2010. The standard deviation of the forecast anomalies is calculated from the mean square amplitude of all ensemble members and all start years (for a given start month and lead time), and then compared with the standard deviation of observations.

3.1.3 Root mean square error

For tropical ocean and QBO indices, the root mean square error (RMSE) is calculated from the cross-validated bias-corrected ensemble mean of the forecasts.

3.1.4 CRPSS

The continuous ranked probability skill score (CRPSS; Wilks, 2011) is calculated for each variable's seasonal average at each grid point for each year of the whole re-forecast period. It follows that the CRPSS map is estimated over 36 independent events. A climatology computed over the 36-year re-forecast period is used as the reference forecast. Therefore the CRPSS gives an indication of the added value of a forecasting system over simply forecasting climatology: a value of 1 indicating perfect forecasts, 0 showing no improvement over climatology and negative values indicating a failing forecasting system. Significance testing for the CRPSS differences between SEAS5 and SEAS4 is evaluated at a 5 % significance level with a Z test on pairwise bootstrapped CRPSS differences. For this Gaussian-approximated bootstrap method, we resample the forecasts and ensemble members over 1000 repetitions (with replacement) to capture the uncertainty both in time and in the ensemble.

3.1.5 Reliability

Reliability diagrams are used to summarise whether the forecast probabilities agree with the observed frequency of occurrence of a binary event (e.g. temperature in the upper tercile). To create the reliability diagrams used in this paper, each forecast at every grid point within a selected region is binned into 1 of 26 bins (one more than the number of ensemble members) according to the forecasted likelihood of occurrence of the chosen event. This likelihood is then plotted against the frequency with which the event actually occurred for this subset of forecasts and grid points. In a perfectly reliable system, the forecast probability will equal the frequency of occurrence and the values for each bin will lie along a straight diagonal line in the reliability diagram. Uncertainties are computed by bootstrapped resampling over years and ensemble members.

3.2 Datasets

For most variables the ERA-Interim reanalysis was used for verification (Dee et al., 2011), which is also the atmosphere initialisation data for SEAS4 and SEAS5. To verify precipitation we use the Global Precipitation Climatology Project (GPCP) monthly precipitation analysis 2.2 (Adler et al., 2003). Since GPCP 2.2 data are not available for the whole re-forecast period, precipitation verification statistics are based on the 1981 to 2014 period.

The depth of the surface layer of the ocean model decreases from 10 m in SEAS4 to 1 m in SEAS5, which changes the depth that SST is calculated from. To ameliorate the impact of this difference on the SST biases, we initially compare SST maps in each system to the analysis it was initialised from, ORAS4 (Balmaseda et al., 2013) or ORAS5 (Zuo et al., 2019). Later, area-averaged SST indices are compared to the OI.v2 reanalysis (OIv2; Reynolds et al., 2002), or ERA-Interim reanalysis, to measure both systems against the same standard. As will be seen in Sect. 4, when averaging over large regions, consistent conclusions are reached regardless of which observational dataset is used.

ERA-Interim sea ice is not temporally consistent, and is not recommended as a sea-ice verification dataset. Instead, we use the EUMETSAT Ocean and Sea Ice Satellite Application Facilities' (OSI SAF) global sea-ice concentration climate data record (OSI-450)⁴. OSI-450 is the second major version of the OSI SAF Global Sea Ice Concentration Climate Data Record. The sea-ice concentration is computed from the SMMR (1979–1987), SSM/I (1987–2008) and SSMIS (2006–2015) instruments. The OSI-450 product is available from 1979 to 2015; but because of gaps in the satellite record, data are not available for every day. We have taken the choice that if five consecutive days of data are missing from any season, that season is left out of our evaluation of sea-ice concentration. Consequently, in JJA we exclude 1984, 1986 and 2016; in DJF we exclude 1986, 1987, 1990, 2015 and 2016; in MAM we exclude 1981, 1986 and 2016; and in SON we exclude 2016.

4 SEAS5 diagnostics: climate and inter-annual processes

In this section we use diagnostics of inter-annual processes to assess SEAS5 and compare it to SEAS4. We first discuss the tropics, with a focus on tropical SST variability (Sect. 4.1). Then we discuss the northern extratropics, with a particular focus on the North Atlantic SST (Sect. 4.2). Finally we discuss the impact of introducing the prognostic sea-ice model LIM2 (Sect. 4.3) and the representation of the stratosphere (Sect. 4.4), before going on to discuss the global verification of SEAS5 in the next section.

https://www.geosci-model-dev.net/12/1087/2019/gmd-12-1087-2019-f01

Figure 1DJF and JJA SST bias in the tropics at 1-month forecast lead for SEAS4 (a, b) and SEAS5 (c, d) compared to the analysis they were initialised from (ORAS4, ORAS5). The regions discussed in detail later in this section are outlined in grey here.

SEAS5: the new ECMWF seasonal forecast system

2.1 Re-forecast and forecast production

2.2 Model configuration

2.2.1 Atmosphere model and forcing

2.2.2 Ocean and cryosphere models

2.2.3 Coupling

2.3 Model initialisation

2.3.1 Atmosphere and land

2.3.2 Ocean

2.4 Ensemble generation

2.4.1 Initial condition perturbations

2.4.2 Stochastic model perturbations

3.1 Evaluation and verification metrics

3.1.1 Anomaly correlation

3.1.2 Amplitude ratio

3.1.3 Root mean square error

3.1.4 CRPSS

3.1.5 Reliability

3.2 Datasets

4.1 Tropics

4.2 Northern extratropics

4.3 Arctic

4.4 Stratosphere and QBO

5.1 Anomaly correlation

5.2 CRPSS

5.3 Reliability