Decadal evaluation of regional climate , air quality , and their interactions using WRF / Chem Version 3 . 6 . 1

Introduction Conclusions References Tables Figures

consistent annual biases exist for most of the years from 2001 to 2010.Ozone mixing ratios are slightly overpredicted at both urban and rural locations but underpredicted at rural locations.PM 2.5 concentrations are slightly overpredicted at rural sites, but slightly underpredicted at urban/suburban sites.In general, the model performs relatively well for chemical and meteorological variables, and not as well for aerosol-cloud-radiation variables.Cloud-aerosol variables including aerosol optical depth, cloud water path, cloud optical thickness, and cloud droplet number concentration are generally underpredicted on average across the continental US.Overpredictions of several cloud variables over eastern US result in underpredictions of radiation variables and overpredictions of shortwave and longwave cloud forcing which are important climate variables.
While the current performance is deemed to be acceptable, improvements to the biascorrection method for CESM downscaling and the model parameterizations of cloud 1 Introduction Regional atmospheric models have been developed and applied for high resolution climate, meteorology, and air quality modeling in the past few decades.Comparing to global models with a coarser domain resolution (Leung et al., 2003) those regional models have advantages over global models because they can more accurately represent mesoscale variability (Feser et al., 2011), and also better predict the local variability of concentrations of specific species such as black carbon and sulfate (Petikainen et al., 2012).General circulation models (GCMs) and global chemical transport models (GCTMs) are usually downscaled to regional meteorological models such as the Weather Research and Forecasting model (WRF) (Caldwell et al., 2009;Gao et al., 2012), regional climate models such as REMO-HAM (Petikainen et al., 2012), the regional modeling system known as Providing Regional Climates for Impacts Studies (PRECIS) (Jones et al., 2004;Fan et al., 2014), and a number of European models described in Jacob et al. (2007), as well as regional CTMs such as the Community Multiscale Air Quality Model (CMAQ) (Penrod et al., 2014;Xing et al., 2015).These regional models are used for climate/meteorology or air quality simulations.Some are applied for more than ten years (Caldwell et al., 2009;Warrach-Sagi et al., 2013;Xing et al., 2015).However these regional models either lack the detailed treatment of chemistry (e.g., in WRF), or use prescribed chemical concentrations (e.g., REMO-HAM uses monthly mean oxidant fields for several chemical species), or do not have online-coupled meteorology and chemistry (e.g., in CMAQ).In addition, the past regional model simulations and analyses have mainly focused on meteorological parameters such as surface temperature and precipitation, cloud variables such as net radiative cloud forcing, and chemical constituents such as ozone.Regional climate model simulations tend to focus on significant climatic events such as extreme tem-Figures

Back Close
Full peratures (very cold or very hot) (Dasari et al., 2014), heat waves, heavy precipitation, drought, and storms (Beniston et al., 2007), rather than the important air quality and climate interactions.In addition, the impacts of complex chemistry-aerosol-cloudradiation-climate feedbacks on future climate change remain uncertain, and these feedbacks are most accurately represented using online-coupled meteorology and chemistry models (Zhang, 2010;IPCC, 2013).An online-coupled meteorology and chemistry model, however, is more computationally expensive compared to an offline-coupled model (Grell et al., 2004), and thus requires significant computing resources for their long-term (a decade or longer) applications.With rapid increases in the availability of high performance computing resources on the petaflop scale, however, long term simulations using online-coupled models have become possible in recent years.
The online-coupled WRF model with Chemistry (WRF/Chem) has been updated with a suite of physical parameterizations from the Community Atmosphere Model version 5 (CAM5) (Neale et al., 2010) so that the physics in the global CAM5 model is consistent with the regional model for downscaling purposes (Ma et al., 2014).There are also limited applications of dynamical downscaling (Gao et al., 2013) under the new Intergovernmental Panel on Climate Change (IPCC) Fifth Assessment Report's Representative Concentration Pathway (RCP) scenarios (van Vuuren et al., 2011).Gao et al. (2013) applied dynamic downscaling to link the global-climate-chemistry model CAM-Chem with WRF and CMAQ using RCP 8.5 and RCP 4.5 emissions to study the impacts of climate change and emissions on ozone (O 3 ).Molders et al. (2014) downscaled the Community Earth System Model (CESM) (Hurrell et al., 2013) to drive the online-coupled WRF/Chem model over Southeast Alaska using RCP 4.5 emissions; however, their study did not address the feedback processes between chemistry and meteorology.This study evaluates the online-coupled regional WRF/Chem model, which takes into account gas and aerosol-phase chemistry, as well as aerosol direct and indirect effects.WRF/Chem is used to simulate the "current" climate scenario for 10 years, from 2001 to 2010 using the RCP 8. over continental US (CONUS) (He at al., 2015;Glotfelty et al., 2015) with a focus on air-quality and climate interactions.Both CESM and WRF/Chem include similar gasphase chemistry and aerosol treatments.To our best knowledge, this study is the first to report the WRF/Chem simulation, evaluation, and analyses over a period of 10 years (i.e., 2001-2010) to assess if the model is able to accurately simulate decadal long air quality and climatology by taking into account feedback processes between chemistry and meteorology.This study also assesses whether the RCP8.5 emissions for the 10 year period are robust enough to produce satisfactory performance against observations with WRF/Chem.
2 Model set-up and evaluation protocol

Model configurations and simulation design
The model used is the modified WRF/Chem v3.6.1 with updates similar to those implemented into WRF/Chem v3.4.1 as documented in Wang et al. (2015a).The main updates include the implementation of an extended Carbon Bond 2005 (CB05) of Yarwood et al. (2005) gas-phase mechanism with chlorine chemistry of Sarwar et al. (2007), which is coupled with the Modal for Aerosol Dynamics in Europe/Volatility Basis Set (MADE/VBS) (Ahmadov et al., 2012).MADE/VBS incorporates a modal aerosol size distribution, and includes an advanced secondary organic aerosol (SOA) treatment based on gas-particle partitioning and gas-phase oxidation in volatility bins.The CB05-MADE/VBS option has also been coupled to existing model treatments of various feedback processes such as the aerosol semi-direct effect on photolysis rates of major gases, and the aerosol indirect effect on cloud droplet number concentration (CDNC) and resulting impacts on shortwave radiation.The main physics and chemistry options used in this study as well as their corresponding references can be found in   Full Mexico, and a vertical resolution of 34 layers from the surface to 100 mb.Considering the decadal applications of WRF/Chem in this work which is much longer than many past WRF/Chem applications, the simulations are reinitialized monthly (rather than 1-4 days used in most past WRF/Chem applications to short-term episodes that are on an order of months up to 1 year, e.g., Zhang et al., 2012a, b;Yahya et al., 2014Yahya et al., , 2015b) ) to constrain meteorological fields toward National Centers for Environmental Prediction (NCEP) reanalysis data while allowing chemistry-meteorology feedbacks within the system.As discussed in Sects.3.1 and 3.3, the reinitialization frequency of 1 month may be too large to constain some of the meteorological fields such as moistures, which in turn affect other parameters, and a more frequent reinitialization may be needed to improve the model performance.The impact of the frequency of the reinitialization on simulated meteorological and cloud parameters will be further discussed in Sects.3.1 and 3.2.

Processing of emissions and initial conditions (ICs)/boundary conditions (BCs)
Global RCP emissions are available as monthly-average emissions for 2000, 2005, and for every 10 years between 2010 and 2100, at a grid resolution of 0.5×0.5 • (Moss et al., 2010;van Vuuren et al., 2011).The RCP emissions in 2000, 2005, and 2010 are used to cover the 10 year emissions needed for WRF/Chem simulations, i.e., the periods of 2001-2003, 2004-2006, and 2007-2010, respectively. Processing global RCP emissions in 2000, 2005, and 2010 into regional, hourly emissions needed for the 10 year WRF/Chem simulations requires essentially three main tasks.These include (1) mapping the RCP species to CB05 speciation used in WRF/Chem, (2) re-gridding the RCP emissions from 0.5 • ×0.5 • grid resolution to the 36 km×36 km grid resolution used for regional simulation over North America; and (3) applying species and location dependent temporal allocations (i.e., emissions variation over time) to the re-gridded RCP emissions.Table S1 shows the species mapping between RCP species and CB05 species.
To map the RCP species to CB05 speciation, some assumptions are made due the Introduction

Conclusions References
Tables Figures

Back Close
Full relatively detailed speciation required by CB05.Some of the CB05 species are directly available in RCP; however, others are lumped into RCP groups, for example, the "other alkanals" and "hexanes and higher alkanes" in the RCP groups can be considered to approximately represent the acetaldehyde and higher aldehydes emissions required by CB05, respectively (Table S1).For the CB05 species such as ethanol, methanol, internal and terminal olefin carbon bonds in the gas-phase, and elemental and organic carbon in the accumulation mode of the aerosol particles, other RCP groups are used to approximate these emissions (Table S1).(Pouliot et al., 2015).To re-grid the RCP emissions, the RCP rectilinear grid is first interpolated to a WRF/Chem curvilinear grid using a simple inverse distance weighting (NCAR Command Language Function -rgrid2rcm), and a subset of the RCP grid that covers the WRF/Chem CONUS domain is then extracted.
To derive a temporal allocation for monthly-averaged RCP emissions, hourly emissions profiles are taken from in-house WRF/Chem simulations over CONUS during 2001 (Yahya et al., 2015a), and 2006and 2010(Yahya et al., 2014, 2015b).For those existing in-house simulations, the emissions were generated with the Sparse Matrix Operator  , 2001-2003, 2004-2006, and 2007-2010, respectively represent the original monthly-averaged RCP emissions rates, the monthly-averaged WRF/Chem emissions rates, and the hourly WRF/Chem emission rates, respectively, which are valid at each model time t, layer z, and lat and lon grid points.The RCP elevated source emissions for sulfur dioxide (SO 2 ), sulfate (SO 2− 4 ), elemental carbon (EC) and organic carbon (OC) were also incorporated into the model-ready emissions for WRF/Chem using steps (1-3) and Eq.(1) above.Lastly, RCP aircraft source emissions for EC, nitric oxide (NO), and nitrogen dioxide (NO 2 ) are directly injected into the closest model layers.No temporal allocations are applied to the RCP aircraft source emissions.
Biogenic emissions are calculated online using the Model of Emissions of Gases and Aerosols from Nature version 2 (MEGAN2) (Guenther et al., 2006).Emissions from dust are based on the online Atmospheric and Environmental Research Inc. and Air Force Weather Agency (AER/AFWA) scheme (Jones and Creighton, 2011).Emissions from sea salt are generated based on the scheme of Gong et al. (1997).
The chemical and meteorological ICs/BCs come from the modified CESM/CAM5 version 1.2.2 with updates by He et al. (2014) and Glotfelty et al. (2015) developed at the North Carolina State University (CESM_NCSU).In addition to similar gas-phase bias correction technique based on Xu and Yang (2012).Temperature, water vapor, geopotential height, wind, and soil moisture variables from the NCEP Final Reanalyses (NCEP FNL) dataset, which is available every 6 h, are used to correct the ICs and BCs for WRF/Chem simulations generated by CESM_NCSU.The differences between the NCEP FNL data and the CESM_NCSU derived ICs/BCs are obtained from 2001 to 2010, and are then averaged to produce 10 year average differences of 6 hourly meteorological ICs/BCs.The 10 year average differences of 6 hourly meteorological ICs/BCs are used to correct CESM_NCSU meteorological ICs/BCs for each of the 10 years.This bias correction technique can also be applied to future year simulations where NCEP FNL data is not available.

Model evaluation protocol
The focus of the model evaluation is mainly to assess whether the model is able to adequately reproduce the spatial and temporal distributions of key meteorological and chemical variables as compared to observations on a climatological time scale.A scientific question to be addressed in this work is, is WRF/Chem sufficiently good for regional climate and air quality simulations on a decadal scale?A climatological month refers to the average of the month for all the 10 years.For example, January refers to the average for all the months of January from 2001 to 2010.Statistical evaluations such as mean bias (MB), Pearson's correlation coefficient (R), normalized mean bias (NMB), normalized mean error (NME) and Index of Agreement (IOA) ranging from 0 to 1 (Willmott et al., 1981) for major chemical and meteorological variables are included.
The spatial and temporal analyses include spatial plots of MB over CONUS, spatial overlay plots of averaged simulated and observational data, monthly climatologicallyaveraged time series of major meteorological and chemical variables, annual average time series; probability distributions of major meteorological and chemical variables, and spatial plots of major aerosol and cloud variables compared with satellite data.A summary of the observational data from surface networks and satellite retrievals can be found in Table S2.The variables that are analyzed in this study include O 3 , par-Introduction

Conclusions References
Tables Figures

Back Close
Full ticulate matter with diameter less than and equal to 2.5 and 10 µm (PM 2.5 and PM 10 , respectively), and PM 2.5 species including SO 2− 4 , ammonium (NH + 4 ), nitrate (NO − 3 ), EC, OC, and total carbon (TC = EC + OC), temperature at 2 m (T2), relative humidity at 2 m (RH2), and wind speed at 10 m (WS10), wind direction at 10 m (WD10), precipitation, aerosol optical depth (AOD), cloud fraction (CLDFRA), cloud water path (CWP), cloud optical thickness (COT), CDNC, cloud condensation nuclei (CCN), downward shortwave radiation (SWDOWN), net shortwave radiation (GSW), downward longwave radiation (GLW), outgoing longwave radiation at the top of atmosphere (OLR), and shortwave and longwave cloud forcing (SWCF and LWCF).While uncertainties exit in all the observational data used, systemetic uncertain analysis/quantification is beyond the scope of this work.In this work, all observational data are considered to be the true values in calculating the performance statistics.The information on the accuracy of most data used in the model evaluation has been provided in Table 2 of Zhang et al. (2012a).Uncertainties associated with some of the observational data are discussed in Sect.3.

Meteorological predictions
Table 2 summarizes the statistics for T2, RH2, WS10, WD10, and precipitation.The model performs very well for a 10 year average T2 with a slight underprediction (an MB of −0.(Pleim and Gilliam, 2009) or errors in the green vegetation fraction in the National Center for Environmental Prediction, Oregon State University, Air Force and Hydrologic Research Lab (NOAH) Land Surface Model (LSM) (Refslund et al., 2013).RH2 and WS10 are slightly overpredicted.Precipitation is largely overpredicted, consistent with overpredictions in precipitation from WRF and WRF/Chem simulations reported in literatures.For example, Caldwell et al. (2009) attributed the overprediction in precipitation to overprediction in precipitation intensity but underprediction in precipitation frequency.Otte et al. (2012) also reported that the precipitation predicted by WRF is too high compared to the North American Regional Reanalyses (NARR) data throughout the whole CONUS domain over a period of 1988-2007. Nasrollahi et al. (2012) ) examined 20 combinations of microphysics and cumulus parameterization schemes available in WRF and found that most parameterization schemes overestimate the amount of rainfall and the extent of high rainfall values.In this study, while Grell 3-D Ensemble cumulus parameterization contributes in part to the overpredictions of precipitation, most overpredictions occur at high thresholds as shown in Fig. 3d and they are attributed to possible errors in the Morrison two moment scheme because the overpredictions of non-convective precipitation dominate the overpredictions of total precipitation.Nudging and reinitialization have been most commonly used methods to control such errors.This work uses reinitialization but the frequency of reinitialization is monthly rather than every 1-4 days used in other studies (e.g., Zhang et al., 2012a, b;Yahya et al., 2014Yahya et al., , 2015b)), which led to a buildup of storm systems, especially over the warm Atlantic.This buildup in turn influences mainly non-convective precipitation over land, especially in the east coast.Simulations Introduction

Conclusions References
Tables Figures

Back Close
Full with a more frequent reinitialization tend to perform better than those with less frequent reinitialization (Lo et al., 2008).
Figure 1 shows the spatial distributions of MB for 10 year average predictions of T2, RH2, WS10, and precipitation.Figure 2 shows the time series of 10 year average monthly and annual average T2, WS10, RH2, precipitation, O 3 , and PM 2.5 against observational data and IOA statistics.T2 (Fig. 1a) tends to be underpredicted over eastern and western US and overpredicted over the central US.The bias correction method itself may also contribute to the slight biases in T2.A single temporally averaged (2001-2010) NCEP reanalysis file is applied to the 6 hourly BCs for each individual year, which would in some cases contribute to the biases in the climatological 10 year evaluation.T2 also tends to be underpredicted during the cooler months but overpredicted during the warmer months (Fig. 2a).While the bar charts in Fig. 2 show domain-average mean observed and mean simulated T2, IOA performance takes into account the proportion of differences between mean observed and mean simulated values at different sites.IOA can be calculated as, where O i and S i denote time-dependent observations and predictions at time and location i , respectively N is the number of samples (by time and/or location), O denotes mean observation and S denotes mean predictions over all time and locations, they can be calculated as: IOA values range from 0-1, with a value of 1 indicating a perfect agreement.Introduction

Conclusions References
Tables Figures

Back Close
Full The model performance in terms of IOA for T2 is slightly worse during the warmer months as compared to the cooler months; however, IOA values for all months are ≥ 0.9.The poorer IOA statistics for the warmer months are possibly influenced to a certain extent by the fact that the IOA tends to be more sensitive towards extreme values (when temperatures are maximum) due to the squared differences used in calculating IOA (Legates and McCabe, 1999).As shown in Figs.1b and 2b, the spatial distributions of MBs for RH2 follow closely the spatial distributions of MBs for T2, where T2 is underpredicted, RH2 is overpredicted and vice versa.Unlike T2, the IOA for RH2 is the highest during the warmer months and the lowest during the winter months, but IOA for RH2 is generally high (> 0.7) for all months.WS10 is also generally overpredicted along the coast, over eastern US and some portions over the western US (Fig. 1c), consistent with overpredictions of T2 over the coast, and partially due to unresolved topographical features.In this case the topographic correction for surface winds used to represent extra drag from sub-grid topography (Jimenez and Dudhia, 2012) is used as an option in the 10 year WRF/Chem simulations; however, WS10 is still overpredicted except for the areas of flat undulating land in the central US.Jimenez and Dudhia (2012) also suggested that the grid points nearest to the observational data might not be the most appropriate or most representative, and that the selection of nearby grid points can help to reduce errors in surface wind speed estimations.In this study, as the evaluation is conducted over the whole CONUS, the nearest grid points are used for evaluation, which could also result in errors in wind speed evaluation.The positive T2 and WS10 bias along the coast could be due to the fact that the model grids for temperatures and wind speeds are located over the ocean, however, the observation points are located slightly inland.As shown in Fig. 2, WS10 performs well on average for the months of April, May, and June, and is overpredicted for the other months.
Nonetheless the climatological NMB for WS10 overall is low at 7.7 % (Table 2).WS10 has higher IOA values during the spring months and the lowest IOA during the summer months and in November.The model performs relatively well in predicting WD10 variability with a Corr of 0.6, indicating overall a more southerly direction domain-wide Introduction

Conclusions References
Tables Figures

Back Close
Full predicted by the model compared to observations.Precipitation is overpredicted for all months except for June, especially during the summer months of July to August.Even with the inclusion of radiative feedback effects from the subgrid-scale clouds in the radiation calculations, precipitation is still overpredicted with the Grell 3-D scheme, which is consistent with the results shown by Alapaty et al. (2012).Precipitation mainly has lower IOAs during the summer compared to other months, except in June which actually exhibits the largest IOA of all months.Even though June is considered a summer month, it does not show overprediction in precipitation compared to the other summer months.It is possible that in June, the overall atmospheric moisture content is low.This is consistent with simulated RH2 as June is the only month where RH2 is underpredicted compared to observations.In general the model is able to reproduce the monthly trends in meteorological variables; for example, the predicted trend in T2 closely follows the observed trends by the National Climatic Data Center (NCDC).The observed RH2 decreases from January to a minimum in April, and then increases from April to December.Although the model predicts a similar pattern in RH2, there is a lag in the RH2 minimum occurring two months later in June (Fig. 2b).For WS10, the observation peaks in April, as compared to the simulated peak in March.The model correctly predicts the observed WS10 minimum occurring in August.The model trend in precipitation is similar to observations, except during the summer months of July through September, where a large overprediction leads to a sharp increase in July, followed by a gradual decrease through December.
Figure 2e-h shows the annual time series trends for T2, RH2, WS10, and precipitation.The model performs relatively well in predicting the annual mean T2 for most years (with MBs of < 0.5 • C; Fig. 2e).T2 also does not show an obvious decreasing or increasing T2 trend between 2001 and 2010.The IOA for annual T2 for all years are > 0.95.However for 2002, mean simulated T2 is ∼ 0.7 Figure 3 shows the probability distributions of T2, RH2, WS10, and precipitation against NCDC and NADP for 10 years.The observed and simulated variables are averaged at each site for the 10 year period, and the pairs are then distributed into a probability distribution over 30 bins of observed and simulated values of T2.For T2, the simulated and observed probability distributions are very similar (Fig. 3a), consistent with the statistics for T2 which shows only a small cold bias.The model overpredicts T2 at sites where temperatures are very low.The probability distribution curve for simulated RH2 is also shifted to the right of the observed RH2 (Fig. 3b), with an observed and modeled peak 74 and 78 % respectively.The probability distribution of simulated WS10 is narrower (between 2 and 6 m s −1 ) compared to that of observed WS10 (between 1 and 7 m s −1 ).The model thus overpredicts when near-surface wind speeds are low, but underpredicts when wind speeds are very high.This suggests that the surface drag parameterization is still insufficient to help predict low wind speeds; however, it might have contributed to the reduction in the simulated high wind speeds (Mass, 2012).The probability distribution for simulated precipitation against NADP also shows a shift to the right, consistent with the statistics for overpredicted precipitation and also with the probability curve of RH2.Introduction

Conclusions References
Tables Figures

Back Close
Full   4a) but underpredicted at CASTNET sites for all months except for October with the largest underpredictions occurring in April and May where IOA statistics are the lowest (Fig. 4b).IOA statistics for all climatological months range from 0.5 to 0.6 forAIRS-AQS and from 0.4 to 0.9 for CASTNET.In general, IOA values tend to be higher for CASTNET compared toAIRS-AQS during the fall and winter months of October to March.The IOA values forAIRS-AQS are rather steady on average over the 12 months compared to CASTNET.This can be attributed to the larger dataset ofAIRS- Figure 5 shows the probability distributions of maximum 1 and 8 h O 3 mixing ratios against CASTNET and AQS.The probability distributions of the observed and simulated O 3 mixing ratios are very similar.The model is able to simulate the range and probabilities of O 3 mixing ratios relatively well at both CASTNET andAIRS-AQS sites.At the CASTNET sites as shown in Fig. 5a and b, the model accurately predicts the peak maximum 1 h O 3 mixing ratio centered at ∼ 60 ppb, however, slightly underpredicts the peak maximum 8 h O 3 mixing ratio by a few ppb.At theAIRS-AQS sites as shown in Fig. 5c and d, the predicted probability distribution curve is slightly shifted to the right of the observations for both maximum 1 and 8 h O 3 mixing ratios.It is also interesting to note that the probability distributions for CASTNET andAIRS-AQS are quite different.O 3 at theAIRS-AQS sites has a unimodal normal distribution, while O 3 at the CASTNET sites has a bi-modal distribution, with a tail of the distribution extending toward lower O 3 mixing ratios (0-20 ppb).The peak distribution occurs at around 10 ppb, because the O 3 mixing rations are low at most CASTNET sites.The second peak at ∼ 60 ppb for CASTNET occurs mainly around the summer months during which O 3 is produced through photochemistry involving its precursors.These distributions are attributed to the nature of the sites' locations, where theAIRS-AQS network includes a mixture of urban, suburban and rural sites, leading to a normal distribution of O 3 mixing ratios centered at relatively higher O 3 mixing ratios, while the CASTNET network includes mostly rural sites that exhibit a low maximum 1 and 8 h O 3 mixing ratios, thus leading to a distribution with a tail skewed towards the lower O 3 mixing ratios.

Conclusions References
Tables Figures

Back Close
Full Figure 6 shows the diurnal variation of O 3 concentrations and IOA statistics for the four climatological seasons against CASTNET (Fig. 6a-d) andAIRS-AQS (Fig. 6eh) (Winter -January, February and December (JFD); Spring -March, April, and May (MAM); Summer -June, July, and August (JJA); Fall -September, October, and November (SON).Figure 6a shows that in more rural sites (CASTNET) in winter O 3 tends to be underpredicted during the morning (01:00-09:00 local standard time (LST)) and evening hours (18:00-24:00 LST).However, Fig. 6b shows that in general for allAIRS-AQS sites including urban sites, O 3 is systematically overpredicted for all hours of the day.The diurnal trends for CASTNET andAIRS-AQS are completely opposite for winter.As CASTNET sites are located in areas where urban influences are minimal, most of these sites are likely to be NO x -limited sites (Campbell et al., 2015).Underpredicted NO x emissions in rural areas can lead to underpredictions in O 3 concentrations in NO x -limited areas.As shown in Fig. 2a), T2 is generally overpredicted during the winter months, which explains the overpredictions in O 3 for most sites against AQS.As shown in Fig. 6a-c, for CASTNET, the diurnal variations of O 3 in MAM and JJA are similar to that in JFD.As shown in Fig. 6d, slight overpredictions during the daylight hours of 10:00 to 17:00 LST occur in SON at the CASTNET sites, however the trends are similar for morning and evening hours as compared to the other seasons.Similar to SON at the CASTNET sites, forAIRS-AQS sites, overpredictions during daylight hours occur in JJA and SON (Fig. 6g and h), and also to a much lesser extent in MAM (Fig. 6f).This is probably due to the overpredictions of T2, which are the smallest during MAM compared to other months as shown in Fig. 2a.
Figure 7 compares the spatial distributions of 10 year average of the predicted and observed hourly O 3 mixing ratios.The O 3 mixing ratios tend to be underpredicted in eastern and northeastern US, where most of the CASTNET sites are located (Fig. 7a).This is consistent with the diurnal trends from Fig. 6a-d  photochemical reactivities would also be reduced due to reduced T2.O 3 mixing ratios are, however, overpredicted over northwestern US, and also near the coastline of western US.The overprediction of O 3 mixing ratios in northwestern US can be attributed to an overprediction in the chemical BCs from CESM, as indicated by the high O 3 mixing ratios near the northwestern region of the domain boundary.

Particulate matter
The 10 year average PM 2.5 concentrations are overpredicted with an NMB of 23.3 % against IMPROVE, and underpredicted with an NMB of −10.8 % against the Speciated Trends Network (STN) (Table 2).In addition, the IOA trend in Fig. 4c shows very good performance for PM 2.5 against the Interagency Monitoring of Protected Visual Environments (IMPROVE) with IOA values > 0.8.IOA values for PM 2.5 against STN are high (∼ 0.6-0.8)during the spring and summer months, but lower (∼ 0.4) during the winter months (Fig. 4d).The IMPROVE surface network covers generally rural areas and national parks while the STN surface network covers urban sites.The horizontal resolution of 36 km × 36 km used in this study may be too coarse to resolve the locally high PM 2.5 concentrations at urban sites in STN which are in proximity of significant point sources, especially during the fall and winter.During these colder seasons, PM 2.5 concentrations over the US in general tend to be higher due to an extensive use of woodstove and cold temperature inversions, which trap particulates near the ground (EPA, 2011).As shown in Table 2, the concentrations of PM 2.5 species such as SO 2− 4 , OC, and TC are overpredicted at the IMPROVE sites, while the concentrations of the other main PM 2.5 species NO − 3 , NH + 4 , and EC are underpredicted at both IMPROVE and STN sites.TC concentrations, which are the sum of OC and EC, are overpredicted due to larger overpredictions of OC compared to the underpredictions of EC.The model also simulates both primary organic aerosol (POA) and secondary organic aerosol (SOA).OC is calculated as the sum of POA and SOA divided by the ratio of OA/OC, which is assumed to be a constant of 1.4 (Aitken et al., 2008).This calculation of OC using a constant of 1.4 is an approximation, which is subject to uncertainties Introduction

Conclusions References
Tables Figures

Back Close
Full when comparing simulated OC against observational data, as the ratio of OA/OC can be different in different environments (Aitken et al., 2008).
As shown in Table 2, at the STN sites, the model slightly overpredicts the concentrations of SO 2− 4 , while underpredicting those of NO − 3 , NH + 4 , and EC.The overpredictions of SO 2− 4 are likely due to the uncertainties that arise from processing of the RCP SO 2 emissions.The RCP SO 2 emissions are only available as a total emission flux, and they are not vertically distributed to the important point sources such as furnaces and stacks.In this work, two steps are taken to resolve the RCP elevated SO 2 emissions in each emission layer.First, a set of factors are derived from the fraction of the elevated emissions in each layer to the vertical sum of emissions for NEI used by default in the SMOKE model with the NEI data.Second, these factors are applied to the total RCP emissions to obtain SO 2 emissions in each emission layer.The total RCP SO 2 emissions were higher than the total NEI emissions, resulting in higher surface and elevated SO 2 emissions.Figure 4g and h compares the modeled annual average time series for PM 2.5 against IMPROVE and STN observations, respectively.In general, the model performs well for PM 2.5 at the IMPROVE (IOA > 0.8) and STN (IOA ∼ 0.5-0.7)sites.
A declining trend in PM 2.5 observed and simulated concentrations are also observed over the years.For the later years (2007 to 2010), the model performs significantly better against IMPROVE compared to STN.As 2010 NEI emissions are used for the years 2007 to 2010, there are not many variations in the simulated PM 2.5 concentrations over these 4 years.
Figures 7 and 8 show the spatial plots of 10 year average of simulated 24 h average, PM 10 , PM 2.5 , and PM 2.5 species concentrations, overlaid with observations from both STN and IMPROVE.The underpredictions of PM 10 are dominated by an underprediction in the wind-blown dust emissions, especially in western US (Fig. 7b).This is confirmed in Table 2,  < 20 µg m −3 ).The AER/AFWA dust module (Table 1) does not produce sufficient dust in this case, even though WS10 is overpredicted and is proportional to the dust emissions.The sea-salt emission module by Gong et al. (1997), however, seems to produce a reasonable amount of sea-salt as shown by the similar concentrations between simulated and observational data for PM 10 near the coastlines.In addition, the MADE/VBS module in WRF/Chem does not explicitly simulate the formation/volatilization of coarse inorganic species.The coarse inorganic species are available, however, in the emissions and are transported and deposited in a manner that is similar to non-reactive tracers.
The model performs well for PM 2.5 over eastern US (Fig. 7c), where modeled concentrations are close to the observations; however, over the western US there are underpredictions in PM 2.5 , especially in central to southern California.Even though Table 2 shows in general an overprediction of SO 2− 4 against STN sites, the model underpredicts SO 2− 4 in regions of elevated SO 2− 4 concentrations, in particular, where concentrations are above 10 µg m −3 in the vicinity of significant point sources of SO 2 and SO 2− 4 over eastern US (Fig. 7d).This is likely due to the coarse resolution (0.5 • × 0.5 • ) of RCP emissions, which probably results in a general overprediction of SO 2 emissions over a grid but cannot resolve point sources smaller than the grid resolution.A similar pattern is found for NH + 4 over eastern US due to underpredictions of high concentrations of SO 2− 4 (Fig. 8a).There are also large underpredictions in NH + 4 over the western US.The underpredictions in NH + 4 are likely due to underpredictions of NH 3 emissions from RCP.The NH 3 emissions from RCP are much lower than those of NEI emissions over western US, by more than a factor of 5, especially over portions of California.Large underpredictions occur over both eastern and western US for NO − 3 , EC, and TC (Fig. 8b-d).The underpredictions in NO − 3 are more likely influenced by the underpredictions of NH + 4 rather than NO x emissions.NO x emissions for NEI are higher than those of RCP for a number of point sources, however, in general RCP has higher NO x emissions.The statistics for IMPROVE TC indicate overpredic-Introduction

Conclusions References
Tables Figures

Back Close
Full tions; however the statistics for STN TC indicate larger underpredictions with an MB of −2.0 µg m −3 , which would explain the large underpredictions in PM 2.5 concentrations over western US.The large underpredictions are in part impacted by uncertainties in emissions as well as due to uncertainties in the precursor gas emissions for these species, especially for TC.The RCP emissions of EC and POA are lower when compared to those of NEI.NEI emissions have a higher spatial resolution, and thus more adequately represent the emissions from point sources compared to RCP.The underpredictions of TC are also more likely due to underpredictions in EC as compared to OC, as shown in underpredictions of EC by Fig. 8c.As T2 is slightly underpredicted, these could have resulted in underpredictions in the isoprene and terpene, which are major gas precursors of biogenic SOA, resulting in lower SOA and OC concentrations.
In addition, the emissions of anthropogenic VOC species from RCP which are also of a lower spatial resolution compared to their emissions in the NEI tend to also be lower than NEI levels especially at point sources.The underpredictions for these particulate species, especially for water-soluble species including NH + 4 and NO − 3 are also likely impacted by overpredictions in precipitation (Fig. 2d), which leads to an overprediction in their wet deposition rates and thus a reduction of their ambient concentrations.
The overpredictions in WS10 also help contribute to the deposition of PM 2.5 and PM 2.5 species onto the ground (Sievering et al., 1987).

Aerosol, cloud, and radiation predictions
There are uncertainties in the satellite retrievals of various aerosol-cloud-radiation variables from the Clouds and the Earth's Radiant Energy System (CERES) and the Moderate Resolution Imaging Spectroradiometer (MODIS).Loeb et al. (2009) reported that the major uncertainties of the top of atmosphere radiative fluxes from CERES are derived from instrument calibration (with a net error of 4.2 W m −2 ), and the assumed value of 1 W m −2 for total solar irradiance.However, there is good correlation (R > 0.8) between the model and CERES for the radiation variables SWDOWN, GSW, and GLW, which are all measured at the surface (Table 2).Modeled OLR at the top of the at-Introduction

Conclusions References
Tables Figures

Back Close
Full mosphere also has relatively good correlation (R ∼ 0.6).SWDOWN and GLW are both slightly overpredicted due to influences from biases in PM concentrations and clouds, but GSW and OLR are slightly underpredicted.
The overpredictions of the surface radiation variables are also impacted by the underpredictions in AOD and COT.AOD is underpredicted with an NMB of −24.0 %, and COT is underpredicted with an NMB of −44.3 %.These underpredictions indicate that less radiation is attenuated (i.e., absorbed or scattered) or reflected while traversing through the atmospheric column and clouds, thus allowing more radiation to reach the ground.Using the CESM model, He et al. (2015) also showed underpredictions in AOD and COT over CONUS against MODIS satellite retrievals.Figure 9 compares the spatial distributions of the 10 year average predictions of AOD (a and b) against the satellite retrieval data from MODIS.The simulated AODs show relatively large values over eastern US, due to the relatively higher PM concentrations in this region of the US.The MODIS AOD, however, shows slightly elevated AOD over eastern US, but the magnitudes are not as high as the simulated AOD over eastern US, and are also not as high as the MODIS-derived AOD over the western US.The differences between the MODIS AOD and the simulated AOD are likely due to the differences in the algorithms used to retrieve AOD based on MODIS measurements and calculate AOD in WRF/Chem.For MODIS, AOD is calculated by matching the spectral reflectance observations with a lookup table based on a set of aerosol parameters including the aerosol size distributions from a variety of aerosol models, which differ based on seasons and locations (Levy et al., 2007).There are also different algorithms for dark land, bright land, and over oceans (Levy et al., 2013).The MODIS data are aggregated into a global 1 • gridded (Level-3) dataset with monthly (MOD08_M3) temporal resolution (https://www.earthsystemcog.org/site_media/projects/obs4mips/TechNote_MODIS_L3_C5_Aerosols.pdf).The inaccuracies for the calculation of AOD in WRF/Chem include biases in aerosol size distribution, aerosol composition, aerosol water content, and reflectances.They can also arise from parameterizations in the calculations including the assumption of an internally-mixed Introduction

Conclusions References
Tables Figures

Back Close
Full aerosol composition.Therefore, caution should also be taken when comparing simulated AOD with the satellite-derived AOD products.Toth et al. (2013) compared Aqua MODIS AOD products over the mid to high latitude Southern Ocean where a band of enhanced AOD is observed, to cloud and aerosol products produced by the Cloud-Aerosol Lidar with Orthogonal Polarization (CALIOP) project; and AOD data from the Aerosol Robotic Network (AERONET) and the Maritime Aerosol Network (MAN).They concluded that the band of enhanced AOD is not detected in the CALIOP, AERONET, or MAN products.The enhanced AOD band is attributed to stratocumulus and low broken cumulus cloud contamination, as well as the misidentification of relatively warm cloud tops compared with surrounding open seas.Figure 9a and b shows the spatial distributions of the 10 year average predictions of AOD compared against the satellite retrieval data from MODIS.The simulated AODs show relatively large values over eastern US, due to the relatively higher PM concentrations in this region of the US.The MODIS AOD, however, does not show a similar spatial pattern.Figure 9 also shows spatial distributions of the 10 year average predictions of CDNC (c and d), CWP (e and f), and COT (g and h), compared against the satellite retrieval data from MODIS.The cloud variables CDNC, CWP, and COT tend to be underpredicted for most of the regions over the US.However, CWP is largely overpredicted over the Atlantic ocean.This is also likely due to the infrequent monthly reinitialization of the WRF/Chem simulations in this study, which results in a build-up of moisture over the Atlantic ocean, also influencing precipitation as mentioned previously.CDNC is overpredicted over some regions in eastern US, but there are also relatively large areas of underpredictions over both the land and ocean.This leads to an average domain-wide underprediction for CDNC (Table 2).This is likely due to the differences in deriving CDNC in the model and in the satellite retrievals.CDNC in the model is calculated served data.As indicated by Bennartz (2007), the errors in CDNC can be up to 260 %, especially for regions with low CF (< 0.1).The model and MODIS spatial patterns are similar for CWP and COT over land, although the model values are underpredicted.King et al. (2013) reported that the MODIS retrieval of cloud effective radius when compared to in-situ observations is overestimated by 13 % on average.Combined with overestimations in COT, this leads to overestimation of liquid water path.In addition, there can also be differences in satellite-derived cloud products from different satellites.For example, Shan et al. (2011) showed that the derived CLDFRA from MODIS and another satellite, the Polarization and Directionality of Earth Reflectances (POLDER) can differ with a global average of 10 %.
Figure 10 shows similar spatial plots for modeled vs. CERES derived SWDOWN, OLR, SWCF, and LWCF.We note that modeled SWCF is calculated based on the differences between the net cloudy sky and net clear sky shortwave radiation at the top of atmosphere, which in turn are dependent on cloud properties including the CLDFRA, COT, cloud asymmetry parameter, and cloud albedo.It is possible that due to the overprediction of CLDFRA, the magnitudes of the simulated SWCF are greater than those from CERES (Fig. 10c and g), even though the other cloud variables are underpredicted.LWCF is calculated based on the differences in clear-sky OLR and cloudy-sky OLR, which in turn are dependent on CLDFRA, COT, and absorbance and radiance due to atmospheric gases.The underprediction of total-sky OLR (Table 2 and Fig. 10b and f) leads to an overprediction in LWCF.SWCF is largely overpredicted over eastern US and especially over the Atlantic ocean (Fig. 10c and g).LWCF is also overpredicted by the model in similar locations as SWCF, such as in southeastern US, and over the ocean in the eastern portion of the domain (Fig. 10d and h).This is further confirmed by the underpredictions in SWDOWN over the Atlantic ocean and in general over the eastern portion of the domain, as increased clouds (as a consequence of overpredicted AOD, CWP and COT) and SWCF lead to less SWDOWN reaching the ground (Fig. 10a and e) which also eventually leads to a reduction in the OLR also over the eastern portion of the domain.The larger negative SWCF and positive LWCF in the

Conclusions References
Tables Figures

Back Close
Full model compared to CERES, however, lead to an overall good agreement with CERES for the net cloud forcing (SWCF + LWCF; not shown).The mean bias for SWCF against CERES of 7.8 W m −2 and that for LWCF against CERES of 6.9 W m −2 are comparable to the results from the CMIP5 models of −10 to 10 W m −2 over CONUS region (Fig. 9.5 in Flato et al., 2013).The evaluation of 10 year averaged predictions of aerosol-cloudradiation variables is similar to the results from the WRF/Chem simulations in 2006 and 2010 by Yahya et al. (2014Yahya et al. ( , 2015)).For example WRF/Chem generally performs well for cloud fraction but AOD, CDNC, CWP and COT are underpredicted in both studies, which possibly indicate consistent biases for every year contributing to climatological biases.

Summary and conclusions
Overall, the model slightly underpredicts T2 with a mean bias of ∼ −0.performance when predicting chemical variables such as O 3 and PM 2.5 .Biases in O 3 and PM 2.5 concentrations can be attributed to biases in any of the meteorological and chemical variables.The model performs generally well for radiation variables, as well as for the main chemical species such as O 3 and PM 2.5 , which indicates that the processed RCP 8.5 emissions are reasonably accurate to produce acceptable results for the concentrations of chemical species.
Modeled O 3 mixing ratios at the CASTNET sites are slightly underpredicted, but are slightly overpredicted atAIRS-AQS sites, in part due to the fact that the CASTNET sites are classified as rural, while theAIRS-AQS sites are classified as both urban and rural.O 3 mixing ratios at theAIRS-AQS sites tend to be overpredicted during the colder fall and winter seasons, and annually, O 3 mixing ratios are overpredicted every year from 2001 to 2010.O 3 mixing ratios at the CASTNET sites are underpredicted for all climatological months, while the largest underpredictions are observed from January to May.However, on a decadal time scale, WRF/Chem adequately represents the different O 3 probability distributions at theAIRS-AQS and CASTNET sites.This study also showed that peak O 3 mixing ratios are observed over April and May rather than June to August, which is consistent with Cooper et al. (2014) who attributed this to emission reductions and opposite trends in O 3 mixing ratios over eastern and western US over the last 20 years.Modeled PM 2.5 concentrations tend to be overpredicted at the IMPROVE sites but underpredicted at the STN sites.PM 2.5 at the IMPROVE sites tend to be underpredicted in spring and summer but overpredicted in fall and winter, while PM 2.5 concentrations against STN are persistently underpredicted for all climatological months.The IMPROVE and STN sites are classified as rural and urban, respectively.Due to the relatively coarse horizontal resolution of the model (36 km×36 km), the model is unable to capture the locally higher PM 2.5 concentrations at the STN sites.In general, however, the model performs relatively well for total PM 2.5 concentrations at the IMPROVE and STN sites with NMBs of within ±25 %, although larger biases exist for PM 2.5 species.Model performance for PM 10 should be improved, as PM 10 also has important impacts on climate through influencing the radiative budget both directly and indirectly due to its

Conclusions References
Tables Figures

Back Close
Full work, they use somewhat different physics schemes which may contribute to such uncertainties.The development of scale-aware parameterizations that can be applied at both global and regional scales would help reduce uncertainties associated with the use of different schemes for global simulations and downscaled regional simulations.
The Supplement related to this article is available online at doi:10.5194/gmdd-8-6707-2015-supplement.Introduction

Conclusions References
Tables Figures

Back Close
Full Discussion Paper | Discussion Paper | Discussion Paper | 5 emissions and boundary conditions from an updated version of CESM with advanced chemistry and aerosol treatments Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Kernel Emissions (SMOKE) model version 2.3 for 2002 NEI and SMOKE version 3.4 for 2008 NEI with year-specific sector emissions for 2006 and 2010, which prepare the spatially, temporally, and chemically speciated "model-ready" emissions.Since NEI is updated and released every three years, the temporal profiles of emissions used in SMOKE for 2002, 2006 and 2010 are assumed to be valid for 3-4 years around the NEI years, i.e.
chemistry and aerosol treatments, CESM and WRF/Chem use the same shortwave and longwave radiation schemes (i.e., the Rapid and accurate Radiative Transfer Model for GCM (RRTMG)), though they use different cloud microphysics parameterizations, PBL, and convection schemes.As GCMs generally contain systematic biases which can influence the downscaled simulation, the meteorological ICs/BCs predicted by CESM are bias corrected before they are used by WRF/Chem using the simple Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | 3 • C).This is better or consistent with other studies which tend to report underpredictions in simulated T2.Brunner et al. (2015) reported a range of monthly MBs for T2 of −2 to 1 • C for simulations using a number of CTMs over individual years for 2006 and 2010 with reanalysis meteorological ICs/BCs.Seasonal temperature biases of −1.8 to −2.3 • C were reported from an ensemble of regional climate models (RCMs) for a simulation period of 1971 to 2000 over the Northeast (Rawlins et al., 2012).He et al. (2015) also showed biases of 0 to −3 • C over CONUS when compared against Discussion Paper | Discussion Paper | Discussion Paper | NCEP reanalysis data.Kim et al. (2013) compared the results of a number of RCMs over CONUS over a climatological period of 1980 to 2003 against Climatic Research Unit (CRU) surface analysis data at a 0.5 • resolution and reported T2 biases of −5 to 5 • C. Figure 9.2 from Flato et al. (2013) shows that the Coupled Model Intercomparison Project Phase 5 (CMIP5) models tend to underpredict T2 for the period of 1980 to 2005 over western US by up to −3 • C. The slight bias in T2 can be attributed to errors in soil temperature and soil moisture Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | AQS (> 1000 stations) compared to CASTNET (< 100 stations), the high and low undulations in O 3 averages at the CASTNET sites tend to be smoothed or averaged out in O 3 averages at theAIRS-AQS sites given largerAIRS-AQS dataset.The observed data fromAIRS-AQS and CASTNET also show the highest monthly O 3 mixing ratios over April and May.This result is consistent with the findings ofCooper et al. (2014), who reported the highest mass of tropospheric O 3 for the Northern Hemisphere in April and May based on the Ozone Monitoring Instrument (OMI) measurements in 2004, which suggested that the column mass of O 3 is not necessarily proportional to nitrogen oxide (NO x ) emissions that peak during the summer.In addition, Cooper et al. (2014) attributed a shift in the seasonal O 3 cycle observed at many rural mid-latitude monitoring sites to emissions reductions in the US.The same study also reported that the summertime O 3 mixing ratios were lower in eastern US between 2005 and 2010 when compared to previous years, while remaining relatively constant in spring.Discussion Paper | Discussion Paper | Discussion Paper | the summer O 3 maximum during 2001-2004 was replaced by a broad spring/summer peak in 2005-2010.Both the observed and simulated O 3 mixing ratios do not decrease forAIRS-AQS and CASTNET from 2001 to 2010 (Fig.4e and f).This is somewhat consistent withCooper et al. (2014) which showed that surface and lower tropospheric O 3 has a decreasing trend over eastern US but an increasing trend over the western US from 1990-1999 to 2010.The predicted annual average O 3 mixing ratios are consistent from 2001 to 2010, with overpredictions and IOAs of ∼ 0.6 at theAIRS-AQS sites, and underpredictions and IOAs of ∼ 0.6 to 0.8 at the CASTNET sites.
Discussion Paper | Discussion Paper | Discussion Paper | which also show underpredictions for CASTNET sites.From Fig. 1a, T2 is underpredicted on average over northeastern US, which results in underpredictions in biogenic emissions in the rural areas from MEGAN2.This would in turn reduce O 3 mixing ratios in VOC-limited areas.O 3 Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | which shows an MB of −11.5 µg m −3 and an NMB of −51.2 % against PM 10 observations atAIRS-AQS sites.The observational data indicate the elevated concentrations of dust over portions of Arizona and California (> 50 µg m −3 ), which are not reproduced by the simulations (the simulated concentrations are much lower, Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | based on the activation parameterization by Abdul Razzak and Ghan (2000) based on the aerosol size distribution, aerosol composition, and the updraft velocity.The MODISderived CDNC from Bennartz (2007) is calculated based on cloud effective radius and COT, which would explain the differences in spatial patterns between model and ob-Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | 3• C, which is consistent or better than other studies based on chemical transport models and regional climate models.The underpredictions in T2 correlate to the overpredictions in RH2.WS10 biases are likely due to issues with unresolved topography or due to inaccuracies in the selection of representative grid points.There are seasonal biases in precipitation, where overpredictions tend to occur largely over the summer months; however, precipitation is overpredicted every year between 2001 and 2010 likely due to uncertainties in WRF microphysics and cumulus parameterization schemes as well as the accumulation of moisture due to the monthly reinitialization.More frequent reinitializations would help to reduce the biases in moisture, precipitation, and related cloud and radiation variables, however, a balance would have to be achieved between running continuous climate simulations and the frequency of reinitializations.A satisfactory model performance for meteorological variables is important and necessary when simulating future years, as data evaluation is not possible.Meteorological variables such as temperature, humidity, wind speed and direction, PBL height, and radiation have a strong impact on chemical predictions, and thus are critical to the satisfactory model Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper |larger size and higher concentrations.The choice of observational networks for model evaluation are therefore important as both networks can show positive and negative biases depending on the type and location of the sites (e.g., O 3 againstAIRS-AQS and CASTNET, and PM 2.5 against STN and IMPROVE).The major uncertainties lie in the predictions of cloud-aerosol variables.As demonstrated in this study, large biases and error in simulating cloud variables even in the most advanced models such as WRF/Chem, indicating a need for future improvement in relevant model treatments such as cloud dynamics and thermodynamics, as well as aerosol-cloud interactions.In addition, there are large uncertainties in satellite retrievals of cloud variables for evaluation.In this study, most of the cloud-aerosol variables including AOD, COT, CWP, and CDNC are on average underpredicted across the domain; however, the overpredictions of cloud variables including COT and CWP over the Atlantic ocean and eastern US lead to underpredictions in radiation and overpredictions in cloud forcing, which are important parameters when simulating future climate change.In summary, the model is able to predict O 3 mixing ratios and PM 2.5 concentrations relatively well with regards to decadal scale air quality and climate applications.The model is able to predict meteorological variables satisfactorily and with results comparable to RCM and GCM applications from literatures.Possible reasons behind the chemical and meteorological biases identified through this work should be taken into account when simulating longer climatological periods and/or future years.Aerosolcloud-radiation variables are important for climate simulations, the performance of these variables are not as good as that of the chemical and meteorological variables.They contain consistent biases in single-year evaluations of WRF/Chem.However, magnitudes of biases for SWCF and LWCF are comparable to those from literature, which suggests that model improvements should be made in terms of bias correction of downscaled ICs/BCs as well as aerosol-cloud-radiation parameterizations in the model.In addition, having consistent physical and chemical mechanisms between the GCM and RCMs could help to reduce uncertainties in the results(Ma et al., 2014).Although CESM and WRF/Chem use similar chemistry and aerosol treatments in this Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | for summertime secondary organic aerosols over the eastern United States in 2006, J. Geophys.Res.117, D06301, doi:10.1029/2011JD016831,2012.Beniston, M., Stephenson, D. B., Christensen, O. B., Ferro, C. A. T., Frei, C., Goyette, S., Halsnaes, K., Holt, T., Jylha, K., Koffi, B., Palutikof, J., Scholl, R., Semmler, T., and Woth, K.: Future extreme events in European climate: an exploration of regional climate model projections, Clim.Change, 81, 71-95, doi:10.1007/s10584-006-9226-z,2007.Bennartz, R.: Global assessment of marine boundary layer cloud droplet number concentration from satellite, J. Geophys.Res.-Atmos., 112, D02201, doi:10.1029/2006JD007547,2007.Brunner, D., Savage, N., Jorba, O., Eder, B., Giordano, L., Badia, A., Balzarini, A., Baro, R., Bianconi, R., Chemel, C., Curci, G., Forkel, R., Jimenez-Guerrero, P., Hirtl, M., Hodzic, A., Discussion Paper | Discussion Paper | Discussion Paper |

Table
For the remaining CB05 species that are not available in RCP, the 2000 emissions are based on the 2002 National Emission Inventory (NEI) (version 3, http://www.epa.gov/ttn/chief/emch/), while the 2005 and 2010 emissions are based on the 2008 NEI (version 2), with year-specific updates for on/off road transport, wildfires and prescribed fires, and Continuous Emission Monitoring-equipped point sources observed and mean simulated value but not influence IOA significantly.RH2 is consistently overpredicted by the model with the largest overprediction in 2009.With the exception of 2009, observed RH2 is rather steady (65-70 %) from 2001 to 2010.IOA is also steady for RH2, except for 2009.As mentioned earlier, WRF tends to overpredict WS10 in general.Figure2gshows that observations indicate weaker wind speeds from 2001 to 2007.Model performance is better from 2007 to 2010 with higher IOAs compared to previous years.WRF has worse performance especially at weaker wind speeds as is the case from 2001 to 2007.Model performance for precipitation is more variable year-to-year, with IOAs ranging from 0.4 to 0.7; however, there is a systematic positive bias during the 10 year period.
• C higher than the observational data.IOA is still high for 2002 which indicates probably good performance of T2 at most sites, however with large overpredictions at a few sites which could skew the mean Introduction Discussion Paper | Discussion Paper | Discussion Paper | Discussion Paper | Table2summarizes the statistics for major chemical species.The model overpredicts hourly O 3 mixing ratios on average against the Aerometric Information Retrieval System (AIRS) -Air Quality System (AQS) with an NMB of 9.7 % and an NME of 22.4 %, but underpredicts O 3 mixing ratios against the Clean Air Status and Trends Network (CASTNET) with an NMB of −8.8 % and an NME of 19.8 %.The O 3 mixing ratios are overpredicted at AIRS-AQS sites for all climatological months except for April and May (Fig.

Table 1 .
Model configurations and set-up.