Point by Point Response to the "White Paper" by Skamarock and Baldwin

Zavisa Janjic and Tom Black, NCEP

 

 

 

There are established mechanisms for independent evaluation of the WRF cores within the WRF project.  Thus, the effort volunteered by Skamarock and Baldwin was welcome but not necessary.

 

 

Interestingly, this one, and some of the subsequent claims by Skamarock and Baldwin were made on the basis of a single example of an NMM precipitation forecast.  We do not know the origin of this NMM forecast example.  However, in the upper panel of Fig. 1 we show an example in which contrary to the claims by Skamarock and Baldwin, the NMM precipitation forecast has more variance than the forecast by the Mass core (MC).  These examples demonstrate that it is not possible to make valid general conclusions on the basis of selected fragmentary evidence.

 

 

As will be explained later on, we disagree with this claim.

 

 

Note that all this analysis is done on a single example.  As already pointed out, we do not know the origin of this example.  The NMM was not yet operational at that time and has changed since then.

 

We have demonstrated in one of our previous responses that the rather smooth appearance of some of our precipitation forecasts is mostly a matter of tuning of precipitation physics, and not a result of numerical filtering.  The explanation of the alleged properties of the precipitation spectra by damping in the dynamics is unfounded.  Moreover, as can be seen from the random example from the Western Domain shown in Fig. 1, our precipitation forecasts are not necessarily smoother than the forecasts by the MC (upper two panels) as claimed by Skamarock and Baldwin.  On the contrary, the NMM forecast presented here shows more structure.  Needless to say, neither the precipitation example by Skamarock and Baldwin nor our example is sufficient for making general conclusions on the behavior of the dynamics of WRF cores as Skamarock and Baldwin do.

 

Use of the spectrum as a measure of success of precipitation forecasts is very unusual and its relevance is questionable.  The most important aspects of precipitation forecasts are ultimately how well models predict the location, timing, and amount of precipitation.  The success of precipitation forecasts concerning the location and the amount of predicted precipitation can be measured, respectively, e.g., by equitable threat score and by the bias score. 

 

The statistics on these two scores for the Western domain for November 2003 is shown in the lower panel of Fig. 1 for the Eta (red solid line), the GFS (dashed dark blue line), operational NMM (dashed green line), experimental WRF NMM (dashed light blue line) and MC (dashed purple line).  As can be seen from the plots, despite the fact that the MC generally shows less structure than the operational NMM and the experimental WRF NMM in this domain, the MC lags considerably behind other models in both the threat and the bias scores.  It is particularly disturbing that, as can be seen from the bias score, the MC tends to significantly over-predict the precipitation amounts.  This indicates that the model has problems with maintaining the hydrological balance, and more importantly, that the energy input by the phase changes of water is overestimated by a considerable amount.  This spurious energy input adversely affects both the model physics and dynamics.

 

Two other points can be made regarding Fig. 1.  The forecast from the MC predicted a large amount of widespread heavy precipitation over Missouri and northeast Oklahoma where virtually none actually fell.  It also greatly exaggerates the amount of rainfall in northwest Wisconsin while totally missing what did fall in east central Iowa.  The NMM and Eta forecasts did not produce these errors or did so to a far lesser degree.  The simple fact that the MC precipitation forecast generates structure certainly does not in itself translate into skill in forecasting the nature of convection, timing, location, or amount of rainfall.  Secondly we have demonstrated in one of our previous responses that the rather smooth appearance of some of the NMM precipitation forecasts is mostly a matter of tuning of precipitation physics and not a result of numerical filtering.  This is demonstrated again in Fig. 2 that was produced by modifying parameters within the BMJ convection scheme.  The result is that the detail and structure in the field have been greatly enhanced although some of the errors made by the MC forecast are also beginning to appear.  As development proceeds we want to provide as much detail to the forecasters as possible but not simply for the sake of detail itself.

 

 

The data that are supposed to support this claim are not available from the web page to which Skamarock and Baldwin refer.  We will come back to the question of dissipation later.

 

 

The claim that "direct measure of dissipation in the model's dynamics is its kinetic energy spectra" is untenable in the current context.  The kinetic energy spectrum certainly is not a direct measure of dissipation in the model's dynamics.  The spectrum produced by a mesoscale model in a short-range run in a limited area domain depends on the spectrum of initial data, on the synoptic situation, on the presence and characteristics of physical and spurious sources and sinks of energy, the size of the integration domain, the length of the integration, and, of course, on how well the model dynamics simulate the nonlinear energy cascade.

 

But let us first clarify what is the subject of the discussion.  Namely, Nastrom and Gage (1985) examined measurements made by commercial aircraft and found that one-dimensional kinetic energy spectra along their flight-paths in the lower stratosphere and in the upper troposphere follow the –5/3 slope in the range from several hundred kilometers to several kilometers.  As of now, there is no universally accepted explanation of this spectral shape.  Several possible explanations have been proposed (e.g. Gage, 1979; Lilly, 1983; Gage and Nastrom, 1986; Tung and Orlando, 2003).  They include the downscale nonlinear energy cascade and an inverse cascade from smaller to larger scales.

 

Using two-parameter quasi-geostrophic dynamics, Tung and Orlando (2003) demonstrated that given enough time to reach the statistical equilibrium, the –5/3 spectral range can be generated through downscale cascade of energy.  Similar statistical properties of the spectra were obtained in extended simulations using the GFDL SKYHI model with higher than usual yet still quite modest horizontal resolution for a climate model (e.g., Hamilton et al., 1999; Kosyik and Hamilton, 2001).

 

Concerning the properties of the motions in the –5/3 spectral range, as can be seen from the Nastrom and Gage (1985) spectrum, the spectral amplitudes of the waves with the largest energy are 103 to 106 times larger than the amplitudes in the –5/3 range.  So if the amplitudes in the large scale part of the spectrum correspond to a wind speed on the order of 10 m s-1, then the amplitudes in the –5/3 range correspond to wind speeds of a few cm s-1 to a few tens of cm s-1.  Apart from being representative for lower stratosphere and upper troposphere, this is obviously not enough to explain, e.g. an MCS and many other important mesoscale motions.  Thus, this part of the spectrum cannot be directly related to severe mesoscale phenomena as Skamarock and Baldwin repeatedly do.

 

Generally, it should come as no surprise if a mesoscale model develops the –5/3 spectrum considering that models with much simpler dynamics and coarser resolution have successfully done that (e.g., Hamilton et al., 1999; Kosyik and Hamilton 2001; Tung and Orlando, 2003).  However, the mesoscale runs typically are made on much shorter time scales than those considered in most previous studies.  Namely, the statistical properties of atmospheric spectra typically are investigated in extended integrations (tens or hundreds of days), and the spectra are averaged over long periods (tens or hundreds of days) in order to ensure that statistical equilibrium is reached.  The need for extended integrations and long averaging periods arise due to the time scale of the nonlinear cascade.

 

Despite the problem with the time scale, in short range runs the model spectrum perhaps still can be close to the statistical atmospheric spectrum if the initial data are well balanced and do not deviate too much from observed statistics.  However, consider the possibility that the initial data are not well balanced, and that they do deviate considerably from the observed atmospheric spectrum in the –5/3 range.  This could be due to the imperfections of the driving model or due to the set-up of the data assimilation system for example.  In addition, the size of the integration domain in mesoscale runs is typically smaller than the size of the large-scale atmospheric disturbances that feed the downscale nonlinear cascade.  Thus it appears that physical or spurious sources of energy other than the downscale nonlinear cascade from the large-scale motions are needed in order to generate and maintain the –5/3 spectra in mesoscale atmospheric models.

 

Possible sources that can generate and maintain the –5/3 spectrum in mesoscale models may be (a) physically justified mesoscale forcing, (b) early collapse of the spectrum due to spurious computational nonlinear cascade, (c) other small-scale computational errors such as the errors due to the representation of topography, etc.  In order to clarify the situation several tests have been performed using two versions of the NMM, namely the parallel version on the E grid (WRF-NMM) and the PC version on the B grid (NMM-B).  Both versions were designed applying the same principles in the discretization of the model dynamics and share the same physical package (Janjic et al., 2001; Janjic, 2003).

 

It should be noted that the WRF-NMM and the NMM-B are well qualified for investigating atmospheric spectra.  Their conserving of energy and enstrophy generally improves the accuracy of the nonlinear dynamics.  In particular, the energy and enstrophy conservation controls the nonlinear energy cascade and restricts an early spurious energy transfer toward smaller scales by nonlinear interactions.  The energy conservation improves the stability of the model and eliminates the need for excessive dissipation (either explicit or built into the finite-difference schemes) that could affect the spectra generated by the model.  In addition, the WRF-NMM and the NMM-B use a hybrid pressure-sigma vertical coordinate so that except for the errors propagating from below, in the upper troposphere and in the stratosphere they are relatively free of the sigma coordinate errors that are largest at higher altitudes.  Finally, explicit formulation of major dissipative processes allows precise “dosage” of dissipation.

 

The topography used in the tests is defined as grid-box means of the USGS 30’’ global data.  Except in ten rows along the lateral boundaries, no smoothing or filtering is applied.  The test with the operational set-up of the WRF-NMM over the Central Domain, but in the sigma coordinate, generated the –5/3 spectral slope on constant pressure surfaces in the upper troposphere and in the stratosphere.  An example of the spectrum (blue diamonds) obtained at the 300 hPa level, and averaged over forecast times from 24 to 36 hours with 3 hour intervals, is shown in Fig. 3.  As can be seen from the figure, the model develops the spectrum (blue diamonds) following the –3 (purple squares) and the –5/3 (yellow triangles) slopes that is in excellent agreement with observations.

 

However, there are problems with interpretation of this result.  Namely, when the operational set-up of the WRF-NMM is run in the hybrid coordinate, the spectrum at the small scales remained steeper than the –5/3 slope and approached the –3 slope (not shown).  Another reason for concern is that, as can be seen from Fig. 4, the spectrum of the square of unfiltered topography in the Central Domain (blue diamonds) follows the –5/3 law in the mesoscale range.  This result, together with the lack of the –5/3 range in the hybrid coordinate runs, challenges the interpretation of the result obtained in the sigma coordinate over mountainous areas as a legitimate Nastrom and Gage (1985) spectra.  Namely, the –5/3 spectrum in the upper troposphere and in the stratosphere could be generated by nonlinear cascade of spurious energy due to the sigma coordinate errors.  In this case, the computational noise would be mistaken for the Nastrom and Gage (1985) spectrum.  On the positive side one could argue that, although for a wrong reason, the nonlinear dynamics of the model still performed well by generating the –5/3 spectrum.  However, considering the shape of the mountain spectrum, the –5/3 spectrum observed in model integrations over mountainous areas could be simply a projection of the spectrum of topography and thus may have nothing to do with the nonlinear dynamics.

 

In order to investigate the possible effect of topography, the NMM-B was run using a resolution of 15 km in the horizontal and 32 levels in the vertical in a domain of the same size as the Central Domain, but over Atlantic Ocean.  In this way the possibility of mountains influencing the energy spectrum is eliminated.  In addition, as a major deviation from the operational set-up, the lateral diffusion was turned off in order to facilitate and hasten the accumulation of energy at the small scales.  However, weak divergence damping was still used.

 

The evolution of the NMM-B spectra at 300 hPa over Atlantic Ocean (blue lines) is shown in Fig. 5 at 6 hour intervals (6-24 hours, top to bottom in the left column and 30-48 hours, top to bottom in the right column).  The –3 (purple lines) and –5/3 (yellow lines) slopes are shown for comparison.  The time average over 36-48 hours of the spectra is shown in Fig. 6.  In the case considered, there was a small but vigorous extratropical cyclone in the northern part of the integration domain.  As can be seen from Fig. 6, by the end of the forecast, the model (blue diamonds) developed the –3 (purple squares) and the –5/3 (yellow triangles) spectral ranges that agree well with observations.  However, as can be seen from Fig. 5, it needed 24-36 hours to do so.  In the run with the same initial and boundary conditions, but with the physical package turned off, the –5/3 spectral range did not develop which indicates that in this case the physical processes provided the energy needed for spinning up the –5/3 spectrum.  It may be that the generation of the –5/3 spectrum was also aided by a more vigorous downscale nonlinear energy cascade since there was more energy than usual on the scales smaller than the size of the integration domain.

 

The WRF-NMM run in the Eastern Domain for the case of hurricane Isabel (initial data September 17, 18Z, from the Eta data) with the resolution of 8 km and 60 levels, and with no lateral diffusion, showed consistent behavior.  The evolution of the WRF-NMM spectra (blue lines) at 300 hPa in the Eastern Domain is shown in Fig. 7 at 6-hour intervals (6-24 hours, top to bottom in the left column, 30-48 hours, top to bottom in the right column).  The –3 (purple lines) and –5/3 (yellow lines) slopes are shown for comparison.  The time average over 36-48 hours of the spectra (blue diamonds) is shown in Fig. 8 together with the –3 (purple squares) and –5/3 (yellow triangles) slopes.  As can be seen from Fig. 8, by the end of the forecast the WRF-NMM also generated well developed –3 and–5/3 spectral ranges in agreement with the observations.  However, as can be seen from Fig. 7, the WRF-NMM needed some time to do so, but perhaps somewhat less than in the previous run of the NMM-B over the Atlantic Ocean.  Again, it may be that the generation of the –5/3 spectrum was also aided by a more vigorous downscale nonlinear energy cascade since there was more energy on the scales smaller than the size of the integration domain.

 

As expected on the basis of theoretical considerations, the presented results demonstrate that the nonlinear dynamics used in the NMM has been successful in reproducing the observed mesoscale atmospheric spectra, even at a rather modest resolution of 15 km, and even without the forcing by the mountains.  Whether the energy in the small-scale part of the spectrum comes from legitimate physical sources or from computational noise due to model imperfections is another issue that requires further investigation.  In other words, there is still no guarantee that the –5/3 spectrum of the model forecasts is generated by the same mechanisms as the Nastrom and Gage –5/3 spectrum observed in nature.  If it turns out that the model forecast spectra in short-range integrations are physically legitimate, and if the model simulates the nonlinear energy cascade well, the shape of the spectrum produced by the model could perhaps be used as guidance for more precise tuning of the model’s dissipation parameters.

 

The most important point here is that, contrary to the claims by Skamarock and Baldwin, the NMM dynamics apparently can and do spin-up the atmospheric spectrum virtually perfectly.  Interestingly, Skamarock and Baldwin have ignored these results.

 

 

 

It is not clear why Skamarock and Baldwin chose to show the results in their Figs. 4-6.  It is even more puzzling that they advertise these apparently problematic results as resounding success.  Namely, none of the spectra they show resembles the observed Nastrom and Gage (1985) spectrum, or the successfully simulated spectra by the NMM shown in our Figs. 3, 6 and 8.

 

For example, in our Fig. 9 we reproduced their Fig. 4, except that we added for convenience a few more lines in order to facilitate evaluation of the agreement of their spectrum with observed spectral slopes.  As can be seen from Fig. 9, the MC spectrum (red line) is significantly steeper than the –5/3 slope in the range where the –5/3 slope is observed (see the light blue line).  In addition, there is a sharp drop-off at the small-scale end of the spectrum that is not observed in nature.  But perhaps the most disturbing is the difficulty that the model apparently had in developing the –3 spectral range (see the dark blue line).  Namely, instead of following the –3 slope, and then gradually transitioning to the –5/3 slope, as in the case of the NMM spectra shown in Figs. 3, 6 and 8 and in the observations, their spectrum generally follows a relatively straight line (purple line in Fig. 9) that is both significantly shallower than –3 and steeper than –5/3.  This indicates that the MC has significant problems with computational nonlinear energy cascade.

 

It is also important that all spectra shown are computed over the central US.  We have pointed out to Skamarock and Baldwin that in this area the spectrum of topography squared follows the –5/3 slope (Fig. 4).  Thus, the deviation of the MC spectrum from the –3 slope may have been aided by the sigma coordinate errors (if the spectrum is indeed at the height of the Nastrom and Gage data), or even worse, generated in the postprocessing of the projection of topography on the sigma coordinate surfaces.

 

We also reproduced their Fig. 5 as our Fig. 10 except that we added for convenience a few more lines in order to facilitate comparison of their spectra with observed spectral slopes.  As can be seen from Fig. 10, the spectra again qualitatively follow straight lines throughout the entire spectral range (brown straight line on top of the red 22 km spectrum, dark green straight line on top of the green 10 km spectrum, and dark blue straight line on top of the blue 4 km spectrum) with slopes in between –5/3 and –3.  However, the slopes of the spectra become shallower with resolution. 

 

In comparison with the NMM, it should be noted that the MC cannot reproduce the –5/3 spectral range even with the 4 km resolution, while the NMM can do so with the 15 km resolution.  These results underscore the importance of energy and enstrophy conservation for proper simulation of atmospheric spectra.

 

Skamarock and Baldwin Fig. 6 does not add any new information.

 

 

Hamilton and Koshik showed climatological spectra, and not spectra spun up in short-range runs.  As already pointed out, the MC spectra are very problematic and therefore cannot serve as a standard for comparison.

 

 

Skamarock and Baldwin cannot know that the deviation from the –3 slope in their spectrum in the central domain is not due to the sigma coordinate errors unless they run their model using an efficient technique for controlling the sigma coordinate errors in the stratosphere and in the upper troposphere.  Such a technique is the application of the hybrid sigma-pressure coordinate used in the NMM.  The claimed similarity of the spectra over the ocean and over the land does not prove anything because the spectra in the two domains could have been spun up from different energy sources.  However, the Skamarock and Baldwin spectra over the land and over the sea are not quite similar.  It is encouraging to note that there is a slight deviation from the straight line in the MC ocean spectrum which supports the speculation about spurious forcing by topography in the runs over land.

 

 

 

 

Skamarock and Baldwin do not offer any scientifically valid evidence for their claim that “Neither of these arguments suggest that the spin-up time should be long.”  If this claim is true, why are the statistical spectra computed after spinning them up for hundreds of days and then averaging them over hundreds of days (e.g. Tung and Orlando 2003)?  The experimental evidence produced by the MC is not acceptable because Skamarock and Baldwin have not demonstrated that the MC can reproduce a realistic atmospheric spectrum (see our Figs. 9 and 10).

 

Concerning the eddy turnover time argument, take for example the scale of 1000 km.  From the Nastrom and Gage (1985) spectrum we find that the characteristic wind speed for the motions on this scale is of the order 1 m s-1.  Then, 1000 km/1 m s-1 = 1 000 000 s = 11.574 days.  For the scale of 100 km, the wind speed is of the order of 0.1 m s-1 so that the eddy turnover time is about the same as for 1000 km. 

 

Concerning the inverse cascade, there have been difficulties with simulating it in numerical experiments, and this hypothesis is still being debated (e.g. Tung and Orlando, 2003).

 

 

We have already pointed out that Skamarock and Baldwin erroneously associate the –5/3 spectral range with severe mesoscale weather phenomena.  Apparently there is not enough energy in the –5/3 range to explain such phenomena.  The bulk of the energy in the spectrum is too high up.

 

 

The “significant input of energy from external or physical mechanisms outside the dynamical equations” is precisely what was responsible for spinning up the –5/3 spectrum in our short-range runs, and that is why it is inappropriate to associate the –5/3 spectrum entirely with the dynamics in the present context.

 

 

Why would the frontal collapse affect the spectrum in the stratosphere?  Also, much of the forcing responsible for cyclogenesis belongs to the –3 range and not to the –5/3 range.

 

 

We also believe that mesoscale models should reproduce the observed –5/3 spectral slope.  Moreover, we have demonstrated that the NMM dynamics can and does produce spectra that agree perfectly well with the observations provided there is a sufficiently strong physical or spurious source of energy, and given enough time (see Figs 3, 6 and 8).  In contrast to that, and contrary to the claims by Skamarock and Baldwin, the evidence they presented does not show that the MC can spin up a spectrum that resembles the observed one at the resolutions at which the NMM can, or at any other resolution they showed (see Figs 9 and 10).  Before we can start a meaningful discussion on the present topic Skamarock and Baldwin need to demonstrate that the MC is capable of reproducing the atmospheric spectrum in mesoscale applications.

 

However, we disagree that the statistical properties of the atmosphere are necessarily reproduced in each realization in small limited area domains, and that mesoscale models should necessarily spin up the –5/3 spectral range in less than 6 hours if the slope in this wavenumber range is much steeper in the initial data.  We are unaware of the parts of turbulence theory and mesoscale dynamics that contradict our view.

 

As already pointed out, comparing the amplitudes of motions in the –5/3 spectral range with the amplitudes of the large scale motions, we arrive at the conclusion that the –5/3 spectral range is dynamically consistent with motions with wind speeds in the range from several cm s-1 to several tens of cm s-1.  Such motions are hardly visible with conventional methods for presenting meteorological information and certainly do not represent phenomena such as severe storms as Skamarock and Baldwin appear to believe.

 

 

The other two primary filtering mechanisms in the MC used at NCEP following NCAR recommendation are:

 

smdiv (default value is 0.)

         This parameter specifies the divergence damping (0.1 is typical – and used at NCEP).

         Used in RK schemes with time-splitting to selectively damp

         acoustic noise.

 

emdiv (default value is 0.)

         This parameter specifies the external mode damping (whatever that may be) in mass model.

         Use it only in real-data cases in mass model (0.01 is typical – and used at NCEP).

 

 

Our intention has been to represent the dissipation in the atmosphere in a physically justified way.  The physical models of dissipation we have chosen, and particularly the tuning parameters that control the intensity of dissipation, are open for discussion and will be addressed in due course in future parallel tests since the NMM has been frozen for a considerable time as the transition to WRF has taken place.

 

 

Takemi and Rotunno (2003) seemed to be largely unsuccessful in reproducing the –5/3  spectral slope for w2 with the MC in their 3D turbulence test.  In contrast to that, the NMM-B produced the –5/3 range in a similar experiment without difficulties.

 

 

The key issue in our misunderstandings appears to be the belief of Skamarock and Baldwin that we are removing the noise from our forecasts by heavy numerical filtering.  That impression is incorrect.  While conserving energy almost exactly in the spatial finite-differencing (except for weak built-in divergence damping), our model dynamics do not generate small scale noise because it was designed not to do so following physical principles that govern the nonlinear dynamics of the real atmosphere (energy and enstrophy conservation in case of rotational flow).  Namely, following Arakawa, our modeling principle has been to prevent generation of computational noise rather than to employ continuous removal of the noise by numerical filters after it is generated.  Thus, it is our contention that our sophisticated numerical schemes based on physical considerations, and not the numerical filtering, are the main reason why our forecasts are to a large extent noise free.  At the same time, despite the fact that some forecasts have a smoother general appearance, we have presented examples demonstrating that our model dynamics produces spectra that agree extremely well with observations in both the –3 and the –5/3 ranges, even with 15 km resolution.  Actually, improved accuracy of the nonlinear energy cascade is probably one of the reasons the NMM produces such spectra.

 

The effect of dissipation in the operational NMM can be clearly seen in comparison with the WRF-NMM forecasts shown at http://wwwt.emc.ncep.noaa.gov/mmb/mmbpll/nestpage/.  Namely, the WRF-NMM is run with very little lateral diffusion.  Still, the forecasts by the operational NMM and by the WRF-NMM are similar in appearance.  The differences in the forecasts between the two models are mainly due to the different specifications of initial and boundary conditions and to subtle differences in their respective physical packages.  In contrast to the two NMM’s, the MC forecast that is also available at the site shows a much higher level of computational noise.

 

Extrapolating experiences gained with the MC to our models is simply not justified.  NCAR has chosen a discretization approach based on higher-order formal accuracy and it is not surprising that the experiences of Skamarock and Baldwin are different.  Namely, this approach is not without problems.  Experience with fitting higher-order polynomials to noisy data indicates that the results of discretization based on higher order formal accuracy may be locally very inaccurate in case of large amplitudes of small scale motions such as those forced by the sigma coordinate errors over rugged topography, or by convection.  In addition, higher order finite-differences require additional computational boundary conditions which also generate computational noise.  Finally, higher order formal local accuracy does not give any guarantee about the accuracy of energy transport by nonlinear interactions and the only means for controlling the nonlinear cascade in the MC is computational filtering.

 

 

The amplification factors sent to Skamarock and Baldwin say otherwise (see Fig. 11).  So, the dissipation in the MC is not smaller than in the Eta and in the NMM, and in contrast to the dissipative MC advection schemes, the dissipation in the Eta and the NMM is not hardwired.  Therefore the dissipation in the Eta and the NMM can be easily reduced further while the dissipation in the MC can be reduced only by increasing resolution because of the dissipative advection schemes.

 

 

The spectrum is not evidence of damping.  Again, the spectrum produced by a mesoscale model in a short-range run in a limited area domain depends on the spectrum of initial data, on the synoptic situation, on physical and spurious sources and sinks of energy, the size of the integration domain, the length of the integration, and, of course, on how well the model dynamics simulate the nonlinear energy cascade.  And, again, the small-scale noise may not be there because it is not generated by the inherent flaws in numerical algorithms.

 

 

It is imperative to note that second order schemes are not all equal.  In particular, we have been using a compact energy and enstrophy conserving scheme (Janjic, 1984) which controls spurious nonlinear downscale cascade.  This scheme performs differently from the most straightforward second order scheme used in these tests and does not even reduce to such a scheme when linearized.  Since we are dealing with spectra, we should expect that the energy and enstrophy conservation matters.  And as can be seen from Skamarock and Baldwin Fig. 11, it does.  As in previous examples, the MC spectrum is somewhere in between the –3 and –5/3 in the entire spectral range except at the end where it falls off sharply.  This should be a reason for concern since even the large scale part of the MC spectrum is too shallow presumably due to a spurious nonlinear energy cascade.  It is well known that the dissipation is the primary means for controlling the spectrum in model formulations that disregard energy and enstrophy conservation.  The dissipation appears to be helpful in the case of MC since it at least improves the shape of the spectrum at the large scales where it should approach the –3 slope.

 

From the description of the experiments we understand that the tests with the second order centered advection were done as follows:

 

a.  “we use 2nd order centered advection (both vertical and horizontal, to remove the upwind filtering from the standard mass-core WRF configuration), we have turned off all other computational spatial filters,” (presumably the red line in the plot)

 

b.  as in a. but with “we use the lower-bound viscosity from NMM in a second order horizontal damping term” (presumably the green line),

 

c.  as in b. but with “we use horizontal divergence damping with the same coefficient used in NMM” (presumably the blue line).

 

Thus, in this experiment series only the sensitivity of the second order MC spectra to dissipation alone was examined.  This experiment bears no relevance to dissipation properties of the NMM.

 

 

Our point is that the NMM forecasts have generally smooth appearance primarily because its sophisticated nonlinear schemes produce little computational noise.

 

 

We view lateral diffusion as a legitimate physical process, not as a numerical filter, and thus model it accordingly.  It is well known that there is no justification based on physical considerations for using diffusion operators higher than second order. 

 

It is also important to note that our lateral diffusion is not a second order filter because it is not linear.  Whether the sixth order filter is more selective than the nonlinear lateral diffusion is not obvious, and in contrast to using nonlinear lateral diffusion, there is no physical justification for using hardwired numerical filters.

 

Besides, the lateral diffusion is also used in the MC in addition to numerical filters.  At NCEP we had to double the recommended MC diffusion coefficient in order to keep the integrations of the MC stable with our physical package.

 

The reason why we don’t use high order numerical filters is not because we haven’t heard of them.  The reason is that we believe that we are doing better without using them at all.

 

 

Historically, the lateral diffusion coefficient in the Eta was being adjusted to damp the two-grid-interval wave in a given time.  This has been a common practice in NWP models (see e.g. how it used to be done in the UK MetOffice models at http://www.cgam.nerc.ac.uk/um/doc/umug/html/node110.htm).  This practice implies that the lateral diffusion is actually little more than a numerical filter, and for this reason we consider it inappropriate.  However, the lateral diffusion coefficients obtained in this way were not excessive.

 

The NMM dissipation was tentatively set in such a way as to mimic the total dissipation in the Eta pending a more detailed study when parallel runs become available.  We do not claim that the dissipation in the current operational NMM may not be overestimated, but we want to approach this issue in a more rigorous way.

 

 

Skamarock and Baldwin again erroneously consider the –5/3 spectrum synonymous with significant mesoscale phenomena.  As we have indicated, there is not enough energy in the –5/3 range to explain such phenomena as MCS’s.

 

 

This is not true.  As can be seen from Fig. 3, consistent with theory, the NMM as operationally set up can reproduce the –5/3 spectrum provided a sufficiently strong physical or spurious energy source is present, and given enough time.  Furthermore, the example shown in Fig. 1 demonstrates that the operational setup of the NMM can produce more meso-scale structure than the MC, and, moreover, that this additional structure verifies as can be seen from NMM’s significantly better threat and bias scores in the Western Domain.

 

In contrast, the evidence presented indicates that the MC has a serious problem in reproducing the observed atmospheric spectrum.  A typical MC spectrum approximately follows a single straight line throughout the spectral domain, except at the short-wave end of the spectrum where it sharply drops off.  The slope of this straight line lies in between –3 and –5/3 and decreases with increasing resolution.

 

 

This is perhaps the most fantastic conclusion that has been reached by Skamarock and Baldwin in view of the numbers themselves and in attempting to consider perceived smoothness.  With the same spatial resolution, the NMM is about three times faster than the MC.  Considering that the NMM code can be further optimized, this factor may be increased to about four.  On the other hand, as can be seen from the sharp drop-off of the MC spectra, its nominal resolution is reduced by a factor of about two by its filters.  Smoothness is completely irrelevant to efficiency since the amount of structure in the NMM precipitation fields can be increased easily through the existing physics packages.

 

In other words, in the quasi-operational runs at NCEP with the same resolution and using NCEP’s standard skill scores, the MC produced forecasts that are inferior in important aspects but at three times the computational cost of the NMM forecasts.

 

 

 

Nobody at NCEP is advocating “running high resolution forecasts using numerical filters that systematically remove the resolution gained by the refined grid.” Our experience at NCEP has been that the forecasts have been systematically improving with increasing resolution, which would not have happened if we have been “systematically removing the resolution gained by the refined grid.” 

 

On the other hand, as a matter of principle, we do not see justification for producing and retaining in the forecasts computational noise that is difficult to distinguish from legitimate mesoscale processes.  This statement should not be interpreted as the advocating of numerical filtering.  Namely, our modeling principle has been to prevent generation of computational noise rather than to continually remove the noise by numerical filters after it is generated.

 

We have been trying to represent the dissipation in the atmosphere in a physically justified way.  The physical models of dissipation we have chosen, and particularly the tuning parameters that control the intensity of dissipation, are open for discussion and will be addressed in due course in future parallel tests since the NMM has been frozen for a long time.

 

 

Skamarock and Baldwin advocate changing the verification rules in such a way that the verification versus observed data be reduced in importance in favor of comparison with statistical properties such as the Nastrom and Gage (1985) spectrum despite the fact that comparison with observations remains one of the most fundamental means of verifying the forecasts.  Their approach is inapplicable in day to day verification, since it is inappropriate to use statistical properties of the atmosphere to verify a particular realization, as well as potentially misleading, since the –5/3 range of the spectrum can be spun-up by computational errors.  Thus, a model may be rewarded for computational noise it makes and penalized for forecasts that agree with observations.

 

Comparison of conventional scores for the forecasts run at NCEP shows that the MC scores are consistently lower than the scores of the Eta and the NMM.

 

The experience at NCEP indicates that the conventional skill scores are still being improved with increasing resolution at horizontal resolutions of 10 km or so.

 

We look forward to the use of new methods of measuring skill in high resolution forecasts.  Until such methods are available and widely accepted, the current ones continue to provide a valuable tool in gauging forecast skill.

 

 

 

The MC appeared to be largely unsuccessful in reproducing the –5/3 spectral slope in the Takemi and Rotunno (2003) tests on cloud scales.  In contrast, the NMM had no difficulties with the –5/3 spectral range in a similar test.

 

 

Generally, this is not true.  The NMM dynamics has been tested on the cloud resolving scales in many runs performed primarily with the NMM-B.  The model passed successfully all the tests to which it has been subjected.

 

            Still, this claim by Skamarock and Baldwin is perhaps partly understandable considering that we have not published much about our results.  Due to resource constraints, the efforts at NCEP have been focused on the resolutions that will be most relevant for NCEP’s NWP applications in the near future.  Nevertheless, we are confident that the NMM will perform well at the cloud-resolving scale should researchers want to test this model application.

 

 

This paragraph does not make sense.  No one would run a model with the same dissipation parameters at 10 km resolution and at 1 km resolution.  At 1 km resolution even the parameterization of dissipation should be different (3-dimensional).

 

 

Although our very limited resources have not allowed us to publish much, particularly about the results on the cloud resolving scales, some relevant results have been presented to the scientific community.  For example, the results of explicit simulations of convection using different microphysics schemes were presented in the discussion at the Convection Workshop held at NASA in December 2001.  These runs were made with the horizontal resolution of 1 km, and 32 layers in the vertical.  The –5/3 spectrum spun up in a test analogous to that of Takemi and Rotunno (2003) (that the MC had difficulties with) was shown at the SRNWP meeting on nonhydrostatic modeling in Bad Orb this year.  Finally, the NMM is running daily with 2 km and 4 km resolutions in Switzerland, so that presumably there is more experience with the NMM forecasts run at 2 km and 4 km resolutions than with those produced by the MC.

 

 

Why before?

 

 

The same applies to the MC considering its notable tendency to overpredict hurricanes.

 

 

As demonstrated many times in this text, the claim that the NMM WRF is heavily damped, and in particular, that it is more damped than the MC WRF, is arbitrary and unfounded.  Even if there is too much damping in the current operational set-up of the NMM WRF, this damping is not hardwired as in the MC WRF finite-difference algorithm and can be easily reduced if needed.  In contrast, the dissipation in the MC due to its high order filters can be reduced only by increasing resolution.  The simple enhancement of detail in such fields as precipitation in the NMM can be done easily through the current physics packages.

 

REFERENCES

 

Gage, K.S., 1979:  Evidence far a k−5/3 Law Inertial Range in Mesoscale Two-Dimensional Turbulence. J. Atmos. Sci.  36, 1950–1954.

Gage, K.S., Nastrom, G.D., 1986: Theoretical Interpretation of Atmospheric Wavenumber Spectra of Wind and Temperature Observed by Commercial Aircraft During GASP. J. Atmos. Sci, 43, 729–740.

Hamilton, Kevin, Wilson, R. John, Hemler, Richard S., 1999: Middle Atmosphere Simulated with High Vertical and Horizontal Resolution Versions of a GCM: Improvements in the Cold Pole Bias and Generation of a QBO-like Oscillation in the Tropics.  J. Atmos. Sci, 56, 3829–3846.

Janjic, Z. I., 2003:  A Nonhydrostatic Model Based on a New Approach.  Meteorol. Atmos. Phys., 82, 271-285.

Janjic, Z. I., J. P. Gerrity, Jr. and S. Nickovic, 2001: An Alternative Approach to Nonhydrostatic Modeling.  Mon. Wea. Rev., 129, 1164-1178.

Koshyk, John N., Hamilton, Kevin, 2001: The Horizontal Kinetic Energy Spectrum and Spectral Budget Simulated by a High-Resolution Troposphere–Stratosphere–Mesosphere GCM. J. Atmos. Sci, 58, 329–348.

Lilly, D.K., 1983: Stratified Turbulence and the Mesoscale Variability of the Atmosphere. J. Atmos. Sci, 40, 749–761.

Nastrom, G.D., Gage, K.S., 1985:  A Climatology of Atmospheric Wavenumber Spectra of Wind and Temperature Observed by Commercial Aircraft. J. Atmos. Sci, 42, 950–960.

Takemi, T., Rotunno, R., 2003:  The Effects of Subgrid Model Mixing and Numerical Filtering in Simulations of Mesoscale Cloud Systems.  Mon. Wea. Rev, 131, 2085-2101.

Tung, Ka Kit, Orlando, Wendell Welch, 2003: The k−3 and k−5/3 Energy Spectrum of Atmospheric Turbulence: Quasigeostrophic Two-Level Model Simulation.  J. Atmos. Sci, 60, 824–835.

 

 

 

 


 

Fig. 1.  Examples of the NMM (upper lef panel) and the MC (upper right panel) forecasts and threat (upper part of lower panel) and bias (lower part of lower panel) scores for November 2003 for the Eta (red solid line), the GFS (dashed dark blue line), operational NMM (dashed green line), experimental WRF NMM (dashed light blue line) and MC (dashed purple line).


 

Fig. 2


 

 

Fig. 3.  The WRF-NMM spectrum (blue diamonds) at 300 hPa averaged over forecast times from 24 to 36 hours with 3 hour intervals produced with operational set-up of the NMM in Central Domain, but in the sigma coordinate.  The –3 (purple squares) and –5/3 (yellow triangles) slopes are shown for comparison.

 

Fig. 4.  The spectrum (blue diamonds) of the square of unfiltered topography height in the NMM-B Central Domain with 15 km resolution.  The –3 (purple squares) and –5/3 (yellow triangles) slopes are shown for comparison.

 


 

Fig. 5.  Time evolution of the NMM-B spectra (blue lines) at 300 hPa over Atlantic Ocean.  .  The –3 (purple lines) and –5/3 (yellow lines) slopes are shown for comparison Run starting from 12 UTC, 09/07/2003, GFS data, 15 km, 32 levels resolution.  No lateral diffusion, weak mass divergence damping.  Plots every 6 hours, top to bottom, 6-24 left column, 30-48 right column.

 

 

Fig. 6.  Time average over 36-48 hours of the NMM-B spectra (blue diamonds) at 300 hPa over Atlantic Ocean.  The –3 (purple squares) and –5/3 (yellow triangles) slopes are shown for comparison.  Run starting from 12 UTC, 09/07/2003, GFS data, 15 km, 32 levels resolution.  No lateral diffusion, weak mass divergence damping.


Fig. 7.  Time evolution of the WRF-NMM spectra (blue lines) at 300 hPa in the Eastern Domain.  Run starting from 18 UTC, 09/17/2003 (Isabel), Eta data, 8 km, 60 levels resolution.  The –3 (purple lines) and –5/3 (yellow lines) slopes are shown for comparison.  No lateral diffusion, weak mass divergence damping.  Plots every 6 hours, top to bottom, 6-24 left column, 30-48 right column.

 

Fig. 8.  Time average over 36-48 hours of the WRF-NMM spectra (blue diamonds) at 300 hPa in the Eastern Domain.  The –3 (purple squares) and –5/3 (yellow triangles) slopes are shown for comparison.  Run starting from 18 UTC, 09/17/2003 (Isabel), Eta data, 8 km, 60 levels resolution.  No lateral diffusion, weak mass divergence damping.

 

 

 

Fig. 9.  Skamarock and Baldwin Fig. 4, except that light blue (-5/3 slope), dark blue (-3 slope) and purple (fit to the Skamarock and Baldwin spectrum) lines were added in order to facilitate comparison of the Skamarock and Baldwin spectrum (red line) with observed spectral slopes.  Instead of following the –3 slope, and then gradually transitioning to the –5/3 slope, the Skamarock and Baldwin spectrum generally follows a straight line (purple) that is both significantly shallower than –3 and steeper than –5/3.

 

 

Fig.  10. Skamarock and Baldwin Fig. 5 except that brown (fit to their 22 km spectrum), dark green (fit to their 10 km spectrum) and dark blue (fit to their 4 km spectrum) straight lines were added for convenience.  The spectra qualitatively follow straight lines throughout the entire spectral range with slopes in between –5/3 and –3, and generally become shallower with resolution. 

 

 

Fig. 11.  Amplification factors of the Mass Core RK vertical advection scheme (upper left panel) and the Mass Core horizontal advection scheme (lower left panel).  The Courant number for flow along the coordinate axes is 0.5, and it is 0.707 along the diagonal.  For convenience, the missing contour labels are added in red and marked by z.j. with the aid of another plot from the same source (a presentation by Wicker).