Atmospheric modeling, data assimilation and predictability

by Eugenia Kalnay in 2003

Because of their higher resolution, regional models have the advantage of higher accuracy and the ability to reproduce smaller-scale phenomena such as fronts, squall lines, and much better orographic forcing than global models. On the other hand, regional models have the disadvantage that,they are not “self-contained” because they require lateral boundary conditions at the borders of the horizontal domain. These boundary conditions must be as accurate as possible, because otherwise the interior solution of the regional models quickly deteriorates. Therefore it is customary to nest the regional models within another model with coarser resolution, whose forecast provides the boundary conditions. For this reason, regional models are used only for short-range forecasts.

a latitude-longitude model with a typical resolution of 1 degree and 20 vertical levels would have 360x180x20 = 1.3 M grid points. Each grid will have to carry the values of at least 4 prognostic variables (wu,wv, T, RH), and surface pressure for each column.

It is necessary to use additional information (background or first guess) to prepare initial conditions. Th model forecast is interpolated to the observation location, and if they are different, converted from model variables to observed variables y^o.

The analysis x^a is obtained by adding the correction:

$x^a=x^b+W[y^a-H(x^b)]$

Threat score TS = (P & O) /(P | O). It is also known as critical success index (CSI), as a particularly useful score for quantities that are relatively rare.

The forecasters also have access to several forecasts, and they use their judgment in assessing which one is more accurate in each case. This constitutes a major source of the “value-added” by the human forecasters.

The human forecasts are on the average significantly more skillful than the numerical guidance, but it is the improvement in NWP forecasts that drives the improvements in the subjective forecasts.

Since 1994, NCEP has been running 17 global forecasts per day, each out to 16 days, with initial perturbations obtained using the method of breeding growing dynamical perturbations in the atmosphere, which are also present in the analysis errors. The ECMWF ensemble contains 50 members.

Ensemble forecasting has 2 goals:

components of the forecast that are most uncertain tend to be averaged out
provide forecasters with an estimation of reliabilitylity of the forecast.

Slowing varying surface forcing, especially from the tropical ocean and from land-surface anomalies, can produce atmospheric anomalies that are longer lasting and more predictable than individual weather patterns. A most notable example is the ENSO produced by unstable oscillations of the coupled ocean-atmosphere system, with a frequency of 3-7 years. Because of their long time scale, the ENSO oscillations should be predictable a year or more in advance.

Governing equations

V. Bjerknes was a professor of applied mechanics and mathematical physics at the University of Stockholm. He elucidated the fundamental interaction between fluid dynamics and thermodynamics. In 1904, he pointed out the primitive equations which are used in climate models. Basic,ally it is 7 equations with 7 unknown variables:

velocity vector (u, v, w)
Temperature T
pressure P
Density rho: $p= \rho RT$
water vapor mixing ratio q: $\frac{dq}{dt}=E-C$

It can be grouped into 3 sets of equations:

conservation of mass (continuity euqation): $\frac{\rho}{dt}=\nabla (\rho v)$
conservation of momentum (Newton 2nd law): $\frac{dv}{dt}=F/m$ , must consider rotating frame of reference, pressure gradient force, gravitational acceleration, frictional force, Coriolis force and centrifugal force
conservation of energy (thermodynamic energy equation)

Interestingly, his son is also a meteorologist, who help to pick the best date to throw the atomic bomb in Japan in 1945.

spherical coordinates

3 velocity components:

Zonal: along a latitudinal circle, west-east direction, u
Meridional: along logitudinal lines, v
Vertical: positive up, w

Basic wave oscillations:

sound
gravity
slower weather wave

they have profound implications for the present use of hydrostatic and nonhydrostatic models. Different approximations (hydrostatic, quasi-geostrophic, and the anelastic approximations) are designed to filter out some of them.

Assume the solutions have plan wave form, the specific type of wave can be determined by deriving the FDR (frequency dispersion relationship), frequency, phase speed, group velocity.

pure sound waves, speed = c_s = 320 m/s, propagating in any direction.
Lamb waves (horizontally propagating sound waves)
vertical gravitational oscillations
inertia oscillations (due to basic rotation)
Lamb waves in the presence of rotation and geostrophic modes. There will be 2 solutions: inertia Lamb waves and rossy waves (Coriolis force changes with latitude)

General wave solution of the perturbation equations in a resting, isothermal atmosphere.

Filtering approximations

Neglect the time derivative of one of the euqations of motions, we convert it from a prognostic equation into a diagnostic equation
Physically, we eliminate a restoring force that supports a certain type of wave
Most global models and some regional models use the hydrostatic approximation, whic filters sound waves.

3 numerical discretization of the equations of motion

classification of partial differential equations (PDEs):

wave equation(hyperbolic)
diffusion equation (parabolic)
Laplace’s or Poisson’s equations (elliptic)

well-posedness, initial and boundary conditions

a well-posed initial/boundary condition problem has a unique solution that depends continuously on the initial/bounary conditions
If too many initial/boundary conditions are specified, there will be no solution.
If too few are specified, the solution will not be unique.
If the number of initial/boundary condictions is right, but they are specified at the wrong place or time, the solution will be unique, but it will not depend smoothly on initial/boundary conditions. i.e., small errors in the initial/boundary conditions will produce huge errors in the solution.
We can never find a numerical solution of a problem that is ill posed: the computer will show its disgust by blowing up.

One method of solving simple SDEs is the method of separation of variables, but unfortunately in most cases it is not possible to use it, hence the need for numerical models.

3.3.2 Galerkin and spectral space representation

p94

Spatial finite differences introduces errors in the space derivatives, resulting in a computational phase speed slower than the true phase speed, especially for short waves.

Galerkin approach uses a sum of basis functions. The basis functions are usually the eigensolutionsof the Laplace equation. For spherical coordiantes, the spherical harmonics are used.

The spatial resolution is uniform throughout the sphere. This is a major advantage over finite differences based on a latitude-longitude grid, where the convergence of the meridians at the poles requires very small time steps.

4. Introduction to the parameterization of subgrid scale physical processes

Despite the continued increase of resolution, many important processes and scales of motion in the atmosphere can not be explicitly resolved with present or future models. They include turbulent motions (0.01 m to a model grid), molecular scale (condensation, evaporation, friction and radiation)

These processes are called “sub grid-scale processes”.

to reproduce the interaction of the grid and sub grid-scale processes, the sub grid-scale phenomena are parameterized, i.e., their effect is formulated in terms of the resolved fields.

5. data assimilation

Currently, operational NWP centers produce initial conditions through a statistical combination of observations and short-range forecasts.

Spatial interpolation of obervations is not enough:

not enough data are available to initialize current models. Number of degrees of freedom in a modern NWP model is of the order of 10^7, but the total number of conventional observations of the variables used in the models is of the order of 10^4.
remote sensing data such as satellite and radar observation do not measure directly measure the model variables (wind, temperature, moisture, and surface pressure)
data distribution in space and time is very nonuniform. North America and Eurasia are relatively data-rich, others are much more poorly observed.

Solution:

have a complete first guess estimate of the state of the atmosphere at all the grid points in order to generate the initial conditions. The first guess should be our best estimate of the state of the atmosphere prior to the use of the observations.
climatology, or a combination of climatology and a short forecast were used as a first guess.
As forecasts became better, the use of short-range forecast as a first guess was universally adopted in operational systems in what is called an “analysis cycle”.

3 statistical interpolation methods(3D-Var, OI(Optimal interpolation), and PSAS), have been shown to formally solve the same problem. In practice, OI requires the introduction of a number of approximations, and local solution of the analysis, grid point by grid point, or small volume by small volume.

optimal analysis: minimize the analysis error variance, finding the optimal weights through a least squared approach
variational approach, find the analysis that minimizes a cost function measuring its distance to the background and to the observations.

Ensemble Kalman filtering: All the cycles assimilate the same real obervations, but in order to maintain them realistically independent, different sets of random perturbations are added to the observations assimilated in each member of the ensemble data assimilations.

4D var. The cost function includes a term measuring the distance tothe background at the beginning of the interval, and a summation over time of the cost function for each observational increment computed with respect to the model integrated to the observation time. … 4D var seeks an initial condition such that the forecast best fits the observations within the assimilation interval. However, the fact that the 4D var method asumes a perfect model is a disadvantage since, for example, it will give the same credits to older observations at the beginning of the interval as to newer observations at the end of the interval.

quality control is based on a comparison between observations and some kind of expected value (from climatology, an average of nearby observations, or the first guess).

Collins(1998): Most common human errors have a simple structure: a single digit or a sign is wrong or missiong.

6 Atmospheric predictability and enseble forecasting

Lorenz (1993):

The initial round-off errors were the culprits; they were steadily amplifying until they dominated the solution. In today’s terminology, there was chaos. .. It soon struct me that, if the real atmosphere behaved like the simple model, long-range forecasting would be impossible.

The early hsitory of NWP

The 1st real-time, operational NWP was run in Sweden in September 1954 (to 72h at 500 hPa), half a year before the USA.

Two reasons:

In 1954, the Swedes has the world’s most poweful computer, BESK.
Rossby moved to Sweden.

Interestingly, Rossby was seen as a troublemaker and was not elected as the director of Swedish Meteorolgoical office. What an internal political conflict! Anyway, Rossby seek support from Military Meteorolgocial Service

Yuchao's blogspot

Wednesday, June 27, 2018

book, Atmospheric modeling, data assimilation and predictability