Search for notes by fellow students, in your own course and all over the country.
Browse our notes for titles which look like what you need, you can preview any of the notes via a sample of the contents. After you're happy these are the notes you're after simply pop them into your shopping cart.
Title: Autocorrelation correction
Description: How to correct autocorrelation, tests for autocorrelation and definition of autocorrelation.
Description: How to correct autocorrelation, tests for autocorrelation and definition of autocorrelation.
Document Preview
Extracts from the notes are below, to see the PDF you'll receive please use the links above
3 Autocorrelation
Autocorrelation refers to the correlation of a time series with its own past and future values
...
Positive
autocorrelation might be considered a specific form of “persistence”, a tendency for a system to
remain in the same state from one observation to the next
...
Geophysical time series are
frequently autocorrelated because of inertia or carryover processes in the physical system
...
Or the slow drainage of groundwater reserves might impart
correlation to successive annual flows of a river
...
Autocorrelation complicates the
application of statistical tests by reducing the number of independent observations
...
g
...
Autocorrelation can be exploited
for predictions: an autocorrelated time series is predictable, probabilistically, because future
values depend on current and past values
...
3
...
1)
...
Positive autocorrelation might show up in a time series plot as
unusually long runs, or stretches, of several consecutive observations above or below the mean
...
Because the
“departures” for computing autocorrelation are relative the mean, a horizontal line plotted at the
sample mean is useful in evaluating autocorrelation with the time series plot
...
Statistical tests based on the observed number of runs above and
below the mean are available (e
...
, Draper and Smith 1981), though none are covered in this
course
...
If nothing else, this inspection might show that the persistence is much more
prevalent in some parts of the series than in others
...
2 Lagged scatterplot
The simplest graphical summary of autocorrelation in a time series is the lagged scatterplot,
which is a scatterplot of the time series against itself offset in time by one to several time steps
(Figure 3
...
Let the time series of length N be x i , i 1,
...
The lagged scatterplot for lag k
is a scatterplot of the last N k observations against the first N k observations
...
A random scattering of points in the lagged scatterplot indicates a lack of autocorrelation
...
Alignment from lower left to upper right in the lagged scatterplot
indicates positive autocorrelation
...
Figure 3
...
Time series plot illustrating signatures of persistence
...
Figure 3
...
Lagged scatterplots of tree-ring series MEAF
...
1 with itself offset by 1, 2, 3 and 4 years
...
05)
...
At an offset of 3
years, the juxtaposition of high-growth 1999 with low-growth 2002 exerts high influence
(point in red rectangle)
...
An assumption of linear dependence is not necessary
...
Such nonlinear dependence might not be effectively summarized by other
methods (e
...
, the autocorrelation function [acf], which is described later)
...
The scatter plot in Figure 3
...
Influence of outliers would not be detectable from the acf alone
...
A straight line can be fit to the points in a lagged scatterplot to facilitate
evaluation linearity and strength of relationship of current with past values
...
g
...
Correlation coefficient and 95% significance level
...
It
is helpful to compare the computed correlation coefficient with critical level of correlation
required to reject the null hypothesis that the sample comes from a population with zero
correlation at the indicated lag
...
It follows that the approximate threshold, or critical, level of
correlation for 95% significance ( 0
...
9 5 0 2 / N , where N is the sample size
...
3)
...
3 Autocorrelation function
(correlogram)
Figure 3
...
Critical level of correlation
coefficient (95 percent significance) as a function
of sample size
...
20 for a sample size of 100 to r=0
...
x
r
xi x
Notes_3, GEOS 585A, Spring 2013
i
2
x
1/ 2
An important guide to the persistence in
a time series is given by the series of
quantities called the sample autocorrelation
coefficients, which measure the correlation
between observations at different times
...
An analogy can be drawn between the
autocorrelation coefficient and the productmoment correlation coefficient
...
The correlation coefficient between x and
y is given by
yi
y
yi y
2
1/ 2
(1)
3
where the summations are over the N observations
...
Instead of two different time series, the correlation is computed between one time series and the
same series lagged by one or more time units
...
The first-order autocorrelation coefficient is the simple correlation coefficient of the
first N 1 observations, x t , t 1, 2 ,
...
, N
...
As the correlation coefficient given by (2) measures correlation between
successive observations, it is called the autocorrelation coefficient or serial correlation
coefficient
...
First, the difference between the sub-period means x (1 ) and x ( 2 ) can be ignored
...
Accordingly, r1 can be approximated by
N 1
x
xt 1
x
r1
x
t
x
t 1
(3)
N
t
x
2
t 1
N
where
x
xt
is the overall mean
...
The plot of the
autocorrelation function as a function of lag is also called the correlogram
...
The correlation coefficients for the lagged
scatterplots at lags k 1, 2 ,
...
Notes_3, GEOS 585A, Spring 2013
4
Link between acf and autocovariance function (acvf)
...
By analogy the autocovariance of a time series is defined as
the average product of departures at times t and t+k
ck
1
N
N k
x
t
x
xt k
x
(5)
t 1
where c k is the autocovariance coefficient at lag k
...
By combining equations (4) and (5), the autocorrelation at lag k can be written in terms
of the autocovariance:
(6)
rk c k c 0
Alternative equation for autocovariance function
...
The acvf is sometimes
computed with the alternative equation
1
N k
(7)
xt x xt k x
N k t 1
The acvf by (7) has a lower bias than the acvf by (5), but is conjectured to have a higher mean
square error (Jenkins and Watts 1968, chapter 5)
...
4 Testing for randomness with the correlogram
The first question that can be answered with the correlogram is whether the series is
random or not
...
It can be shown that if x1
...
The 95%
confidence limits for the correlogram can therefore be plotted at
further approximated to 0 2
N
1 / N 2
N
, and are often
...
2 0
...
Factors that must be considered in judging whether a sample autocorrelation outside the
confidence limits indicates an autocorrelated process or population are (1) how many lags are
being examined, (2) the magnitude of rk , and (3) at what lag k the large coefficient occurs
...
And a large rk at a low lag (e
...
, k 1 ) is more likely to represent persistence in most
physical systems than an isolated large rk at some higher lag
...
5 Large-lag standard error
While the confidence bands described above are horizontal lines above and below zero on the
correlogram, the confidence bands you see in the assignment script may appear to be narrowest at
Notes_3, GEOS 585A, Spring 2013
5
lag 1 and to widen slightly at higher lags
...
8)
...
This interdependence makes it difficult to assess just at how
many lags the correlogram is significant
...
The variance of rk , with the adjustment, is given by
1
2
1 2 ri
N
i 1
K
V a r( rk )
(10)
where K k
...
8)
...
For example, the variance of the lag-3 autocorrelation coefficient,
V a r r3 , is greater than 1 / N by an amount that depends on the autocorrelation coefficients at
lags 1 and 2
...
Assessment of the significance of lag-k
autocorrelation by the large-lag standard error essentially assumes that the theoretical
autocorrelation has “died out” by lag k, but does not assume that the lower-lag theoretical
autocorrelations are zero (Box and Jenkins 1976, p
...
Thus the null hypothesis is NOT that
the series is random, as lower-lag autocorrelations in the generating process may be non-zero
...
4)
...
Thus, the autocorrelation at lag 5, say, is
judged significant under the null hypothesis that the series is random, but is not judged significant
if the theoretical autocorrelation function is considered to not have died out until lag 5
...
4
...
Dotted line is
simple approximate confidence interval at ± 2 / N , where N is
the sample size
...
Notes_3, GEOS 585A, Spring 2013
6
3
...
The first-order
autocorrelation coefficient, r1 , can be tested against the null hypothesis that the corresponding
population value 1 0
...
g
...
For the one-tailed hypothesis, the
alternative hypothesis is usually that the true first-order autocorrelation is greater than zero:
(11)
H1 : 0
For the two-tailed test, the alternative hypothesis is that the true first-order autocorrelation is
different from zero, with no specification of whether it is positive or negative:
(12)
H1 : 0
Which alternative hypothesis to use depends on the problem
...
g
...
Otherwise, the two-sided test is best
...
9 5
1 1
...
More generally, following Salas et al
...
6 4 5
N k 1
N k
o n e s id e d
(14)
rk ( 9 5 % )
1 1
...
Equation (13) comes from substitution of k=1 into
equation (14)
...
7 Effective Sample Size
If a time series of length N is autocorrelated, the number of independent observations is fewer
than N
...
The reduction in number of
independent observations has implications for hypothesis testing
...
The way of circumventing the
problem of autocorrelation is to adjust the sample size for autocorrelation
...
Below is an equation for computing so-called “effective” sample size, or sample size adjusted for
autocorrelation
...
The equation was derived based on the assumption that the autocorrelation in the
series represents first-order autocorrelation (dependence on lag-1 only)
...
Computation of the effective sample
Notes_3, GEOS 585A, Spring 2013
7
size requires only the sample size and first-order sample autocorrelation coefficient
...
The ratio 1
r1 is
the first-order
1 r1 is a scaling factor multiplied by the original
sample size to compute the effective sample size
...
50 has an adjusted sample size of
N ' 100
(1 0
...
5
r1
3 3 ye a rs
1
...
5
The adjustment to effective sample size becomes less important the lower the autocorrelation,
but a first-order autocorrelation coefficient as small as r1=0
...
5)
...
L
...
Statistics, v
...
1, p
...
Anderson, O
...
182 pp
...
5
...
For a given first-order
autocorrelation, the scaling factor is multiplied by the
original time series
...
E
...
, and Jenkins, G
...
,
1976,
Time
series
analysis:
forecasting and control: San
Francisco, Holden Day, p
...
Chatfield, C
...
Dawdy, D
...
, and Matalas, N
...
, 1964, Statistical and probability analysis of hydrologic data,
part III: Analysis of variance, covariance and time series, in Ven Te Chow, ed
...
8
...
90
...
M
...
G
...
Salas, J
...
, Delleur, J
...
, Yevjevich, V
...
, and Lane, W
...
, 1980, Applied modeling of
hydrologic time series: Littleton, Colorado, Water Resources Publications, 484 pp
...
79: Climatic Change, WMO-No,
195
...
100, Geneva, 80 pp
Title: Autocorrelation correction
Description: How to correct autocorrelation, tests for autocorrelation and definition of autocorrelation.
Description: How to correct autocorrelation, tests for autocorrelation and definition of autocorrelation.