Zero-lift drag coefficient: Difference between revisions

From formulasearchengine
Jump to navigation Jump to search
en>Youngjim
m Proofreading
en>Dolphin51
m →‎top: Erased "coefficient" as discussed in the latest thread on the Talk page.
 
Line 1: Line 1:
{{more footnotes|date=December 2012}}
I would like to introduce myself to you, I am Andrew and my spouse doesn't like it at all. Invoicing is my occupation. Ohio is where her house is. To climb is something she would never give up.<br><br>Feel free to surf to my web site; [http://www.january-yjm.com/xe/index.php?mid=video&document_srl=158289 psychic phone]
In [[statistics]], the '''Durbin–Watson statistic''' is a [[test statistic]] used to detect the presence of [[autocorrelation]] (a relationship between values separated from each other by a given time lag) in the [[errors and residuals in statistics|residuals]] (prediction errors) from a [[regression analysis]]. It is named after [[James Durbin]] and [[Geoffrey Watson]]. The [[Sample size|small sample]] distribution of this ratio was derived by [[John von Neumann]] (von Neumann, 1941). Durbin and Watson (1950, 1951) applied this statistic to the residuals from [[Ordinary least squares|least squares]] regressions, and developed bounds tests for the [[null hypothesis]] that the errors are serially uncorrelated against the alternative that they follow a first order [[autoregressive]] process. Later, [[John Denis Sargan]] and [[Alok Bhargava]] developed several von Neumann–Durbin–Watson type test statistics for the null hypothesis that the errors on a regression model follow a process with a [[unit root]] against the alternative hypothesis that the errors follow a stationary first order [[autoregression]] (Sargan and Bhargava, 1983). One drawback of this statistic is that it is not pivotal; that is, its distribution depends on unknown parameters. The statistic is pivotal in the asymptote.<ref name=pivotal>{{cite book|last=Chatterjee|first=Samprit|last2=Simonoff|first2=Jeffrey| title=Handbook of Regression Analysis|year=2013|publisher=John Wiley & Sons|isbn=1118532813}}</ref>
 
==Computing and interpreting the Durbin–Watson statistic==
 
If ''e<sub>t</sub>'' is the [[errors and residuals in statistics|residual]] associated with the observation at time ''t'', then the [[test statistic]] is
 
: <math>d = {\sum_{t=2}^T (e_t - e_{t-1})^2 \over {\sum_{t=1}^T e_t^2}},</math>
 
where ''T'' is the number of observations. Since ''d'' is approximately equal to 2(1&nbsp;−&nbsp;''r''), where ''r'' is the sample autocorrelation of the residuals,<ref name="Gujarati 2003">Gujarati (2003)  p. 469</ref> ''d''&nbsp;=&nbsp;2 indicates no autocorrelation. The value of ''d'' always lies between 0 and 4. If the Durbin–Watson statistic is substantially less than 2, there is evidence of positive serial correlation. As a rough rule of thumb, if Durbin–Watson is less than 1.0, there may be cause for alarm. Small values of ''d'' indicate successive error terms are, on average, close in value to one another, or positively correlated. If ''d''&nbsp;>&nbsp;2, successive error terms are, on average, much different in value from one another, i.e., negatively correlated. In regressions, this can imply an underestimation of the level of [[statistical significance]].
 
To test for '''positive autocorrelation''' at significance ''α'', the test statistic ''d'' is compared to lower and upper critical values (''d<sub>L,α</sub>'' and ''d<sub>U,α</sub>''):
:*If ''d'' < ''d<sub>L,α</sub>'', there is statistical evidence that the error terms are positively autocorrelated.
:*If ''d'' > ''d<sub>U,α</sub>'', there is '''no''' statistical evidence that the error terms are positively autocorrelated.
:*If ''d<sub>L,α</sub>'' < ''d'' < ''d<sub>U,α</sub>'', the test is inconclusive.
 
Positive serial correlation is serial correlation in which a positive error for one observation increases the chances of a positive error for another observation.
 
To test for '''negative autocorrelation''' at significance ''α'', the test statistic (4&nbsp;−&nbsp;''d'') is compared to lower and upper critical values (''d<sub>L,α</sub>'' and ''d<sub>U,α</sub>''):
:*If (4 − ''d'') < ''d<sub>L,α</sub>'', there is statistical evidence that the error terms are negatively autocorrelated.
:*If (4&nbsp;−&nbsp;''d'') > ''d<sub>U,α</sub>'', there is '''no''' statistical evidence that the error terms are negatively autocorrelated.
:*If ''d<sub>L,α</sub>'' < (4 − ''d'') < ''d<sub>U,α</sub>'', the test is inconclusive.
 
Negative serial correlation implies that a positive error for one observation increases the chance of a negative error for another observation and a negative error for one observation increases the chances of a positive error for another.
 
The critical values, ''d<sub>L,α</sub>'' and ''d<sub>U,α</sub>'', vary by level of significance (''α''), the number of observations, and the number of predictors in the regression equation. Their derivation is complex—statisticians typically obtain them from the appendices of statistical texts.
 
If the [[design matrix]] <math>\mathbf{X}</math> of the regression is known, exact critical values for the distribution of <math>d</math> under the null hypothesis of no serial correlation can be calculated. Under the null hypothesis <math>d</math> is distributed as
 
: <math>
\frac
{\sum_{i=1}^{n-k} \nu_i \xi_i^2}
{\sum_{i=1}^{n-k} \xi_i^2},
</math>
 
where ''n'' are the number of observations and ''k'' the number of regression variables; the <math> \xi_i </math> are independent standard normal random variables; and the <math> \nu_i </math> are the nonzero eigenvalues of <math>
( \mathbf{I} - \mathbf{X} ( \mathbf{X}^T \mathbf{X} ) ^{-1} \mathbf{X}^T ) \mathbf{A},
</math>
where <math>\mathbf{A}</math> is the matrix that transforms the residuals into the <math>d</math> statistic, i.e. <math>d = \mathbf{e}^T\mathbf{A}\mathbf{e}.</math>
.<ref name="Durbin 1971">{{cite journal |last1=Durbin |first1=J. |last2=Watson |first2=G. S. |year=1971 |title=Testing for serial correlation in least squares regression.III |journal=Biometrika |volume=58 |issue=1 |pages=1–19}}</ref> A number of computational algorithms for finding percentiles of this distribution are available.<ref name="Farebrother 1980">{{cite journal |last1=Farebrother |first1=R. W.|year=1980 |title=Algorithm AS 153: Pan's procedure for the tail probabilities of the Durbin-Watson statistic |journal=Journal of the Royal Statistical Society, Series C |volume=29 |issue=2 |pages=224–227}}</ref>
 
Although serial correlation does not affect the consistency of  the estimated regression coefficients, it does affect our ability to conduct valid statistical tests. First, the F-statistic to test for overall significance of the regression may be inflated under positive serial correlation because the mean squared error (MSE) will tend to underestimate the population error variance. Second, positive serial correlation typically causes the ordinary least squares (OLS) standard errors for the regression coefficients to underestimate the true standard errors. As a consequence, if positive serial correlation is present in the regression, standard linear regression analysis will typically lead us to compute artificially small standard errors for the regression coefficient. These small standard errors will cause the estimated t-statistic to be inflated, suggesting significance where perhaps there is none. The inflated t-statistic, may in turn, lead us to incorrectly reject null hypotheses, about population values of the parameters of the regression model more often than we would if the standard errors were correctly estimated.
 
If the Durbin–Watson statistic indicates the presence of serial correlation of the residuals, this can be remedied by using the [[Cochrane-Orcutt procedure]].
 
An important note is that the Durbin–Watson statistic, while displayed by many regression analysis programs, is not relevant in many situations. For instance, if the error distribution is not normal, if there is higher-order autocorrelation, or if the dependent variable is in a lagged form as an independent variable, this is not an appropriate test for autocorrelation.  A suggested test that does not have these limitations is the [[Breusch–Godfrey test]] (serial correlation LM) Test.
 
==Durbin h-statistic==<!-- This section is linked from [[Autocorrelation]] -->
 
The Durbin–Watson statistic is [[bias of an estimator|biased]] for [[autoregressive moving average model]]s, so that autocorrelation is underestimated. But for large samples one can easily compute the unbiased [[Normal distribution|normally distributed]] h-statistic:
 
:<math>h = \left( 1 - \frac {1} {2} d \right) \sqrt{\frac {T}  {1-T \cdot \widehat {\operatorname{Var}}(\widehat\beta_1\,)}},</math>
 
using the Durbin–Watson statistic ''d'' and the estimated variance
 
: <math>\widehat Var (\widehat\beta_1)</math>
 
of the regression coefficient of the lagged dependent variable, provided
 
: <math>T \cdot \widehat{Var}(\widehat\beta_1)<1. \,</math>
 
==Durbin–Watson test for panel data==
For [[panel data]] this statistic was generalized as follows by [[Alok Bhargava]] et al. (1982):
 
:If ''e<sub>i, t</sub>'' is the [[errors and residuals in statistics|residual]] from an OLS regression with fixed effects for each panel ''i'', associated with the observation in panel ''i'' at time ''t'', then the test statistic is
 
:: <math>d_{pd}=\frac{\sum_{i=1}^N \sum_{t=2}^T (e_{i,t} - e_{i,t-1})^2}  {\sum_{i=1}^N \sum_{t=1}^T e_{i,t}^2}.</math>
 
This statistic can be compared with tabulated rejection values [see [[Alok Bhargava]] et al. (1982), page 537]. These values are calculated dependent on ''T'' (length of the balanced panel—time periods the individuals were surveyed), ''K'' (number of regressors) and ''N'' (number of individuals in the panel). This test statistic can also be used for testing the null hypothesis of a [[unit root]] against stationary alternatives in fixed effects models using another set of bounds (Tables V and VI) tabulated by [[Alok Bhargava]] et al. (1982).
 
==Implementations in statistics packages==
#R: the dwtest function in the lmtest package, and durbinWatsonTest function in the car package.
#MATLAB: the dwtest function in the Statistics Toolbox.
#Mathematica: the Durbin–Watson (''d'') statistic is included as an option in the LinearModelFit function.
#SAS: Is a standard output when using proc model and is an option (dw) when using proc reg.
#Eviews: Automatically calculated when using ols regression
#Stata: the command -estat dwatson-, following -regress- in times series data.  Engle's LM test for autoregressive conditional heteroskedasticity (ARCH), a test for time-dependent volatility, the Breusch–Godfrey test, and Durbin's alternative test for serial correlation are also available. All (except -dwatson-) tests separately for higher-order serial correlations. The Breusch–Godfrey test and Durbin's alternative test also allow regressors that are not strictly exogenous.
#EXCEL: although Microsoft Excel 2007 does not have a specific Durbin–Watson function, the ''d''-statistic may be calculated using "=SUMXMY2(x_array,y_array)/SUMSQ(array)"
#Minitab: the option to report the statistic in the Session window can be found under the "Options" box under Regression and via the "Results" box under General Regression.
#Python: a durbin_watson function is included in the statsmodels package (statsmodels.stats.stattools.durbin_watson)
#SPSS: Included as an option in the Regression function.
<!--
people will add more
-->
 
==See also==
* [[Time-series regression]]
* [[Autocorrelation|ACF]] / [[PACF]]
* [[Correlation dimension]]
 
==Notes==
{{Reflist}}
 
==References==
{{refbegin|60em}}
*{{cite journal |last=Bhargava |first=Alok |last2=Franzini |first2=L. |last3=Narendranathan |first3=W. |year=1982 |title=Serial Correlation and the Fixed Effects Model |journal=[[Review of Economic Studies]] |volume=49 |issue=4 |pages=533–549 |doi=10.2307/2297285 }}
*{{cite journal |last=Durbin |first=J. |last2=Watson |first2=G. S. |year=1950 |title=Testing for Serial Correlation in Least Squares Regression, I |journal=[[Biometrika]] |volume=37 |issue=3–4 |pages=409–428 |jstor=2332391 }}
*{{cite journal |last=Durbin |first=J. |last2=Watson |first2=G. S. |year=1951 |title=Testing for Serial Correlation in Least Squares Regression, II |journal=Biometrika |volume=38 |issue=1–2 |pages=159–179 |jstor=2332325 }}
*{{cite book |last=Gujarati |first=Damodar N. |first2=Dawn C. |last2=Porter |year=2009 |title=Basic Econometrics |edition=5th |location=Boston |publisher=McGraw-Hill Irwin |isbn=9780073375779 }}
*{{cite journal |last=Neumann |first=John von |year=1941 |title=Distribution of the ratio of the mean square successive difference to the variance |journal=[[Annals of Mathematical Statistics]] |volume=12 |issue=4 |pages=367–395 |jstor=2235951 }}
*{{cite journal |last=Sargan |first=J. D. |authorlink=Alok Bhargava |first2=Alok |last2=Bhargava |year=1983 |title=Testing residuals from least squares regression for being generated by the Gaussian random walk |journal=[[Econometrica]] |volume=51 |issue=1 |pages=153–174 |jstor=1912252 }}
*{{cite book |last=Verbeek |first=Marno |title=A Guide to Modern Econometrics |edition=2nd |location=Chichester |publisher=John Wiley & Sons |year=2004 |pages=102f. |isbn=0470857730 }}
* Multiple regression and issues in regression analysis, Richard A DeFusco, CFA, Denis W. Mc. Leavey, CFA, Jerald E. Pinto, CFA and David E. Runkle, CFA, CFA Curriculum Level II
{{refend}}
 
==External links==
* [http://www.stanford.edu/~clint/bench/dwcrit.htm Table for high ''n'' and ''k'']
 
{{Statistics|analysis}}
 
{{DEFAULTSORT:Durbin-Watson statistic}}
[[Category:Econometrics]]
[[Category:Statistical tests]]
[[Category:Time series analysis]]

Latest revision as of 14:07, 14 September 2014

I would like to introduce myself to you, I am Andrew and my spouse doesn't like it at all. Invoicing is my occupation. Ohio is where her house is. To climb is something she would never give up.

Feel free to surf to my web site; psychic phone