AUTOREG Procedure

From SAS Wiki

Jump to: navigation, search

The AUTOREG procedure estimates and forecasts linear regression models for time series data when the errors are autocorrelated or heteroscedastic. The autoregressive error model is used to correct for autocorrelation, and the generalized autoregressive conditional heteroscedasticity (GARCH) model and its variants are used to model and correct for heteroscedasticity.

When time series data are used in regression analysis, often the error term is not independent through time. Instead, the errors are serially correlated or autocorrelated. If the error term is autocorrelated, the efficiency of ordinary least squares (OLS) parameter estimates is adversely affected and standard error estimates are biased.

The autoregressive error model corrects for serial correlation. The AUTOREG procedure can fit autoregressive error models of any order and can fit subset autoregressive models. You can also specify stepwise autoregression to select the autoregressive error model automatically.

The AUTOREG procedure supports the following variations of the GARCH models:

 o generalized ARCH (GARCH) 
 o integrated GARCH (IGARCH) 
 o exponential GARCH (EGARCH) 
 o quadratic GARCH (QGARCH) 
 o threshold GARCH (TGARCH) 
 o power GARCH (PGARCH) 
 o GARCH-in-mean (GARCH-M)

Contents

[edit] Syntax

Syntax: PROC AUTOREG options ; 
    BY variables ; 
    CLASS variables ; 
    MODEL dependent = regressors / options ; 
    HETERO variables / options ; 
    NLOPTIONS options ; 
    RESTRICT equation , ..., equation ; 
    TEST equation , ..., equation / option ; 
    OUTPUT OUT = SAS data set options ;

[edit] Procedure Options

[edit] DATA=

[Syntax: DATA= SAS-data-set] Specifies the input SAS data set. If the DATA= option is not specified, PROC AUTOREG uses the most recently created SAS data set.

[edit] OUTEST=

[Syntax: OUTEST= SAS-data-set] Writes the parameter estimates to an output data set.

[edit] COVOUT

Writes the covariance matrix for the parameter estimates to the OUTEST= data set. This option is valid only if the OUTEST= option is specified.

[edit] PLOTS=

[Syntax: PLOTS<(global-plot-options)> < = (specific plot options)>] Requests that the AUTOREG procedure produce statistical graphics via the Output Delivery System, provided that the ODS GRAPHICS statement has been specified. The global-plot-options apply to all relevant plots generated by the AUTOREG procedure. The global-plot-options supported by the AUTOREG procedure follow. ONLY suppresses the default plots. Only the plots specifically requested are produced. UNPACK | UNPACKPANEL breaks a graphic that is otherwise paneled into individual component plots
ALL
Requests that all plots appropriate for the particular analysis be produced.
ACF
Produces the autocorrelation function plot.
IACF
Produces the inverse autocorrelation function plot of residuals.
PACF
Produces the partial autocorrelation function plot of residuals.
FITPLOT
Plots the predicted and actual values.
COOKSD
Produces the Cook's D plot.
QQ
Q-Q plot of residuals.
RESIDUAL
Plots the residuals.
STUDENTRESIDUAL
Plots the studentized residuals.
STANDARDRESIDUAL
Plots the standardized residuals.
WHITENOISE
Plots the white noise probabilities.
RESIDUALHISTOGRAM
Plots the histogram of residuals.
NONE
Suppresses all plots.

[edit] Procedure Statements

[edit] BY

Syntax: BY variables ; A BY statement can be used with PROC AUTOREG to obtain separate analyses on observations in groups defined by the BY variables.

[edit] CLASS

Syntax: CLASS variables ; The CLASS statement names the classification variables to be used in the analysis. Classification variables can be either character or numeric. In PROC AUTOREG, the CLASS statement enables you to output class variables to a data set that contains a copy of the original data. Class levels are determined from the formatted values of the CLASS variables. Thus, you can use formats to group values into levels.

[edit] MODEL

Syntax: MODEL dependent = regressors / options ; | label: MODEL dependent = regressors / options ; The MODEL statement specifies the dependent variable and independent regressor variables for the regression model. If no independent variables are specified in the MODEL statement, only the mean is fitted. (This is a way to obtain autocorrelations of a series. Models can be given labels of up to eight characters. Model labels are used in the printed output to identify the results for different models.)
CENTER
Centers the dependent variable by subtracting its mean and suppresses the intercept parameter from the model. This option is valid only when the model does not have regressors (explanatory variables).
NOINT
Suppresses the intercept parameter.
NLAG=
[Syntax: NLAG= number | NLAG= (number-list)] Specifies the order of the autoregressive error process or the subset of autoregressive error lags to be fitted. Note that NLAG=3 is the same as NLAG=(1 2 3). If the NLAG= option is not specified, PROC AUTOREG does not fit an autoregressive model.
DIST=
Specifies the distribution assumed for the error term in GARCH-type estimation. If no GARCH= option is specified, the option is ignored. If EGARCH is specified, the distribution is always normal distribution.
T
Specifies Student's t distribution.
NORMAL
Specifies the standard normal distribution. This is the default.
GARCH=
[Syntax: GARCH=( option-list )] Specifies a GARCH-type conditional heteroscedasticity model. The GARCH= option in the MODEL statement specifies the family of ARCH models to be estimated. The GARCH regression model is specified in the following statement: model y = x1 x2 / garch=(q=1,p=1);When you want to estimate the subset of ARCH terms, such as ARCH, you can write the SAS statement as follows: model y = x1 x2 / garch=(q=(1 3)); The following options can be used in the GARCH=( ) option. The options are listed within parentheses and separated by commas. Q= number | Q= (number-list) specifies the order of the process or the subset of ARCH terms to be fitted. P= number | P= (number-list) specifies the order of the process or the subset of GARCH terms to be fitted. TYPE= value specifies the type of GARCH model. The values of the TYPE= option are as follows: EXP -- specifies the exponential GARCH or EGARCH model. INTEGRATED -- specifies the integrated GARCH or IGARCH model. NELSON | NELSONCAO --specifies the Nelson-Cao inequality constraints. This is the default. NONNEG -- specifies the GARCH model with nonnegativity constraints. STATIONARY -- constrains the sum of GARCH coefficients to be less than 1. POWER | PGARCH -- specifies the power GARCH or PGARCH model. QUADR | QUADRATIC | QGARCH -- specifies the quadratic GARCH or QGARCH model. STATIONARY -- constrains the sum of GARCH coefficients to be less than 1. THRES | THRESHOLD | TGARCH -- specifies the threshold GARCH or TGARCH model. MEAN= -- Specifies the functional form of the GARCH-M model. NOINT -- Suppresses the intercept parameter in the conditional variance model. This option is valid only with the TYPE=INTEG option. STARTUP=MSE | ESTIMATE -- Requests whether the positive constant c for the start-up values of the GARCH conditional error variance process is estimated, or the value of the mean squared error is used as the default constant. TR -- Uses the trust region method for GARCH estimation. This algorithm is numerically stable, though computation is expensive. The double quasi-Newton method is the default.
ALL
Requests all printing options.
ARCHTEST=
[Syntax: ARCHTEST=(option-list)] Specifies tests for the absence of ARCH effects. The following options can be used in the ARCHTEST=( ) option. The options are listed within parentheses and separated by commas. QLM | QLMARCH requests the Q and Engle's LM tests. LK | LKARCH requests Lee and King's ARCH tests. WL | WLARCH requests Wong and Li's ARCH tests. ALL requests all ARCH tests, namely Q and Engle's LM tests, Lee and King's tests, and Wong and Li's tests.
BDS=
[Syntax: BDS=(option-list)] Specifies Brock-Dechert-Scheinkman (BDS) tests for independence. The following options can be used in the BDS=( ) option. The options are listed within parentheses and separated by commas. M=number specifies the maximum number of the embedding dimension. The BDS tests with embedding dimension from 2 to M are calculated. M must be an integer between 2 and 20. The default value of the M= suboption is 20. D=number specifies the parameter to determine the radius for BDS test. The BDS test sets up the radius as r=D*σ, where σ is the standard deviation of the time series to be tested. By default, D=1.5. PVALUE=DIST | SIM specifies the way to calculate the p-values. By default or if PVALUE=DIST is specified, the p-values are calculated according to the asymptotic distribution of BDS statistics (that is, the standard normal distribution). Otherwise, for samples of size less than 500, the p-values are obtained though Monte Carlo simulation. Z=value specifies the type of the time series (residuals) to be tested. The values of the Z= suboption are as follows: Y -- specifies the regressand. The default is Z=Y. RO -- specifies the OLS residuals. R -- specifies the residuals of the final model. RM -- specifies the structural residuals of the final model. SR -- specifies the standardized residuals of the final model, defined by residuals over the square root of the conditional variance.
CHOW=
[Syntax: CHOW= ( obs-1 ...obs-n )] Computes Chow tests to evaluate the stability of the regression coefficient. The Chow testis also called the analysis-of-variance test. Each value obs-i listed on the CHOW= option specifies a break point of the sample. The sample is divided into parts at the specified break point, with observations before obs-i in the first part and obs-i and later observations in the second part, and the fits of the model in the two parts are compared to whether both parts of the sample are consistent with the same model. The break points obs-i refer to observations within the time range of the dependent variable, ignoring missing values before the start of the dependent series. Thus, CHOW=20 specifies the 20th observation after the first nonmissing observation for the dependent variable. For example, if the dependent variable Y contains 10 missing values before the first observation with a nonmissing Y value, then CHOW=20 actually refers to the 30th observation in the data set. When you specify the break point, you should note the number of presample missing values.
COEF
Prints the transformation coefficients for the first p observations. These coefficients are formed from a scalar multiplied by the inverse of the Cholesky root of the Toeplitz matrix of autocovariances.
CORRB
Prints the estimated correlations of the parameter estimates.
COVB
Prints the estimated covariances of the parameter estimates.
COVEST=
Specifies the type of covariance matrix for the GARCH or heteroscedasticity model.
OP
The outer product matrix is used to compute the covariance matrix of the parameter estimates. This is the default.
HESSIAN
Produces the covariance matrix by using the Hessian matrix.
QML
Computes the quasi-maximum likelihood estimates.
DW=
[Syntax: DW= n] Prints Durbin-Watson statistics up to the order n. The default is DW=1. When the LAGDEP option is specified, the Durbin-Watson statistic is not printed unless the DW= option is explicitly specified.
DWPROB
Now produces p-values for the generalized Durbin-Watson test statistics for large sample sizes. Previously, the Durbin-Watson probabilities were calculated only for small sample sizes. The new method of calculating Durbin-Watson probabilities is based on the algorithm of Ansley, Kohn, and Shively (1992).
GINV
Prints the inverse of the Toeplitz matrix of autocovariances for the Yule-Walker solution.
GODFREY=
[Syntax: GODFREY=r] Produces Godfrey's general Lagrange multiplier test against ARMA errors.
ITPRINT
Prints the objective function and parameter estimates at each iteration. The objective function is the full log likelihood function for the maximum likelihood method, while the error sum of squares is produced as the objective function of unconditional least squares. For the ML method, the ITPRINT option prints the value of the full log likelihood function, not the concentrated likelihood.
LAGDEP | LAGDV
Prints the Durbin t statistic, which is used to detect residual autocorrelation in the presence of lagged dependent variables.
LAGDEP= | LAGDV=
[Syntax: LAGDEP= name] Prints the Durbin h statistic for testing the presence of first-order autocorrelation when regressors contain the lagged dependent variable whose name is specified as LAGDEP=name.
LOGLIKL
Prints the log likelihood value of the regression model, assuming normally distributed errors.
NOPRINT
Suppresses all printed output.
NORMAL
Specifies the Jarque-Bera's normality test statistic for regression residuals.
PARTIAL
Prints partial autocorrelations.
PCHOW=
[Syntax: PCHOW= (obs-1 ...obs-n)] Computes the predictive Chow test. The form of the PCHOW= option is the same as the CHOW= option.
RESET
Produces Ramsey's RESET test statistics.
RUNS=
[Syntax: RUNS=(Z=value)] Specifies the runs test for independence. The Z= suboption specifies the type of the time series or residuals to be tested. The values of the Z= suboption are as follows: Y specifies the regressand. The default is Z=Y. RO specifies the OLS residuals. R specifies the residuals of the final model. RM specifies the structural residuals of the final model. SR specifies the standardized residuals of the final model, defined by residuals over the square root of the conditional variance.
STATIONARITY=
Specifies tests of stationarity or unit roots. The STATIONARITY= option provides Phillips-Perron, Phillips-Ouliaris, and KPSS tests.
(PHILLIPS)
Produces the Phillips-Perron unit root test when there are no regressors in the MODEL statement. When the model includes regressors, the PHILLIPS option produces the Phillips-Ouliaris cointegration test.
(PHILLIPS=(value ...value))
Produces the Phillips-Perron unit root test when there are no regressors in the MODEL statement. When the model includes regressors, the PHILLIPS option produces the Phillips-Ouliaris cointegration test.
(KPSS)
Produces the Kwiatkowski, Phillips, Schmidt, and Shin (1992) (KPSS) unit root test. The Newey-West kernelis used, in which the default truncation point method is the Schwert with c=4 (SCHW=4).
(KPSS=(KERNEL=TYPE))
Produces the Kwiatkowski, Phillips, Schmidt, and Shin (1992) (KPSS) unit root test. You can specify twotypes of the kernel: KERNEL=NW|BART Newey-West (or Bartlett) kernel KERNEL=QS Quadratic spectral kernel
(KPSS=(KERNEL=TYPE TRUNCPOINTMETHOD))
Produces the Kwiatkowski, Phillips, Schmidt, and Shin (1992) (KPSS) unit root test. You can specify twotypes of the kernel: KERNEL=NW|BART Newey-West (or Bartlett) kernel KERNEL=QS Quadratic spectral kernelYou can set the truncation point s by using three different methods: SCHW=c Schwert maximum lag formula LAG= LAG= manually defined number of lags AUTO Automatic bandwidth selection
(PHILLIPS<=(...)>, KPSS<=(...)>)
Specifies both the Phillips-Perron and KPSS unit root tests.
(ERS)
Provides a class of efficient unit root tests, in the sense that it reduces the size distortion and improve the power compared with traditional unit root tests such as augmented Dickey-Fuller and Phillips-Perron tests.
(ERS=(value))
Two test statistics are provided by the ERS test: the point optimal test and the DF-GLS test, which are originally proposed in Elliott, Rothenberg, and Stock (1996).
(NP)
Provides a class of efficient unit root tests, in the sense that it reduces the size distortion and improve the power compared with traditional unit root tests such as augmented Dickey-Fuller and Phillips-Perron tests.
=(NP=(value))
Four different tests, discussed in Ng and Perron (2001), are reported by NP test. These four tests include the two in the ERS test and two other tests, the modified PP test and the modified point optimal test.
(ADF<=(...)>,ERS<=(...)>, KPSS<=(...)>, NP<=(...)>, PHILLIPS<=(...)> )
The ADF or ADF= suboption produces the augmented Dickey-Fuller unit root test (Dickey and Fuller; 1979).
URSQ
Prints the uncentered regression R². The uncentered regression R² is useful to compute Lagrange multiplier test statistics, since most LM test statistics are computed as T *URSQ, where T is the number of observations used in estimation.
BACKSTEP
Removes insignificant autoregressive parameters. The parameters are removed in order of least significance. This backward elimination is done only once on the Yule-Walker estimates computed after the initial ordinary least squares estimation. The BACKSTEP option can be used with all estimation methods since the initial parameter values for other estimation methods are estimated using the Yule-Walker method.
SLSTAY=
[Syntax: SLSTAY= value] Specifies the significance level criterion to be used by the BACKSTEP option. The default is SLSTAY=.05.
CONVERGE=
[Syntax: CONVERGE= value] Specifies the convergence criterion. If the maximum absolute value of the change in the autoregressive parameter estimates between iterations is less than this amount, then convergence is assumed. The default is CONVERGE=.001. If the GARCH= option and/or the HETERO statement is specified, convergence is assumed when the absolute maximum gradient is smaller than the value specified by the CONVERGE= option or when the relative gradient is smaller than 1E–8. By default, CONVERGE=1E–5.
INITIAL= | START=
[Syntax: INITIAL=(initial-values)] Specifies initial values for some or all of the parameter estimates. The values specified are assigned to model parameters in the same order as the parameter estimates are printed in the AUTOREG procedure output. The order of values in the INITIAL= or START= option is as follows: the intercept, the regressor coefficients, the autoregressive parameters, the ARCH parameters, the GARCH parameters, the inverted degrees of freedom for Student's t distribution, the start-up value for conditional variance, and the heteroscedasticity model parameters η specified by the HETERO statement.
LDW
Specifies that p-values for the Durbin-Watson test be computed using a linearized approximation of the design matrix when the model is nonlinear due to the presence of an autoregressive error process. (The Durbin-Watson tests of the OLS linear regression model residuals are not affected by the LDW option.)
MAXITER=
[Syntax: MAXITER= number] Sets the maximum number of iterations allowed. The default is MAXITER=50.
METHOD=
Requests the type of estimates to be computed.
ML
Specifies maximum likelihood estimates.
ULS
Specifies unconditional least squares estimates.
YW
Specifies Yule-Walker estimates.
ITYW
Specifies iterative Yule-Walker estimates.
NOMISS
Requests the estimation to the first contiguous sequence of data with no missing values. Otherwise, all complete observations are used.
OPTMETHOD=
Specifies the optimization technique when the GARCH or heteroscedasticity model is estimated.
QN
Specifies the quasi-Newton method. This is the default.
TR
Specifies the trust region method.

[edit] HETERO

Syntax: HETERO variables / options ; The HETERO statement specifies variables that are related to the heteroscedasticity of the residuals and the way these variables are used to model the error variance of the regression.
LINK=
Specifies the functional form of the heteroscedasticity model. By default, LINK=EXP. If you specify a GARCH model with the HETERO statement, the model is estimated using LINK= LINEAR only.
EXP
Specifies the exponential link function.
SQUARE
Specifies the square link function.
LINEAR
Specifies the linear function; that is, the HETERO statement variables predict the error variance linearly.
COEF=
Imposes constraints on the estimated parameters η of the heteroscedasticity model.
NONNEG
Specifies that the estimated heteroscedasticity parameters η must be nonnegative.
UNIT
Constrains all heteroscedasticity parameters η to equal 1
ZERO
Constrains all heteroscedasticity parameters η to equal 0.
UNREST
Specifies unrestricted estimation of η.
STD=
Imposes constraints on the estimated standard deviation σ of the heteroscedasticity model.
NONNEG
Specifies that the estimated standard deviation parameter σ must be nonnegative.
UNIT
Constrains the standard deviation parameter σ to equal 1.
UNREST
Specifies unrestricted estimation of σ.
TEST=
Produces a Lagrange multiplier test for heteroscedasticity. The null hypothesis is homoscedasticity; the alternative hypothesis is heteroscedasticity of the form specified by the HETERO statement. The power of the test depends on the variables specified in the HETERO statement.
LM
Specifies a Lagrange multiplier test for heteroscedasticity.
NOCONST
Specifies that the heteroscedasticity model does not include the unit term for the LINK=SQUARE and LINK=LINEAR options.

[edit] NLOPTIONS

Syntax: NLOPTIONS options ; Several SAS/ETS procedures (COUNTREG, ENTROPY, MDC, QLIM, UCM, and VARMAX) use the nonlinear optimization (NLO) subsystem to perform nonlinear optimization.
ABSCONV= | ABSTOL=
[Syntax: ABSCONV= | ABSTOL=r] Specifies an absolute function convergence criterion. The default value of r is the negative square root of the largest double-precision value, which serves only as a protection against overflows.
ABSFCONV= | ABSFTOL=
[Syntax: ABSFCONV=r[n] | ABSFTOL=r[n]] Specifies an absolute function convergence criterion. The default value is r=0. The optional integer value n specifies the number of successive iterations for which the criterion must be satisfied before the process can be terminated.
ABSGCONV= | ABSGTOL=
[Syntax: ABSGCONV=r[n] | ABSGTOL=r[n]] Specifies the absolute gradient convergence criterion. The default value is r=1E-5. The optional integer value n specifies the number of successive iterations for which the criterion must be satisfied before the process can be terminated..
ABSXCONV= | ABSXTOL=
[Syntax: ABSXCONV= | ABSXTOL=r[n]] Specifies the absolute parameter convergence criterion. The default is r=1E-8 for the NMSIMP technique and r=0 otherwise. The optional integer value n specifies the number of successive iterations for which the criterion must be satisfied before the process can terminate.
DAMPSTEP=
[Syntax: DAMPSTEP<=r>] Specifies that the initial step length value a**(0) for each line search (used by the QUANEW, HYQUAN, CONGRA, or NEWRAP technique) cannot be larger than r times the step length value used in the former iteration. If the DAMPSTEP option is specified but r is not specified, the default is r=2. The DAMPSTEP=r option can prevent the line-search algorithm from repeatedly stepping intoregions where some objective functions are difficult to compute or where they could lead to floating point overflows during the computation of objective functions and their derivatives. The DAMPSTEP=r option can save time-costly function calls during the line searches of objective functions that result in very small steps.
FCONV= | FTOL=
[Syntax: FCONV= | FTOL=r[n]] Specifies a relative function convergence criterion.
FCONV2= | FTOL2=
[Syntax: FCONV2= | FTOL2=r[n]] Specifies another function convergence criterion.
FSIZE=
[Syntax:FSIZE=r] Specifies the FSIZE parameter of the relative function and relative gradient termination criteria. The default value is r=0.
GCONV= | GTOL=
[Syntax: GCONV=r | GTOL=r] Specifies a relative gradient convergence criterion. The default value is r=1E-8. The optional integer value n specifies the number of successive iterations for which the criterion must be satisfied before the process can terminate.
HESCAL= | HS=
Specifies the scaling version of the Hessian matrix used in NRRIDG, TRUREG, NEWRAP, or DBLDOG optimization.
0
Specifies that no scaling is done.
1
Specifies the Moré (1978) scaling update
2
Specifies the Dennis, Gay, and Welsch (1981) scaling update
3
Specifies that di is reset in each iteration
INHESSIAN= | INHESS=
[Syntax: INHESSIAN[=r]] Specifies how the initial estimate of the approximate Hessian is defined for the quasi-Newton techniques QUANEW and DBLDOG.
INSTEP=
[Syntax: INSTEP=r] Reduces the length of the first trial step during the line search of the first iterations.
LINESEARCH= | LIS=
[Syntax: LINESEARCH= | LIS=i] Specifies the line-search method for the CONGRA, QUANEW, and NEWRAP optimization techniques.
1
Specifies a line-search method that needs the same number of function and gradient calls for cubic interpolation and cubic extrapolation; this method is similar to one used by the Harwell subroutine library.
2
Specifies a line-search method that needs more function calls than gradient calls for quadratic and cubic interpolation and cubic extrapolation; this method is implemented as shown in Fletcher (1987) and can be modified to an exact line search by using the LSPRECISION= option.
3
Specifies a line-search method that needs the same number of function and gradient calls for cubic interpolation and cubic extrapolation; this method is implemented as shown in Fletcher (1987) and can be modified to an exact line search by using the LSPRECISION= option.
4
Specifies a line-search method that needs the same number of function and gradient calls for stepwise extrapolation and cubic interpolation
5
Specifies a line-search method that is a modified version of LIS=4.
6
Specifies golden section line search (Polak 1971), which uses only function values for linear approximation.
7
Specifies bisection line search (Polak 1971), which uses only function values for linear approximation.
8
Specifies Armijo line-search technique (Polak 1971), which uses only function values for linear approximation.
LSPRECISION= | LSP=
[Syntax: LSPRECISION= | LSP=r] Specifies the degree of accuracy that should be obtained by the line-search algorithms LIS=2 and LIS=3. The default LSPRECISION= values are:
  1. For TECH=QUANEW UPDATE=DBFGS, BFGS: r = 0.4
  2. For TECH=QUANEW UPDATE=DDFP, DFP: r = 0.06
  3. For TECH=CONGRA UPDATE=all r = 0.1
  4. For TECH=NEWRAP NO UPDATE: r = 0.9
MAXFUNC= | MAXFU=
[Syntax: MAXFUNC= | MAXFU=i] Requires the number of function calls to be no larger than i. The default values are:
  1. For TECH= TRUREG, NRRIDG, NEWRAP: i=125
  2. For TECH= DBLDOG, QUANEW: i=500
  3. For TECH= CONGRA: i=1000
  4. For TECH= NMSIMP: i=3000 Note that the optimization can terminate only after completing a full iteration. Therefore, the number of function calls that is actually performed can exceed the number that is specified by the MAXFUNC= option.
MAXITER= | MAXIT=
[Syntax: MAXITER= | MAXIT=i[n]] Requires the number of iterations to be no larger than i. The default values are:
  1. For TECH= TRUREG, NRRIDG, NEWRAP: i=50
  2. For TECH= QUANEW, DBLDOG: i=200
  3. For TECH= CONGRA: i=400
  4. For TECH= NMSIMP: i=1000 These default values are also valid when i is specified as a missing value.
MAXSTEP=
[Syntax: MAXSTEP=r[n]] Specifies an upper bound for the step length of the line-search algorithms during the first n iterations.
MAXTIME=
[Syntax: MAXTIME=r] Requires the CPU time to be no larger than r. The default value of the MAXTIME= option is the largest double floating-point number on your computer. Note that the time specified by the MAXTIME= option is checked only once at the end of each iteration. Therefore, the actual running time can be much longer than that specified by the MAXTIME= option. The actual running time includes the rest of the time needed to finish the iteration and the time needed to generate the output of the results.
MINITER= | MINIT=
[Syntax: MINITER= | MINIT=i] Specifies the minimum number of iterations. The default value is 0. If you request more iterations than are actually needed for convergence to a stationary point, the optimization algorithms can behave strangely. For example, the effect of rounding errors can prevent the algorithm from continuing for the required number of iterations.
NOPRINT
Suppresses the output.
PALL
[Displays all optional output for optimization.
PHISTORY
Displays the optimization history.
PHISTPARMS
Display parameter estimates in each iteration.
PINIT
Displays the initial values and derivatives (if available).
PSHORT
Restricts the amount of default output.
PSUMMARY
Restricts the amount of default displayed output to a short form of iteration history and notes, warnings, and errors.
RESTART= | REST=
[Syntax: RESTART= | REST=i > 0] Specifies that the QUANEW or CONGRA algorithm is restarted with a steepest descent/ascent search direction after, at most, i iterations, i > 0.
SOCKET=
Specifies the fileref that contains the information needed for remote monitoring.
TECHNIQUE= | TECH=
[Syntax: TECHNIQUE= | TECH=name] Specifies the optimization technique.
CONGRA
Chooses one of four different conjugate-gradient optimization algorithms, which can be more precisely defined with the UPDATE= option and modified with the LINESEARCH= option.
DBLDOG
Performs a version of double-dogleg optimization, which uses the gradient to update an approximation of the Cholesky factor of the Hessian.
NMSIMP
Performs a Nelder-Mead simplex optimization.
NEWRAP
Performs a usually stable but, for large problems, memory- and time-consuming Newton-Raphson optimization technique. The algorithm combines a line-search algorithm with ridging, and it can be modified with the LINESEARCH= option.
NRRIDG
Performs a usually stable but, for large problems, memory- and time-consuming Newton-Raphson optimization technique. This algorithm does not perform a line search.
QUANEW
Chooses one of four different quasi-Newton optimization algorithms that can be more precisely defined with the UPDATE= option and modified with the LINESEARCH= option.
TRUREG
Performs a usually very stable but, for large problems, memory- and time-consuming trust-region optimization technique. The algorithm is implemented similar to Gay (1983) and Moré and Sorensen (1983).
NONE
Does not perform any optimization. This option is similar to METHOD=NONE, but TECH=NONE also computes and displays residuals and goodness of fit statistics.
UPDATE= | UPD=
Specifies the update method for the quasi-Newton, double-dogleg, or conjugate-gradient optimization technique.
BFGS
Performs the original Broyden, Fletcher, Goldfarb, and Shanno (BFGS) update of the inverse Hessian matrix.
DBFGS
Performs the dual BFGS update of the Cholesky factor of the Hessian matrix. This is the default update method.
DDFP
Performs the dual Davidon, Fletcher, and Powell (DFP) update of the Cholesky factor of the Hessian matrix.
DFP
Performs the original DFP update of the inverse Hessian matrix.
PB
Performs the automatic restart update method of Powell (1977) and Beale (1972).
FR
Performs the Fletcher-Reeves update (Fletcher 1987).
PR
Performs the Polak-Ribiere update (Fletcher 1987).
CD
Performs a conjugate-descent update of Fletcher (1987).
XCONV= | XTOL=
[Syntax: XCONV= | XTOL=r[n]] Specifies the relative parameter convergence criterion. For all techniques except NMSIMP, termination requires a small relative parameter change in subsequent iterations. The default value is r=1E-8 for the NMSIMP technique and r=0 otherwise. The optional integer value n specifies the number of successive iterations for which the criterion must be satisfied before the process can be terminated.
XSIZE=
[Syntax: XSIZE=r > 0] Specifies the XSIZE parameter of the relative parameter termination criterion. The default value is r=0.

[edit] RESTRICT

Syntax: RESTRICT equation , ..., equation ; The RESTRICT statement provides constrained estimation. The RESTRICT statement places restrictions on the parameter estimates for covariates in the preceding MODEL statement. The AR, GARCH, and HETERO parameters are also supported in the RESTRICT statement. Any number of RESTRICT statements can follow a MODEL statement. Several restrictions can be specified in a single RESTRICT statement by separating the individual restrictions with commas. Each restriction is written as a linear equation composed of constants and parameter names. Refer to model parameters by the name of the corresponding regressor variable. Each name used in the equation must be a regressor in the preceding MODEL statement. Use the keyword INTERCEPT to refer to the intercept parameter in the model. The following is an example of a RESTRICT statement: model y = a b c d; restrict a+b=0, 2*d-c=0;

[edit] TEST

Syntax: TEST equation , ..., equation / option ; The TEST statement tests hypotheses about the covariates in the model estimated by the preceding MODEL statement. The AR, GARCH, and HETERO parameters are also supported in the TEST statement. Each equation specifies a linear hypothesis to be tested. If more than one equation is specified, the equations are separated by commas. Each test is written as a linear equation composed of constants and parameter names. Refer to parameters by the name of the corresponding regressor variable. Each name used in the equation must be a regressor in the preceding MODEL statement. Use the keyword INTERCEPT to refer to the intercept parameter in the model.
TYPE=
Specifies the test statistics to use.
F
Produces an F test. This option is supported for all models specified in MODEL statement.
WALD
Produces a Wald test. This option is supported for all models specified in MODEL statement.
LM
Produces a Lagrange multiplier test. This option is supported only when the GARCH= option is specified (for example, when there is a statement like MODEL Y = C D I / GARCH=(Q=2)).
LR
Produces a likelihood ratio test. This option is supported only when the GARCH= option is specified (for example, when there is a statement like MODEL Y = C D I / GARCH=(Q=2)).
ALL
Produces all tests applicable for a particular model. For non-GARCH-type models, only F and Wald tests are output. For all other models, all four tests (LR, LM, F, and Wald) are computed.

[edit] OUTPUT

Syntax: OUTPUT OUT=SAS-data-set keyword = options ...; ; The OUTPUT statement creates an output SAS data set.
OUT=
[Syntax: OUT= SAS-data-set] Names the output SAS data set containing the predicted and transformed values. If the OUT= option is not specified, the new data set is named according to the DATAn convention.
ALPHACLI=
[Syntax: ALPHACLI= number] Sets the confidence limit size for the estimates of future values of the response time series. The ALPHACLI= value must be between 0 and 1. The resulting confidence interval has 1-number confidence. The default is ALPHACLI=.05, corresponding to a 95% confidence interval.
ALPHACLM=
[Syntax: ALPHACLM= number] Sets the confidence limit size for the estimates of the structural or regression part of the model. The ALPHACLI= value must be between 0 and 1. The resulting confidence interval has 1-number confidence. The default is ALPHACLM=.05, corresponding to a 95% confidence interval.
ALPHACSM=
Specifies the significance level for the upper and lower bounds of the CUSUM and CUSUMSQ statistics output by the CUSUMLB=, CUSUMUB=, CUSUMSQLB=, and CUSUMSQUB= options.
.01
ALPHACSM=.01
.05
ALPHACSM=.05
.10
ALPHACSM=.10
BLUS=
[Syntax: BLUS= variable] Specifies the name of a variable to contain the values of the Theil's BLUS residuals.
CEV= | HT=
[Syntax: CEV= variable] Writes to the output data set the value of the error variance σ²t from the heteroscedasticity model specified by the HETERO statement or the value of the conditional error variance ht by the GARCH= option in the MODEL statement.
CPEV=
[Syntax: CPEV= variable] Writes the conditional prediction error variance to the output data set. The value of conditional prediction error variance is equal to that of the conditional error variance when there are no autoregressive parameters. For the exponential GARCH model, conditional prediction error variance cannot be calculated.
CONSTANT=
[Syntax: CONSTANT= variable] Writes the transformed intercept to the output data set.
CUSUM=
[Syntax: CUSUM= variable] Specifies the name of a variable to contain the CUSUM statistics.
CUSUMSQ=
[Syntax: CUSUMSQ= variable] Specifies the name of a variable to contain the CUSUMSQ statistics.
CUSUMUB=
[Syntax: CUSUMUB= variable] Specifies the name of a variable to contain the upper confidence bound for the CUSUM statistic.
CUSUMLB=
[Syntax: CUSUMLB= variable] Specifies the name of a variable to contain the lower confidence bound for the CUSUM statistic.
CUSUMSQUB=
[Syntax: CUSUMSQUB= variable] Specifies the name of a variable to contain the upper confidence bound for the CUSUMSQ statistic.
CUSUMSQLB=
[Syntax: CUSUMSQLB= variable] Specifies the name of a variable to contain the lower confidence bound for the CUSUMSQ statistic.
LCL=
[Syntax: LCL= name] Writes the lower confidence limit for the predicted value (specified in the PREDICTED= option) to the output data set. The size of the confidence interval is set by the ALPHACLI= option. When a GARCH model is estimated, the lower confidence limit is calculated assuming that the disturbances have homoscedastic conditional variance.
LCLM=
[Syntax: LCLM= name] Writes the lower confidence limit for the structural predicted value (specified in the PREDICTEDM= option) to the output data set under the name given. The size of the confidence interval is set by the ALPHACLM= option.
PREDICTED= | P=
[Syntax: PREDICTED= name] Writes the predicted values to the output data set. These values are formed from both the structural and autoregressive parts of the model.
PREDICTEDM= | PM=
[Syntax: PREDICTEDM= name] Writes the structural predicted values to the output data set. These values are formed from only the structural part of the model.
RECPEV=
[Syntax: RECPEV= variable] Specifies the name of a variable to contain the part of the predictive error variance (vt) that is used to compute the recursive residuals.
RECRES=
[Syntax: RECRES= variable] Specifies the name of a variable to contain recursive residuals. The recursive residuals are used to compute the CUSUM and CUSUMSQ statistics.
RESIDUAL= | R=
[Syntax: RESIDUAL= name] Writes the residuals from the predicted values based on both the structural and time series parts of the model to the output data set.
RESIDUALM= | RM=
[Syntax: RESIDUALM= name] Writes the residuals from the structural prediction to the output data set.
TRANSFORM=
[Syntax: TRANSFORM= variables] Transforms the specified variables from the input data set by the autoregressive model and writes the transformed variables to the output data set. If you need to reproduce the data suitable for reestimation, you must also transform an intercept variable. To do this, transform a variable that is all 1s or use the CONSTANT= option.
UCL=
[Syntax: UCL= name] Writes the upper confidence limit for the predicted value (specified in the PREDICTED= option) to the output data set.
UCLM=
[Syntax: UCLM= name] Writes the upper confidence limit for the structural predicted value (specified in the PREDICTEDM= option) to the output data set. The size of the confidence interval is set by the ALPHACLM= option.

[edit] LAGLIST

This is no longer supported under SAS/ETS 9.22.

[edit] Example Code

DATA _NULL_;
   PUT 'Do you have a great example for AUTOREG?';
   PUT 'You can add it here!';
RUN;

[edit] Usage Tips

Have any tips for using AUTOREG? Add them here!

[edit] See Also

[edit] References





Personal tools
Namespaces
Variants
Actions
Navigation
Toolbox