Title: | Estimation and Diagnostics for Partially Linear Censored Regression Models Based on Heavy-Tailed Distributions |
---|---|
Description: | It estimates the parameters of a partially linear regression censored model via maximum penalized likelihood through of ECME algorithm. The model belong to the semiparametric class, that including a parametric and nonparametric component. The error term considered belongs to the scale-mixture of normal (SMN) distribution, that includes well-known heavy tails distributions as the Student-t distribution, among others. To examine the performance of the fitted model, case-deletion and local influence techniques are provided to show its robust aspect against outlying and influential observations. This work is based in Ferreira, C. S., & Paula, G. A. (2017) <doi:10.1080/02664763.2016.1267124> but considering the SMN family. |
Authors: | Marcela Nunez Lemus, Christian E. Galarza, Larissa Avila Matos, Victor H Lachos |
Maintainer: | Marcela Nunez Lemus <[email protected]> |
License: | GPL (>= 2) |
Version: | 1.39 |
Built: | 2025-02-16 03:16:09 UTC |
Source: | https://github.com/cran/PartCensReg |
It estimates the parameters of a partially linear regression censored model via maximum penalized likelihood through of ECME algorithm. The model belong to the semiparametric class, that including a parametric and nonparametric component. The error term considered belongs to the scale-mixture of normal (SMN) distribution, that includes well-known heavy tails distributions as the Student-t distribution, among others. To examine the performance of the fitted model, case-deletion and local influence techniques are provided to show its robust aspect against outlying and influential observations. This work is based in Ferreira, C. S., & Paula, G. A. (2017) <doi:10.1080/02664763.2016.1267124> but considering the SMN family.
Ferreira, C. S., & Paula, G. A. (2017). Estimation and diagnostic for skew-normal partially linear models. Journal of Applied Statistics, 44(16), 3033-3053.
Ibacache-Pulgar, G., Paula, G. A., & Cysneiros, F. J. A. (2013). Semiparametric additive models under symmetric distributions. Test, 22(1), 103-121.
Ibacache-Pulgar, G., & Paula, G. A. (2011). Local influence for Student-t partially linear models. Computational Statistics & Data Analysis, 55(3), 1462-1478.
dtawage = get(data(PSID1976,package = "AER")) y = dtawage$wage cc = c(rep(0,428),rep(1,325)) tt = dtawage$exper x = cbind(dtawage$education,dtawage$age, dtawage$hhours, dtawage$hwage, dtawage$tax, dtawage$youngkids, dtawage$oldkids) #Normal case by default with only 10 iterations PCR.default1 = Cens.SMN.PCR(x=x, y=y, c=cc, cens="left",tt =tt,iter.max = 10,Diagnostic = FALSE) ## Not run: #This may take few minutes #Normal case by default with full (200) iterations PCR.default2 = Cens.SMN.PCR(x=x, y=y, c=cc, cens="left",tt =tt) #contaminated normal case PCR.CN = Cens.SMN.PCR(x=x, y=y, c=cc, cens="left",tt =tt,type="NormalC", nu = c(0.1,0.1),iter.max = 100) ## End(Not run)
dtawage = get(data(PSID1976,package = "AER")) y = dtawage$wage cc = c(rep(0,428),rep(1,325)) tt = dtawage$exper x = cbind(dtawage$education,dtawage$age, dtawage$hhours, dtawage$hwage, dtawage$tax, dtawage$youngkids, dtawage$oldkids) #Normal case by default with only 10 iterations PCR.default1 = Cens.SMN.PCR(x=x, y=y, c=cc, cens="left",tt =tt,iter.max = 10,Diagnostic = FALSE) ## Not run: #This may take few minutes #Normal case by default with full (200) iterations PCR.default2 = Cens.SMN.PCR(x=x, y=y, c=cc, cens="left",tt =tt) #contaminated normal case PCR.CN = Cens.SMN.PCR(x=x, y=y, c=cc, cens="left",tt =tt,type="NormalC", nu = c(0.1,0.1),iter.max = 100) ## End(Not run)
Return the MPL estimates obtained through of ECME algorithm for partially linear regression models with censored data under scale-mixture of normal (SMN) distributions (some members are the normal, Student-t, slash and contaminated normal distribution). The types of censoring considered are left and right. Graphics for diagnostic analysis such as case-deletion and local influence techniques are provided to show its robust aspect against outlying and influential observations.
Cens.SMN.PCR(x, y, c, cens = "left", tt, nu = NULL, error = 10^-6, iter.max = 200, type = "Normal", alpha.FIX = TRUE, nu.FIX = TRUE, alpha.in = 10^-3, k = 1, Diagnostic = TRUE, a = 2)
Cens.SMN.PCR(x, y, c, cens = "left", tt, nu = NULL, error = 10^-6, iter.max = 200, type = "Normal", alpha.FIX = TRUE, nu.FIX = TRUE, alpha.in = 10^-3, k = 1, Diagnostic = TRUE, a = 2)
x |
Matrix or vector of covariates. |
y |
Vector of responses. |
c |
Vector of censoring indicators. For each observation: 1 if censored and 0 if non-censored. |
cens |
'left' for left censoring and 'right' for rigth censoring. |
tt |
Vector of values of a continuous covariate for the nonparametric component of the model. |
nu |
Initial value of the parameter of the SMN family. In the case of the Student-t and slash is a scalar, in the contaminated normal is a vector bidimensional. |
error |
The convergence maximum error. By default = 10^-6. |
iter.max |
The maximum number of iterations of the ECME algorithm. By default = 200. |
type |
Represents the type of distribution to be used in fitting: 'Normal' for normal, 'T' for Student-t, 'Slash' for slash and 'NormalC' for contaminated normal distribution respectively. By default ='Normal' |
alpha.FIX |
|
nu.FIX |
|
alpha.in |
Initial value of smoothing parameter. |
k |
For the local influence in explanatory variable perturbation, indicates the |
Diagnostic |
|
a |
The value for |
We consider a partial linear model which belongs to the class of semiparametric regression models with vector of response and with errors
which are independent and identically distributed according to a SMN distribution. To be more precise,
for , where
is an
vector with
being the distinct and ordered values of
;
is a
vector of incidence whose
-th element equals the indicator function
for
.
beta |
ECME estimates for the parametric component. |
sigma2 |
ECME estimates for the scale parameter. |
Alpha |
If |
AIC |
AIC criteria for model selection. |
ff |
ECME estimates for the nonparametric component. |
yest |
Predicted values of the model. |
loglik |
Value of the log-likelihood under the fitted model. |
iter |
Number of iterations of the ECME algorithm. |
nu |
If |
MI |
Observed information matrix. |
D |
A list of objects for diagnostic analysis that contains: the Hessian matrix ( |
For the contaminated normal case, if nu parameters were close to the bounds, i.e., close to 0 or 1, computational problems could arrise.
When alpha.FIX = FALSE
the algorithm may take a long time to converge. The package estimates the value in each iteration taking as an estimate the argument that maximizes the actual marginal log-likelihood function, already evaluated in the estimates of
and
. The diagnostic analysis is performed considering the estimated final value of
obtained in the last iteration of the ECME algorithm.
Marcela Nunez Lemus, Christian E. Galarza, Larissa Avila Matos and Victor H. Lachos.
Ferreira, C. S., & Paula, G. A. (2017). Estimation and diagnostic for skew-normal partially linear models. Journal of Applied Statistics, 44(16), 3033-3053.
Ibacache-Pulgar, G., Paula, G. A., & Cysneiros, F. J. A. (2013). Semiparametric additive models under symmetric distributions. Test, 22(1), 103-121.
Ibacache-Pulgar, G., & Paula, G. A. (2011). Local influence for Student-t partially linear models. Computational Statistics & Data Analysis, 55(3), 1462-1478.
dtawage = get(data(PSID1976,package = "AER")) y = dtawage$wage cc = c(rep(0,428),rep(1,325)) tt = dtawage$exper x = cbind(dtawage$education,dtawage$age, dtawage$hhours, dtawage$hwage, dtawage$tax, dtawage$youngkids, dtawage$oldkids) #Normal case by default with only 10 iterations PCR.default1 = Cens.SMN.PCR(x=x, y=y, c=cc, cens="left",tt =tt,iter.max = 10,Diagnostic = FALSE) ## Not run: #This may take few minutes #Normal case by default with full (200) iterations PCR.default2 = Cens.SMN.PCR(x=x, y=y, c=cc, cens="left",tt =tt) #contaminated normal case PCR.CN = Cens.SMN.PCR(x=x, y=y, c=cc, cens="left",tt =tt,type="NormalC", nu = c(0.1,0.1),iter.max = 100) ## End(Not run)
dtawage = get(data(PSID1976,package = "AER")) y = dtawage$wage cc = c(rep(0,428),rep(1,325)) tt = dtawage$exper x = cbind(dtawage$education,dtawage$age, dtawage$hhours, dtawage$hwage, dtawage$tax, dtawage$youngkids, dtawage$oldkids) #Normal case by default with only 10 iterations PCR.default1 = Cens.SMN.PCR(x=x, y=y, c=cc, cens="left",tt =tt,iter.max = 10,Diagnostic = FALSE) ## Not run: #This may take few minutes #Normal case by default with full (200) iterations PCR.default2 = Cens.SMN.PCR(x=x, y=y, c=cc, cens="left",tt =tt) #contaminated normal case PCR.CN = Cens.SMN.PCR(x=x, y=y, c=cc, cens="left",tt =tt,type="NormalC", nu = c(0.1,0.1),iter.max = 100) ## End(Not run)