Article,

Variable selection via nonconcave penalized likelihood and its oracle properties

J. Fan, and R. Li.
Journal of the American Statistical Association, 96 (456): 1348--1360 (December 2001)
DOI: 10.1198/016214501753382273

Abstract

Variable selection is fundamental to high-dimensional statistical modeling, including nonparametric regression. Many approaches in use are stepwise selection procedures, which can be computationally expensive and ignore stochastic errors in the variable selection process. In this article, penalized likelihood approaches are proposed to handle these kinds of problems. The proposed methods select variables and estimate coefficients simultaneously. Hence they enable us to construct confidence intervals for estimated parameters. The proposed approaches are distinguished from others in that the penalty functions are symmetric, nonconcave on (0, ∞), and have singularities at the origin to produce sparse solutions. Furthermore, the penalty functions should be bounded by a constant to reduce bias and satisfy certain conditions to yield continuous solutions. A new algorithm is proposed for optimizing penalized likelihood functions. The proposed ideas are widely applicable. They are readily applied to a variety of parametric models such as generalized linear models and robust regression models. They can also be applied easily to nonparametric modeling by using wavelets and splines. Rates of convergence of the proposed penalized likelihood estimators are established. Furthermore, with proper choice of regularization parameters, we show that the proposed estimators perform as well as the oracle procedure in variable selection; namely, they work as well as if the correct submodel were known. Our simulation shows that the newly proposed methods compare favorably with other variable selection techniques. Furthermore, the standard error formulas are tested to be accurate enough for practical applications.

BibTeX key: fan_variable_2001
entry type: article
year: 2001
month: dec
journal: Journal of the American Statistical Association
number: 456
pages: 1348--1360
volume: 96
issn: 0162-1459
DOI: 10.1198/016214501753382273
urldate: 2016-11-22
url: http://dx.doi.org/10.1198/016214501753382273

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

%0 Journal Article %1 fan_variable_2001 %A Fan, Jianqing %A Li, Runze %D 2001 %J Journal of the American Statistical Association %K Hard LASSO, Nonnegative SCAD, Soft estimator garrote, oracle, sparse thresholding, %N 456 %P 1348--1360 %R 10.1198/016214501753382273 %T Variable selection via nonconcave penalized likelihood and its oracle properties %U http://dx.doi.org/10.1198/016214501753382273 %V 96 %X Variable selection is fundamental to high-dimensional statistical modeling, including nonparametric regression. Many approaches in use are stepwise selection procedures, which can be computationally expensive and ignore stochastic errors in the variable selection process. In this article, penalized likelihood approaches are proposed to handle these kinds of problems. The proposed methods select variables and estimate coefficients simultaneously. Hence they enable us to construct confidence intervals for estimated parameters. The proposed approaches are distinguished from others in that the penalty functions are symmetric, nonconcave on (0, ∞), and have singularities at the origin to produce sparse solutions. Furthermore, the penalty functions should be bounded by a constant to reduce bias and satisfy certain conditions to yield continuous solutions. A new algorithm is proposed for optimizing penalized likelihood functions. The proposed ideas are widely applicable. They are readily applied to a variety of parametric models such as generalized linear models and robust regression models. They can also be applied easily to nonparametric modeling by using wavelets and splines. Rates of convergence of the proposed penalized likelihood estimators are established. Furthermore, with proper choice of regularization parameters, we show that the proposed estimators perform as well as the oracle procedure in variable selection; namely, they work as well as if the correct submodel were known. Our simulation shows that the newly proposed methods compare favorably with other variable selection techniques. Furthermore, the standard error formulas are tested to be accurate enough for practical applications.

@article{fan_variable_2001, abstract = {Variable selection is fundamental to high-dimensional statistical modeling, including nonparametric regression. Many approaches in use are stepwise selection procedures, which can be computationally expensive and ignore stochastic errors in the variable selection process. In this article, penalized likelihood approaches are proposed to handle these kinds of problems. The proposed methods select variables and estimate coefficients simultaneously. Hence they enable us to construct confidence intervals for estimated parameters. The proposed approaches are distinguished from others in that the penalty functions are symmetric, nonconcave on (0, ∞), and have singularities at the origin to produce sparse solutions. Furthermore, the penalty functions should be bounded by a constant to reduce bias and satisfy certain conditions to yield continuous solutions. A new algorithm is proposed for optimizing penalized likelihood functions. The proposed ideas are widely applicable. They are readily applied to a variety of parametric models such as generalized linear models and robust regression models. They can also be applied easily to nonparametric modeling by using wavelets and splines. Rates of convergence of the proposed penalized likelihood estimators are established. Furthermore, with proper choice of regularization parameters, we show that the proposed estimators perform as well as the oracle procedure in variable selection; namely, they work as well as if the correct submodel were known. Our simulation shows that the newly proposed methods compare favorably with other variable selection techniques. Furthermore, the standard error formulas are tested to be accurate enough for practical applications.}, added-at = {2017-01-09T13:57:26.000+0100}, author = {Fan, Jianqing and Li, Runze}, biburl = {https://www.bibsonomy.org/bibtex/282d75ae5c0da99ad3f5e56e730ab39a0/yourwelcome}, doi = {10.1198/016214501753382273}, interhash = {b87c357c42be69c4a72c87e3d7d2b04f}, intrahash = {82d75ae5c0da99ad3f5e56e730ab39a0}, issn = {0162-1459}, journal = {Journal of the American Statistical Association}, keywords = {Hard LASSO, Nonnegative SCAD, Soft estimator garrote, oracle, sparse thresholding,}, month = dec, number = 456, pages = {1348--1360}, timestamp = {2017-01-09T14:01:11.000+0100}, title = {Variable selection via nonconcave penalized likelihood and its oracle properties}, url = {http://dx.doi.org/10.1198/016214501753382273}, urldate = {2016-11-22}, volume = 96, year = 2001 }

BibSonomy

Variable selection via nonconcave penalized likelihood and its oracle properties

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on