High-dimensional variable screening and bias in subsequent inference, with an empirical comparison


Loading...

Date

2014-06

Publication Type

Journal Article

ETH Bibliography

yes

Citations

Altmetric

Data

Abstract

We review variable selection and variable screening in high-dimensional linear models. Thereby, a major focus is an empirical comparison of various estimation methods with respect to true and false positive selection rates based on 128 different sparse scenarios from semi-real data (real data covariables but synthetic regression coefficients and noise). Furthermore, we present some theoretical bounds for the bias in subsequent least squares estimation, using the selected variables from the first stage, which have direct implications for construction of p-values for regression coefficients.

Publication status

published

Editor

Book title

Volume

29 (3)

Pages / Article No.

407 - 430

Publisher

Springer

Event

Edition / version

Methods

Software

Geographic location

Date collected

Date created

Subject

Elastic net; Lasso; Linear model; Ridge; Sparsity; Sure independence screening; Variable selection

Organisational unit

03502 - Bühlmann, Peter L. / Bühlmann, Peter L. check_circle

Notes

It was possible to publish this article open access thanks to a Swiss National Licence with the publisher.

Funding

Related publications and datasets