| Type: | Package | 
| Title: | Variable Selection via Tilted Correlation Screening Algorithm | 
| Version: | 1.1.1 | 
| Date: | 2016-12-22 | 
| Author: | Haeran Cho [aut, cre], Piotr Fryzlewicz [aut] | 
| Maintainer: | Haeran Cho <haeran.cho@bristol.ac.uk> | 
| Description: | Implements an algorithm for variable selection in high-dimensional linear regression using the "tilted correlation", a new way of measuring the contribution of each variable to the response which takes into account high correlations among the variables in a data-driven way. | 
| Depends: | R (≥ 2.14.0), mvtnorm | 
| License: | GPL-2 | GPL-3 [expanded from: GPL (≥ 2)] | 
| LazyLoad: | yes | 
| NeedsCompilation: | no | 
| Packaged: | 2016-12-26 10:05:59 UTC; h | 
| Repository: | CRAN | 
| Date/Publication: | 2016-12-26 12:25:13 | 
Variable Selection via Tilted Correlation Screening Algorithm
Description
Implements an algorithm for variable selection in high-dimensional linear regression using the "tilted correlation", a way of measuring the contribution of each variable to the response which takes into account high correlations among the variables in a data-driven way.
Details
| Package: | tilting | 
| Type: | Package | 
| Version: | 1.1.1 | 
| Date: | 2016-12-22 | 
| License: | GPL (>= 2) | 
| LazyLoad: | yes | 
The main function of the package is tilting.
Author(s)
Haeran Cho, Piotr Fryzlewicz
Maintainer: Haeran Cho <haeran.cho@bristol.ac.ukk>
References
H. Cho and P. Fryzlewicz (2012) High-dimensional variable selection via tilting, Journal of the Royal Statistical Society Series B, 74: 593-622.
Examples
X <- matrix(rnorm(100*100), 100, 100) # 100-by-100 design matrix
y <- apply(X[,1:5], 1, sum)+rnorm(100) # first five variables are significant
tilt <- tilting(X, y, op=2)
tilt$active.hat # returns the finally selected variables
Compute the L2 norm of each column
Description
The function returns a vector containing the L2 norm of each column for a given matrix.
Usage
col.norm(X)
Arguments
| X | a matrix for which the column norms are computed. | 
Value
A vector containing the L2 norm of the columns of X is returned.
Author(s)
Haeran Cho
Select a threshold for sample correlation matrix
Description
The function selects a threshold for sample correlation matrix.
Usage
get.thr(C, n, p, max.num = 1, alpha = NULL, step = NULL)
Arguments
| C | sample correlation matrix of a design matrix. | 
| n | the number of observations of the design matrix. | 
| p | the number of variables of the design matrix. | 
| max.num | the number of times for which the threshold selection procedure is repeated. Usually max.num==1 is used. | 
| alpha | The level at which the false discovery rate is controlled. When alpha==NULL, it is set to be 1/sqrt(p). | 
| step | the size of a step taken when screening the p(p-1)/2 off-diagonal elements of C. | 
Value
| thr | selected threshold. | 
| thr.seq | when max.num>1, the sequence of thresholds selected at each iteration. | 
Author(s)
Haeran Cho
References
H. Cho and P. Fryzlewicz (2012) High-dimensional variable selection via tilting, Journal of the Royal Statistical Society Series B, 74: 593-622.
Compute the least squares estimate on a given index set
Description
The function returns an estimate of the coefficient vector for a linear regression problem by setting the coefficients corresponding to a given index set to be the least squares estimate and the rest to be equal to zero.
Usage
lse.beta(X, y, active = NULL)
Arguments
| X | design matrix. | 
| y | response vector. | 
| active | the index set on which the least squares estimate is computed. | 
Value
An estimate of the coefficient vector is returned as above. If active==NULL, a vector of zeros is returned.
Author(s)
Haeran Cho
Compute the projection matrix onto a given set of variables
Description
The function computes the projection matrix onto a set of columns of a given matrix.
Usage
projection(X, active = NULL)
Arguments
| X | a matrix containing the columns onto which the projection matrix is computed. | 
| active | an index set of the columns of X. | 
Value
Returns the projection matrix onto the columns of "X" whose indices are included in "active". When active==NULL, a null matrix is returned.
Author(s)
Haeran Cho
Select the final model
Description
The function returns the final model as a subset of the active set chosen by Tilted Correlation Screening algorithm, for which the extended BIC is minimised.
Usage
select.model(bic.seq, active)
Arguments
| bic.seq | the sequence of extended BIC at each iteration. | 
| active | the index set of selected variables by Tilted Correlation Screening algorithm. | 
Value
The index set of finally selected variables is returned.
Author(s)
Haeran Cho
Hard-threshold a matrix
Description
For a given matrix and a threshold, the function performs element-wise hard-thresholding based on the absolute value of each element.
Usage
thresh(C, alph, eps = 1e-10)
Arguments
| C | a matrix on which the hard-thresholding is performed. | 
| alph | threshold. | 
| eps | effective zero. | 
Value
Returns the matrix C after hard-thresholding.
Author(s)
Haeran Cho
Variable selection via Tilted Correlation Screening algorithm
Description
Given a design matrix and a response vector, the function selects a threshold for the sample correlation matrix, computes an adaptive measure for the contribution of each variable to the response variable based on the thus-thresholded sample correlation matrix, and chooses a variable at each iteration. Once variables are selected in the "active" set, the extended BIC is used for the final model selection.
Usage
tilting(X, y, thr.step = NULL, thr.rep = 1, max.size = NULL, max.count = NULL,
op = 2, bic.gamma = 1, eps = 1e-10)
Arguments
| X | design matrix. | 
| y | response vector. | 
| thr.step | a step size used for threshold selection. When thr.step==NULL, it is chosen automatically. | 
| thr.rep | the number of times for which the threshold selection procedure is repeated. | 
| max.size | the maximum number of the variables conditional on which the contribution of each variable to the response is measured (when max.size==NULL, it is set to be half the number of observations). | 
| max.count | the maximum number of iterations. | 
| op | when op==1, rescaling 1 is used to compute the tilted correlation. If op==2, rescaling 2 is used. | 
| bic.gamma | a parameter used to compute the extended BIC. | 
| eps | an effective zero. | 
Value
| active | active set containing the variables selected over the iterations. | 
| thr.seq | a sequence of thresholds selected over the iterations. | 
| bic.seq | extended BIC computed over the iterations. | 
| active.hat | finally chosen variables using the extended BIC. | 
Author(s)
Haeran Cho
References
H. Cho and P. Fryzlewicz (2012) High-dimensional variable selection via tilting, Journal of the Royal Statistical Society Series B, 74: 593-622.
Examples
X<-matrix(rnorm(100*100), 100, 100) # 100-by-100 design matrix
y<-apply(X[,1:5], 1, sum)+rnorm(100) # first five variables are significant
tilt<-tilting(X, y, op=2)
tilt$active.hat # returns the finally selected variables