Help for package poolVIM

Type:

Package

Title:

Gene-Based Association Tests using the Actual Impurity Reduction (AIR) Variable Importance

Version:

1.0.0

Date:

2018-07-25

Author:

Stefano Nembrini <stefanonembrini@gmail.com>

Maintainer:

Stefano Nembrini <stefanonembrini@gmail.com>

Description:

Gene-based association tests using the actual impurity reduction (AIR) variable importance. The function aggregates AIR importance measures from a group of SNPs or probes and outputs a p-value for each gene. The procedures builds upon the method described in <doi:10.1093/Bioinformatics/Bty373> and will be published soon.

License:

GPL-2 | GPL-3 [expanded from: GPL (≥ 2)]

LazyData:

TRUE

Imports:

stats, ranger

Depends:

R(≥ 3.3.1), EmpiricalBrownsMethod(≥ 1.6.0), Hmisc(≥ 4.1)

RoxygenNote:

6.0.1

NeedsCompilation:

Packaged:

2018-07-30 07:18:24 UTC; S

Repository:

CRAN

Date/Publication:

2018-07-30 17:30:08 UTC

fisher

Description

fisher

Usage

fisher(p, adjust, R)

Arguments

p

vector of pvalues

adjust

if correlation has to be taken into account

R

correlation matrix

gaussianize null variable importances

Description

gaussianize null variable importances

Usage

gaussianize(x, a)

Arguments

x

distr

a

value to interpolate/extrapolate

meff.

Description

meff.

Usage

m_effective(R)

Arguments

R

after the Actual Impurity Reduction Importance is computed with a Random Forest, pvalues from different probes or SNPs belonging to the same gene can be aggregated in order to obtain a single pvalue for that gene. Correlation between probes can also be taken into account.

Description

after the Actual Impurity Reduction Importance is computed with a Random Forest, pvalues from different probes or SNPs belonging to the same gene can be aggregated in order to obtain a single pvalue for that gene. Correlation between probes can also be taken into account.

Usage

poolVIM(rf, genenames, x, method = "Tippett", adjust)

Arguments

rf

a ranger object with "importance="impurity_corrected"

genenames

a vector of the name of the gene to which each probe or SNP belongs, it has to be of size dim(x)[1]

x

design matrix used by the random forest

method

one of Tippett, Fisher, Kost, EBM

adjust

"no" / "yes" depending if correlation has to be taken into account

Examples

n <- 250
x=replicate(50, runif(n))
dat <- data.frame(y = factor(rbinom(n, 1, .5)), x)
library(ranger)
rf <- ranger(y ~ ., dat, importance = "impurity_corrected",num.trees=100)
genenames=colnames(x)=rep(c("G1","G2"),50/2)
poolVIM(rf,genenames,x,method="Fisher",adjust="no")

tippett.

Description

tippett.

Usage

tippett(p, adjust, R)

Arguments

p

vector of pvalues

adjust

if correlation has to be taken into account

R

correlation matrix