| Type: | Package | 
| Title: | Optimal Transport-Based Kernel Regression | 
| Version: | 1.0.0 | 
| Date: | 2025-03-04 | 
| Description: | Perform optimal transport on somatic point mutations and kernel regression hypothesis testing by integrating pathway level similarities at the gene level (Little et al. (2023) <doi:10.1111/biom.13769>). The software implements balanced and unbalanced optimal transport and omnibus tests with 'C++' across a set of tumor samples and allows for multi-threading to decrease computational runtime. | 
| Encoding: | UTF-8 | 
| Imports: | Rcpp, smarter, ggplot2, survival, MiRKAT | 
| LinkingTo: | Rcpp, RcppArmadillo | 
| RoxygenNote: | 7.2.3 | 
| License: | GPL (≥ 3) | 
| Suggests: | rmarkdown, knitr, reshape2, ggdendro | 
| VignetteBuilder: | knitr | 
| NeedsCompilation: | yes | 
| Packaged: | 2025-03-04 22:23:50 UTC; Admin | 
| Author: | Paul Little [aut, cre] | 
| Maintainer: | Paul Little <pllittle321@gmail.com> | 
| Repository: | CRAN | 
| Date/Publication: | 2025-03-06 17:00:02 UTC | 
kOT_sim_AGG
Description
kOT_sim_AGG
Usage
kOT_sim_AGG(work_dir)
Arguments
| work_dir | A full path to create "sim_ROKET" and subdirectories | 
Value
Nothing. Png files are created within the simulation ROKET directory.
kOT_sim_OT
Description
kOT_sim_OT
Usage
kOT_sim_OT(work_dir, NN, nGENE, nPATH, SCEN, ncores = 1)
Arguments
| work_dir | A full path to create "sim_ROKET" and subdirectories | 
| NN | A positive integer for sample size | 
| nGENE | A positive integer for number of genes to simulate | 
| nPATH | A positive integer for number of pathways to simulate | 
| SCEN | An integer taking values 1, 2, 3, or 4 | 
| ncores | A positive integer specifying the number of cores/threads to use for optimal transport calculations | 
Value
Nothing. Rds files are created within the simulation ROKET directory.
kOT_sim_REG
Description
kOT_sim_REG
Usage
kOT_sim_REG(work_dir, NN, nGENE, nPATH, SCEN, rr)
Arguments
| work_dir | A full path to create "sim_ROKET" and subdirectories | 
| NN | A positive integer for sample size | 
| nGENE | A positive integer for number of genes to simulate | 
| nPATH | A positive integer for number of pathways to simulate | 
| SCEN | An integer taking values 1, 2, 3, or 4 | 
| rr | A positive integer indexing a replicate | 
Value
Nothing. A rds file is created within the simulation ROKET directory.
kOT_sim_make
Description
Generates simulation files
Usage
kOT_sim_make(work_dir, NN = 200, nGENE = 500, nPATH = 25, RR = 200)
Arguments
| work_dir | A full path to create "sim_ROKET" and subdirectories | 
| NN | A positive integer for sample size | 
| nGENE | A positive integer for number of genes to simulate | 
| nPATH | A positive integer for number of pathways to simulate | 
| RR | A positive integer for number of replicates to simulate | 
Value
Nothing. Rds files are created within the simulation ROKET directory.
kernTEST
Description
kernTEST
Usage
kernTEST(
  RESI = NULL,
  KK,
  YY = NULL,
  XX = NULL,
  OMNI,
  nPERMS = 1e+05,
  ncores = 1
)
Arguments
| RESI | A numeric vector of null model residuals
 | 
| KK | An array containing double-centered positive semi-definite
kernel matrices. Refer to  | 
| YY | A numeric vector of continuous outcomes to be fitted in a linear model. Defaults to NULL for survival model. | 
| XX | A numeric data matrix with first column for intercept, a column of ones. | 
| OMNI | A matrix of zeros and ones. Each column corresponds to a
distance matrix while each row corresponds to an omnibus test. Set
 | 
| nPERMS | A positive integer to specify the number of permutation-based p-value calculation | 
| ncores | A positive integer for the number of cores/threads to reduce computational runtime when running for loops | 
Value
A R list of p-values and omnibus p-values.
run_myOT
Description
Runs balanced or unbalanced optimal transport on two input vectors
Usage
run_myOT(
  XX,
  YY,
  COST,
  EPS,
  LAMBDA1,
  LAMBDA2 = NULL,
  balance = FALSE,
  conv = 1e-05,
  max_iter = 3000,
  verbose = TRUE,
  show_iter = 50
)
Arguments
| XX | A numeric vector of positive masses | 
| YY | A numeric vector of positive masses | 
| COST | A numeric matrix of non-negative values
representing the costs to transport masses between
features of  | 
| EPS | A positive numeric value representing the tuning parameter for entropic regularization. | 
| LAMBDA1 | A non-negative numeric value representing
the tuning parameter penalizing the distance between  | 
| LAMBDA2 | A non-negative numeric value representing
the tuning parameter penalizing the distance between  | 
| balance | Boolean set to  | 
| conv | A positive numeric value to determine 
algorithmic convergence. The default value is  | 
| max_iter | A positive integer denoting the maximum iterations to run the algorithm. | 
| verbose | Boolean value to display verbose function output. | 
| show_iter | A positive integer to display iteration details
at multiples of  | 
Value
A R list containing the optimal transport matrix and associated distance metric.
run_myOTs
Description
run_myOTs
Usage
run_myOTs(
  ZZ,
  COST,
  EPS,
  LAMBDA1,
  LAMBDA2 = NULL,
  balance,
  conv = 1e-05,
  max_iter = 3000,
  ncores = 1,
  verbose = TRUE,
  show_iter = 50
)
Arguments
| ZZ | A numeric matrix of non-negative mass to transport. Rows correspond to features (e.g. genes) and columns correspond to samples or individuals. Each column must have strictly positive mass | 
| COST | A numeric square matrix of non-negative values representing the non-negative costs to transport masses between pairs of features | 
| EPS | A positive numeric value representing the tuning parameter for entropic regularization. | 
| LAMBDA1 | A non-negative numeric value representing
the tuning parameter penalizing the distance between  | 
| LAMBDA2 | A non-negative numeric value representing
the tuning parameter penalizing the distance between  | 
| balance | Boolean set to  | 
| conv | A positive numeric value to determine 
algorithmic convergence. The default value is  | 
| max_iter | A positive integer denoting the maximum iterations to run the algorithm. | 
| ncores | A positive integer for the number of cores/threads to reduce computational runtime when running for loops | 
| verbose | Boolean value to display verbose function output. | 
| show_iter | A positive integer to display iteration details
at multiples of  | 
Value
A R numeric matrix of pairwise distances.