The hardware and bandwidth for this mirror is donated by dogado GmbH, the Webhosting and Full Service-Cloud Provider. Check out our Wordpress Tutorial.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]dogado.de.

Type: Package
Title: Two-Sample Test in High Dimensions using Random Projection
Version: 0.1.4
Description: Performs the random projection test (Lopes et al., (2011) <doi:10.48550/arXiv.1108.2401>) for the one-sample and two-sample hypothesis testing problem for equality of means in the high dimensional setting. We are interested in detecting the mean vector in the one-sample problem or the difference between mean vectors in the two-sample problem.
License: GPL-3
Encoding: UTF-8
Imports: MASS, stats, glue
RoxygenNote: 7.3.1
NeedsCompilation: no
Packaged: 2024-06-03 09:32:33 UTC; jortialo
Author: Juan Ortiz Author [aut, cre, cph, rev]
Maintainer: Juan Ortiz Author <juan.ortiz1alonso@gmail.com>
Repository: CRAN
Date/Publication: 2024-06-04 09:44:51 UTC

Two-Sample Test in High Dimensions using Random Projection

Description

This function performs the random projection test (Lopes et al., (2011) <arXiv:1108.2401>) for the one-sample and two-sample hypothesis testing problem for equality of means in the high dimensional setting. We are interested in detecting the mean vector in the one-sample problem or the difference between mean vectors in the two-sample problem.

Usage

random_projection_test(X, Y = NULL, mu0 = NULL, proj_dimension = NULL)

Arguments

X

The n1-by-p observation matrix with numeric column variables.

Y

An optional n2-by-p observation matrix with numeric column variables. If NULL, one-sample test is conducted on X; otherwise, a two-sample test is conducted on X and Y.

mu0

The null hypothesis vector to be tested. If NULL, the default value is the 0 vector of lenght p.

proj_dimension

Dimension where to project the given samples. If NULL, the default value is floor(n/2), where n=n1 if Y=NULL or n=n1+n2-2 if not, as in Lopes et al.

Details

Since the matrix used to project the data into a lower-dimension subset is a random matrix, obtaining the exactly same p-values in two repetitions is not likely. However, power function has been proved to perform adequately in the vast majority of settings.

Value

statistic

Value of the test's statistic T_k^2.

p_value

The p-value of the test.

degrees_freedom

The degrees of freedom used for the F distribution, returns list(k, n-k+1).

null_value

Returns mu0.

method

Brief description of the test that has been carried out.

Author(s)

Juan Ortiz, <juan.ortiz1alonso@gmail.com>

References

Lopes, M. E., Jacob, L. J. & Wainwright, M. J. (2011). A More Powerful Two-Sample Test in High Dimensions using Random Projection. <arXiv:1108.2401>.

Examples

set.seed(10086)
# One-sample test
n1=30; p=120
X = matrix(rnorm(n1*p), nrow = n1, ncol = p)
res1 = random_projection_test(X)

# Two-sample test
n2=65
Y = matrix(rnorm(n2*p), nrow = n2, ncol = p)
res2 = random_projection_test(X, Y)

# Specify a null hypothesis vector
res3 = random_projection_test(X, Y, mu0 = rep(0.1, p))

# Choose a projection dimension manually, will work worse than previous example
res4 = random_projection_test(X, Y, mu0 = rep(0.1, p), proj_dimension = 4)

These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.
Health stats visible at Monitor.