The hardware and bandwidth for this mirror is donated by dogado GmbH, the Webhosting and Full Service-Cloud Provider. Check out our Wordpress Tutorial.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]dogado.de.
m_causal=1 bug (used to die, now works
correctly).sigmaSq to sigma_sq
(parameter of sim_trait and cov_trait).sim_trait return list has more descriptive names:
trait (old y)causal_indexes (old i)causal_coeffs (old beta)popkin 1.2.0.9000
changes.README changes in parallel with
bnpsd 1.1.0.9000 changes.allele_freqs, which handles
BEDMatrix objects correctly. Rest of sim_trait
doesn’t handle BEDMatrix objects yet.sim_trait, and consequently the whole package,
supports BEDMatrix objects. This is ideal for simulating
traits from real and potentially very large genotype data.allele_freqs to load genotype chunks from a
BEDMatrix object more efficiently, using as much available
memory as possible. Requires an updated popkin package
(>= 1.2.6.9000).class usage now that matrices
return a two-element array in R-develman/figures/sim_trait changed default maf_cut
from 0.05 to NA, and in that case code no longer computes
marginal allele frequencies unless they are needed because
p_anc is missing (and kinship is provided).
This is much faster and memory efficient in extremely large simulations,
and just makes more sense to me generally (the MAF threshold was a
quirky feature, still available but non-default now).sim_trait now delays calculating allele
frequencies if maf_cut = NA and p_anc is
missing (and kinship is provided), so that these
frequencies are only calculated on the small subset of loci selected to
be causal (rather than the whole genome, which was the original
behavior). This is expected to speed-up trait simulations from real
genotypes.sim_trait_mvn, which draws traits from
the multivariate normal distribution with covariance structure matching
that of genetic traits (also called “infinitesimal” model). These traits
are useful for testing heritability estimation. A visual validation of
these simulated traits is available in the vignette.alelle_freq was modified to reduce memory
usage for the BEDMatrix case.
sim_trait, the
BEDMatrix-specific options mem_factor and
mem_lim were replaced by the option
m_chunk_max.popkin is no longer a
dependency of this simtrait package. However,
popkin is still recommended for estimating the kinship
matrices required by some of the functions of this simtrait
package.simtrait:
rmsd: General root-mean-square deviation (RMSD)
functionpval_srmsd: Signed RMSD between observed and expected
(uniform) p-valuespval_aucpr: Area under the curve (AUC) of
precision-recall (PR) curve
PRROC package dependencypval_infl (classic inflation factor, but
from p-values) to complement yesterday’s new functionsallele_freqs added option fold,
which if TRUE returns folded i.e. minor
allele frequencies. Default is FALSE, to return allele
frequencies for the alternative allele (the allele counted as it given
in the genotype matrix, whether it is the minor or major allele).sim_trait added option
const_herit_loci, which when TRUE constructs
causal coefficients as inversely proportional to the square root of
p*(1-p), where p is the ancestral allele
frequency. This ensures equal per-causal-locus contribution to trait
variance. Default draws causal coefficients randomly from a standard
normal distribution, rescaled to result in the desired heritability, for
unequal per-locus contribution to trait variance.herit_loci, which calculates per-locus
heritabilities based on variance formula (in terms of allele
frequencies, coefficients, and overall trait variance factor).
sim_trait in unit tests.sim_trait with option
const_herit_loci = TRUE now adds random signs (+/-) to the
causal coefficients.
sim_trait now requires that p_anc
have the same length as the number of loci in X (stops with
an error otherwise). Previously this was not checked and could return
traits that were NA for all individuals without clear
indications that anything was wrong.pval_srmsd now accepts
causal_indexes = NULL to handle cases where all p-values
are null (before a NULL input would cause an error).sim_trait renamed option
const_herit_loci to fes (fixed effect sizes),
the language used in the paper.NEWS.md slightly to improve its
automatic parsing.LazyData: true (to avoid a new “NOTE” on
CRAN).URL and
BugReports)par() in vignette examples.sim_trait fixed an important bug resulting in
misspecified heritability!
rnorm the desired variance
(1 - herit) * sigma_sq where the standard deviation (its
square root) was required!herit value by
herit / ( herit + (1-herit)^2 * sigma_sq ). If
sigma_sq = 1 (default), the effective heritability was
always larger than desired!sim_trait bug and its fix.inbr_diag in plot_popkin calls,
which in this case made diagonal values larger (as they were larger than
1), among other minor adjustments.sim_trait, sim_trait_mvn, and
cov_trait: added parameters labs and
labs_sigma_sq to simulate/model non-genetic group effects.
sim_trait also now returns
group_effects as one of the named elements of the return
list.sim_trait parameters p_anc and
kinship now have default NULL values (used to
not have default values), to facilitate scripting in cases that can be
either simulated or real genotypes.pval_type_1_err and
pval_power_calib for calculating type I error rates and
calibrated power, respectively.pval_type_1_err and
pval_power_calib:
alpha can be a vector, and return value is a
vector of estimates, one for each alpha.1/alpha.pval_type_1_err and
pval_power_calib:
pvals is NULL and
alpha had more than one value, these functions incorrectly
returned a scalar NA, now they return a vector with
NAs the length of alpha.pval_infl if input pvals = NULL
now returns NA (used to return
numeric(0)).sim_trait, sim_trait_mvn,
cov_trait:
check_labs renamed to
check_herit_labs and:
herit is now firstherit checks that used to be in upstream
functions into hereThese binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.
Health stats visible at Monitor.