S7 class for bulk RNA-seq phylotranscriptomic expression data. This class handles expression data with biological replicates and provides bootstrapping functionality for statistical analysis.
Usage
BulkPhyloExpressionSet(
strata = stop("@strata is required"),
strata_values = stop("@strata_values is required"),
expression = stop("@expression is required"),
groups = stop("@groups is required"),
name = "Phylo Expression Set",
species = character(0),
index_type = "TXI",
identities_label = "Identities",
null_conservation_sample_size = 5000L,
.null_conservation_txis = NULL,
.bootstrapped_txis = NULL
)
Arguments
- strata
Factor vector of phylostratum assignments for each gene
- strata_values
Numeric vector of phylostratum values used in TXI calculations
- expression
Matrix of expression counts with genes as rows and samples as columns
- groups
Factor vector indicating which identity each sample belongs to
- name
Character string naming the dataset (default: "Phylo Expression Set")
- species
Character string specifying the species (default: NULL)
- index_type
Character string specifying the transcriptomic index type (default: "TXI")
- identities_label
Character string labeling the identities (default: "Stages")
- null_conservation_sample_size
Numeric value for null conservation sample size (default: 5000)
- .null_conservation_txis
Precomputed null conservation TXI values (default: NULL)
- .bootstrapped_txis
Precomputed bootstrapped TXI values (default: NULL)
Details
The BulkPhyloExpressionSet class is designed for bulk RNA-seq data with biological replicates. It extends the base PhyloExpressionSetBase class with bulk-specific functionality.
Replicate Handling: Expression data across biological replicates is collapsed by taking row means within each experimental condition or developmental stage.
Computed Properties: In addition to inherited computed properties from the base class, this class provides:
expression_collapsed
- Matrix of expression data collapsed across replicates (genes x identities)bootstrapped_txis
- Matrix of bootstrapped TXI values for statistical inference (500 bootstrap samples x identities)
Inherited computed properties from PhyloExpressionSetBase include:
gene_ids
- Character vector of gene identifiersidentities
- Character vector of identity labelssample_names
- Character vector of sample namesnum_identities
- Integer count of unique identitiesnum_samples
- Integer count of total samplesnum_genes
- Integer count of genesnum_strata
- Integer count of phylostrataindex_full_name
- Full name of the transcriptomic index typegroup_map
- List mapping identity names to sample namesTXI
- Numeric vector of TXI values for each identityTXI_sample
- Numeric vector of TXI values for each samplenull_conservation_txis
- Matrix of null conservation TXI values for statistical testing
Statistical Analysis: The class supports confidence interval estimation and standard deviation calculation through bootstrapped TXI values, enabling robust statistical analysis of developmental or experimental patterns.