Skip to contents

Select genes with the highest variance across samples.

Usage

genes_top_variance(phyex_set, top_p = 0.99, top_k = NULL)

Arguments

phyex_set

A PhyloExpressionSet object

top_p

Quantile threshold for gene selection (default: 0.99). Ignored if top_k is specified.

top_k

Absolute number of top genes to select (default: NULL). Takes precedence over top_p.

Value

Character vector of gene IDs with variance >= top_p quantile or top top_k genes

Details

This function identifies genes with the highest variance across samples, which are often the most informative for downstream analyses.

Examples

# Select top 1% most variable genes
high_var_genes <- genes_top_variance(example_phyex_set, top_p = 0.99)

# Select top 500 most variable genes
top_500_var_genes <- genes_top_variance(example_phyex_set, top_k = 500)