This function takes an ExpressionSet object and removes genes from the gene expression matrix that
have an expression level below, above, or below AND above a defined cut.off
value. Hence, this function allows to remove
genes that have been defined as not expressed or outliers and returns an ExpressionSet
retaining only expressed genes.
Arguments
- ExpressionSet
a standard PhyloExpressionSet or DivergenceExpressionSet object.
- cut.off
a numeric value specifying the expression cut-off to define genes as not expressed (
comparison = "below"
) , outliers (comparison = "above"
), or both (comparison = "both"
). Seecomparison
for details. In casecomparison = "both"
, thecut.off
argument must be a two dimensional vector defining the lowercut.off
value at the first position and the uppercut.off
value at the second position.- method
a method defining how to treat gene expression values in multiple stages. The corresponding method that is chosen allows to control the stage-wise fulfillment of the threshold criteria. Options are
"const"
,"min-set"
, and"n-set"
.- comparison
a character string specifying whether genes having expression levels below, above, or below AND above (both) the
cut.off
value should be excluded from the dataset. In casecomparison = "both"
is chosen, thecut.off
argument must be a two dimensional vector defining the lowercut.off
value at the first position and the uppercut.off
value at the second position.- n
a numeric value for
method = "n-set"
.
Details
This filter function allows users to remove genes from the ExpressionSet
object that undercut or exceed a certain expression level cut.off
.
Following extraction criteria are implemented in this function:
const
: all genes that have at least one stage that undercuts or exceeds the expressioncut.off
will be excluded from theExpressionSet
. Hence, for a 7 stageExpressionSet
genes passing the expression levelcut.off
in 6 stages will be retained in theExpressionSet
.min-set
: genes passing the expression levelcut.off
inceiling(n/2)
stages will be retained in theExpressionSet
, where n is the number of stages in theExpressionSet
.n-set
: genes passing the expression levelcut.off
inn
stages will be retained in theExpressionSet
. Here, the argumentn
needs to be specified.
Examples
data(PhyloExpressionSetExample)
# remove genes that have an expression level below 8000
# in at least one developmental stage
FilterConst <- Expressed(ExpressionSet = PhyloExpressionSetExample,
cut.off = 8000,
method = "const",
comparison = "below")
dim(FilterConst) # check number of retained genes
#> [1] 449 9
# remove genes that have an expression level below 8000
# in at least 3 developmental stages
# (in this case: ceiling(7/2) = 4 stages fulfilling the cut-off criteria)
FilterMinSet <- Expressed(ExpressionSet = PhyloExpressionSetExample,
cut.off = 8000,
method = "min-set",
comparison = "below")
dim(FilterMinSet) # check number of retained genes
#> [1] 830 9
# remove genes that have an expression level below 8000
# in at least 5 developmental stages (in this case: n = 2 stages fulfilling the criteria)
FilterNSet <- Expressed(ExpressionSet = PhyloExpressionSetExample,
cut.off = 8000,
method = "n-set",
comparison = "below",
n = 2)
dim(FilterMinSet) # check number of retained genes
#> [1] 830 9
# remove expression levels that exceed the cut.off criteria
FilterMinSet <- Expressed(ExpressionSet = PhyloExpressionSetExample,
cut.off = 12000,
method = "min-set",
comparison = "above")
dim(FilterMinSet) # check number of retained genes
#> [1] 24456 9
# remove expression levels that undercut AND exceed the cut.off criteria
FilterMinSet <- Expressed(ExpressionSet = PhyloExpressionSetExample,
cut.off = c(8000,12000),
method = "min-set",
comparison = "both")
dim(FilterMinSet) # check number of retained genes
#> [1] 111 9