R/extract_promotor_seqs_from_genome.R
extract_promotor_seqs_from_genome.Rd
Given a genome assembly file and an corresponding annotation file users can retrieve all upstream promotor sequences of all genes from a genome.
extract_promotor_seqs_from_genome(
annotation_file,
genome_file,
promotor_length = 500,
annotation_format = "gtf",
file_name = NULL,
path = NULL,
update = TRUE
)
file path to the annotation file of the genome assembly
in gtf
or gff
format.
file path to the genome assembly file.
width of upstream promotors. This is -promotor_width
bp from the transcription start site (TSS) of the gene.
format of the annotation file. Options are:
annotation_format = "gtf"
annotation_format = "gff"
file path to the output file storing the promotor sequences.
a file path to an output folder storing the promotor sequences.
shall previously generated promotor sequences be overwritten when again generated for the same genome assembly?