import_cds.Rd
This function reads an organism specific CDS stored in a defined file format.
import_cds(file, format, delete_corrupt_cds = TRUE, ...)
file | a character string specifying the path to the file storing the CDS. |
---|---|
format | a character string specifying the file format used to store the CDS, e.g. "fasta", "fatsq". |
delete_corrupt_cds | a logical value indicating whether sequences with corrupt base triplets should be removed from the input |
... | additional arguments that are used by the |
A data.table storing the gene id in the first column and the corresponding sequence as string in the second column.
The import_cds
function takes a string specifying the path to the cds file
of interest as first argument.
It is possible to read in different proteome file standards such as fasta or fastq.
CDS stored in fasta files can be downloaded from http://www.ensembl.org/info/data/ftp/index.html.
Hajk-Georg Drost
if (FALSE) { # reading a cds file stored in fasta format Ath_cds <- import_cds(system.file('seqs/ortho_thal_cds.fasta', package = 'homologr'), format = "fasta") # look at results Ath_cds }