This function reads an organism specific proteome stored in a defined file format.

import_proteome(file, format, ...)

Arguments

file

a character string specifying the path to the file storing the proteome.

format

a character string specifying the file format used to store the proteome, e.g. "fasta", "fastq".

...

additional arguments that are used by the readAAStringSet function.

Value

A data.table storing the gene id in the first column and the corresponding sequence as string in the second column.

Details

The read.proteome function takes a string specifying the path to the proteome file of interest as first argument.

It is possible to read in different proteome file standards such as fasta or fastq.

Proteomes stored in fasta files can be downloaded from http://www.ebi.ac.uk/reference_proteomes.

Author

Hajk-Georg Drost

Examples

if (FALSE) { # reading a proteome stored in a fasta file Ath_proteome <- import_proteome(system.file('seqs/ortho_thal_aa.fasta', package = 'homologr'), format = "fasta") # look at results Ath_proteome }