orthologr version 0.4.3
New Function
-
deepclust()enablesdiamond deepclustfunctionalities for clustering protein sequences. Multiple input.fastafiles or.fasta.gzare supported. -
deepclust_realign()enablesdiamond deepclustfunctionalities for realigning protein sequences from the output ofdeepclust(). Multiple input.fastafiles or.fasta.gzare supported. -
deepclust_annotate()enablesdiamond deepclustfunctionalities for annotating protein sequences from the output ofdeepclust()based on user provided names for each proteome file (or the name of the proteome file if not provided).deepclust_annotate()can also be used to annotate the output ofdeepclust_realign(). Multiple input.fastafiles or.fasta.gzare supported.
Miscellaneous
- Unit tests for
deepclust(),deepclust_realign()anddeepclust_annotate()have been added to ensure the correct functionality of the new features. - Unit tests for
dNdS(),divergence_stratigraphy()anddivergence_map()functions have been created.
orthologr version 0.4.2
New Function
-
diamond(),set_diamond(),diamond_best()anddiamond_rec()enable a massive speed-up in pairwise sequence alignment functionalities.
New Features
- a logo is now set for
orthologr. - DIAMOND2 is now used by default in
dNdS()anddivergence_stratigraphy()unless blast is specified (aligner = "blast").
orthologr version 0.4.1
New Features
- the
divergence_stratigraphy()anddivergence_map()functions now include the parametern_quantile, which enables users to choose the number of quantiles to generate for thedivergence map. This could allow users to get a higher-resolutiondivergence mapifn_quantileis greater than 10. Alternatively, this can resolve the issue of empty divergence strata when deciling the dNdS values for closely related organisms with dNdS = 0 for over 10% of the genes.
orthologr version 0.4.0
New Functions
- new function
check_annotation()helps to detect corrupt GFF or GTF annotation files and removes such outlier lines
New Features
the
generate_ortholog_tables()andretrieve_longest_isoforms()now include the newcheck_annotation()function to capture corrupt GFF or GTF files and fix themadding a new argument
of_pathtoorthofinder2()to allow users to specify their own path to their locally installedorthofinderexecutableadding new argument
tasktomap_generator_lnc()andorthologs_lnc()to allow users to use the fullblastnrange provided by blast_nucleotide_to_nucleotide()adding new argument
pathtomap_generator_lnc()to allow users to specify their local installation path of BLAST
orthologr version 0.3.0
New Functions
new function
plot_pairwise_orthologs()allows users to plot pairwise orthologs tables for multiple pairwise comparisonsnew function
retrieve_core_orthologs()allows users to retrieve a core set of orthologous gene loci from several pairwise ortholog tablesnew functions
generate_ortholog_tables()andgenerate_ortholog_tables_all()allow users to generate ortholog tables by gene locus and splice varaint for a set of speciesnew function
retrieve_longest_isoforms_all()allows users to specify folders and retrieve the longest splice variants for all proteomes stored in a foldernew functions
translate_cds_to_protein()andtranslate_cds_to_protein_all()which translate coding sequences into amino acid sequences for single or multiple files
New Features
in
orthologs()the default value ofdelete_corrupt_cdschanged fromdelete_corrupt_cds = TRUEtodelete_corrupt_cds = FALSEto be consistent withdNdS()anddivergence_stratigraphy()the
divergence_stratigraphy()function received a new argumentdnds_est.methodwhich now allows users to select different dNdS estimation methods when runningdivergence_stratigraphy()(suggested by Momir Futo)the
divergence_stratigraphy()function now allows to change theevalargument which wasn’t passed down to thedNdS()call within the function (Many thanks to Momir Futo)the function
map.generator()was renamed tomap_generator_dnds()to be more consistent with the notation of other functionsthe function
map.generator.lnc()was renamed tomap_generator_lnc()to be more consistent with the notation of other functionsthe function
DivergenceMap()was renamed todivergence_map()to be more consistent with the notation of other functionsthe function
DivergenceMap()was renamed todivergence_map()to be more consistent with the notation of other functionsthe function
orthologs.lncwas renamed toorthologs_lncto be more consistent with the notation of other functionsthe function
OF2CoreOrthologs()was renamed toorthofinder2_retrieve_core_orthologs()
Removed Functions
the function
advanced_blast()is not supported anymore and thus is not available to users anymore (please consult the metablastr package in case you need this functionality)the function
advanced_makedb()is not supported anymore and thus is not available to users anymore (please consult the metablastr package in case you need this functionality)the function
blast.nr()is not supported anymore and thus is not available to users anymore (please consult the metablastr package in case you need this functionality)the function
delta.blast()is not supported anymore and thus is not available to users anymore (please consult the metablastr package in case you need this functionality)the function
ProteinOrtho()is not supported anymore and thus is not available to users anymore
orthologr version 0.2.0
New Features
- new function
retrieve_longest_isoforms()which enables retrieval of the longest isoforms from a proteome file and save results as fasta file for downstream analyses - new function
OF2CoreOrthologs()to retrieve core orthologs across multiple species from Orthofinder2 output - new function
extract_features(): Helper function to extract gene loci and splice variant IDs from GFF files - new function
filter_best_hits(): Helper function to select best BLAST hit based on minimum evalue - new function
generate_ortholog_tables(): Generate ortholog tables by gene locus and splice varaint
orthologr version 0.1.0
New Features
-
read.cds()now trimms corrupted CDS (= CDS not divisible by 3) whendelete_corrupt_cds = FALSEis specified - the new default value of argument
delete_corrupt_cdsindNdS()is nowFALSE. Thus, given the new trimming feature inread.cds(), corrupted CDS equences will be trimmed before being translated.
orthologr version 0.0.5
Bug fixes
- The default setting of the
BLASTargumentmax_target_seqs 1was removed fromblast_best()andblast_rec()due to the misunderstood functionality of theBLASTargument (See details here and #9 ; Many thanks to @armish)
orthologr version 0.0.4
New Features
- Users can now control via the new
delete_corrupt_cdsargument indNdS()and related downstream functions whether or not corrupted input coding sequences shall be removed prior to dN/dS inference. In case corrupted CDS exist, thedNdS()function will now generate a separate fasta file which stores all corrupted CDS so that they can be investigated. See issue #8 for details.
Function updates
dNdS()receives new argumentdelete_corrupt_cdsto remove corrupted input coding sequences (delete_corrupt_cdsis set toTRUEas default)read.cds()receives new argumentdelete_corrupt_cdsto remove corrupted input coding sequences (delete_corrupt_cdsis set toTRUEas default)cds2aa()receives new argumentdelete_corrupt_cdsto remove corrupted input coding sequences (delete_corrupt_cdsis set toTRUEas default)set_blast()receives new argumentdelete_corrupt_cdsto remove corrupted input coding sequences (delete_corrupt_cdsis set toTRUEas default)blast()receives new argumentdelete_corrupt_cdsto remove corrupted input coding sequences (delete_corrupt_cdsis set toTRUEas default)blast_best()receives new argumentdelete_corrupt_cdsto remove corrupted input coding sequences (delete_corrupt_cdsis set toTRUEas default)blast_rec()receives new argumentdelete_corrupt_cdsto remove corrupted input coding sequences (delete_corrupt_cdsis set toTRUEas default)
orthologr version 0.0.3
- Fixing internal path bug that caused that wrong pal2nal paths were generated when using multiple sequence aligners -> see issue https://github.com/drostlab/orthologr/issues/5 (Many thanks to Dr. Mario López-Pérez)
