DIOPT Documentation

URL

http://www.flyrnai.org/diopt

About DIOPT

The identification of orthologs is commonly used for bioinformatics activities such as data mining and establishing models for human diseases. Moreover, our group notes that researchers analyzing the results of screens performed at the Drosophila RNAi Screening Center (DRSC) frequently wish to identify mammalian orthologs of the fly genes that were "hits" (positive results) in their screens.

In helping DRSC screeners to identify orthologs using existing tools and algorithms, we recognized a need for a user-friendly approach to viewing and comparing ortholog predictions obtained using different tools and algorithms. This was our motivation in developing DIOPT. To facilitate identification of orthologs specifically of human disease-associated genes, we further developed DIOPT-DIST. Information about our approaches to development of both tools is summarized below.

The DIOPT Approach

Many tools have emerged to meet the need to identify orthologs. However, low coverage and heterogeneity of these tools present an obstacle to scientists who want to identify a one or a few highest-confidence orthologs for a given gene of interest or conversely, want to cast a wide net and follow up on all possible orthologs of a gene.

Our goal is to provide an easy-to-use resource that facilitates summary, comparison and access to various sources of ortholog predictions. DIOPT integrates human, mouse, fly, worm, zebrafish and yeast ortholog predictions made by Ensembl Compara, HomoloGene, Inparanoid, Isobase, OMA, orthoMCL, Phylome, RoundUp, and TreeFam. DIOPT lets users find ortholog pairs for a specified gene or genes identified by one, many or all of these published approaches. This provides a streamlined method for integration, comparison and access to orthology predictions originating from algorithms based on sequence homology, phylogenetic trees, and functional similarity. DIOPT calculates a simple score indicating the number of tools that support a given orthologous gene-pair relationship, as well as a weighted score based on functional assessment using high quality GO molecular function annotation of all fly-human orthologous pairs predicted by each tool. Differences in the algorithms used by each tool to predict orthologous relationship is one source of difference in the set of predictions made by one tool versus another. However, we also note that some of these differences might be attributable to use of different genome annotation releases used by some tools versus others, and that not all tools cover all of the species that we include in the DIOPT tool (see Tables 1,2 and 3).

DIOPT also displays protein and domain alignments, including percent amino acid identity, for predicted ortholog pairs. These should help you to identify the most appropriate matches among multiple possible orthologs.

The following summary figures and tables help to explain our approach and summarize the tools and algorithms included in DIOPT.

DIOPT integration schemaFigure 1: Summary of the DIOPT approach to integration of results from multiple ortholog prediction tools and algorithms. In green, tools based on sequence alignment. In purple, tools based on evolutionary relationships. In orange, a tool that incorporates protein-protein interaction network data into ortholog predictions.

Table 1: Summary Information and Publications for the Tools Integrated in DIOPT

Prediction Method Source Prediction Algorithm Coverage DIOPT Weight* PMID
Compara Ensembl Phylogenetic approach 112 species (vs 91) 0.93 19029536
Homologene NCBI Combination of BBH*, tree and synteny 21 species (vs. 68) 1 11125071
Inparanoid Stockholm University, Sweden BBH* approach to identify orthologs and in-paralogs 273 species (vs. 8) 1.05 11743721
25429972
Isobase MIT Sequence and PPI* network alignments 5 species (vs.2, Nov. 2014) 0.95 21177658
OMA CBRG, ETH Zurich BBH*, global sequence alignments 2103 species (Jan 2018) 1.01 17545180
OrthoDB University of Geneva Phylogenetic approach >5000 species (vs.9.1) 1.01 20972218
25428351
orthoMCL University of Pennsylvania Markov Cluster algorithm 150 species (vs. 5) 0.9 12952885
Phylome Centre for Genomic Regulation (CRG), Spain Reconstruction of evolutionary histories of all genes in a genome, also known as phylome. 1059 species,120 Phylomes (vs. 4) QfO 0.91 17962297
24275491
RoundUp Harvard Medical School RSD*, modified BBH* 2044 species (Apr 2013) 1.03 16777906
TreeFam Wellcome Trust Sanger Institute Manually curated based on trees 109 species (vs. 9) 0.96 16381935
24194607
Panther University of Southern California Phylogenetic approach 112 species (vs. 13.1) 1.1 26578592
HGNC European Bioinformatics Institute (EMBL-EBI) Manually curated 3 species (Jan 2018) 1.5  
ZFIN Zebrafish Model Organism Database Sequence similarity analysis and manual curation 4 species (Jan 2018) 1.5  
eggNOG Embl, Germany Graph-based algorithms 2031 species (vs. 4.5.1) 0.9 26582926
University of Oxford
Graph-based algorithms
 
1
 
Institut de Génétique et de Biologie Moléculaire et Cellulaire
A novel orthogroup inference algorithm that solves a previously undetected gene length bias in orthogroup inference, resulting in significant improvements in accuracy.
259 species (vs2.21)
1
Stockholm Bioinformatics Center
The OrthoInspector algorithm is divided into three main steps. First, the results of a Blast all-versus-all (proteomes are blasted against each other) is provided by the user and is parsed to find all the Blast best hits for each protein and to create the groups of inparalogs. Second, the inparalog groups for each organism are compared in a pairwise fashion to define potential orthologs and/or in-paralogs. Third, best hits that contradict the potential orthology between entities are detected.
66 species (vs. 2)
1

* DIOPT weights are based on the mean semantic similarity of high quality GO molecular function annotation of all fly-human orthologous pairs predicted by each tool.
   BBH, Best Blast Hits
   RSD, Reciprocal Smallest Distance
   PPI, Protein-Protein Interactions

Table 2A: Genome Release Information for the Tools Integrated in DIOPT

 

  Worm Fish Fly Human Mouse Yeast Fission Yeast Frog Rat Thale cress
Compara WBcel235 GRCz10 BDGP6 GRCh38.p3 GRCm38.p4 R64-1-1 NA JGI 4.2 Rnor_6.0 NA
Homologene WS195 Zv9 FlyBase r5.48 GRCh38 GRCm38.p2 R64-1-1 ASM294v2 Xtropicalis_v7 Rnor_5.0 TAIR10
OMA Ensembl 86; WBcel235; 14-SEP-2016 Ensembl 90; GRCz10 Ensembl 90; BDGP6 Ensembl 86; GRCh38; 13-SEP-2016 Ensembl 86; GRCm38; 13-SEP-2016 Ensembl 73; EF4; 23-AUG-2013 Ensembl Fungi 22; ASM294v2; 17-MAR-2014 Ensembl 73; JGI_4.2; 23-AUG-2013 Ensembl 83; Rnor_6.0; 28-NOV-2015 Ensembl Plants 20; TAIR10; 2-SEP-2013
Inparanoid UniProt Nov 2013 UniProt Nov 2013 UniProt Nov 2013 UniProt Nov 2013 UniProt Nov 2013 UniProt Nov 2013 UniProt Nov 2013 UniProt Nov 2013 UniProt Nov 2013 UniProt Nov 2013
Isobase Ensembl v59 NA Ensembl v59 Ensembl v59 Ensembl v59 Ensembl v59 NA NA NA NA
orthoMCL WS206 Zv8.56 BDGP5.13.56 GRCh37.56 NCBI v37.56 FungiDB GenBank NA Ensembl v53 GenBank
orthoDB UniProt UniProt UniProt UniProt UniProt UniProt UniProt UniProt UniProt UniProt
RoundUp UniProt Apr 2013 UniProt Apr 2013 UniProt Apr 2013 UniProt Apr 2013 UniProt Apr 2013 UniProt Apr 2013 UniProt Apr 2013 UniProt Apr 2013 NA NA
TreeFam Ensembl v69 Ensembl v69 Ensembl v69 Ensembl v69 Ensembl v69 Ensembl v69 Ensembl v69 Ensembl v69 Ensembl v69 NA
Panther WormBase Oct 2017 Ensembl Oct 2017 FlyBase Oct 2017 HGNC Oct 2017 MGI Oct 2017 SGD Oct 2017 PomBase Oct 2017 Zenbase Oct 2017 RGD Oct 2017 TAIR Oct 2017
Phylome UniProt UniProt UniProt UniProt UniProt UniProt UniProt UniProt UniProt UniProt
HGNC NA NA NA HGNC Jan 2018 HGNC Jan 2018 NA NA NA HGNC Jan 2018 NA
ZFIN NA ZFIN Jan 2018 ZFIN Jan 2018 ZFIN Jan 2018 ZFIN Jan 2018 NA NA NA NA NA
eggNOG Ensembl Ensembl Ensembl Ensembl Ensembl Ensembl NA Ensembl Ensembl NA
OrthoFinder RefSeq78 RefSeq78 RefSeq78 RefSeq78 RefSeq78 RefSeq78 RefSeq78 RefSeq78 RefSeq78 NA
OrthoInspector UniProt UniProt UniProt UniProt UniProt UniProt UniProt UniProt UniProt UniProt
Hieranoid UniProt (2016_07) UniProt (2016_07) UniProt (2016_07) UniProt (2016_07) UniProt (2016_07) UniProt (2016_07) UniProt (2016_07) UniProt (2016_07) UniProt (2016_07) UniProt (2016_07)

 

 

Table 2B: Additional Information About Genome Releases

Other Resource Version
WormBase release263
FlyBase release6.19
RefSeq release85
EntrezGene 1-Jan-18


 

Table 3. Maximum DIOPT score for each orthologous relationship

 

Orthologous Relationship Max score Type Relevant Tools
baker's yeast-baker's yeast 10 paralog OrthoDB;Compara;Panther;orthoMCL;OrthoFinder;Isobase;Inparanoid;eggNOG;RoundUp;Homologene;
baker's yeast-fish 14 ortholog OrthoFinder;RoundUp;Treefam;Phylome;Panther;orthoMCL;OrthoInspector;OMA;Inparanoid;Homologene;Hieranoid;eggNOG;Compara;OrthoDB;
baker's yeast-fission yeast 12 ortholog OrthoInspector;RoundUp;Phylome;Treefam;orthoMCL;OrthoDB;OMA;Inparanoid;Homologene;Hieranoid;Panther;OrthoFinder;
baker's yeast-fly 15 ortholog OMA;Treefam;RoundUp;Phylome;Panther;orthoMCL;OrthoInspector;OrthoDB;Isobase;Inparanoid;Homologene;Hieranoid;eggNOG;Compara;OrthoFinder;
baker's yeast-frog 12 ortholog OMA;Compara;Treefam;RoundUp;Phylome;OrthoInspector;OrthoDB;Inparanoid;Homologene;Hieranoid;eggNOG;OrthoFinder;
baker's yeast-human 15 ortholog OrthoFinder;eggNOG;RoundUp;Phylome;Treefam;Panther;orthoMCL;OrthoInspector;OrthoDB;OMA;Isobase;Inparanoid;Hieranoid;Compara;Homologene;
baker's yeast-mouse 15 ortholog Isobase;Treefam;RoundUp;Phylome;Panther;orthoMCL;OrthoInspector;OrthoFinder;OrthoDB;Inparanoid;Homologene;Hieranoid;eggNOG;Compara;OMA;
baker's yeast-rat 13 ortholog orthoMCL;Treefam;Panther;OrthoInspector;OrthoFinder;OrthoDB;Inparanoid;Homologene;Compara;Hieranoid;eggNOG;OMA;Phylome;
baker's yeast-Thale cress 9 ortholog OMA;Hieranoid;Phylome;Panther;orthoMCL;OrthoDB;Inparanoid;Homologene;OrthoInspector;
baker's yeast-worm 15 ortholog OrthoFinder;RoundUp;Phylome;Panther;Homologene;orthoMCL;Treefam;OrthoInspector;OMA;Inparanoid;Hieranoid;eggNOG;Compara;Isobase;OrthoDB;
fish-fish 9 paralog OrthoDB;Compara;eggNOG;Homologene;OrthoFinder;orthoMCL;Panther;RoundUp;Inparanoid;
fish-fission yeast 12 ortholog Treefam;Hieranoid;Homologene;Inparanoid;OMA;OrthoDB;OrthoFinder;OrthoInspector;orthoMCL;Panther;RoundUp;Phylome;
fish-fly 15 ortholog OrthoInspector;OMA;ZFIN;Treefam;RoundUp;Phylome;Panther;orthoMCL;OrthoDB;Compara;Inparanoid;Homologene;Hieranoid;eggNOG;OrthoFinder;
fish-frog 12 ortholog OrthoFinder;Compara;Treefam;RoundUp;Phylome;OrthoInspector;OrthoDB;OMA;Inparanoid;Homologene;eggNOG;Hieranoid;
fish-human 15 ortholog OrthoFinder;Phylome;ZFIN;RoundUp;Panther;orthoMCL;OrthoInspector;OMA;Inparanoid;Homologene;Compara;Hieranoid;Treefam;eggNOG;OrthoDB;
fish-mouse 15 ortholog Panther;ZFIN;Treefam;Phylome;orthoMCL;OrthoInspector;OrthoFinder;OMA;Inparanoid;Compara;Homologene;Hieranoid;eggNOG;OrthoDB;RoundUp;
fish-rat 13 ortholog Homologene;Treefam;Phylome;Panther;orthoMCL;OrthoInspector;OrthoFinder;OrthoDB;Inparanoid;Hieranoid;eggNOG;Compara;OMA;
fish-Thale cress 9 ortholog OrthoInspector;Phylome;Hieranoid;orthoMCL;OrthoDB;OMA;Inparanoid;Homologene;Panther;
fish-worm 14 ortholog Hieranoid;OrthoInspector;Treefam;RoundUp;Phylome;Panther;orthoMCL;OrthoFinder;OrthoDB;OMA;Homologene;eggNOG;Compara;Inparanoid;
fission yeast-fission yeast 7 paralog OrthoDB;RoundUp;Panther;OrthoFinder;Inparanoid;Homologene;orthoMCL;
fission yeast-fly 12 ortholog RoundUp;Treefam;Phylome;Panther;orthoMCL;OrthoInspector;OrthoDB;OMA;Inparanoid;Homologene;Hieranoid;OrthoFinder;
fission yeast-frog 10 ortholog Inparanoid;Treefam;RoundUp;Phylome;OrthoInspector;OMA;OrthoDB;Homologene;Hieranoid;OrthoFinder;
fission yeast-human 12 ortholog OrthoFinder;RoundUp;Phylome;Panther;OrthoInspector;Treefam;OrthoDB;OMA;Inparanoid;Homologene;Hieranoid;orthoMCL;
fission yeast-mouse 12 ortholog OrthoInspector;Treefam;RoundUp;Phylome;Panther;orthoMCL;OrthoFinder;OrthoDB;OMA;Inparanoid;Hieranoid;Homologene;
fission yeast-rat 11 ortholog OrthoInspector;Phylome;Treefam;Panther;orthoMCL;OrthoDB;OMA;Inparanoid;Hieranoid;Homologene;OrthoFinder;
fission yeast-Thale cress 9 ortholog OMA;Phylome;Panther;orthoMCL;OrthoDB;Inparanoid;Homologene;Hieranoid;OrthoInspector;
fission yeast-worm 12 ortholog OMA;RoundUp;Phylome;Panther;Treefam;orthoMCL;OrthoDB;Inparanoid;Homologene;Hieranoid;OrthoFinder;OrthoInspector;
fly-fly 10 paralog Inparanoid;RoundUp;Panther;orthoMCL;OrthoFinder;Isobase;Homologene;eggNOG;Compara;OrthoDB;
fly-frog 12 ortholog Compara;OrthoFinder;RoundUp;Treefam;Phylome;OrthoInspector;OMA;Inparanoid;Homologene;eggNOG;Hieranoid;OrthoDB;
fly-human 15 ortholog OrthoDB;Treefam;RoundUp;Phylome;Panther;orthoMCL;OrthoFinder;OMA;Isobase;Inparanoid;Homologene;Hieranoid;eggNOG;Compara;OrthoInspector;
fly-mouse 15 ortholog Phylome;OrthoDB;RoundUp;Panther;orthoMCL;OrthoInspector;OrthoFinder;Isobase;Inparanoid;Homologene;Hieranoid;eggNOG;Compara;Treefam;OMA;
fly-rat 13 ortholog Phylome;Treefam;Panther;orthoMCL;OrthoInspector;OrthoFinder;OMA;Inparanoid;Homologene;Hieranoid;eggNOG;Compara;OrthoDB;
fly-Thale cress 9 ortholog OrthoInspector;Panther;Phylome;orthoMCL;OMA;Inparanoid;Hieranoid;Homologene;OrthoDB;
fly-worm 15 ortholog OrthoInspector;OMA;Treefam;RoundUp;Phylome;Panther;orthoMCL;Compara;OrthoFinder;Isobase;Inparanoid;Homologene;Hieranoid;eggNOG;OrthoDB;
frog-frog 7 paralog Homologene;RoundUp;OrthoFinder;Inparanoid;eggNOG;Compara;OrthoDB;
frog-human 12 ortholog OrthoDB;RoundUp;Phylome;OrthoFinder;OMA;Inparanoid;Homologene;Hieranoid;eggNOG;Compara;Treefam;OrthoInspector;
frog-mouse 12 ortholog Treefam;RoundUp;OrthoDB;Phylome;OrthoInspector;OrthoFinder;Inparanoid;Homologene;Hieranoid;Compara;eggNOG;OMA;
frog-rat 11 ortholog Phylome;Treefam;OrthoInspector;OrthoFinder;OrthoDB;Inparanoid;Homologene;Hieranoid;eggNOG;Compara;OMA;
frog-Thale cress 7 ortholog OMA;OrthoInspector;OrthoDB;Homologene;Hieranoid;Inparanoid;Phylome;
frog-worm 12 ortholog Inparanoid;Treefam;RoundUp;Phylome;OrthoInspector;OrthoFinder;OMA;Homologene;Hieranoid;eggNOG;Compara;OrthoDB;
human-human 11 paralog OrthoDB;Panther;RoundUp;orthoMCL;OrthoFinder;Isobase;Inparanoid;Homologene;eggNOG;Compara;OMA;
human-mouse 16 ortholog HGNC;OrthoFinder;Treefam;RoundUp;Phylome;Panther;orthoMCL;OrthoInspector;OrthoDB;OMA;Isobase;Inparanoid;Hieranoid;eggNOG;Compara;Homologene;
human-rat 14 ortholog OrthoFinder;Homologene;Treefam;Phylome;Panther;orthoMCL;OrthoInspector;Compara;Inparanoid;OMA;Hieranoid;HGNC;eggNOG;OrthoDB;
human-Thale cress 9 ortholog OMA;Phylome;Panther;orthoMCL;Hieranoid;OrthoDB;Inparanoid;Homologene;OrthoInspector;
human-worm 15 ortholog OrthoDB;RoundUp;Phylome;Panther;orthoMCL;OrthoInspector;OrthoFinder;OMA;Isobase;Inparanoid;Homologene;Hieranoid;Compara;Treefam;eggNOG;
mouse-mouse 13 paralog OrthoDB;RoundUp;Phylome;Panther;Compara;OrthoFinder;Isobase;Inparanoid;Homologene;Hieranoid;eggNOG;OMA;orthoMCL;
mouse-rat 13 ortholog Inparanoid;Treefam;Phylome;Panther;orthoMCL;OrthoInspector;OrthoFinder;OMA;Homologene;Hieranoid;eggNOG;Compara;OrthoDB;
mouse-Thale cress 9 ortholog OMA;Phylome;Panther;orthoMCL;OrthoDB;Inparanoid;Homologene;Hieranoid;OrthoInspector;
mouse-worm 15 ortholog OMA;Treefam;RoundUp;Phylome;Panther;orthoMCL;OrthoInspector;OrthoDB;Isobase;Inparanoid;Homologene;Hieranoid;eggNOG;Compara;OrthoFinder;
rat-rat 9 paralog Compara;orthoMCL;OrthoFinder;OrthoDB;OMA;Inparanoid;eggNOG;Panther;Homologene;
rat-Thale cress 9 ortholog OrthoInspector;Homologene;Phylome;Panther;orthoMCL;Hieranoid;Inparanoid;OMA;OrthoDB;
rat-worm 13 ortholog OrthoInspector;Phylome;OrthoDB;orthoMCL;Treefam;OrthoFinder;Inparanoid;Homologene;Hieranoid;eggNOG;Compara;OMA;Panther;
Thale cress-Thale cress 5 paralog Inparanoid;OrthoDB;orthoMCL;Panther;Homologene;
Thale cress-worm 9 ortholog OMA;Phylome;Panther;orthoMCL;OrthoDB;Inparanoid;Homologene;Hieranoid;OrthoInspector;
worm-worm 10 paralog OrthoDB;RoundUp;Panther;orthoMCL;OrthoFinder;Inparanoid;Homologene;Compara;Isobase;eggNOG;

SCORE DISTRIBUTIONS

diopt ortholog distribution

diopt paralog distribution

Version information

 

6.0- Dec 2016 -

  • Updated Data Sources
  • added eggNOG as a source
  • Added paralogs

5.5- Oct 2016 - Added multi-sequence alignment from target "All" heatmap

5.4- Sept 2016 - Added target species "All" and new filter

 

5.3- May 2016 - Added more prediction tools (Panther, HGNC and ZFIN)
5.2.1- April 2016 - Added orthologous rank
High: best score both ways AND DIOPT score >=2
Moderate:
(best score forward or reverse) AND DIOPT score >=2
DIOPT score >=4
Low: all others
5.2- April 2016 - Added New Spcecies (Rattus norvegicus)
5.1.1 - December 2015 - Added Best forward and reverse columns
5.1 - November 2015 - Upgraded gene matching algorithm
5.0 - November 2015 - Upgraded data sources to version 5