The accumulation of biological and biomedical literature outpaces the ability of most researchers and clinicians to stay abreast of their own immediate fields, let alone a broader range of topics. Although available search tools support identification of relevant literature, finding relevant and key publications is not always straightforward. For example, important publications might be missed in searches with an official gene name due to gene synonyms. Moreover, ambiguity of gene names can result in retrieval of a large number of irrelevant publications. To address these issues and help researchers and physicians quickly identify relevant publications, we developed BioLitMine, an advanced literature mining tool that takes advantage of the medical subject heading (MeSH) index and gene-to-publication annotations already available for PubMed literature. Using BioLitMine, a user can identify what MeSH terms are represented in the set of publications associated with a given gene of the interest, or start with a term and identify relevant publications. Users can also use the tool to find co-cited genes and a build a literature co-citation network. In addition, BioLitMine can help users build a gene list relevant to a MeSH terms, such as a list of genes relevant to "stem cells" or "breast neoplasms." Users can also start with a gene or pathway of interest and identify authors associated with that gene or pathway, a feature that makes it easier to identify experts who might serve as collaborators or reviewers. Altogether, BioLitMine extends the value of PubMed-indexed literature and its existing expert curation by providing a robust and gene-centric approach to retrieval of relevant information.
Inactivation of the tumor suppressor gene is the signature initiating event in clear cell renal cell carcinoma (ccRCC), the most common form of kidney cancer, and causes the accumulation of hypoxia-inducible factor 2α (HIF-2α). HIF-2α inhibitors are effective in some ccRCC cases, but both de novo and acquired resistance have been observed in the laboratory and in the clinic. Here, we identified synthetic lethality between decreased activity of cyclin-dependent kinases 4 and 6 (CDK4/6) and inactivation in two species (human and ) and across diverse human ccRCC cell lines in culture and xenografts. Although HIF-2α transcriptionally induced the CDK4/6 partner cyclin D1, HIF-2α was not required for the increased CDK4/6 requirement of ccRCC cells. Accordingly, the antiproliferative effects of CDK4/6 inhibition were synergistic with HIF-2α inhibition in HIF-2α-dependent ccRCC cells and not antagonistic with HIF-2α inhibition in HIF-2α-independent cells. These findings support testing CDK4/6 inhibitors as treatments for ccRCC, alone and in combination with HIF-2α inhibitors.
One of the most powerful ways to develop hypotheses regarding biological functions of conserved genes in a given species, such as in humans, is to first look at what is known about function in another species. Model organism databases (MODs) and other resources are rich with functional information but difficult to mine. Gene2Function (G2F) addresses a broad need by integrating information about conserved genes in a single online resource.
One major challenge encountered with interpreting human genetic variants is the limited understanding of the functional impact of genetic alterations on biological processes. Furthermore, there remains an unmet demand for an efficient survey of the wealth of information on human homologs in model organisms across numerous databases. To efficiently assess the large volume of publically available information, it is important to provide a concise summary of the most relevant information in a rapid user-friendly format. To this end, we created MARRVEL (model organism aggregated resources for rare variant exploration). MARRVEL is a publicly available website that integrates information from six human genetic databases and seven model organism databases. For any given variant or gene, MARRVEL displays information from OMIM, ExAC, ClinVar, Geno2MP, DGV, and DECIPHER. Importantly, it curates model organism-specific databases to concurrently display a concise summary regarding the human gene homologs in budding and fission yeast, worm, fly, fish, mouse, and rat on a single webpage. Experiment-based information on tissue expression, protein subcellular localization, biological process, and molecular function for the human gene and homologs in the seven model organisms are arranged into a concise output. Hence, rather than visiting multiple separate databases for variant and gene analysis, users can obtain important information by searching once through MARRVEL. Altogether, MARRVEL dramatically improves efficiency and accessibility to data collection and facilitates analysis of human genes and variants by cross-disciplinary integration of 18 million records available in public databases to facilitate clinical diagnosis and basic research.
The FlyRNAi database of the Drosophila RNAi Screening Center (DRSC) and Transgenic RNAi Project (TRiP) at Harvard Medical School and associated DRSC/TRiP Functional Genomics Resources website (http://fgr.hms.harvard.edu) serve as a reagent production tracking system, screen data repository, and portal to the community. Through this portal, we make available protocols, online tools, and other resources useful to researchers at all stages of high-throughput functional genomics screening, from assay design and reagent identification to data analysis and interpretation. In this update, we describe recent changes and additions to our website, database and suite of online tools. Recent changes reflect a shift in our focus from a single technology (RNAi) and model species (Drosophila) to the application of additional technologies (e.g. CRISPR) and support of integrated, cross-species approaches to uncovering gene function using functional genomics and other approaches.
The tuberous sclerosis complex (TSC) family of tumor suppressors, TSC1 and TSC2, function together in an evolutionarily conserved protein complex that is a point of convergence for major cell signaling pathways that regulate mTOR complex 1 (mTORC1). Mutation or aberrant inhibition of the TSC complex is common in various human tumor syndromes and cancers. The discovery of novel therapeutic strategies to selectively target cells with functional loss of this complex is therefore of clinical relevance to patients with nonmalignant TSC and those with sporadic cancers. We developed a CRISPR-based method to generate homogeneous mutant Drosophila cell lines. By combining TSC1 or TSC2 mutant cell lines with RNAi screens against all kinases and phosphatases, we identified synthetic interactions with TSC1 and TSC2. Individual knockdown of three candidate genes (mRNA-cap, Pitslre, and CycT; orthologs of RNGTT, CDK11, and CCNT1 in humans) reduced the population growth rate of Drosophila cells lacking either TSC1 or TSC2 but not that of wild-type cells. Moreover, individual knockdown of these three genes had similar growth-inhibiting effects in mammalian TSC2-deficient cell lines, including human tumor-derived cells, illustrating the power of this cross-species screening strategy to identify potential drug targets.
Using a Drosophila model of Alzheimer's disease (AD), we systematically evaluated 67 candidate genes based on AD-associated genomic loci (P < 10(-4)) from published human genome-wide association studies (GWAS). Genetic manipulation of 87 homologous fly genes was tested for modulation of neurotoxicity caused by human Tau, which forms neurofibrillary tangle pathology in AD. RNA interference (RNAi) targeting 9 genes enhanced Tau neurotoxicity, and in most cases reciprocal activation of gene expression suppressed Tau toxicity. Our screen implicates cindr, the fly ortholog of the human CD2AP AD susceptibility gene, as a modulator of Tau-mediated disease mechanisms. Importantly, we also identify the fly orthologs of FERMT2 and CELF1 as Tau modifiers, and these loci have been independently validated as AD susceptibility loci in the latest GWAS meta-analysis. Both CD2AP and FERMT2 have been previously implicated with roles in cell adhesion, and our screen additionally identifies a fly homolog of the human integrin adhesion receptors, ITGAM and ITGA9, as a modifier of Tau neurotoxicity. Our results highlight cell adhesion pathways as important in Tau toxicity and AD susceptibility and demonstrate the power of model organism genetic screens for the functional follow-up of human GWAS.
The androgen receptor (AR) is a mediator of both androgen-dependent and castration-resistant prostate cancers. Identification of cellular factors affecting AR transcriptional activity could in principle yield new targets that reduce AR activity and combat prostate cancer, yet a comprehensive analysis of the genes required for AR-dependent transcriptional activity has not been determined. Using an unbiased genetic approach that takes advantage of the evolutionary conservation of AR signaling, we have conducted a genome-wide RNAi screen in Drosophila cells for genes required for AR transcriptional activity and applied the results to human prostate cancer cells. We identified 45 AR-regulators, which include known pathway components and genes with functions not previously linked to AR regulation, such as HIPK2 (a protein kinase) and MED19 (a subunit of the Mediator complex). Depletion of HIPK2 and MED19 in human prostate cancer cells decreased AR target gene expression and, importantly, reduced the proliferation of androgen-dependent and castration-resistant prostate cancer cells. We also systematically analyzed additional Mediator subunits and uncovered a small subset of Mediator subunits that interpret AR signaling and affect AR-dependent transcription and prostate cancer cell proliferation. Importantly, targeting of HIPK2 by an FDA-approved kinase inhibitor phenocopied the effect of depletion by RNAi and reduced the growth of AR-positive, but not AR-negative, treatment-resistant prostate cancer cells. Thus, our screen has yielded new AR regulators including drugable targets that reduce the proliferation of castration-resistant prostate cancer cells.
To identify Huntington's Disease therapeutics, we conducted high-content small molecule and RNAi suppressor screens using a Drosophila primary neural culture Huntingtin model. Drosophila primary neurons offer a sensitive readout for neurotoxicty, as their neurites develop dysmorphic features in the presence of mutant polyglutamine-expanded Huntingtin compared to nonpathogenic Huntingtin. By tracking the subcellular distribution of mRFP-tagged pathogenic Huntingtin and assaying neurite branch morphology via live-imaging, we identified suppressors that could reduce Huntingtin aggregation and/or prevent the formation of dystrophic neurites. The custom algorithms we used to quantify neurite morphologies in complex cultures provide a useful tool for future high-content screening approaches focused on neurodegenerative disease models. Compounds previously found to be effective aggregation inhibitors in mammalian systems were also effective in Drosophila primary cultures, suggesting translational capacity between these models. However, we did not observe a direct correlation between the ability of a compound or gene knockdown to suppress aggregate formation and its ability to rescue dysmorphic neurites. Only a subset of aggregation inhibitors could revert dysmorphic cellular profiles. We identified lkb1, an upstream kinase in the mTOR/Insulin pathway, and four novel drugs, Camptothecin, OH-Camptothecin, 18β-Glycyrrhetinic acid, and Carbenoxolone, that were strong suppressors of mutant Huntingtin-induced neurotoxicity. Huntingtin neurotoxicity suppressors identified through our screen also restored viability in an in vivo Drosophila Huntington's Disease model, making them attractive candidates for further therapeutic evaluation.
BACKGROUND: Mapping of orthologous genes among species serves an important role in functional genomics by allowing researchers to develop hypotheses about gene function in one species based on what is known about the functions of orthologs in other species. Several tools for predicting orthologous gene relationships are available. However, these tools can give different results and identification of predicted orthologs is not always straightforward. RESULTS: We report a simple but effective tool, the Drosophila RNAi Screening Center Integrative Ortholog Prediction Tool (DIOPT; http://www.flyrnai.org/diopt), for rapid identification of orthologs. DIOPT integrates existing approaches, facilitating rapid identification of orthologs among human, mouse, zebrafish, C. elegans, Drosophila, and S. cerevisiae. As compared to individual tools, DIOPT shows increased sensitivity with only a modest decrease in specificity. Moreover, the flexibility built into the DIOPT graphical user interface allows researchers with different goals to appropriately 'cast a wide net' or limit results to highest confidence predictions. DIOPT also displays protein and domain alignments, including percent amino acid identity, for predicted ortholog pairs. This helps users identify the most appropriate matches among multiple possible orthologs. To facilitate using model organisms for functional analysis of human disease-associated genes, we used DIOPT to predict high-confidence orthologs of disease genes in Online Mendelian Inheritance in Man (OMIM) and genes in genome-wide association study (GWAS) data sets. The results are accessible through the DIOPT diseases and traits query tool (DIOPT-DIST; http://www.flyrnai.org/diopt-dist). CONCLUSIONS: DIOPT and DIOPT-DIST are useful resources for researchers working with model organisms, especially those who are interested in exploiting model organisms such as Drosophila to study the functions of human disease genes.
Protein aggregates are a common pathological feature of most neurodegenerative diseases (NDs). Understanding their formation and regulation will help clarify their controversial roles in disease pathogenesis. To date, there have been few systematic studies of aggregates formation in Drosophila, a model organism that has been applied extensively in modeling NDs and screening for toxicity modifiers. We generated transgenic fly lines that express enhanced-GFP-tagged mutant Huntingtin (Htt) fragments with different lengths of polyglutamine (polyQ) tract and showed that these Htt mutants develop protein aggregates in a polyQ-length- and age-dependent manner in Drosophila. To identify central regulators of protein aggregation, we further generated stable Drosophila cell lines expressing these Htt mutants and also established a cell-based quantitative assay that allows automated measurement of aggregates within cells. We then performed a genomewide RNA interference screen for regulators of mutant Htt aggregation and isolated 126 genes involved in diverse cellular processes. Interestingly, although our screen focused only on mutant Htt aggregation, several of the identified candidates were known previously as toxicity modifiers of NDs. Moreover, modulating the in vivo activity of hsp110 (CG6603) or tra1, two hits from the screen, affects neurodegeneration in a dose-dependent manner in a Drosophila model of Huntington's disease. Thus, other aggregates regulators isolated in our screen may identify additional genes involved in the protein-folding pathway and neurotoxicity.
To facilitate the genetic analysis of muscle assembly and maintenance, we have developed a method for efficient RNA interference (RNAi) in Drosophila primary cells using double-stranded RNAs (dsRNAs). First, using molecular markers, we confirm and extend the observation that myogenesis in primary cultures derived from Drosophila embryonic cells follows the same developmental course as that seen in vivo. Second, we apply this approach to analyze 28 Drosophila homologs of human muscle disease genes and find that 19 of them, when disrupted, lead to abnormal muscle phenotypes in primary culture. Third, from an RNAi screen of 1140 genes chosen at random, we identify 49 involved in late muscle differentiation. We validate our approach with the in vivo analyses of three genes. We find that Fermitin 1 and Fermitin 2, which are involved in integrin-containing adhesion structures, act in a partially redundant manner to maintain muscle integrity. In addition, we characterize CG2165, which encodes a plasma membrane Ca2+-ATPase, and show that it plays an important role in maintaining muscle integrity. Finally, we discuss how Drosophila primary cells can be manipulated to develop cell-based assays to model human diseases for RNAi and small-molecule screens.
Antigen stimulation of immune cells triggers Ca2+ entry through Ca2+ release-activated Ca2+ (CRAC) channels, promoting the immune response to pathogens by activating the transcription factor NFAT. We have previously shown that cells from patients with one form of hereditary severe combined immune deficiency (SCID) syndrome are defective in store-operated Ca2+ entry and CRAC channel function. Here we identify the genetic defect in these patients, using a combination of two unbiased genome-wide approaches: a modified linkage analysis with single-nucleotide polymorphism arrays, and a Drosophila RNA interference screen designed to identify regulators of store-operated Ca2+ entry and NFAT nuclear import. Both approaches converged on a novel protein that we call Orai1, which contains four putative transmembrane segments. The SCID patients are homozygous for a single missense mutation in ORAI1, and expression of wild-type Orai1 in SCID T cells restores store-operated Ca2+ influx and the CRAC current (I(CRAC)). We propose that Orai1 is an essential component or regulator of the CRAC channel complex.