With the advent of single-cell RNA sequencing (scRNA-seq) technologies, there has been a spike in studies involving scRNA-seq of several tissues across diverse species including Drosophila. Although a few databases exist for users to query genes of interest within the scRNA-seq studies, search tools that enable users to find orthologous genes and their cell type-specific expression patterns across species are limited. Here, we built a new search database, called DRscDB (https://www.flyrnai.org/tools/single_cell/web/) to address this need. DRscDB serves as a comprehensive repository for published scRNA-seq datasets for Drosophila and the relevant datasets from human and other model organisms. DRscDB is based on manual curation of Drosophila scRNA-seq studies of various tissue types and their corresponding analogous tissues in vertebrates including zebrafish, mouse, and human. Of note, our search database provides most of the literature-derived marker genes, thus preserving the original analysis of the published scRNA-seq datasets. DRscDB serves as a web-based user interface that allows users to mine, utilize and compare gene expression data pertaining to scRNA-seq datasets from the published literature.
The accumulation of biological and biomedical literature outpaces the ability of most researchers and clinicians to stay abreast of their own immediate fields, let alone a broader range of topics. Although available search tools support identification of relevant literature, finding relevant and key publications is not always straightforward. For example, important publications might be missed in searches with an official gene name due to gene synonyms. Moreover, ambiguity of gene names can result in retrieval of a large number of irrelevant publications. To address these issues and help researchers and physicians quickly identify relevant publications, we developed BioLitMine, an advanced literature mining tool that takes advantage of the medical subject heading (MeSH) index and gene-to-publication annotations already available for PubMed literature. Using BioLitMine, a user can identify what MeSH terms are represented in the set of publications associated with a given gene of the interest, or start with a term and identify relevant publications. Users can also use the tool to find co-cited genes and a build a literature co-citation network. In addition, BioLitMine can help users build a gene list relevant to a MeSH terms, such as a list of genes relevant to "stem cells" or "breast neoplasms." Users can also start with a gene or pathway of interest and identify authors associated with that gene or pathway, a feature that makes it easier to identify experts who might serve as collaborators or reviewers. Altogether, BioLitMine extends the value of PubMed-indexed literature and its existing expert curation by providing a robust and gene-centric approach to retrieval of relevant information.
Understanding human gene function is fundamental to understanding and treating diseases. Research using the model organism Drosophila melanogaster benefits from a wealth of molecular genetic resources and information useful for efficient invivo experimentation. Moreover, Drosophila offers a balance as a relatively simple organism that nonetheless exhibits complex multicellular activities. Recent examples demonstrate the power and continued promise of Drosophila research to further our understanding of conserved gene functions.
Inactivation of the tumor suppressor gene is the signature initiating event in clear cell renal cell carcinoma (ccRCC), the most common form of kidney cancer, and causes the accumulation of hypoxia-inducible factor 2α (HIF-2α). HIF-2α inhibitors are effective in some ccRCC cases, but both de novo and acquired resistance have been observed in the laboratory and in the clinic. Here, we identified synthetic lethality between decreased activity of cyclin-dependent kinases 4 and 6 (CDK4/6) and inactivation in two species (human and ) and across diverse human ccRCC cell lines in culture and xenografts. Although HIF-2α transcriptionally induced the CDK4/6 partner cyclin D1, HIF-2α was not required for the increased CDK4/6 requirement of ccRCC cells. Accordingly, the antiproliferative effects of CDK4/6 inhibition were synergistic with HIF-2α inhibition in HIF-2α-dependent ccRCC cells and not antagonistic with HIF-2α inhibition in HIF-2α-independent cells. These findings support testing CDK4/6 inhibitors as treatments for ccRCC, alone and in combination with HIF-2α inhibitors.
Post-translational modification (PTM) serves as a regulatory mechanism for protein function, influencing their stability, interactions, activity and localization, and is critical in many signaling pathways. The best characterized PTM is phosphorylation, whereby a phosphate is added to an acceptor residue, most commonly serine, threonine and tyrosine in metazoans. As proteins are often phosphorylated at multiple sites, identifying those sites that are important for function is a challenging problem. Considering that any given phosphorylation site might be non-functional, prioritizing evolutionarily conserved phosphosites provides a general strategy to identify the putative functional sites. To facilitate the identification of conserved phosphosites, we generated a large-scale phosphoproteomics dataset from embryos collected from six closely-related species. We built iProteinDB (https://www.flyrnai.org/tools/iproteindb/), a resource integrating these data with other high-throughput PTM datasets, including vertebrates, and manually curated information for At iProteinDB, scientists can view the PTM landscape for any protein and identify predicted functional phosphosites based on a comparative analysis of data from closely-related species. Further, iProteinDB enables comparison of PTM data from to that of orthologous proteins from other model organisms, including human, mouse, rat, , , and .
Methionine restriction (MetR) extends lifespan across different species and exerts beneficial effects on metabolic health and inflammatory responses. In contrast, certain cancer cells exhibit methionine auxotrophy that can be exploited for therapeutic treatment, as decreasing dietary methionine selectively suppresses tumor growth. Thus, MetR represents an intervention that can extend lifespan with a complementary effect of delaying tumor growth. Beyond its function in protein synthesis, methionine feeds into complex metabolic pathways including the methionine cycle, the transsulfuration pathway, and polyamine biosynthesis. Manipulation of each of these branches extends lifespan; however, the interplay between MetR and these branches during regulation of lifespan is not well understood. In addition, a potential mechanism linking the activity of methionine metabolism and lifespan is regulation of production of the methyl donor S-adenosylmethionine, which, after transferring its methyl group, is converted to S-adenosylhomocysteine. Methylation regulates a wide range of processes, including those thought to be responsible for lifespan extension by MetR. Although the exact mechanisms of lifespan extension by MetR or methionine metabolism reprogramming are unknown, it may act via reducing the rate of translation, modifying gene expression, inducing a hormetic response, modulating autophagy, or inducing mitochondrial function, antioxidant defense, or other metabolic processes. Here, we review the mechanisms of lifespan extension by MetR and different branches of methionine metabolism in different species and the potential for exploiting the regulation of methyltransferases to delay aging.
Single-gene knockout experiments can fail to reveal function in the context of redundancy, which is frequently observed among duplicated genes (paralogs) with overlapping functions. We discuss the complexity associated with studying paralogs and outline how recent advances in CRISPR will help address the "phenotype gap" and impact biomedical research.
One of the most powerful ways to develop hypotheses regarding biological functions of conserved genes in a given species, such as in humans, is to first look at what is known about function in another species. Model organism databases (MODs) and other resources are rich with functional information but difficult to mine. Gene2Function (G2F) addresses a broad need by integrating information about conserved genes in a single online resource.
One major challenge encountered with interpreting human genetic variants is the limited understanding of the functional impact of genetic alterations on biological processes. Furthermore, there remains an unmet demand for an efficient survey of the wealth of information on human homologs in model organisms across numerous databases. To efficiently assess the large volume of publically available information, it is important to provide a concise summary of the most relevant information in a rapid user-friendly format. To this end, we created MARRVEL (model organism aggregated resources for rare variant exploration). MARRVEL is a publicly available website that integrates information from six human genetic databases and seven model organism databases. For any given variant or gene, MARRVEL displays information from OMIM, ExAC, ClinVar, Geno2MP, DGV, and DECIPHER. Importantly, it curates model organism-specific databases to concurrently display a concise summary regarding the human gene homologs in budding and fission yeast, worm, fly, fish, mouse, and rat on a single webpage. Experiment-based information on tissue expression, protein subcellular localization, biological process, and molecular function for the human gene and homologs in the seven model organisms are arranged into a concise output. Hence, rather than visiting multiple separate databases for variant and gene analysis, users can obtain important information by searching once through MARRVEL. Altogether, MARRVEL dramatically improves efficiency and accessibility to data collection and facilitates analysis of human genes and variants by cross-disciplinary integration of 18 million records available in public databases to facilitate clinical diagnosis and basic research.
Our understanding of the genetic mechanisms that underlie biological processes has relied extensively on loss-of-function (LOF) analyses. LOF methods target DNA, RNA or protein to reduce or to ablate gene function. By analysing the phenotypes that are caused by these perturbations the wild-type function of genes can be elucidated. Although all LOF methods reduce gene activity, the choice of approach (for example, mutagenesis, CRISPR-based gene editing, RNA interference, morpholinos or pharmacological inhibition) can have a major effect on phenotypic outcomes. Interpretation of the LOF phenotype must take into account the biological process that is targeted by each method. The practicality and efficiency of LOF methods also vary considerably between model systems. We describe parameters for choosing the optimal combination of method and system, and for interpreting phenotypes within the constraints of each method.
The FlyRNAi database of the Drosophila RNAi Screening Center (DRSC) and Transgenic RNAi Project (TRiP) at Harvard Medical School and associated DRSC/TRiP Functional Genomics Resources website (http://fgr.hms.harvard.edu) serve as a reagent production tracking system, screen data repository, and portal to the community. Through this portal, we make available protocols, online tools, and other resources useful to researchers at all stages of high-throughput functional genomics screening, from assay design and reagent identification to data analysis and interpretation. In this update, we describe recent changes and additions to our website, database and suite of online tools. Recent changes reflect a shift in our focus from a single technology (RNAi) and model species (Drosophila) to the application of additional technologies (e.g. CRISPR) and support of integrated, cross-species approaches to uncovering gene function using functional genomics and other approaches.
The protein-protein interaction (PPI) network is crucial for cellular information processing and decision-making. With suitable inputs, PPI networks drive the cells to diverse functional outcomes such as cell proliferation or cell death. Here, we characterize the structural controllability of a large directed human PPI network comprising 6,339 proteins and 34,813 interactions. This network allows us to classify proteins as "indispensable," "neutral," or "dispensable," which correlates to increasing, no effect, or decreasing the number of driver nodes in the network upon removal of that protein. We find that 21% of the proteins in the PPI network are indispensable. Interestingly, these indispensable proteins are the primary targets of disease-causing mutations, human viruses, and drugs, suggesting that altering a network's control property is critical for the transition between healthy and disease states. Furthermore, analyzing copy number alterations data from 1,547 cancer patients reveals that 56 genes that are frequently amplified or deleted in nine different cancers are indispensable. Among the 56 genes, 46 of them have not been previously associated with cancer. This suggests that controllability analysis is very useful in identifying novel disease genes and potential drug targets.
Alkylating agents are a key component of cancer chemotherapy. Several cellular mechanisms are known to be important for its survival, particularly DNA repair and xenobiotic detoxification, yet genomic screens indicate that additional cellular components may be involved. Elucidating these components has value in either identifying key processes that can be modulated to improve chemotherapeutic efficacy or may be altered in some cancers to confer chemoresistance. We therefore set out to reevaluate our prior Drosophila RNAi screening data by comparison to gene expression arrays in order to determine if we could identify any novel processes in alkylation damage survival. We noted a consistent conservation of alkylation survival pathways across platforms and species when the analysis was conducted on a pathway/process level rather than at an individual gene level. Better results were obtained when combining gene lists from two datasets (RNAi screen plus microarray) prior to analysis. In addition to previously identified DNA damage responses (p53 signaling and Nucleotide Excision Repair), DNA-mRNA-protein metabolism (transcription/translation) and proteasome machinery, we also noted a highly conserved cross-species requirement for NRF2, glutathione (GSH)-mediated drug detoxification and Endoplasmic Reticulum stress (ER stress)/Unfolded Protein Responses (UPR) in cells exposed to alkylation. The requirement for GSH, NRF2 and UPR in alkylation survival was validated by metabolomics, protein studies and functional cell assays. From this we conclude that RNAi/gene expression fusion is a valid strategy to rapidly identify key processes that may be extendable to other contexts beyond damage survival.
The rapid rise of CRISPR as a technology for genome engineering and related research applications has created a need for algorithms and associated online tools that facilitate design of on-target and effective guide RNAs (gRNAs). Here, we review the state-of-the-art in CRISPR gRNA design for research applications of the CRISPR-Cas9 system, including knockout, activation and inhibition. Notably, achieving good gRNA design is not solely dependent on innovations in CRISPR technology. Good design and design tools also rely on availability of high-quality genome sequence and gene annotations, as well as on availability of accumulated data regarding off-targets and effectiveness metrics. This article is protected by copyright. All rights reserved.
Huajin Wang, Michel Becuwe, Benjamin E Housden, Chandramohan Chitraju, Ashley J Porras, Morven M Graham, Xinran N Liu, Abdou Rachid Thiam, David B Savage, Anil K Agarwal, Abhimanyu Garg, Maria-Jesus Olarte, Qingqing Lin, Florian Fröhlich, Hans Kristian Hannibal-Bach, Srigokul Upadhyayula, Norbert Perrimon, Tomas Kirchhausen, Christer S Ejsing, Tobias C Walther, and Robert V Farese. 2016. “Seipin is required for converting nascent to mature lipid droplets.” Elife, 5.Abstract
How proteins control the biogenesis of cellular lipid droplets (LDs) is poorly understood. Using Drosophila and human cells, we show here that seipin, an ER protein implicated in LD biology, mediates a discrete step in LD formation-the conversion of small, nascent LDs to larger, mature LDs. Seipin forms discrete and dynamic foci in the ER that interact with nascent LDs to enable their growth. In the absence of seipin, numerous small, nascent LDs accumulate near the ER and most often fail to grow. Those that do grow prematurely acquire lipid synthesis enzymes and undergo expansion, eventually leading to the giant LDs characteristic of seipin deficiency. Our studies identify a discrete step of LD formation, namely the conversion of nascent LDs to mature LDs, and define a molecular role for seipin in this process, most likely by acting at ER-LD contact sites to enable lipid transfer to nascent LDs.
The tuberous sclerosis complex (TSC) family of tumor suppressors, TSC1 and TSC2, function together in an evolutionarily conserved protein complex that is a point of convergence for major cell signaling pathways that regulate mTOR complex 1 (mTORC1). Mutation or aberrant inhibition of the TSC complex is common in various human tumor syndromes and cancers. The discovery of novel therapeutic strategies to selectively target cells with functional loss of this complex is therefore of clinical relevance to patients with nonmalignant TSC and those with sporadic cancers. We developed a CRISPR-based method to generate homogeneous mutant Drosophila cell lines. By combining TSC1 or TSC2 mutant cell lines with RNAi screens against all kinases and phosphatases, we identified synthetic interactions with TSC1 and TSC2. Individual knockdown of three candidate genes (mRNA-cap, Pitslre, and CycT; orthologs of RNGTT, CDK11, and CCNT1 in humans) reduced the population growth rate of Drosophila cells lacking either TSC1 or TSC2 but not that of wild-type cells. Moreover, individual knockdown of these three genes had similar growth-inhibiting effects in mammalian TSC2-deficient cell lines, including human tumor-derived cells, illustrating the power of this cross-species screening strategy to identify potential drug targets.
Using a Drosophila model of Alzheimer's disease (AD), we systematically evaluated 67 candidate genes based on AD-associated genomic loci (P < 10(-4)) from published human genome-wide association studies (GWAS). Genetic manipulation of 87 homologous fly genes was tested for modulation of neurotoxicity caused by human Tau, which forms neurofibrillary tangle pathology in AD. RNA interference (RNAi) targeting 9 genes enhanced Tau neurotoxicity, and in most cases reciprocal activation of gene expression suppressed Tau toxicity. Our screen implicates cindr, the fly ortholog of the human CD2AP AD susceptibility gene, as a modulator of Tau-mediated disease mechanisms. Importantly, we also identify the fly orthologs of FERMT2 and CELF1 as Tau modifiers, and these loci have been independently validated as AD susceptibility loci in the latest GWAS meta-analysis. Both CD2AP and FERMT2 have been previously implicated with roles in cell adhesion, and our screen additionally identifies a fly homolog of the human integrin adhesion receptors, ITGAM and ITGA9, as a modifier of Tau neurotoxicity. Our results highlight cell adhesion pathways as important in Tau toxicity and AD susceptibility and demonstrate the power of model organism genetic screens for the functional follow-up of human GWAS.
BACKGROUND: RNA interference (RNAi) is an effective and important tool used to study gene function. For large-scale screens, RNAi is used to systematically down-regulate genes of interest and analyze their roles in a biological process. However, RNAi is associated with off-target effects (OTEs), including microRNA (miRNA)-like OTEs. The contribution of reagent-specific OTEs to RNAi screen data sets can be significant. In addition, the post-screen validation process is time and labor intensive. Thus, the availability of robust approaches to identify candidate off-targeted transcripts would be beneficial. RESULTS: Significant efforts have been made to eliminate false positive results attributable to sequence-specific OTEs associated with RNAi. These approaches have included improved algorithms for RNAi reagent design, incorporation of chemical modifications into siRNAs, and the use of various bioinformatics strategies to identify possible OTEs in screen results. Genome-wide Enrichment of Seed Sequence matches (GESS) was developed to identify potential off-targeted transcripts in large-scale screen data by seed-region analysis. Here, we introduce a user-friendly web application that provides researchers a relatively quick and easy way to perform GESS analysis on data from human or mouse cell-based screens using short interfering RNAs (siRNAs) or short hairpin RNAs (shRNAs), as well as for Drosophila screens using shRNAs. Online GESS relies on up-to-date transcript sequence annotations for human and mouse genes extracted from NCBI Reference Sequence (RefSeq) and Drosophila genes from FlyBase. The tool also accommodates analysis with user-provided reference sequence files. CONCLUSION: Online GESS provides a straightforward user interface for genome-wide seed region analysis for human, mouse and Drosophila RNAi screen data. With the tool, users can either use a built-in database or provide a database of transcripts for analysis. This makes it possible to analyze RNAi data from any organism for which the user can provide transcript sequences.
Gene silencing through sequence-specific targeting of mRNAs by RNAi has enabled genome-wide functional screens in cultured cells and in vivo in model organisms. These screens have resulted in the identification of new cellular pathways and potential drug targets. Considerable progress has been made to improve the quality of RNAi screen data through the development of new experimental and bioinformatics approaches. The recent availability of genome-editing strategies, such as the CRISPR (clustered regularly interspaced short palindromic repeats)-Cas9 system, when combined with RNAi, could lead to further improvements in screen data quality and follow-up experiments, thus promoting our understanding of gene function and gene regulatory networks.