One of the most powerful ways to develop hypotheses regarding biological functions of conserved genes in a given species, such as in humans, is to first look at what is known about function in another species. Model organism databases (MODs) and other resources are rich with functional information but difficult to mine. Gene2Function (G2F) addresses a broad need by integrating information about conserved genes in a single online resource.
BACKGROUND: Next-generation sequencing technologies have greatly increased our ability to identify gene expression levels, including at specific developmental stages and in specific tissues. Gene expression data can help researchers understand the diverse functions of genes and gene networks, as well as help in the design of specific and efficient functional studies, such as by helping researchers choose the most appropriate tissue for a study of a group of genes, or conversely, by limiting a long list of gene candidates to the subset that are normally expressed at a given stage or in a given tissue.
RESULTS: We report DGET, a Drosophila Gene Expression Tool ( www.flyrnai.org/tools/dget/web/ ), which stores and facilitates search of RNA-Seq based expression profiles available from the modENCODE consortium and other public data sets. Using DGET, researchers are able to look up gene expression profiles, filter results based on threshold expression values, and compare expression data across different developmental stages, tissues and treatments. In addition, at DGET a researcher can analyze tissue or stage-specific enrichment for an inputted list of genes (e.g., 'hits' from a screen) and search for additional genes with similar expression patterns. We performed a number of analyses to demonstrate the quality and robustness of the resource. In particular, we show that evolutionary conserved genes expressed at high or moderate levels in both fly and human tend to be expressed in similar tissues. Using DGET, we compared whole tissue profile and sub-region/cell-type specific datasets and estimated a potential source of false positives in one dataset. We also demonstrated the usefulness of DGET for synexpression studies by querying genes with expression profile similar to the mesodermal master regulator Twist.
CONCLUSION: Altogether, DGET provides a flexible tool for expression data retrieval and analysis with short or long lists of Drosophila genes, which can help scientists to design stage- or tissue-specific in vivo studies and do other subsequent analyses.
Our understanding of the genetic mechanisms that underlie biological processes has relied extensively on loss-of-function (LOF) analyses. LOF methods target DNA, RNA or protein to reduce or to ablate gene function. By analysing the phenotypes that are caused by these perturbations the wild-type function of genes can be elucidated. Although all LOF methods reduce gene activity, the choice of approach (for example, mutagenesis, CRISPR-based gene editing, RNA interference, morpholinos or pharmacological inhibition) can have a major effect on phenotypic outcomes. Interpretation of the LOF phenotype must take into account the biological process that is targeted by each method. The practicality and efficiency of LOF methods also vary considerably between model systems. We describe parameters for choosing the optimal combination of method and system, and for interpreting phenotypes within the constraints of each method.
The protein-protein interaction (PPI) network is crucial for cellular information processing and decision-making. With suitable inputs, PPI networks drive the cells to diverse functional outcomes such as cell proliferation or cell death. Here, we characterize the structural controllability of a large directed human PPI network comprising 6,339 proteins and 34,813 interactions. This network allows us to classify proteins as "indispensable," "neutral," or "dispensable," which correlates to increasing, no effect, or decreasing the number of driver nodes in the network upon removal of that protein. We find that 21% of the proteins in the PPI network are indispensable. Interestingly, these indispensable proteins are the primary targets of disease-causing mutations, human viruses, and drugs, suggesting that altering a network's control property is critical for the transition between healthy and disease states. Furthermore, analyzing copy number alterations data from 1,547 cancer patients reveals that 56 genes that are frequently amplified or deleted in nine different cancers are indispensable. Among the 56 genes, 46 of them have not been previously associated with cancer. This suggests that controllability analysis is very useful in identifying novel disease genes and potential drug targets.
The rapid rise of CRISPR as a technology for genome engineering and related research applications has created a need for algorithms and associated online tools that facilitate design of on-target and effective guide RNAs (gRNAs). Here, we review the state-of-the-art in CRISPR gRNA design for research applications of the CRISPR-Cas9 system, including knockout, activation and inhibition. Notably, achieving good gRNA design is not solely dependent on innovations in CRISPR technology. Good design and design tools also rely on availability of high-quality genome sequence and gene annotations, as well as on availability of accumulated data regarding off-targets and effectiveness metrics. This article is protected by copyright. All rights reserved.
Hypercapnia, elevated partial pressure of CO2 in blood and tissue, develops in many patients with chronic severe obstructive pulmonary disease and other advanced lung disorders. Patients with advanced disease frequently develop bacterial lung infections, and hypercapnia is a risk factor for mortality in such individuals. We previously demonstrated that hypercapnia suppresses induction of NF-κB-regulated innate immune response genes required for host defense in human, mouse, and Drosophila cells, and it increases mortality from bacterial infections in both mice and Drosophila. However, the molecular mediators of hypercapnic immune suppression are undefined. In this study, we report a genome-wide RNA interference screen in Drosophila S2* cells stimulated with bacterial peptidoglycan. The screen identified 16 genes with human orthologs whose knockdown reduced hypercapnic suppression of the gene encoding the antimicrobial peptide Diptericin (Dipt), but did not increase Dipt mRNA levels in air. In vivo tests of one of the strongest screen hits, zinc finger homeodomain 2 (Zfh2; mammalian orthologs ZFHX3/ATBF1 and ZFHX4), demonstrate that reducing zfh2 function using a mutation or RNA interference improves survival of flies exposed to elevated CO2 and infected with Staphylococcus aureus. Tissue-specific knockdown of zfh2 in the fat body, the major immune and metabolic organ of the fly, mitigates hypercapnia-induced reductions in Dipt and other antimicrobial peptides and improves resistance of CO2-exposed flies to infection. Zfh2 mutations also partially rescue hypercapnia-induced delays in egg hatching, suggesting that Zfh2's role in mediating responses to hypercapnia extends beyond the immune system. Taken together, to our knowledge, these results identify Zfh2 as the first in vivo mediator of hypercapnic immune suppression.
Huajin Wang, Michel Becuwe, Benjamin E Housden, Chandramohan Chitraju, Ashley J Porras, Morven M Graham, Xinran N Liu, Abdou Rachid Thiam, David B Savage, Anil K Agarwal, Abhimanyu Garg, Maria-Jesus Olarte, Qingqing Lin, Florian Fröhlich, Hans Kristian Hannibal-Bach, Srigokul Upadhyayula, Norbert Perrimon, Tomas Kirchhausen, Christer S Ejsing, Tobias C Walther, and Robert V Farese. 2016. “Seipin is required for converting nascent to mature lipid droplets..” Elife, 5.Abstract
How proteins control the biogenesis of cellular lipid droplets (LDs) is poorly understood. Using Drosophila and human cells, we show here that seipin, an ER protein implicated in LD biology, mediates a discrete step in LD formation-the conversion of small, nascent LDs to larger, mature LDs. Seipin forms discrete and dynamic foci in the ER that interact with nascent LDs to enable their growth. In the absence of seipin, numerous small, nascent LDs accumulate near the ER and most often fail to grow. Those that do grow prematurely acquire lipid synthesis enzymes and undergo expansion, eventually leading to the giant LDs characteristic of seipin deficiency. Our studies identify a discrete step of LD formation, namely the conversion of nascent LDs to mature LDs, and define a molecular role for seipin in this process, most likely by acting at ER-LD contact sites to enable lipid transfer to nascent LDs.
Insulin regulates an essential conserved signaling pathway affecting growth, proliferation, and meta- bolism. To expand our understanding of the insulin pathway, we combine biochemical, genetic, and computational approaches to build a comprehensive Drosophila InR/PI3K/Akt network. First, we map the dynamic protein-protein interaction network sur- rounding the insulin core pathway using bait-prey interactions connecting 566 proteins. Combining RNAi screening and phospho-specific antibodies, we find that 47% of interacting proteins affect pathway activity, and, using quantitative phospho- proteomics, we demonstrate that $10% of interact- ing proteins are regulated by insulin stimulation at the level of phosphorylation. Next, we integrate these orthogonal datasets to characterize the structure and dynamics of the insulin network at the level of protein complexes and validate our method by iden- tifying regulatory roles for the Protein Phosphatase 2A (PP2A) and Reptin-Pontin chromatin-remodeling complexes as negative and positive regulators of ribosome biogenesis, respectively. Altogether, our study represents a comprehensive resource for the study of the evolutionary conserved insulin network.
Heterochromatin is enriched for specific epigenetic factors including Heterochromatin Protein 1a (HP1a), and is essential for many organismal functions. To elucidate heterochromatin organization and regulation, we purified Drosophila melanogaster HP1a interactors, and performed a genome-wide RNAi screen to identify genes that impact HP1a levels or localization. The majority of the over four hundred putative HP1a interactors and regulators identified were previously unknown. We found that 13 of 16 tested candidates (83%) are required for gene silencing, providing a substantial increase in the number of identified components that impact heterochromatin properties. Surprisingly, image analysis revealed that although some HP1a interactors and regulators are broadly distributed within the heterochromatin domain, most localize to discrete subdomains that display dynamic localization patterns during the cell cycle. We conclude that heterochromatin composition and architecture is more spatially complex and dynamic than previously suggested, and propose that a network of subdomains regulates diverse heterochromatin functions.
Alkylating agents are a key component of cancer chemotherapy. Several cellular mechanisms are known to be important for its survival, particularly DNA repair and xenobiotic detoxification, yet genomic screens indicate that additional cellular components may be involved. Elucidating these components has value in either identifying key processes that can be modulated to improve chemotherapeutic efficacy or may be altered in some cancers to confer chemoresistance. We therefore set out to reevaluate our prior Drosophila RNAi screening data by comparison to gene expression arrays in order to determine if we could identify any novel processes in alkylation damage survival. We noted a consistent conservation of alkylation survival pathways across platforms and species when the analysis was conducted on a pathway/process level rather than at an individual gene level. Better results were obtained when combining gene lists from two datasets (RNAi screen plus microarray) prior to analysis. In addition to previously identified DNA damage responses (p53 signaling and Nucleotide Excision Repair), DNA-mRNA-protein metabolism (transcription/translation) and proteasome machinery, we also noted a highly conserved cross-species requirement for NRF2, glutathione (GSH)-mediated drug detoxification and Endoplasmic Reticulum stress (ER stress)/Unfolded Protein Responses (UPR) in cells exposed to alkylation. The requirement for GSH, NRF2 and UPR in alkylation survival was validated by metabolomics, protein studies and functional cell assays. From this we conclude that RNAi/gene expression fusion is a valid strategy to rapidly identify key processes that may be extendable to other contexts beyond damage survival.
The FlyRNAi database of the Drosophila RNAi Screening Center (DRSC) and Transgenic RNAi Project (TRiP) at Harvard Medical School and associated DRSC/TRiP Functional Genomics Resources website (http://fgr.hms.harvard.edu) serve as a reagent production tracking system, screen data repository, and portal to the community. Through this portal, we make available protocols, online tools, and other resources useful to researchers at all stages of high-throughput functional genomics screening, from assay design and reagent identification to data analysis and interpretation. In this update, we describe recent changes and additions to our website, database and suite of online tools. Recent changes reflect a shift in our focus from a single technology (RNAi) and model species (Drosophila) to the application of additional technologies (e.g. CRISPR) and support of integrated, cross-species approaches to uncovering gene function using functional genomics and other approaches.
The transforming growth factor-β (TGF-β) family of cytokines figures prominently in regulation of embryonic development and adult tissue homeostasis from Drosophila to mammals. Genetic defects affecting TGF-β signaling underlie developmental disorders and diseases such as cancer in human. Therefore, delineating the molecular mechanism by which TGF-β regulates cell biology is critical for understanding normal biology and disease mechanisms. Forward genetic screens in model organisms and biochemical approaches in mammalian tissue culture were instrumental in initial characterization of the TGF-β signal transduction pathway. With complete sequence information of the genomes and the advent of RNA interference (RNAi) technology, genome-wide RNAi screening emerged as a powerful functional genomics approach to systematically delineate molecular components of signal transduction pathways. Here, we describe a protocol for image-based whole-genome RNAi screening aimed at identifying molecules required for TGF-β signaling into the nucleus. Using this protocol we examined >90 % of annotated Drosophila open reading frames (ORF) individually and successfully uncovered several novel factors serving critical roles in the TGF-β pathway. Thus cell-based high-throughput functional genomics can uncover new mechanistic insights on signaling pathways beyond what the classical genetics had revealed.
The tuberous sclerosis complex (TSC) family of tumor suppressors, TSC1 and TSC2, function together in an evolutionarily conserved protein complex that is a point of convergence for major cell signaling pathways that regulate mTOR complex 1 (mTORC1). Mutation or aberrant inhibition of the TSC complex is common in various human tumor syndromes and cancers. The discovery of novel therapeutic strategies to selectively target cells with functional loss of this complex is therefore of clinical relevance to patients with nonmalignant TSC and those with sporadic cancers. We developed a CRISPR-based method to generate homogeneous mutant Drosophila cell lines. By combining TSC1 or TSC2 mutant cell lines with RNAi screens against all kinases and phosphatases, we identified synthetic interactions with TSC1 and TSC2. Individual knockdown of three candidate genes (mRNA-cap, Pitslre, and CycT; orthologs of RNGTT, CDK11, and CCNT1 in humans) reduced the population growth rate of Drosophila cells lacking either TSC1 or TSC2 but not that of wild-type cells. Moreover, individual knockdown of these three genes had similar growth-inhibiting effects in mammalian TSC2-deficient cell lines, including human tumor-derived cells, illustrating the power of this cross-species screening strategy to identify potential drug targets.
A wide variety of RNAs encode small open-reading-frame (smORF/sORF) peptides, but their functions are largely unknown. Here, we show that Drosophila polished-rice (pri) sORF peptides trigger proteasome-mediated protein processing, converting the Shavenbaby (Svb) transcription repressor into a shorter activator. A genome-wide RNA interference screen identifies an E2-E3 ubiquitin-conjugating complex, UbcD6-Ubr3, which targets Svb to the proteasome in a pri-dependent manner. Upon interaction with Ubr3, Pri peptides promote the binding of Ubr3 to Svb. Ubr3 can then ubiquitinate the Svb N terminus, which is degraded by the proteasome. The C-terminal domains protect Svb from complete degradation and ensure appropriate processing. Our data show that Pri peptides control selectivity of Ubr3 binding, which suggests that the family of sORF peptides may contain an extended repertoire of protein regulators.
RNA binding proteins (RBPs) are involved in many cellular functions. To facilitate functional characterization of RBPs, we generated an RNA interference (RNAi) library for Drosophila cell-based screens comprising reagents targeting known or putative RBPs. To test the quality of the library and provide a baseline analysis of the effects of the RNAi reagents on viability, we screened the library using a total ATP assay and high-throughput imaging in Drosophila S2R+ cultured cells. The results are consistent with production of a high-quality library that will be useful for functional genomics studies using other assays. Altogether, we provide resources in the form of an initial curated list of Drosophila RBPs; an RNAi screening library we expect to be used with additional assays that address more specific biological questions; and total ATP and image data useful for comparison of those additional assay results with fundamental information such as effects of a given reagent in the library on cell viability. Importantly, we make the baseline data, including more than 200,000 images, easily accessible online.
A number of approaches for Cas9-mediated transcriptional activation have recently been developed, allowing target genes to be overexpressed from their endogenous genomic loci. However, these approaches have thus far been limited to cell culture, and this technique has not been demonstrated in vivo in any animal. The technique involving the fewest separate components, and therefore the most amenable to in vivo applications, is the dCas9-VPR system, where a nuclease-dead Cas9 is fused to a highly active chimeric activator domain. In this study, we characterize the dCas9-VPR system in Drosophila cells and in vivo. We show that this system can be used in cell culture to upregulate a range of target genes, singly and in multiplex, and that a single guide RNA upstream of the transcription start site can activate high levels of target transcription. We observe marked heterogeneity in guide RNA efficacy for any given gene, and we confirm that transcription is inhibited by guide RNAs binding downstream of the transcription start site. To demonstrate one application of this technique in cells, we used dCas9-VPR to identify target genes for Twist and Snail, two highly conserved transcription factors that cooperate during Drosophila mesoderm development. In addition, we simultaneously activated both Twist and Snail to identify synergistic responses to this physiologically relevant combination. Finally, we show that dCas9-VPR can activate target genes and cause dominant phenotypes in vivo, providing the first demonstration of dCas9 activation in a multicellular animal. Transcriptional activation using dCas9-VPR thus offers a simple and broadly applicable technique for a variety of overexpression studies.
Lizabeth A Perkins, Laura Holderbaum, Rong Tao, Yanhui Hu, Richelle Sopko, Kim McCall, Donghui Yang-Zhou, Ian Flockhart, Richard Binari, Hye-Seok Shim, Audrey Miller, Amy Housden, Marianna Foos, Sakara Randkelv, Colleen Kelley, Pema Namgyal, Christians Villalta, Lu-Ping Liu, Xia Jiang, Qiao Huan-Huan, Xia Wang, Asao Fujiyama, Atsushi Toyoda, Kathleen Ayers, Allison Blum, Benjamin Czech, Ralph Neumuller, Dong Yan, Amanda Cavallaro, Karen Hibbard, Don Hall, Lynn Cooley, Gregory J Hannon, Ruth Lehmann, Annette Parks, Stephanie E Mohr, Ryu Ueda, Shu Kondo, Jian-Quan Ni, and Norbert Perrimon. 2015. “The Transgenic RNAi Project at Harvard Medical School: Resources and Validation..” Genetics, 201, 3, Pp. 843-52.Abstract
To facilitate large-scale functional studies in Drosophila, the Drosophila Transgenic RNAi Project (TRiP) at Harvard Medical School (HMS) was established along with several goals: developing efficient vectors for RNAi that work in all tissues, generating a genome-scale collection of RNAi stocks with input from the community, distributing the lines as they are generated through existing stock centers, validating as many lines as possible using RT-qPCR and phenotypic analyses, and developing tools and web resources for identifying RNAi lines and retrieving existing information on their quality. With these goals in mind, here we describe in detail the various tools we developed and the status of the collection, which is currently composed of 11,491 lines and covering 71% of Drosophila genes. Data on the characterization of the lines either by RT-qPCR or phenotype is available on a dedicated website, the RNAi Stock Validation and Phenotypes Project (RSVP, http://www.flyrnai.org/RSVP.html), and stocks are available from three stock centers, the Bloomington Drosophila Stock Center (United States), National Institute of Genetics (Japan), and TsingHua Fly Center (China).