The accumulation of biological and biomedical literature outpaces the ability of most researchers and clinicians to stay abreast of their own immediate fields, let alone a broader range of topics. Although available search tools support identification of relevant literature, finding relevant and key publications is not always straightforward. For example, important publications might be missed in searches with an official gene name due to gene synonyms. Moreover, ambiguity of gene names can result in retrieval of a large number of irrelevant publications. To address these issues and help researchers and physicians quickly identify relevant publications, we developed BioLitMine, an advanced literature mining tool that takes advantage of the medical subject heading (MeSH) index and gene-to-publication annotations already available for PubMed literature. Using BioLitMine, a user can identify what MeSH terms are represented in the set of publications associated with a given gene of the interest, or start with a term and identify relevant publications. Users can also use the tool to find co-cited genes and a build a literature co-citation network. In addition, BioLitMine can help users build a gene list relevant to a MeSH terms, such as a list of genes relevant to "stem cells" or "breast neoplasms." Users can also start with a gene or pathway of interest and identify authors associated with that gene or pathway, a feature that makes it easier to identify experts who might serve as collaborators or reviewers. Altogether, BioLitMine extends the value of PubMed-indexed literature and its existing expert curation by providing a robust and gene-centric approach to retrieval of relevant information.
The FlyRNAi database of the Drosophila RNAi Screening Center (DRSC) and Transgenic RNAi Project (TRiP) at Harvard Medical School and associated DRSC/TRiP Functional Genomics Resources website (http://fgr.hms.harvard.edu) serve as a reagent production tracking system, screen data repository, and portal to the community. Through this portal, we make available protocols, online tools, and other resources useful to researchers at all stages of high-throughput functional genomics screening, from assay design and reagent identification to data analysis and interpretation. In this update, we describe recent changes and additions to our website, database and suite of online tools. Recent changes reflect a shift in our focus from a single technology (RNAi) and model species (Drosophila) to the application of additional technologies (e.g. CRISPR) and support of integrated, cross-species approaches to uncovering gene function using functional genomics and other approaches.
Connecting phosphorylation events to kinases and phosphatases is key to understanding the molecular organization and signaling dynamics of networks. We have generated a validated set of transgenic RNA-interference reagents for knockdown and characterization of all protein kinases and phosphatases present during early Drosophila melanogaster development. These genetic tools enable collection of sufficient quantities of embryos depleted of single gene products for proteomics. As a demonstration of an application of the collection, we have used multiplexed isobaric labeling for quantitative proteomics to derive global phosphorylation signatures associated with kinase-depleted embryos to systematically link phosphosites with relevant kinases. We demonstrate how this strategy uncovers kinase consensus motifs and prioritizes phosphoproteins for kinase target validation. We validate this approach by providing auxiliary evidence for Wee kinase-directed regulation of the chromatin regulator Stonewall. Further, we show how correlative phosphorylation at the site level can indicate function, as exemplified by Sterile20-like kinase-dependent regulation of Stat92E.
A major objective of systems biology is to organize molecular interactions as networks and to characterize information flow within networks. We describe a computational framework to integrate protein-protein interaction (PPI) networks and genetic screens to predict the 'signs' of interactions (i.e., activation-inhibition relationships). We constructed a Drosophila melanogaster signed PPI network consisting of 6,125 signed PPIs connecting 3,352 proteins that can be used to identify positive and negative regulators of signaling pathways and protein complexes. We identified an unexpected role for the metabolic enzymes enolase and aldo-keto reductase as positive and negative regulators of proteolysis, respectively. Characterization of the activation-inhibition relationships between physically interacting proteins within signaling pathways will affect our understanding of many biological functions, including signal transduction and mechanisms of disease.
BACKGROUND: RNA interference (RNAi) is an effective and important tool used to study gene function. For large-scale screens, RNAi is used to systematically down-regulate genes of interest and analyze their roles in a biological process. However, RNAi is associated with off-target effects (OTEs), including microRNA (miRNA)-like OTEs. The contribution of reagent-specific OTEs to RNAi screen data sets can be significant. In addition, the post-screen validation process is time and labor intensive. Thus, the availability of robust approaches to identify candidate off-targeted transcripts would be beneficial. RESULTS: Significant efforts have been made to eliminate false positive results attributable to sequence-specific OTEs associated with RNAi. These approaches have included improved algorithms for RNAi reagent design, incorporation of chemical modifications into siRNAs, and the use of various bioinformatics strategies to identify possible OTEs in screen results. Genome-wide Enrichment of Seed Sequence matches (GESS) was developed to identify potential off-targeted transcripts in large-scale screen data by seed-region analysis. Here, we introduce a user-friendly web application that provides researchers a relatively quick and easy way to perform GESS analysis on data from human or mouse cell-based screens using short interfering RNAs (siRNAs) or short hairpin RNAs (shRNAs), as well as for Drosophila screens using shRNAs. Online GESS relies on up-to-date transcript sequence annotations for human and mouse genes extracted from NCBI Reference Sequence (RefSeq) and Drosophila genes from FlyBase. The tool also accommodates analysis with user-provided reference sequence files. CONCLUSION: Online GESS provides a straightforward user interface for genome-wide seed region analysis for human, mouse and Drosophila RNAi screen data. With the tool, users can either use a built-in database or provide a database of transcripts for analysis. This makes it possible to analyze RNAi data from any organism for which the user can provide transcript sequences.
Drosophila melanogaster has become a system of choice for functional genomic studies. Many resources, including online databases and software tools, are now available to support design or identification of relevant fly stocks and reagents or analysis and mining of existing functional genomic, transcriptomic, proteomic, etc. datasets. These include large community collections of fly stocks and plasmid clones, "meta" information sites like FlyBase and FlyMine, and an increasing number of more specialized reagents, databases, and online tools. Here, we introduce key resources useful to plan large-scale functional genomics studies in Drosophila and to analyze, integrate, and mine the results of those studies in ways that facilitate identification of highest-confidence results and generation of new hypotheses. We also discuss ways in which existing resources can be used and might be improved and suggest a few areas of future development that would further support large- and small-scale studies in Drosophila and facilitate use of Drosophila information by the research community more generally.
Gene silencing through sequence-specific targeting of mRNAs by RNAi has enabled genome-wide functional screens in cultured cells and in vivo in model organisms. These screens have resulted in the identification of new cellular pathways and potential drug targets. Considerable progress has been made to improve the quality of RNAi screen data through the development of new experimental and bioinformatics approaches. The recent availability of genome-editing strategies, such as the CRISPR (clustered regularly interspaced short palindromic repeats)-Cas9 system, when combined with RNAi, could lead to further improvements in screen data quality and follow-up experiments, thus promoting our understanding of gene function and gene regulatory networks.
Here, I discuss how RNAi screening can be used effectively to uncover gene function. Specifically, I discuss the types of high-throughput assays that can be done in Drosophila cells and in vivo, RNAi reagent design and available reagent collections, automated screen pipelines, analysis of screen results, and approaches to RNAi results verification.
Reactive Oxygen Species (ROS) are a natural by-product of cellular growth and proliferation, and are required for fundamental processes such as protein-folding and signal transduction. However, ROS accumulation, and the onset of oxidative stress, can negatively impact cellular and genomic integrity. Signalling networks have evolved to respond to oxidative stress by engaging diverse enzymatic and non-enzymatic antioxidant mechanisms to restore redox homeostasis. The architecture of oxidative stress response networks during periods of normal growth, and how increased ROS levels dynamically reconfigure these networks are largely unknown. In order to gain insight into the structure of signalling networks that promote redox homeostasis we first performed genome-scale RNAi screens to identify novel suppressors of superoxide accumulation. We then infer relationships between redox regulators by hierarchical clustering of phenotypic signatures describing how gene inhibition affects superoxide levels, cellular viability, and morphology across different genetic backgrounds. Genes that cluster together are likely to act in the same signalling pathway/complex and thus make "functional interactions". Moreover we also calculate differential phenotypic signatures describing the difference in cellular phenotypes following RNAi between untreated cells and cells submitted to oxidative stress. Using both phenotypic signatures and differential signatures we construct a network model of functional interactions that occur between components of the redox homeostasis network, and how such interactions become rewired in the presence of oxidative stress. This network model predicts a functional interaction between the transcription factor Jun and the IRE1 kinase, which we validate in an orthogonal assay. We thus demonstrate the ability of systems-biology approaches to identify novel signalling events.
Characterizing the extent and logic of signaling networks is essential to understanding specificity in such physiological and pathophysiological contexts as cell fate decisions and mechanisms of oncogenesis and resistance to chemotherapy. Cell-based RNA interference (RNAi) screens enable the inference of large numbers of genes that regulate signaling pathways, but these screens cannot provide network structure directly. We describe an integrated network around the canonical receptor tyrosine kinase (RTK)-Ras-extracellular signal-regulated kinase (ERK) signaling pathway, generated by combining parallel genome-wide RNAi screens with protein-protein interaction (PPI) mapping by tandem affinity purification-mass spectrometry. We found that only a small fraction of the total number of PPI or RNAi screen hits was isolated under all conditions tested and that most of these represented the known canonical pathway components, suggesting that much of the core canonical ERK pathway is known. Because most of the newly identified regulators are likely cell type- and RTK-specific, our analysis provides a resource for understanding how output through this clinically relevant pathway is regulated in different contexts. We report in vivo roles for several of the previously unknown regulators, including CG10289 and PpV, the Drosophila orthologs of two components of the serine/threonine-protein phosphatase 6 complex; the Drosophila ortholog of TepIV, a glycophosphatidylinositol-linked protein mutated in human cancers; CG6453, a noncatalytic subunit of glucosidase II; and Rtf1, a histone methyltransferase.
Systems biology aims to describe the complex interplays between cellular building blocks which, in their concurrence, give rise to the emergent properties observed in cellular behaviors and responses. This approach tries to determine the molecular players and the architectural principles of their interactions within the genetic networks that control certain biological processes. Large-scale loss-of-function screens, applicable in various different model systems, have begun to systematically interrogate entire genomes to identify the genes that contribute to a certain cellular response. In particular, RNA interference (RNAi)-based high-throughput screens have been instrumental in determining the composition of regulatory systems and paired with integrative data analyses have begun to delineate the genetic networks that control cell biological and developmental processes. Through the creation of tools for both, in vitro and in vivo genome-wide RNAi screens, Drosophila melanogaster has emerged as one of the key model organisms in systems biology research and over the last years has massively contributed to and hence shaped this discipline. WIREs Syst Biol Med 2011 3 471-478 DOI: 10.1002/wsbm.127
BACKGROUND: Insights into how the Frizzled/LRP6 receptor complex receives, transduces and terminates Wnt signals will enhance our understanding of the control of the Wnt/ss-catenin pathway. METHODOLOGY/PRINCIPAL FINDINGS: In pursuit of such insights, we performed a genome-wide RNAi screen in Drosophila cells expressing an activated form of LRP6 and a beta-catenin-responsive reporter. This screen resulted in the identification of Bili, a Band4.1-domain containing protein, as a negative regulator of Wnt/beta-catenin signaling. We found that the expression of Bili in Drosophila embryos and larval imaginal discs significantly overlaps with the expression of Wingless (Wg), the Drosophila Wnt ortholog, which is consistent with a potential function for Bili in the Wg pathway. We then tested the functions of Bili in both invertebrate and vertebrate animal model systems. Loss-of-function studies in Drosophila and zebrafish embryos, as well as human cultured cells, demonstrate that Bili is an evolutionarily conserved antagonist of Wnt/beta-catenin signaling. Mechanistically, we found that Bili exerts its antagonistic effects by inhibiting the recruitment of AXIN to LRP6 required during pathway activation. CONCLUSIONS: These studies identify Bili as an evolutionarily conserved negative regulator of the Wnt/beta-catenin pathway.
Lipid droplets are ubiquitous triglyceride and sterol ester storage organelles required for energy storage homeostasis and biosynthesis. Although little is known about lipid droplet formation and regulation, it is clear that members of the PAT (perilipin, adipocyte differentiation related protein, tail interacting protein of 47 kDa) protein family coat the droplet surface and mediate interactions with lipases that remobilize the stored lipids. We identified key Drosophila candidate genes for lipid droplet regulation by RNA interference (RNAi) screening with an image segmentation-based optical read-out system, and show that these regulatory functions are conserved in the mouse. Those include the vesicle-mediated Coat Protein Complex I (COPI) transport complex, which is required for limiting lipid storage. We found that COPI components regulate the PAT protein composition at the lipid droplet surface, and promote the association of adipocyte triglyceride lipase (ATGL) with the lipid droplet surface to mediate lipolysis. Two compounds known to inhibit COPI function, Exo1 and Brefeldin A, phenocopy COPI knockdowns. Furthermore, RNAi inhibition of ATGL and simultaneous drug treatment indicate that COPI and ATGL function in the same pathway. These data indicate that the COPI complex is an evolutionarily conserved regulator of lipid homeostasis, and highlight an interaction between vesicle transport systems and lipid droplets.
Multiple centrosomes in tumor cells create the potential for multipolar divisions that can lead to aneuploidy and cell death. Nevertheless, many cancer cells successfully divide because of mechanisms that suppress multipolar mitoses. A genome-wide RNAi screen in Drosophila S2 cells and a secondary analysis in cancer cells defined mechanisms that suppress multipolar mitoses. In addition to proteins that organize microtubules at the spindle poles, we identified novel roles for the spindle assembly checkpoint, cortical actin cytoskeleton, and cell adhesion. Using live cell imaging and fibronectin micropatterns, we found that interphase cell shape and adhesion pattern can determine the success of the subsequent mitosis in cells with extra centrosomes. These findings may identify cancer-selective therapeutic targets: HSET, a normally nonessential kinesin motor, was essential for the viability of certain extra centrosome-containing cancer cells. Thus, morphological features of cancer cells can be linked to unique genetic requirements for survival.
Cellular signaling networks have evolved to enable swift and accurate responses, even in the face of genetic or environmental perturbation. Thus, genetic screens may not identify all the genes that regulate different biological processes. Moreover, although classical screening approaches have succeeded in providing parts lists of the essential components of signaling networks, they typically do not provide much insight into the hierarchical and functional relations that exist among these components. We describe a high-throughput screen in which we used RNA interference to systematically inhibit two genes simultaneously in 17,724 combinations to identify regulators of Drosophila JUN NH(2)-terminal kinase (JNK). Using both genetic and phosphoproteomics data, we then implemented an integrative network algorithm to construct a JNK phosphorylation network, which provides structural and mechanistic insights into the systems architecture of JNK signaling.
Off-target effects have been demonstrated to be a major source of false-positives in RNA interference (RNAi) high-throughput screens. In this study, we re-assess the previously published transcriptional reporter-based whole-genome RNAi screens for the Wingless and Hedgehog signaling pathways using second generation double-stranded RNA libraries. Furthermore, we investigate other factors that may influence the outcome of such screens, including cell-type specificity, robustness of reporters, and assay normalization, which determine the efficacy of RNAi-knockdown of target genes.
Metazoan replication-dependent histone mRNAs are not polyadenylated and instead end in a conserved stem loop that is the cis element responsible for coordinate posttranscriptional regulation of these mRNAs. Using biochemical approaches, only a limited number of factors required for cleavage of histone pre-mRNA have been identified. We therefore performed a genome-wide RNA interference screen in Drosophila cells using a GFP reporter that is expressed only when histone pre-mRNA processing is disrupted. Four of the 24 genes identified encode proteins also necessary for cleavage/polyadenylation, indicating mechanistic conservation in formation of different mRNA 3' ends. We also unexpectedly identified the histone variants H2Av and H3.3A/B. In H2Av mutant cells, U7 snRNP remains active but fails to accumulate at the histone locus, suggesting there is a regulatory pathway that coordinates the production of variant and canonical histones that acts via localization of essential histone pre-mRNA processing factors.
RNA interference has re-energized the field of functional genomics by enabling genome-scale loss-of-function screens in cultured cells. Looking back on the lessons that have been learned from the first wave of technology developments and applications in this exciting field, we provide both a user's guide for newcomers to the field and a detailed examination of some more complex issues, particularly concerning optimization and quality control, for more advanced users. From a discussion of cell lines, screening paradigms, reagent types and read-out methodologies, we explore in particular the complexities of designing optimal controls and normalization strategies for these challenging but extremely powerful studies.
The cytokine-activated Janus kinase (JAK)/signal transducer and activator of transcription (STAT) pathway plays an important role in the control of a wide variety of biological processes. When misregulated, JAK/STAT signaling is associated with various human diseases, such as immune disorders and tumorigenesis. To gain insights into the mechanisms by which JAK/STAT signaling participates in these diverse biological responses, we carried out a genome-wide RNA interference (RNAi) screen in cultured Drosophila cells. We identified 121 genes whose double-stranded RNA (dsRNA)-mediated knockdowns affected STAT92E activity. Of the 29 positive regulators, 13 are required for the tyrosine phosphorylation of STAT92E. Furthermore, we found that the Drosophila homologs of RanBP3 and RanBP10 are negative regulators of JAK/STAT signaling through their control of nucleocytoplasmic transport of STAT92E. In addition, we identified a key negative regulator of Drosophila JAK/STAT signaling, protein tyrosine phosphatase PTP61F, and showed that it is a transcriptional target of JAK/STAT signaling, thus revealing a novel negative feedback loop. Our study has uncovered many uncharacterized genes required for different steps of the JAK/STAT signaling pathway.
The widespread class of RNA viruses that utilize internal ribosome entry sites (IRESs) for translation include poliovirus and Hepatitis C virus. To identify host factors required for IRES-dependent translation and viral replication, we performed a genome-wide RNAi screen in Drosophila cells infected with Drosophila C virus (DCV). We identified 66 ribosomal proteins that, when depleted, specifically inhibit DCV growth, but not a non-IRES-containing RNA virus. Moreover, treatment of flies with a translation inhibitor is protective in vivo. Finally, this increased sensitivity to ribosome levels also holds true for poliovirus infection of human cells, demonstrating the generality of these findings.