RNA interference (RNAi) leads to sequence-specific knockdown of gene function. The approach can be used in large-scale screens to interrogate function in various model organisms and an increasing number of other species. Genome-scale RNAi screens are routinely performed in cultured or primary cells or in vivo in organisms such as C. elegans. High-throughput RNAi screening is benefitting from the development of sophisticated new instrumentation and software tools for collecting and analyzing data, including high-content image data. The results of large-scale RNAi screens have already proved useful, leading to new understandings of gene function relevant to topics such as infection, cancer, obesity, and aging. Nevertheless, important caveats apply and should be taken into consideration when developing or interpreting RNAi screens. Some level of false discovery is inherent to high-throughput approaches and specific to RNAi screens, false discovery due to off-target effects (OTEs) of RNAi reagents remains a problem. The need to improve our ability to use RNAi to elucidate gene function at large scale and in additional systems continues to be addressed through improved RNAi library design, development of innovative computational and analysis tools and other approaches.
The Drosophila MSL complex mediates dosage compensation by increasing transcription of the single X chromosome in males approximately two-fold. This is accomplished through recognition of the X chromosome and subsequent acetylation of histone H4K16 on X-linked genes. Initial binding to the X is thought to occur at "entry sites" that contain a consensus sequence motif ("MSL recognition element" or MRE). However, this motif is only ∼2 fold enriched on X, and only a fraction of the motifs on X are initially targeted. Here we ask whether chromatin context could distinguish between utilized and non-utilized copies of the motif, by comparing their relative enrichment for histone modifications and chromosomal proteins mapped in the modENCODE project. Through a comparative analysis of the chromatin features in male S2 cells (which contain MSL complex) and female Kc cells (which lack the complex), we find that the presence of active chromatin modifications, together with an elevated local GC content in the surrounding sequences, has strong predictive value for functional MSL entry sites, independent of MSL binding. We tested these sites for function in Kc cells by RNAi knockdown of Sxl, resulting in induction of MSL complex. We show that ectopic MSL expression in Kc cells leads to H4K16 acetylation around these sites and a relative increase in X chromosome transcription. Collectively, our results support a model in which a pre-existing active chromatin environment, coincident with H3K36me3, contributes to MSL entry site selection. The consequences of MSL targeting of the male X chromosome include increase in nucleosome lability, enrichment for H4K16 acetylation and JIL-1 kinase, and depletion of linker histone H1 on active X-linked genes. Our analysis can serve as a model for identifying chromatin and local sequence features that may contribute to selection of functional protein binding sites in the genome.
While the 26S proteasome is a key proteolytic complex, little is known about how proteasome levels are maintained in higher eukaryotic cells. Here we describe an RNA interference (RNAi) screen of Drosophila melanogaster that was used to identify transcription factors that may play a role in maintaining levels of the 26S proteasome. We used an RNAi library against 993 Drosophila transcription factor genes to identify genes whose suppression in Schneider 2 cells stabilized a ubiquitin-green fluorescent protein reporter protein. This screen identified Cnc (cap 'n' collar [CNC]; basic region leucine zipper) as a candidate transcriptional regulator of proteasome component expression. In fact, 20S proteasome activity was reduced in cells depleted of cnc. Immunoblot assays against proteasome components revealed a general decline in both 19S regulatory complex and 20S proteasome subunits after RNAi depletion of this transcription factor. Transcript-specific silencing revealed that the longest of the seven transcripts for the cnc gene, cnc-C, was needed for proteasome and p97 ATPase production. Quantitative reverse transcription-PCR confirmed the role of Cnc-C in activation of transcription of genes encoding proteasome components. Expression of a V5-His-tagged form of Cnc-C revealed that the transcription factor is itself a proteasome substrate that is stabilized when the proteasome is inhibited. We propose that this single cnc gene in Drosophila resembles the ancestral gene family of mammalian nuclear factor erythroid-derived 2-related transcription factors, which are essential in regulating oxidative stress and proteolysis.
Although a large number of actin-binding proteins and their regulators have been identified through classical approaches, gaps in our knowledge remain. Here, we used genome-wide RNA interference as a systematic method to define metazoan actin regulators based on visual phenotype. Using comparative screens in cultured Drosophila and human cells, we generated phenotypic profiles for annotated actin regulators together with proteins bearing predicted actin-binding domains. These phenotypic clusters for the known metazoan "actinome" were used to identify putative new core actin regulators, together with a number of genes with conserved but poorly studied roles in the regulation of the actin cytoskeleton, several of which we studied in detail. This work suggests that although our search for new components of the core actin machinery is nearing saturation, regulation at the level of nuclear actin export, RNA splicing, ubiquitination, and other upstream processes remains an important but unexplored frontier of actin biology.
BACKGROUND: High-throughput screening using RNAi is a powerful gene discovery method but is often complicated by false positive and false negative results. Whereas false positive results associated with RNAi reagents has been a matter of extensive study, the issue of false negatives has received less attention. RESULTS: We performed a meta-analysis of several genome-wide, cell-based Drosophila RNAi screens, together with a more focused RNAi screen, and conclude that the rate of false negative results is at least 8%. Further, we demonstrate how knowledge of the cell transcriptome can be used to resolve ambiguous results and how the number of false negative results can be reduced by using multiple, independently-tested RNAi reagents per gene. CONCLUSIONS: RNAi reagents that target the same gene do not always yield consistent results due to false positives and weak or ineffective reagents. False positive results can be partially minimized by filtering with transcriptome data. RNAi libraries with multiple reagents per gene also reduce false positive and false negative outcomes when inconsistent results are disambiguated carefully.
Wnt proteins are secreted, lipid-modified glycoproteins that control animal development and adult tissue homeostasis. Secretion of Wnt proteins is at least partly regulated by a dedicated machinery. Here, we report a genome-wide RNA interference screen for genes involved in the secretion of Wingless (Wg), a Drosophila Wnt. We identify three new genes required for Wg secretion. Of these, Emp24 and Eclair are required for proper export of Wg from the endoplasmic reticulum (ER). We propose that Emp24 and Eca act as specific cargo receptors for Wg to concentrate it in forming vesicles at sites of ER export.
The DNA damage checkpoint, the first pathway known to be activated in response to DNA damage, is a mechanism by which the cell cycle is temporarily arrested to allow DNA repair. The checkpoint pathway transmits signals from the sites of DNA damage to the cell cycle machinery through the evolutionarily conserved ATM (ataxia telangiectasia mutated) and ATR (ATM- and Rad3-related) kinase cascades. We conducted a genome-wide RNAi (RNA interference) screen in Drosophila cells to identify previously unknown genes and pathways required for the G₂-M checkpoint induced by DNA double-strand breaks (DSBs). Our large-scale analysis provided a systems-level view of the G₂-M checkpoint and revealed the coordinated actions of particular classes of proteins, which include those involved in DNA repair, DNA replication, cell cycle control, chromatin regulation, and RNA processing. Further, from the screen and in vivo analysis, we identified previously unrecognized roles of two DNA damage response genes, mus101 and mus312. Our results suggest that the DNA replication preinitiation complex, which includes MUS101, and the MUS312-containing nuclease complexes, which are important for DSB repair, also function in the G₂-M checkpoint. Our results provide insight into the diverse mechanisms that link DNA damage and the checkpoint signaling pathway.
Characterizing the extent and logic of signaling networks is essential to understanding specificity in such physiological and pathophysiological contexts as cell fate decisions and mechanisms of oncogenesis and resistance to chemotherapy. Cell-based RNA interference (RNAi) screens enable the inference of large numbers of genes that regulate signaling pathways, but these screens cannot provide network structure directly. We describe an integrated network around the canonical receptor tyrosine kinase (RTK)-Ras-extracellular signal-regulated kinase (ERK) signaling pathway, generated by combining parallel genome-wide RNAi screens with protein-protein interaction (PPI) mapping by tandem affinity purification-mass spectrometry. We found that only a small fraction of the total number of PPI or RNAi screen hits was isolated under all conditions tested and that most of these represented the known canonical pathway components, suggesting that much of the core canonical ERK pathway is known. Because most of the newly identified regulators are likely cell type- and RTK-specific, our analysis provides a resource for understanding how output through this clinically relevant pathway is regulated in different contexts. We report in vivo roles for several of the previously unknown regulators, including CG10289 and PpV, the Drosophila orthologs of two components of the serine/threonine-protein phosphatase 6 complex; the Drosophila ortholog of TepIV, a glycophosphatidylinositol-linked protein mutated in human cancers; CG6453, a noncatalytic subunit of glucosidase II; and Rtf1, a histone methyltransferase.
Systems biology aims to describe the complex interplays between cellular building blocks which, in their concurrence, give rise to the emergent properties observed in cellular behaviors and responses. This approach tries to determine the molecular players and the architectural principles of their interactions within the genetic networks that control certain biological processes. Large-scale loss-of-function screens, applicable in various different model systems, have begun to systematically interrogate entire genomes to identify the genes that contribute to a certain cellular response. In particular, RNA interference (RNAi)-based high-throughput screens have been instrumental in determining the composition of regulatory systems and paired with integrative data analyses have begun to delineate the genetic networks that control cell biological and developmental processes. Through the creation of tools for both, in vitro and in vivo genome-wide RNAi screens, Drosophila melanogaster has emerged as one of the key model organisms in systems biology research and over the last years has massively contributed to and hence shaped this discipline. WIREs Syst Biol Med 2011 3 471-478 DOI: 10.1002/wsbm.127
Cell-based high content screening (HCS) is becoming an important and increasingly favored approach in therapeutic drug discovery and functional genomics. In HCS, changes in cellular morphology and biomarker distributions provide an information-rich profile of cellular responses to experimental treatments such as small molecules or gene knockdown probes. One obstacle that currently exists with such cell-based assays is the availability of image processing algorithms that are capable of reliably and automatically analyzing large HCS image sets. HCS images of primary neuronal cell cultures are particularly challenging to analyze due to complex cellular morphology. Here we present a robust method for quantifying and statistically analyzing the morphology of neuronal cells in HCS images. The major advantages of our method over existing software lie in its capability to correct non-uniform illumination using the contrast-limited adaptive histogram equalization method; segment neuromeres using Gabor-wavelet texture analysis; and detect faint neurites by a novel phase-based neurite extraction algorithm that is invariant to changes in illumination and contrast and can accurately localize neurites. Our method was successfully applied to analyze a large HCS image set generated in a morphology screen for polyglutamine-mediated neuronal toxicity using primary neuronal cell cultures derived from embryos of a Drosophila Huntington's Disease (HD) model.
BACKGROUND: A genomic catalogue of protein-protein interactions is a rich source of information, particularly for exploring the relationships between proteins. Numerous systems-wide and small-scale experiments have been conducted to identify interactions; however, our knowledge of all interactions for any one species is incomplete, and alternative means to expand these network maps is needed. We therefore took a comparative biology approach to predict protein-protein interactions across five species (human, mouse, fly, worm, and yeast) and developed InterologFinder for research biologists to easily navigate this data. We also developed a confidence score for interactions based on available experimental evidence and conservation across species. RESULTS: The connectivity of the resultant networks was determined to have scale-free distribution, small-world properties, and increased local modularity, indicating that the added interactions do not disrupt our current understanding of protein network structures. We show examples of how these improved interactomes can be used to analyze a genome-scale dataset (RNAi screen) and to assign new function to proteins. Predicted interactions within this dataset were tested by co-immunoprecipitation, resulting in a high rate of validation, suggesting the high quality of networks produced. CONCLUSIONS: Protein-protein interactions were predicted in five species, based on orthology. An InteroScore, a score accounting for homology, number of orthologues with evidence of interactions, and number of unique observations of interactions, is given to each known and predicted interaction. Our website http://www.interologfinder.org provides research biologists intuitive access to this data.
Hypoxia-inducible factors (HIFs) are a family of evolutionary conserved alpha-beta heterodimeric transcription factors that induce a wide range of genes in response to low oxygen tension. Molecular mechanisms that mediate oxygen-dependent HIF regulation operate at the level of the alpha subunit, controlling protein stability, subcellular localization, and transcriptional coactivator recruitment. We have conducted an unbiased genome-wide RNA interference (RNAi) screen in Drosophila cells aimed to the identification of genes required for HIF activity. After 3 rounds of selection, 30 genes emerged as critical HIF regulators in hypoxia, most of which had not been previously associated with HIF biology. The list of genes includes components of chromatin remodeling complexes, transcription elongation factors, and translational regulators. One remarkable hit was the argonaute 1 (ago1) gene, a central element of the microRNA (miRNA) translational silencing machinery. Further studies confirmed the physiological role of the miRNA machinery in HIF-dependent transcription. This study reveals the occurrence of novel mechanisms of HIF regulation, which might contribute to developing novel strategies for therapeutic intervention of HIF-related pathologies, including heart attack, cancer, and stroke.
Akt represents a nodal point between the Insulin receptor and TOR signaling, and its activation by phosphorylation controls cell proliferation, cell size, and metabolism. The activity of Akt must be carefully balanced, as increased Akt signaling is frequently associated with cancer and as insufficient Akt signaling is linked to metabolic disease and diabetes mellitus. Using a genome-wide RNAi screen in Drosophila cells in culture, and in vivo analyses in the third instar wing imaginal disc, we studied the regulatory circuitries that define dAkt activation. We provide evidence that negative feedback regulation of dAkt occurs during normal Drosophila development in vivo. Whereas in cell culture dAkt is regulated by S6 Kinase (S6K)-dependent negative feedback, this feedback inhibition only plays a minor role in vivo. In contrast, dAkt activation under wild-type conditions is defined by feedback inhibition that depends on TOR Complex 1 (TORC1), but is S6K-independent. This feedback inhibition is switched from TORC1 to S6K only in the context of enhanced TORC1 activity, as triggered by mutations in tsc2. These results illustrate how the Akt-TOR pathway dynamically adapts the routing of negative feedback in response to the activity load of its signaling circuit in vivo.
Protein aggregates are a common pathological feature of most neurodegenerative diseases (NDs). Understanding their formation and regulation will help clarify their controversial roles in disease pathogenesis. To date, there have been few systematic studies of aggregates formation in Drosophila, a model organism that has been applied extensively in modeling NDs and screening for toxicity modifiers. We generated transgenic fly lines that express enhanced-GFP-tagged mutant Huntingtin (Htt) fragments with different lengths of polyglutamine (polyQ) tract and showed that these Htt mutants develop protein aggregates in a polyQ-length- and age-dependent manner in Drosophila. To identify central regulators of protein aggregation, we further generated stable Drosophila cell lines expressing these Htt mutants and also established a cell-based quantitative assay that allows automated measurement of aggregates within cells. We then performed a genomewide RNA interference screen for regulators of mutant Htt aggregation and isolated 126 genes involved in diverse cellular processes. Interestingly, although our screen focused only on mutant Htt aggregation, several of the identified candidates were known previously as toxicity modifiers of NDs. Moreover, modulating the in vivo activity of hsp110 (CG6603) or tra1, two hits from the screen, affects neurodegeneration in a dose-dependent manner in a Drosophila model of Huntington's disease. Thus, other aggregates regulators isolated in our screen may identify additional genes involved in the protein-folding pathway and neurotoxicity.
Genetic screens in the yeast Saccharomyces cerevisiae have identified many proteins involved in the secretory pathway, most of which have orthologues in higher eukaryotes. To investigate whether there are additional proteins that are required for secretion in metazoans but are absent from yeast, we used genome-wide RNA interference (RNAi) to look for genes required for secretion of recombinant luciferase from Drosophila S2 cells. This identified two novel components of the secretory pathway that are conserved from humans to plants. Gryzun is distantly related to, but distinct from, the Trs130 subunit of the TRAPP complex but is absent from S. cerevisiae. RNAi of human Gryzun (C4orf41) blocks Golgi exit. Kish is a small membrane protein with a previously uncharacterised orthologue in yeast. The screen also identified Drosophila orthologues of almost 60% of the yeast genes essential for secretion. Given this coverage, the small number of novel components suggests that contrary to previous indications the number of essential core components of the secretory pathway is not much greater in metazoans than in yeasts.
RNA interference (RNAi) is an effective tool for genome-scale, high-throughput analysis of gene function. In the past five years, a number of genome-scale RNAi high-throughput screens (HTSs) have been done in both Drosophila and mammalian cultured cells to study diverse biological processes, including signal transduction, cancer biology, and host cell responses to infection. Results from these screens have led to the identification of new components of these processes and, importantly, have also provided insights into the complexity of biological systems, forcing new and innovative approaches to understanding functional networks in cells. Here, we review the main findings that have emerged from RNAi HTS and discuss technical issues that remain to be improved, in particular the verification of RNAi results and validation of their biological relevance. Furthermore, we discuss the importance of multiplexed and integrated experimental data analysis pipelines to RNAi HTS.
Francisella tularensis is a highly infectious facultative intracellular bacterium that can be transmitted between mammals by arthropod vectors. Similar to many other intracellular bacteria that replicate within the cytosol, such as Listeria, Shigella, Burkholderia, and Rickettsia, the virulence of F. tularensis depends on its ability to modulate biogenesis of its phagosome and to escape into the host cell cytosol where it proliferates. Recent studies have identified the F. tularensis genes required for modulation of phagosome biogenesis and escape into the host cell cytosol within human and arthropod-derived cells. However, the arthropod and mammalian host factors required for intracellular proliferation of F. tularensis are not known. We have utilized a forward genetic approach employing genome-wide RNAi screen in Drosophila melanogaster-derived cells. Screening a library of approximately 21,300 RNAi, we have identified at least 186 host factors required for intracellular bacterial proliferation. We silenced twelve mammalian homologues by RNAi in HEK293T cells and identified three conserved factors, the PI4 kinase PI4KCA, the ubiquitin hydrolase USP22, and the ubiquitin ligase CDC27, which are also required for replication in human cells. The PI4KCA and USP22 mammalian factors are not required for modulation of phagosome biogenesis or phagosomal escape but are required for proliferation within the cytosol. In contrast, the CDC27 ubiquitin ligase is required for evading lysosomal fusion and for phagosomal escape into the cytosol. Although F. tularensis interacts with the autophagy pathway during late stages of proliferation in mouse macrophages, this does not occur in human cells. Our data suggest that F. tularensis utilizes host ubiquitin turnover in distinct mechanisms during the phagosomal and cytosolic phases and phosphoinositide metabolism is essential for cytosolic proliferation of F. tularensis. Our data will facilitate deciphering molecular ecology, patho-adaptation of F. tularensis to the arthropod vector and its role in bacterial ecology and patho-evolution to infect mammals.
Biological networks are highly complex systems, consisting largely of enzymes that act as molecular switches to activate/inhibit downstream targets via post-translational modification. Computational techniques have been developed to perform signaling network inference using some high-throughput data sources, such as those generated from transcriptional and proteomic studies, but comparable methods have not been developed to use high-content morphological data, which are emerging principally from large-scale RNAi screens, to these ends. Here, we describe a systematic computational framework based on a classification model for identifying genetic interactions using high-dimensional single-cell morphological data from genetic screens, apply it to RhoGAP/GTPase regulation in Drosophila, and evaluate its efficacy. Augmented by knowledge of the basic structure of RhoGAP/GTPase signaling, namely, that GAPs act directly upstream of GTPases, we apply our framework for identifying genetic interactions to predict signaling relationships between these proteins. We find that our method makes mediocre predictions using only RhoGAP single-knockdown morphological data, yet achieves vastly improved accuracy by including original data from a double-knockdown RhoGAP genetic screen, which likely reflects the redundant network structure of RhoGAP/GTPase signaling. We consider other possible methods for inference and show that our primary model outperforms the alternatives. This work demonstrates the fundamental fact that high-throughput morphological data can be used in a systematic, successful fashion to identify genetic interactions and, using additional elementary knowledge of network structure, to infer signaling relations.
Poxviruses include medically important human pathogens, yet little is known about the specific cellular factors essential for their replication. To identify genes essential for poxvirus infection, we used high-throughput RNA interference to screen the Drosophila kinome for factors required for vaccinia infection. We identified seven genes including the three subunits of AMPK as promoting vaccinia infection. AMPK not only facilitated infection in insect cells, but also in mammalian cells. Moreover, we found that AMPK is required for macropinocytosis, a major endocytic entry pathway for vaccinia. Furthermore, we show that AMPK contributes to other virus-independent actin-dependent processes including lamellipodia formation and wound healing, independent of the known AMPK activators LKB1 and CaMKK. Therefore, AMPK plays a highly conserved role in poxvirus infection and actin dynamics independent of its role as an energy regulator.
BACKGROUND: The Notch signaling pathway regulates a diverse array of developmental processes, and aberrant Notch signaling can lead to diseases, including cancer. To obtain a more comprehensive understanding of the genetic network that integrates into Notch signaling, we performed a genome-wide RNAi screen in Drosophila cell culture to identify genes that modify Notch-dependent transcription. RESULTS: Employing complementary data analyses, we found 399 putative modifiers: 189 promoting and 210 antagonizing Notch activated transcription. These modifiers included several known Notch interactors, validating the robustness of the assay. Many novel modifiers were also identified, covering a range of cellular localizations from the extracellular matrix to the nucleus, as well as a large number of proteins with unknown function. Chromatin-modifying proteins represent a major class of genes identified, including histone deacetylase and demethylase complex components and other chromatin modifying, remodeling and replacement factors. A protein-protein interaction map of the Notch-dependent transcription modifiers revealed that a large number of the identified proteins interact physically with these core chromatin components. CONCLUSIONS: The genome-wide RNAi screen identified many genes that can modulate Notch transcriptional output. A protein interaction map of the identified genes highlighted a network of chromatin-modifying enzymes and remodelers that regulate Notch transcription. Our results open new avenues to explore the mechanisms of Notch signal regulation and the integration of this pathway into diverse cellular processes.