High-throughput data analysis

Oaz Nir, Chris Bakal, Norbert Perrimon, and Bonnie Berger. 2010. “Inference of RhoGAP/GTPase regulation using single-cell morphological data from a combinatorial RNAi screen.” Genome Res, 20, 3, Pp. 372-80.Abstract

Biological networks are highly complex systems, consisting largely of enzymes that act as molecular switches to activate/inhibit downstream targets via post-translational modification. Computational techniques have been developed to perform signaling network inference using some high-throughput data sources, such as those generated from transcriptional and proteomic studies, but comparable methods have not been developed to use high-content morphological data, which are emerging principally from large-scale RNAi screens, to these ends. Here, we describe a systematic computational framework based on a classification model for identifying genetic interactions using high-dimensional single-cell morphological data from genetic screens, apply it to RhoGAP/GTPase regulation in Drosophila, and evaluate its efficacy. Augmented by knowledge of the basic structure of RhoGAP/GTPase signaling, namely, that GAPs act directly upstream of GTPases, we apply our framework for identifying genetic interactions to predict signaling relationships between these proteins. We find that our method makes mediocre predictions using only RhoGAP single-knockdown morphological data, yet achieves vastly improved accuracy by including original data from a double-knockdown RhoGAP genetic screen, which likely reflects the redundant network structure of RhoGAP/GTPase signaling. We consider other possible methods for inference and show that our primary model outperforms the alternatives. This work demonstrates the fundamental fact that high-throughput morphological data can be used in a systematic, successful fashion to identify genetic interactions and, using additional elementary knowledge of network structure, to infer signaling relations.

Arunachalam Vinayagam, Yanhui Hu, Meghana Kulkarni, Charles Roesel, Richelle Sopko, Stephanie E Mohr, and Norbert Perrimon. 2013. “Protein complex-based analysis framework for high-throughput data sets.” Sci Signal, 6, 264, Pp. rs5.Abstract

Analysis of high-throughput data increasingly relies on pathway annotation and functional information derived from Gene Ontology. This approach has limitations, in particular for the analysis of network dynamics over time or under different experimental conditions, in which modules within a network rather than complete pathways might respond and change. We report an analysis framework based on protein complexes, which are at the core of network reorganization. We generated a protein complex resource for human, Drosophila, and yeast from the literature and databases of protein-protein interaction networks, with each species having thousands of complexes. We developed COMPLEAT (http://www.flyrnai.org/compleat), a tool for data mining and visualization for complex-based analysis of high-throughput data sets, as well as analysis and integration of heterogeneous proteomics and gene expression data sets. With COMPLEAT, we identified dynamically regulated protein complexes among genome-wide RNA interference data sets that used the abundance of phosphorylated extracellular signal-regulated kinase in cells stimulated with either insulin or epidermal growth factor as the output. The analysis predicted that the Brahma complex participated in the insulin response.

Frederic Bard, Laetitia Casano, Arrate Mallabiabarrena, Erin Wallace, Kota Saito, Hitoshi Kitayama, Gianni Guizzunti, Yue Hu, Franz Wendler, Ramanuj DasGupta, Norbert Perrimon, and Vivek Malhotra. 2006. “Functional genomics reveals genes involved in protein secretion and Golgi organization.” Nature, 439, 7076, Pp. 604-7.Abstract

Yeast genetics and in vitro biochemical analysis have identified numerous genes involved in protein secretion. As compared with yeast, however, the metazoan secretory pathway is more complex and many mechanisms that regulate organization of the Golgi apparatus remain poorly characterized. We performed a genome-wide RNA-mediated interference screen in a Drosophila cell line to identify genes required for constitutive protein secretion. We then classified the genes on the basis of the effect of their depletion on organization of the Golgi membranes. Here we show that depletion of class A genes redistributes Golgi membranes into the endoplasmic reticulum, depletion of class B genes leads to Golgi fragmentation, depletion of class C genes leads to aggregation of Golgi membranes, and depletion of class D genes causes no obvious change. Of the 20 new gene products characterized so far, several localize to the Golgi membranes and the endoplasmic reticulum.

Chris Bakal, Rune Linding, Flora Llense, Elleard Heffern, Enrique Martin-Blanco, Tony Pawson, and Norbert Perrimon. 2008. “Phosphorylation networks regulating JNK activity in diverse genetic backgrounds.” Science, 322, 5900, Pp. 453-6.Abstract

Cellular signaling networks have evolved to enable swift and accurate responses, even in the face of genetic or environmental perturbation. Thus, genetic screens may not identify all the genes that regulate different biological processes. Moreover, although classical screening approaches have succeeded in providing parts lists of the essential components of signaling networks, they typically do not provide much insight into the hierarchical and functional relations that exist among these components. We describe a high-throughput screen in which we used RNA interference to systematically inhibit two genes simultaneously in 17,724 combinations to identify regulators of Drosophila JUN NH(2)-terminal kinase (JNK). Using both genetic and phosphoproteomics data, we then implemented an integrative network algorithm to construct a JNK phosphorylation network, which provides structural and mechanistic insights into the systems architecture of JNK signaling.

Shu Kondo and Norbert Perrimon. 2011. “A genome-wide RNAi screen identifies core components of the G₂-M DNA damage checkpoint.” Sci Signal, 4, 154, Pp. rs1.Abstract

The DNA damage checkpoint, the first pathway known to be activated in response to DNA damage, is a mechanism by which the cell cycle is temporarily arrested to allow DNA repair. The checkpoint pathway transmits signals from the sites of DNA damage to the cell cycle machinery through the evolutionarily conserved ATM (ataxia telangiectasia mutated) and ATR (ATM- and Rad3-related) kinase cascades. We conducted a genome-wide RNAi (RNA interference) screen in Drosophila cells to identify previously unknown genes and pathways required for the G₂-M checkpoint induced by DNA double-strand breaks (DSBs). Our large-scale analysis provided a systems-level view of the G₂-M checkpoint and revealed the coordinated actions of particular classes of proteins, which include those involved in DNA repair, DNA replication, cell cycle control, chromatin regulation, and RNA processing. Further, from the screen and in vivo analysis, we identified previously unrecognized roles of two DNA damage response genes, mus101 and mus312. Our results suggest that the DNA replication preinitiation complex, which includes MUS101, and the MUS312-containing nuclease complexes, which are important for DSB repair, also function in the G₂-M checkpoint. Our results provide insight into the diverse mechanisms that link DNA damage and the checkpoint signaling pathway.

Ulrike S Eggert, Amy A Kiger, Constance Richter, Zachary E Perlman, Norbert Perrimon, Timothy J Mitchison, and Christine M Field. 2004. “Parallel chemical genetic and genome-wide RNAi screens identify cytokinesis inhibitors and targets.” PLoS Biol, 2, 12, Pp. e379.Abstract

Cytokinesis involves temporally and spatially coordinated action of the cell cycle and cytoskeletal and membrane systems to achieve separation of daughter cells. To dissect cytokinesis mechanisms it would be useful to have a complete catalog of the proteins involved, and small molecule tools for specifically inhibiting them with tight temporal control. Finding active small molecules by cell-based screening entails the difficult step of identifying their targets. We performed parallel chemical genetic and genome-wide RNA interference screens in Drosophila cells, identifying 50 small molecule inhibitors of cytokinesis and 214 genes important for cytokinesis, including a new protein in the Aurora B pathway (Borr). By comparing small molecule and RNAi phenotypes, we identified a small molecule that inhibits the Aurora B kinase pathway. Our protein list provides a starting point for systematic dissection of cytokinesis, a direction that will be greatly facilitated by also having diverse small molecule inhibitors, which we have identified. Dissection of the Aurora B pathway, where we found a new gene and a specific small molecule inhibitor, should benefit particularly. Our study shows that parallel RNA interference and small molecule screening is a generally useful approach to identifying active small molecules and their target pathways.

Caroline H Yi, Dodzie K Sogah, Michael Boyce, Alexei Degterev, Dana E Christofferson, and Junying Yuan. 2007. “A genome-wide RNAi screen reveals multiple regulators of caspase activation.” J Cell Biol, 179, 4, Pp. 619-26.Abstract

Apoptosis is an evolutionally conserved cellular suicide mechanism that can be activated in response to a variety of stressful stimuli. Increasing evidence suggests that apoptotic regulation relies on specialized cell death signaling pathways and also integrates diverse signals from additional regulatory circuits, including those of cellular homeostasis. We present a genome-wide RNA interference screen to systematically identify regulators of apoptosis induced by DNA damage in Drosophila melanogaster cells. We identify 47 double- stranded RNA that target a functionally diverse set of genes, including several with a known function in promoting cell death. Further characterization uncovers 10 genes that influence caspase activation upon the removal of Drosophila inhibitor of apoptosis 1. This set includes the Drosophila initiator caspase Dronc and, surprisingly, several metabolic regulators, a candidate tumor suppressor, Charlatan, and an N-acetyltransferase, ARD1. Importantly, several of these genes show functional conservation in regulating apoptosis in mammalian cells. Our data suggest a previously unappreciated fundamental connection between various cellular processes and caspase-dependent cell death.

Stephanie Mohr, Chris Bakal, and Norbert Perrimon. 2010. “Genomic screening with RNAi: results and challenges.” Annu Rev Biochem, 79, Pp. 37-64.Abstract

RNA interference (RNAi) is an effective tool for genome-scale, high-throughput analysis of gene function. In the past five years, a number of genome-scale RNAi high-throughput screens (HTSs) have been done in both Drosophila and mammalian cultured cells to study diverse biological processes, including signal transduction, cancer biology, and host cell responses to infection. Results from these screens have led to the identification of new components of these processes and, importantly, have also provided insights into the complexity of biological systems, forcing new and innovative approaches to understanding functional networks in cells. Here, we review the main findings that have emerged from RNAi HTS and discuss technical issues that remain to be improved, in particular the verification of RNAi results and validation of their biological relevance. Furthermore, we discuss the importance of multiplexed and integrated experimental data analysis pipelines to RNAi HTS.

Ralph A Neumüller, Thomas Gross, Anastasia A Samsonova, Arunachalam Vinayagam, Michael Buckner, Karen Founk, Yanhui Hu, Sara Sharifpoor, Adam P Rosebrock, Brenda Andrews, Fred Winston, and Norbert Perrimon. 2013. “Conserved regulators of nucleolar size revealed by global phenotypic analyses.” Sci Signal, 6, 289, Pp. ra70.Abstract

Regulation of cell growth is a fundamental process in development and disease that integrates a vast array of extra- and intracellular information. A central player in this process is RNA polymerase I (Pol I), which transcribes ribosomal RNA (rRNA) genes in the nucleolus. Rapidly growing cancer cells are characterized by increased Pol I-mediated transcription and, consequently, nucleolar hypertrophy. To map the genetic network underlying the regulation of nucleolar size and of Pol I-mediated transcription, we performed comparative, genome-wide loss-of-function analyses of nucleolar size in Saccharomyces cerevisiae and Drosophila melanogaster coupled with mass spectrometry-based analyses of the ribosomal DNA (rDNA) promoter. With this approach, we identified a set of conserved and nonconserved molecular complexes that control nucleolar size. Furthermore, we characterized a direct role of the histone information regulator (HIR) complex in repressing rRNA transcription in yeast. Our study provides a full-genome, cross-species analysis of a nuclear subcompartment and shows that this approach can identify conserved molecular modules.

Christophe J Echeverri and Norbert Perrimon. 2006. “High-throughput RNAi screening in cultured cells: a user's guide.” Nat Rev Genet, 7, 5, Pp. 373-84.Abstract

RNA interference has re-energized the field of functional genomics by enabling genome-scale loss-of-function screens in cultured cells. Looking back on the lessons that have been learned from the first wave of technology developments and applications in this exciting field, we provide both a user's guide for newcomers to the field and a detailed examination of some more complex issues, particularly concerning optimization and quality control, for more advanced users. From a discussion of cell lines, screening paradigms, reagent types and read-out methodologies, we explore in particular the complexities of designing optimal controls and normalization strategies for these challenging but extremely powerful studies.

Mathias Beller, Carole Sztalryd, Noel Southall, Ming Bell, Herbert Jäckle, Douglas S Auld, and Brian Oliver. 2008. “COPI complex is a regulator of lipid homeostasis.” PLoS Biol, 6, 11, Pp. e292.Abstract

Lipid droplets are ubiquitous triglyceride and sterol ester storage organelles required for energy storage homeostasis and biosynthesis. Although little is known about lipid droplet formation and regulation, it is clear that members of the PAT (perilipin, adipocyte differentiation related protein, tail interacting protein of 47 kDa) protein family coat the droplet surface and mediate interactions with lipases that remobilize the stored lipids. We identified key Drosophila candidate genes for lipid droplet regulation by RNA interference (RNAi) screening with an image segmentation-based optical read-out system, and show that these regulatory functions are conserved in the mouse. Those include the vesicle-mediated Coat Protein Complex I (COPI) transport complex, which is required for limiting lipid storage. We found that COPI components regulate the PAT protein composition at the lipid droplet surface, and promote the association of adipocyte triglyceride lipase (ATGL) with the lipid droplet surface to mediate lipolysis. Two compounds known to inhibit COPI function, Exo1 and Brefeldin A, phenocopy COPI knockdowns. Furthermore, RNAi inhibition of ATGL and simultaneous drug treatment indicate that COPI and ATGL function in the same pathway. These data indicate that the COPI complex is an evolutionarily conserved regulator of lipid homeostasis, and highlight an interaction between vesicle transport systems and lipid droplets.

Stephanie E Mohr and Norbert Perrimon. 2012. “RNAi screening: new approaches, understandings, and organisms.” Wiley Interdiscip Rev RNA, 3, 2, Pp. 145-58.Abstract

RNA interference (RNAi) leads to sequence-specific knockdown of gene function. The approach can be used in large-scale screens to interrogate function in various model organisms and an increasing number of other species. Genome-scale RNAi screens are routinely performed in cultured or primary cells or in vivo in organisms such as C. elegans. High-throughput RNAi screening is benefitting from the development of sophisticated new instrumentation and software tools for collecting and analyzing data, including high-content image data. The results of large-scale RNAi screens have already proved useful, leading to new understandings of gene function relevant to topics such as infection, cancer, obesity, and aging. Nevertheless, important caveats apply and should be taken into consideration when developing or interpreting RNAi screens. Some level of false discovery is inherent to high-throughput approaches and specific to RNAi screens, false discovery due to off-target effects (OTEs) of RNAi reagents remains a problem. The need to improve our ability to use RNAi to elucidate gene function at large scale and in additional systems continues to be addressed through improved RNAi library design, development of innovative computational and analysis tools and other approaches.

Sara Cherry, Tammy Doukas, Susan Armknecht, Sean Whelan, Hui Wang, Peter Sarnow, and Norbert Perrimon. 2005. “Genome-wide RNAi screen reveals a specific sensitivity of IRES-containing RNA viruses to host translation inhibition.” Genes Dev, 19, 4, Pp. 445-52.Abstract

The widespread class of RNA viruses that utilize internal ribosome entry sites (IRESs) for translation include poliovirus and Hepatitis C virus. To identify host factors required for IRES-dependent translation and viral replication, we performed a genome-wide RNAi screen in Drosophila cells infected with Drosophila C virus (DCV). We identified 66 ribosomal proteins that, when depleted, specifically inhibit DCV growth, but not a non-IRES-containing RNA virus. Moreover, treatment of flies with a translation inhibitor is protective in vivo. Finally, this increased sensitivity to ribosome levels also holds true for poliovirus infection of human cells, demonstrating the generality of these findings.

Natalie G Farny, Jessica A Hurt, and Pamela A Silver. 2008. “Definition of global and transcript-specific mRNA export pathways in metazoans.” Genes Dev, 22, 1, Pp. 66-78.Abstract

Eukaryotic gene expression requires export of messenger RNAs (mRNAs) from their site of transcription in the nucleus to the cytoplasm where they are translated. While mRNA export has been studied in yeast, the complexity of gene structure and cellular function in metazoan cells has likely led to increased diversification of these organisms' export pathways. Here we report the results of a genome-wide RNAi screen in which we identify 72 factors required for polyadenylated [poly-(A(+))] mRNA export from the nucleus in Drosophila cells. Using structural and functional conservation analysis of yeast and Drosophila mRNA export factors, we expose the evolutionary divergence of eukaryotic mRNA export pathways. Additionally, we demonstrate the differential export requirements of two endogenous heat-inducible transcripts--intronless heat-shock protein 70 (HSP70) and intron-containing HSP83--and identify novel export factors that participate in HSP83 mRNA splicing. We characterize several novel factors and demonstrate their participation in interactions with known components of the Drosophila export machinery. One of these factors, Drosophila melanogaster PCI domain-containing protein 2 (dmPCID2), associates with polysomes and may bridge the transition between exported messenger ribonucleoprotein particles (mRNPs) and polysomes. Our results define the global network of factors involved in Drosophila mRNA export, reveal specificity in the export requirements of different transcripts, and expose new avenues for future work in mRNA export.

Christine Akimana, Souhaila Al-Khodor, and Yousef Abu Kwaik. 2010. “Host factors required for modulation of phagosome biogenesis and proliferation of Francisella tularensis within the cytosol.” PLoS One, 5, 6, Pp. e11025.Abstract

Francisella tularensis is a highly infectious facultative intracellular bacterium that can be transmitted between mammals by arthropod vectors. Similar to many other intracellular bacteria that replicate within the cytosol, such as Listeria, Shigella, Burkholderia, and Rickettsia, the virulence of F. tularensis depends on its ability to modulate biogenesis of its phagosome and to escape into the host cell cytosol where it proliferates. Recent studies have identified the F. tularensis genes required for modulation of phagosome biogenesis and escape into the host cell cytosol within human and arthropod-derived cells. However, the arthropod and mammalian host factors required for intracellular proliferation of F. tularensis are not known. We have utilized a forward genetic approach employing genome-wide RNAi screen in Drosophila melanogaster-derived cells. Screening a library of approximately 21,300 RNAi, we have identified at least 186 host factors required for intracellular bacterial proliferation. We silenced twelve mammalian homologues by RNAi in HEK293T cells and identified three conserved factors, the PI4 kinase PI4KCA, the ubiquitin hydrolase USP22, and the ubiquitin ligase CDC27, which are also required for replication in human cells. The PI4KCA and USP22 mammalian factors are not required for modulation of phagosome biogenesis or phagosomal escape but are required for proliferation within the cytosol. In contrast, the CDC27 ubiquitin ligase is required for evading lysosomal fusion and for phagosomal escape into the cytosol. Although F. tularensis interacts with the autophagy pathway during late stages of proliferation in mouse macrophages, this does not occur in human cells. Our data suggest that F. tularensis utilizes host ubiquitin turnover in distinct mechanisms during the phagosomal and cytosolic phases and phosphoinositide metabolism is essential for cytosolic proliferation of F. tularensis. Our data will facilitate deciphering molecular ecology, patho-adaptation of F. tularensis to the arthropod vector and its role in bacterial ecology and patho-evolution to infect mammals.