High-throughput data analysis

Oaz Nir, Chris Bakal, Norbert Perrimon, and Bonnie Berger. 2010. “Inference of RhoGAP/GTPase regulation using single-cell morphological data from a combinatorial RNAi screen.” Genome Res, 20, 3, Pp. 372-80.Abstract

Biological networks are highly complex systems, consisting largely of enzymes that act as molecular switches to activate/inhibit downstream targets via post-translational modification. Computational techniques have been developed to perform signaling network inference using some high-throughput data sources, such as those generated from transcriptional and proteomic studies, but comparable methods have not been developed to use high-content morphological data, which are emerging principally from large-scale RNAi screens, to these ends. Here, we describe a systematic computational framework based on a classification model for identifying genetic interactions using high-dimensional single-cell morphological data from genetic screens, apply it to RhoGAP/GTPase regulation in Drosophila, and evaluate its efficacy. Augmented by knowledge of the basic structure of RhoGAP/GTPase signaling, namely, that GAPs act directly upstream of GTPases, we apply our framework for identifying genetic interactions to predict signaling relationships between these proteins. We find that our method makes mediocre predictions using only RhoGAP single-knockdown morphological data, yet achieves vastly improved accuracy by including original data from a double-knockdown RhoGAP genetic screen, which likely reflects the redundant network structure of RhoGAP/GTPase signaling. We consider other possible methods for inference and show that our primary model outperforms the alternatives. This work demonstrates the fundamental fact that high-throughput morphological data can be used in a systematic, successful fashion to identify genetic interactions and, using additional elementary knowledge of network structure, to infer signaling relations.

Arunachalam Vinayagam, Yanhui Hu, Meghana Kulkarni, Charles Roesel, Richelle Sopko, Stephanie E Mohr, and Norbert Perrimon. 2013. “Protein complex-based analysis framework for high-throughput data sets.” Sci Signal, 6, 264, Pp. rs5.Abstract

Analysis of high-throughput data increasingly relies on pathway annotation and functional information derived from Gene Ontology. This approach has limitations, in particular for the analysis of network dynamics over time or under different experimental conditions, in which modules within a network rather than complete pathways might respond and change. We report an analysis framework based on protein complexes, which are at the core of network reorganization. We generated a protein complex resource for human, Drosophila, and yeast from the literature and databases of protein-protein interaction networks, with each species having thousands of complexes. We developed COMPLEAT (http://www.flyrnai.org/compleat), a tool for data mining and visualization for complex-based analysis of high-throughput data sets, as well as analysis and integration of heterogeneous proteomics and gene expression data sets. With COMPLEAT, we identified dynamically regulated protein complexes among genome-wide RNA interference data sets that used the abundance of phosphorylated extracellular signal-regulated kinase in cells stimulated with either insulin or epidermal growth factor as the output. The analysis predicted that the Brahma complex participated in the insulin response.

Frederic Bard, Laetitia Casano, Arrate Mallabiabarrena, Erin Wallace, Kota Saito, Hitoshi Kitayama, Gianni Guizzunti, Yue Hu, Franz Wendler, Ramanuj DasGupta, Norbert Perrimon, and Vivek Malhotra. 2006. “Functional genomics reveals genes involved in protein secretion and Golgi organization.” Nature, 439, 7076, Pp. 604-7.Abstract

Yeast genetics and in vitro biochemical analysis have identified numerous genes involved in protein secretion. As compared with yeast, however, the metazoan secretory pathway is more complex and many mechanisms that regulate organization of the Golgi apparatus remain poorly characterized. We performed a genome-wide RNA-mediated interference screen in a Drosophila cell line to identify genes required for constitutive protein secretion. We then classified the genes on the basis of the effect of their depletion on organization of the Golgi membranes. Here we show that depletion of class A genes redistributes Golgi membranes into the endoplasmic reticulum, depletion of class B genes leads to Golgi fragmentation, depletion of class C genes leads to aggregation of Golgi membranes, and depletion of class D genes causes no obvious change. Of the 20 new gene products characterized so far, several localize to the Golgi membranes and the endoplasmic reticulum.

Chris Bakal, Rune Linding, Flora Llense, Elleard Heffern, Enrique Martin-Blanco, Tony Pawson, and Norbert Perrimon. 2008. “Phosphorylation networks regulating JNK activity in diverse genetic backgrounds.” Science, 322, 5900, Pp. 453-6.Abstract

Cellular signaling networks have evolved to enable swift and accurate responses, even in the face of genetic or environmental perturbation. Thus, genetic screens may not identify all the genes that regulate different biological processes. Moreover, although classical screening approaches have succeeded in providing parts lists of the essential components of signaling networks, they typically do not provide much insight into the hierarchical and functional relations that exist among these components. We describe a high-throughput screen in which we used RNA interference to systematically inhibit two genes simultaneously in 17,724 combinations to identify regulators of Drosophila JUN NH(2)-terminal kinase (JNK). Using both genetic and phosphoproteomics data, we then implemented an integrative network algorithm to construct a JNK phosphorylation network, which provides structural and mechanistic insights into the systems architecture of JNK signaling.

Shu Kondo and Norbert Perrimon. 2011. “A genome-wide RNAi screen identifies core components of the G₂-M DNA damage checkpoint.” Sci Signal, 4, 154, Pp. rs1.Abstract

The DNA damage checkpoint, the first pathway known to be activated in response to DNA damage, is a mechanism by which the cell cycle is temporarily arrested to allow DNA repair. The checkpoint pathway transmits signals from the sites of DNA damage to the cell cycle machinery through the evolutionarily conserved ATM (ataxia telangiectasia mutated) and ATR (ATM- and Rad3-related) kinase cascades. We conducted a genome-wide RNAi (RNA interference) screen in Drosophila cells to identify previously unknown genes and pathways required for the G₂-M checkpoint induced by DNA double-strand breaks (DSBs). Our large-scale analysis provided a systems-level view of the G₂-M checkpoint and revealed the coordinated actions of particular classes of proteins, which include those involved in DNA repair, DNA replication, cell cycle control, chromatin regulation, and RNA processing. Further, from the screen and in vivo analysis, we identified previously unrecognized roles of two DNA damage response genes, mus101 and mus312. Our results suggest that the DNA replication preinitiation complex, which includes MUS101, and the MUS312-containing nuclease complexes, which are important for DSB repair, also function in the G₂-M checkpoint. Our results provide insight into the diverse mechanisms that link DNA damage and the checkpoint signaling pathway.

Ulrike S Eggert, Amy A Kiger, Constance Richter, Zachary E Perlman, Norbert Perrimon, Timothy J Mitchison, and Christine M Field. 2004. “Parallel chemical genetic and genome-wide RNAi screens identify cytokinesis inhibitors and targets.” PLoS Biol, 2, 12, Pp. e379.Abstract

Cytokinesis involves temporally and spatially coordinated action of the cell cycle and cytoskeletal and membrane systems to achieve separation of daughter cells. To dissect cytokinesis mechanisms it would be useful to have a complete catalog of the proteins involved, and small molecule tools for specifically inhibiting them with tight temporal control. Finding active small molecules by cell-based screening entails the difficult step of identifying their targets. We performed parallel chemical genetic and genome-wide RNA interference screens in Drosophila cells, identifying 50 small molecule inhibitors of cytokinesis and 214 genes important for cytokinesis, including a new protein in the Aurora B pathway (Borr). By comparing small molecule and RNAi phenotypes, we identified a small molecule that inhibits the Aurora B kinase pathway. Our protein list provides a starting point for systematic dissection of cytokinesis, a direction that will be greatly facilitated by also having diverse small molecule inhibitors, which we have identified. Dissection of the Aurora B pathway, where we found a new gene and a specific small molecule inhibitor, should benefit particularly. Our study shows that parallel RNA interference and small molecule screening is a generally useful approach to identifying active small molecules and their target pathways.

Caroline H Yi, Dodzie K Sogah, Michael Boyce, Alexei Degterev, Dana E Christofferson, and Junying Yuan. 2007. “A genome-wide RNAi screen reveals multiple regulators of caspase activation.” J Cell Biol, 179, 4, Pp. 619-26.Abstract

Apoptosis is an evolutionally conserved cellular suicide mechanism that can be activated in response to a variety of stressful stimuli. Increasing evidence suggests that apoptotic regulation relies on specialized cell death signaling pathways and also integrates diverse signals from additional regulatory circuits, including those of cellular homeostasis. We present a genome-wide RNA interference screen to systematically identify regulators of apoptosis induced by DNA damage in Drosophila melanogaster cells. We identify 47 double- stranded RNA that target a functionally diverse set of genes, including several with a known function in promoting cell death. Further characterization uncovers 10 genes that influence caspase activation upon the removal of Drosophila inhibitor of apoptosis 1. This set includes the Drosophila initiator caspase Dronc and, surprisingly, several metabolic regulators, a candidate tumor suppressor, Charlatan, and an N-acetyltransferase, ARD1. Importantly, several of these genes show functional conservation in regulating apoptosis in mammalian cells. Our data suggest a previously unappreciated fundamental connection between various cellular processes and caspase-dependent cell death.

Stephanie Mohr, Chris Bakal, and Norbert Perrimon. 2010. “Genomic screening with RNAi: results and challenges.” Annu Rev Biochem, 79, Pp. 37-64.Abstract

RNA interference (RNAi) is an effective tool for genome-scale, high-throughput analysis of gene function. In the past five years, a number of genome-scale RNAi high-throughput screens (HTSs) have been done in both Drosophila and mammalian cultured cells to study diverse biological processes, including signal transduction, cancer biology, and host cell responses to infection. Results from these screens have led to the identification of new components of these processes and, importantly, have also provided insights into the complexity of biological systems, forcing new and innovative approaches to understanding functional networks in cells. Here, we review the main findings that have emerged from RNAi HTS and discuss technical issues that remain to be improved, in particular the verification of RNAi results and validation of their biological relevance. Furthermore, we discuss the importance of multiplexed and integrated experimental data analysis pipelines to RNAi HTS.

Ralph A Neumüller, Thomas Gross, Anastasia A Samsonova, Arunachalam Vinayagam, Michael Buckner, Karen Founk, Yanhui Hu, Sara Sharifpoor, Adam P Rosebrock, Brenda Andrews, Fred Winston, and Norbert Perrimon. 2013. “Conserved regulators of nucleolar size revealed by global phenotypic analyses.” Sci Signal, 6, 289, Pp. ra70.Abstract

Regulation of cell growth is a fundamental process in development and disease that integrates a vast array of extra- and intracellular information. A central player in this process is RNA polymerase I (Pol I), which transcribes ribosomal RNA (rRNA) genes in the nucleolus. Rapidly growing cancer cells are characterized by increased Pol I-mediated transcription and, consequently, nucleolar hypertrophy. To map the genetic network underlying the regulation of nucleolar size and of Pol I-mediated transcription, we performed comparative, genome-wide loss-of-function analyses of nucleolar size in Saccharomyces cerevisiae and Drosophila melanogaster coupled with mass spectrometry-based analyses of the ribosomal DNA (rDNA) promoter. With this approach, we identified a set of conserved and nonconserved molecular complexes that control nucleolar size. Furthermore, we characterized a direct role of the histone information regulator (HIR) complex in repressing rRNA transcription in yeast. Our study provides a full-genome, cross-species analysis of a nuclear subcompartment and shows that this approach can identify conserved molecular modules.

Pages