Uncovering how transcription factors (TFs) regulate their targets at the DNA, RNA and protein levels over time is critical to define gene regulatory networks (GRNs) in normal and diseased states. RNA-seq has become a standard method to measure gene regulation using an established set of analysis steps. However, none of the currently available pipeline methods for interpreting ordered genomic data (in time or space) use time series models to assign cause and effect relationships within GRNs, are adaptive to diverse experimental designs, or enable user interpretation through a web-based platform. Furthermore, methods which integrate ordered RNA-seq data with transcription factor binding data are urgently needed. Here, we present TIMEOR (Trajectory Inference and Mechanism Exploration with Omics data in R), the first web-based and adaptive time series multi-omics pipeline method which infers the relationship between gene regulatory events across time. TIMEOR addresses the critical need for methods to predict causal regulatory mechanism networks between TFs from time series multi-omics data. We used TIMEOR to identify a new link between insulin stimulation and the circadian rhythm cycle. TIMEOR is available at https://github.com/ashleymaeconard/TIMEOR.git.
Manipulation of gene expression is one of the best approaches for studying gene function in vivo. CRISPR-Cas13 has the potential to be a powerful technique for manipulating RNA expression in diverse animal systems in vivo, including Drosophila melanogaster. Studies using Cas13 in mammalian cell lines for gene knockdown showed increased on-target efficiency and decreased off-targeting relative to RNAi. Moreover, catalytically inactive Cas13 fusions can be used to image RNA molecules, install precise changes to the epitranscriptome, or alter splicing. However, recent studies have suggested that there may be limitations to the deployment of these tools in Drosophila, so further optimization of the system is required. Here, we report a new set of PspCas13b and RfxCas13d expression constructs and use these reagents to successfully knockdown both reporter and endogenous transcripts in Drosophila cells. As toxicity issues have been reported with high level of Cas13, we effectively decreased PspCas13b expression without impairing its function by tuning down translation. Furthermore, we altered the spatial activity of both PspCas13b and RfxCas13d by introducing Nuclear Exportation Sequences (NES) and Nuclear Localization Sequences (NLS) while maintaining activity. Finally, we generated a stable cell line expressing RfxCas13d under the inducible metallothionein promoter, establishing a useful tool for high-throughput genetic screening. Thus, we report new reagents for performing RNA CRISPR-Cas13 experiments in Drosophila, providing additional Cas13 expression constructs that retain activity.
With the advent of single-cell RNA sequencing (scRNA-seq) technologies, there has been a spike in studies involving scRNA-seq of several tissues across diverse species including Drosophila. Although a few databases exist for users to query genes of interest within the scRNA-seq studies, search tools that enable users to find orthologous genes and their cell type-specific expression patterns across species are limited. Here, we built a new search database, called DRscDB (https://www.flyrnai.org/tools/single_cell/web/) to address this need. DRscDB serves as a comprehensive repository for published scRNA-seq datasets for Drosophila and the relevant datasets from human and other model organisms. DRscDB is based on manual curation of Drosophila scRNA-seq studies of various tissue types and their corresponding analogous tissues in vertebrates including zebrafish, mouse, and human. Of note, our search database provides most of the literature-derived marker genes, thus preserving the original analysis of the published scRNA-seq datasets. DRscDB serves as a web-based user interface that allows users to mine, utilize and compare gene expression data pertaining to scRNA-seq datasets from the published literature.
Precise genome editing is a valuable tool to study gene function in model organisms. Prime editing, a precise editing system developed in mammalian cells, does not require double-strand breaks or donor DNA and has low off-target effects. Here, we applied prime editing for the model organism Drosophila melanogaster and developed conditions for optimal editing. By expressing prime editing components in cultured cells or somatic cells of transgenic flies, we precisely introduce premature stop codons in three classical visible marker genes, ebony, white, and forked Furthermore, by restricting editing to germ cells, we demonstrate efficient germ-line transmission of a precise edit in ebony to 36% of progeny. Our results suggest that prime editing is a useful system in Drosophila to study gene function, such as engineering precise point mutations, deletions, or epitope tags.
The FlyRNAi database at the Drosophila RNAi Screening Center and Transgenic RNAi Project (DRSC/TRiP) provides a suite of online resources that facilitate functional genomics studies with a special emphasis on Drosophila melanogaster. Currently, the database provides: gene-centric resources that facilitate ortholog mapping and mining of information about orthologs in common genetic model species; reagent-centric resources that help researchers identify RNAi and CRISPR sgRNA reagents or designs; and data-centric resources that facilitate visualization and mining of transcriptomics data, protein modification data, protein interactions, and more. Here, we discuss updated and new features that help biological and biomedical researchers efficiently identify, visualize, analyze, and integrate information and data for Drosophila and other species. Together, these resources facilitate multiple steps in functional genomics workflows, from building gene and reagent lists to management, analysis, and integration of data.
The accumulation of biological and biomedical literature outpaces the ability of most researchers and clinicians to stay abreast of their own immediate fields, let alone a broader range of topics. Although available search tools support identification of relevant literature, finding relevant and key publications is not always straightforward. For example, important publications might be missed in searches with an official gene name due to gene synonyms. Moreover, ambiguity of gene names can result in retrieval of a large number of irrelevant publications. To address these issues and help researchers and physicians quickly identify relevant publications, we developed BioLitMine, an advanced literature mining tool that takes advantage of the medical subject heading (MeSH) index and gene-to-publication annotations already available for PubMed literature. Using BioLitMine, a user can identify what MeSH terms are represented in the set of publications associated with a given gene of the interest, or start with a term and identify relevant publications. Users can also use the tool to find co-cited genes and a build a literature co-citation network. In addition, BioLitMine can help users build a gene list relevant to a MeSH terms, such as a list of genes relevant to "stem cells" or "breast neoplasms." Users can also start with a gene or pathway of interest and identify authors associated with that gene or pathway, a feature that makes it easier to identify experts who might serve as collaborators or reviewers. Altogether, BioLitMine extends the value of PubMed-indexed literature and its existing expert curation by providing a robust and gene-centric approach to retrieval of relevant information.
Precise and efficient genome modifications provide powerful tools for biological studies. Previous CRISPR gene knockout methods in cell lines have relied on frameshifts caused by stochastic insertion/deletion in all alleles. However, this method is inefficient for genes with high copy number due to polyploidy or gene amplification because frameshifts in all alleles can be difficult to generate and detect. Here we describe a homology-directed insertion method to knockout genes in the polyploid Drosophila S2R+ cell line. This protocol allows generation of homozygous mutant cell lines using an insertion cassette which autocatalytically generates insertion mutations in all alleles. Knockout cells generated using this method can be directly identified by PCR without a need for DNA sequencing. This protocol takes 2-3 months and can be applied to other polyploid cell lines or high-copy-number genes.
Jonathan Zirin, Yanhui Hu, Luping Liu, Donghui Yang-Zhou, Ryan Colbeth, Dong Yan, Ben Ewen-Campen, Rong Tao, Eric Vogt, Sara VanNest, Cooper Cavers, Christians Villalta, Aram Comjean, Jin Sun, Xia Wang, Yu Jia, Ruibao Zhu, Ping Peng, Jinchao Yu, Da Shen, Yuhao Qiu, Limmond Ayisi, Henna Ragoowansi, Ethan Fenton, Senait Efrem, Annette Parks, Kuniaki Saito, Shu Kondo, Liz Perkins, Stephanie E Mohr, Jianquan Ni, and Norbert Perrimon. 2020. “Large-Scale Transgenic Resource Collections for Loss- and Gain-of-Function Studies.” Genetics.Abstract
The Transgenic RNAi Project (TRiP), a functional genomics platform at Harvard Medical School, was initiated in 2008 to generate and distribute a genome-scale collection of RNAi fly stocks. To date, the TRiP has generated >15,000 RNAi fly stocks. As this covers most genes, we have largely transitioned to development of new resources based on CRISPR technology. Here, we present an update on our libraries of publicly available RNAi and CRISPR fly stocks, and focus on the TRiP-CRISPR overexpression (TRiP-OE) and TRiP-CRISPR knockout (TRiP-KO) collections. TRiP-OE stocks express sgRNAs targeting upstream of a gene transcription start site. Gene activation is triggered by co-expression of catalytically dead Cas9 (dCas9) fused to an activator domain, either VP64-p65-Rta (VPR) or Synergistic Activation Mediator (SAM). TRiP-KO stocks express one or two sgRNAs targeting the coding sequence of a gene or genes. Cutting is triggered by co-expression of Cas9, allowing for generation of indels in both germline and somatic tissue. To date, we have generated more than 5,000 CRISPR-OE or -KO stocks for the community. These resources provide versatile, transformative tools for gene activation, gene repression, and genome engineering.
CRISPR-Cas9 is a powerful genome editing technology in which a short guide RNA (sgRNA) confers target site specificity to achieve Cas9-mediated genome editing. Numerous sgRNA design tools have been developed based on reference genomes for humans and model organisms. However, existing resources are not optimal as genetic mutations or single nucleotide polymorphisms (SNPs) within the targeting region affect the efficiency of CRISPR-based approaches by interfering with guide-target complementarity. To facilitate identification of sgRNAs (1) in non-reference genomes, (2) across varying genetic backgrounds, or (3) for specific targeting of SNP-containing alleles, for example, disease relevant mutations, we developed a web tool, SNP-CRISPR (https://www.flyrnai.org/tools/snp_crispr/). SNP-CRISPR can be used to design sgRNAs based on public variant data sets or user-identified variants. In addition, the tool computes efficiency and specificity scores for sgRNA designs targeting both the variant and the reference. Moreover, SNP-CRISPR provides the option to upload multiple SNPs and target single or multiple nearby base changes simultaneously with a single sgRNA design. Given these capabilities, SNP-CRISPR has a wide range of potential research applications in model systems and potential applications for design of sgRNAs for disease-associated mutant correction.
Raghuvir Viswanatha, Roderick Brathwaite, Yanhui Hu, Zhongchi Li, Jonathan Rodiger, Pierre Merckaert, Verena Chung, Stephanie E Mohr, and Norbert Perrimon. 2019. “Pooled CRISPR Screens in Drosophila Cells.” Curr Protoc Mol Biol, 129, 1, Pp. e111.Abstract
Cell division and tissue growth must be coordinated with development. Defects in these processes are the basis for a number of diseases, including developmental malformations and cancer. We have conducted an unbiased RNAi screen for genes that are required for growth in the wing, using GAL4-inducible short hairpin RNA (shRNA) fly strains made by the Drosophila RNAi Screening Center. shRNA expression down the center of the larval wing disc using , and the central region of the adult wing was then scored for tissue growth and wing hair morphology. Out of 4,753 shRNA crosses that survived to adulthood, 18 had impaired wing growth. FlyBase and the new Alliance of Genome Resources knowledgebases were used to determine the known or predicted functions of these genes and the association of their human orthologs with disease. The function of eight of the genes identified has not been previously defined in The genes identified included those with known or predicted functions in cell cycle, chromosome segregation, morphogenesis, metabolism, steroid processing, transcription, and translation. All but one of the genes are similar to those in humans, and many are associated with disease. Knockdown of , a subunit of the Myb-MuvB transcription factor, or β, a gene involved in protein folding and trafficking, resulted in a switch from cell proliferation to an endoreplication growth program through which wing tissue grew by an increase in cell size (hypertrophy). It is anticipated that further analysis of the genes that we have identified will reveal new mechanisms that regulate tissue growth during development.
We previously reported a CRISPR-mediated knock-in strategy into introns of genes, generating an - transgenic library for multiple uses (Lee et al., 2018b). The method relied on double stranded DNA (dsDNA) homology donors with ~1 kb homology arms. Here, we describe three new simpler ways to edit genes in flies. We create single stranded DNA (ssDNA) donors using PCR and add 100 nt of homology on each side of an integration cassette, followed by enzymatic removal of one strand. Using this method, we generated GFP-tagged proteins that mark organelles in S2 cells. We then describe two dsDNA methods using cheap synthesized donors flanked by 100 nt homology arms and gRNA target sites cloned into a plasmid. Upon injection, donor DNA (1 to 5 kb) is released from the plasmid by Cas9. The cassette integrates efficiently and precisely . The approach is fast, cheap, and scalable.
Inactivation of the tumor suppressor gene is the signature initiating event in clear cell renal cell carcinoma (ccRCC), the most common form of kidney cancer, and causes the accumulation of hypoxia-inducible factor 2α (HIF-2α). HIF-2α inhibitors are effective in some ccRCC cases, but both de novo and acquired resistance have been observed in the laboratory and in the clinic. Here, we identified synthetic lethality between decreased activity of cyclin-dependent kinases 4 and 6 (CDK4/6) and inactivation in two species (human and ) and across diverse human ccRCC cell lines in culture and xenografts. Although HIF-2α transcriptionally induced the CDK4/6 partner cyclin D1, HIF-2α was not required for the increased CDK4/6 requirement of ccRCC cells. Accordingly, the antiproliferative effects of CDK4/6 inhibition were synergistic with HIF-2α inhibition in HIF-2α-dependent ccRCC cells and not antagonistic with HIF-2α inhibition in HIF-2α-independent cells. These findings support testing CDK4/6 inhibitors as treatments for ccRCC, alone and in combination with HIF-2α inhibitors.
Understanding human gene function is fundamental to understanding and treating diseases. Research using the model organism Drosophila melanogaster benefits from a wealth of molecular genetic resources and information useful for efficient invivo experimentation. Moreover, Drosophila offers a balance as a relatively simple organism that nonetheless exhibits complex multicellular activities. Recent examples demonstrate the power and continued promise of Drosophila research to further our understanding of conserved gene functions.
Methionine restriction (MetR) extends lifespan across different species and exerts beneficial effects on metabolic health and inflammatory responses. In contrast, certain cancer cells exhibit methionine auxotrophy that can be exploited for therapeutic treatment, as decreasing dietary methionine selectively suppresses tumor growth. Thus, MetR represents an intervention that can extend lifespan with a complementary effect of delaying tumor growth. Beyond its function in protein synthesis, methionine feeds into complex metabolic pathways including the methionine cycle, the transsulfuration pathway, and polyamine biosynthesis. Manipulation of each of these branches extends lifespan; however, the interplay between MetR and these branches during regulation of lifespan is not well understood. In addition, a potential mechanism linking the activity of methionine metabolism and lifespan is regulation of production of the methyl donor S-adenosylmethionine, which, after transferring its methyl group, is converted to S-adenosylhomocysteine. Methylation regulates a wide range of processes, including those thought to be responsible for lifespan extension by MetR. Although the exact mechanisms of lifespan extension by MetR or methionine metabolism reprogramming are unknown, it may act via reducing the rate of translation, modifying gene expression, inducing a hormetic response, modulating autophagy, or inducing mitochondrial function, antioxidant defense, or other metabolic processes. Here, we review the mechanisms of lifespan extension by MetR and different branches of methionine metabolism in different species and the potential for exploiting the regulation of methyltransferases to delay aging.
Post-translational modification (PTM) serves as a regulatory mechanism for protein function, influencing their stability, interactions, activity and localization, and is critical in many signaling pathways. The best characterized PTM is phosphorylation, whereby a phosphate is added to an acceptor residue, most commonly serine, threonine and tyrosine in metazoans. As proteins are often phosphorylated at multiple sites, identifying those sites that are important for function is a challenging problem. Considering that any given phosphorylation site might be non-functional, prioritizing evolutionarily conserved phosphosites provides a general strategy to identify the putative functional sites. To facilitate the identification of conserved phosphosites, we generated a large-scale phosphoproteomics dataset from embryos collected from six closely-related species. We built iProteinDB (https://www.flyrnai.org/tools/iproteindb/), a resource integrating these data with other high-throughput PTM datasets, including vertebrates, and manually curated information for At iProteinDB, scientists can view the PTM landscape for any protein and identify predicted functional phosphosites based on a comparative analysis of data from closely-related species. Further, iProteinDB enables comparison of PTM data from to that of orthologous proteins from other model organisms, including human, mouse, rat, , , and .