Paramecium

14,000,000 Leading Edge Experts on the ideXlab platform

Scan Science and Technology

Contact Leading Edge Experts & Companies

Scan Science and Technology

Contact Leading Edge Experts & Companies

The Experts below are selected from a list of 261 Experts worldwide ranked by ideXlab platform

Linda Sperling - One of the best experts on this subject based on the ideXlab platform.

  • ParameciumDB 2019: integrating genomic data across the genus for functional and evolutionary biology
    Nucleic Acids Research, 2019
    Co-Authors: Olivier Arnaiz, Eric Meyer, Linda Sperling
    Abstract:

    ParameciumDB (https://Paramecium.i2bc.paris-saclay.fr) is a community model organism database for the genome and genetics of the ciliate Paramecium. ParameciumDB development relies on the GMOD (www.gmod.org) toolkit. The ParameciumDB web site has been publicly available since 2006 when the P. tetraurelia somatic genome sequence was released, revealing that a series of whole genome duplications punctuated the evolutionary history of the species. The genome is linked to available genetic data and stocks. ParameciumDB has undergone major changes in its content and website since the last update published in 2011. Genomes from multiple Paramecium species, especially from the P. aurelia complex, are now included in ParameciumDB. A new modern web interface accompanies this transition to a database for the whole Paramecium genus. Gene pages have been enriched with orthology relationships, among the Paramecium species and with a panel of model organisms across the eukaryotic tree. This update also presents expert curation of Paramecium mitochondrial genomes.

  • Improved methods and resources for Paramecium genomics: transcription units, gene annotation and gene expression
    BMC Genomics, 2017
    Co-Authors: Olivier Arnaiz, Erwin Van Dijk, Mireille Bétermier, Maoussi Lhuillier-akakpo, Augustin De Vanssay, Sandra Duharcourt, Erika Sallet, Jérôme Gouzy, Linda Sperling
    Abstract:

    Background The 15 sibling species of the Paramecium aurelia cryptic species complex emerged after a whole genome duplication that occurred tens of millions of years ago. Given extensive knowledge of the genetics and epigenetics of Paramecium acquired over the last century, this species complex offers a uniquely powerful system to investigate the consequences of whole genome duplication in a unicellular eukaryote as well as the genetic and epigenetic mechanisms that drive speciation. High quality Paramecium gene models are important for research using this system. The major aim of the work reported here was to build an improved gene annotation pipeline for the Paramecium lineage. Results We generated oriented RNA-Seq transcriptome data across the sexual process of autogamy for the model species Paramecium tetraurelia . We determined, for the first time in a ciliate, candidate P. tetraurelia transcription start sites using an adapted Cap-Seq protocol. We developed TrUC, multi-threaded Perl software that in conjunction with TopHat mapping of RNA-Seq data to a reference genome, predicts transcription units for the annotation pipeline. We used EuGene software to combine annotation evidence. The high quality gene structural annotations obtained for P. tetraurelia were used as evidence to improve published annotations for 3 other Paramecium species. The RNA-Seq data were also used for differential gene expression analysis, providing a gene expression atlas that is more sensitive than the previously established microarray resource. Conclusions We have developed a gene annotation pipeline tailored for the compact genomes and tiny introns of Paramecium species. A novel component of this pipeline, TrUC, predicts transcription units using Cap-Seq and oriented RNA-Seq data. TrUC could prove useful beyond Paramecium, especially in the case of high gene density. Accurate predictions of 3′ and 5′ UTR will be particularly valuable for studies of gene expression (e.g. nucleosome positioning, identification of cis regulatory motifs). The P. tetraurelia improved transcriptome resource, gene annotations for P. tetraurelia , P. biaurelia, P. sexaurelia and P. caudatum , and Paramecium -trained EuGene configuration are available through ParameciumDB ( http://Paramecium.i2bc.paris-saclay.fr ). TrUC software is freely distributed under a GNU GPL v3 licence ( https://github.com/oarnaiz/TrUC ).

  • improved methods and resources for Paramecium genomics transcription units gene annotation and gene expression
    BMC Genomics, 2017
    Co-Authors: Olivier Arnaiz, Mireille Bétermier, Augustin De Vanssay, Sandra Duharcourt, Erika Sallet, Jérôme Gouzy, Erwin Van Dijk, Maoussi Lhuillierakakpo, Linda Sperling
    Abstract:

    The 15 sibling species of the Paramecium aurelia cryptic species complex emerged after a whole genome duplication that occurred tens of millions of years ago. Given extensive knowledge of the genetics and epigenetics of Paramecium acquired over the last century, this species complex offers a uniquely powerful system to investigate the consequences of whole genome duplication in a unicellular eukaryote as well as the genetic and epigenetic mechanisms that drive speciation. High quality Paramecium gene models are important for research using this system. The major aim of the work reported here was to build an improved gene annotation pipeline for the Paramecium lineage. We generated oriented RNA-Seq transcriptome data across the sexual process of autogamy for the model species Paramecium tetraurelia. We determined, for the first time in a ciliate, candidate P. tetraurelia transcription start sites using an adapted Cap-Seq protocol. We developed TrUC, multi-threaded Perl software that in conjunction with TopHat mapping of RNA-Seq data to a reference genome, predicts transcription units for the annotation pipeline. We used EuGene software to combine annotation evidence. The high quality gene structural annotations obtained for P. tetraurelia were used as evidence to improve published annotations for 3 other Paramecium species. The RNA-Seq data were also used for differential gene expression analysis, providing a gene expression atlas that is more sensitive than the previously established microarray resource. We have developed a gene annotation pipeline tailored for the compact genomes and tiny introns of Paramecium species. A novel component of this pipeline, TrUC, predicts transcription units using Cap-Seq and oriented RNA-Seq data. TrUC could prove useful beyond Paramecium, especially in the case of high gene density. Accurate predictions of 3′ and 5′ UTR will be particularly valuable for studies of gene expression (e.g. nucleosome positioning, identification of cis regulatory motifs). The P. tetraurelia improved transcriptome resource, gene annotations for P. tetraurelia, P. biaurelia, P. sexaurelia and P. caudatum, and Paramecium-trained EuGene configuration are available through ParameciumDB ( http://Paramecium.i2bc.paris-saclay.fr ). TrUC software is freely distributed under a GNU GPL v3 licence ( https://github.com/oarnaiz/TrUC ).

  • Remembrance of things past retrieved from the Paramecium genome.
    Research in Microbiology, 2011
    Co-Authors: Linda Sperling
    Abstract:

    Paramecium and other ciliates are the only unicellular eukaryotes that separate germinal and somatic functions. A germline micronucleus transmits the genetic information to sexual progeny, while a somatic macronucleus expresses the genetic information during vegetative growth to determine the phenotype. At each sexual generation, a new macronucleus develops from the zygotic nucleus through programmed rearrangements of the germline genome. Paramecium tetraurelia somatic genome sequencing, reviewed here, has provided insight into the organization and evolution of the genome. A series of at least 3 whole genome duplications was detected in the Paramecium lineage and selective pressures that determine the fate of the gene duplicates analyzed. Variability in the somatic DNA was characterized and could be attributed to the genome rearrangement processes. Since, in Paramecium, alternative genome rearrangement patterns can be inherited across sexual generations by homology-dependent epigenetic mechanisms and can affect phenotype, I discuss the possibility that ciliate nuclear dimorphism buffers genetic variation hidden in the germline.

  • Random sequencing of Paramecium somatic DNA.
    Eukaryotic Cell, 2002
    Co-Authors: Linda Sperling, Philippe Dessen, Marek Zagulski, Ron E. Pearlman, Andrzey Migdalski, Robert Gromadka, Marine Froissard, Anne-marie Keller, Jean Cohen
    Abstract:

    We report a random survey of 1 to 2% of the somatic genome of the free-living ciliate Paramecium tetraurelia by single-run sequencing of the ends of plasmid inserts. As in all ciliates, the germ line genome of Paramecium (100 to 200 Mb) is reproducibly rearranged at each sexual cycle to produce a somatic genome of expressed or potentially expressed genes, stripped of repeated sequences, transposons, and AT-rich unique sequence elements limited to the germ line. We found the somatic genome to be compact (>68% coding, estimated from the sequence of several complete library inserts) and to feature uniformly small introns (18 to 35 nucleotides). This facilitated gene discovery: 722 open reading frames (ORFs) were identified by similarity with known proteins, and 119 novel ORFs were tentatively identified by internal comparison of the data set. We determined the phylogenetic position of Paramecium with respect to eukaryotes whose genomes have been sequenced by the distance matrix neighbor-joining method by using random combined protein data from the project. The unrooted tree obtained is very robust and in excellent agreement with accepted topology, providing strong support for the quality and consistency of the data set. Our study demonstrates that a random survey of the somatic genome of Paramecium is a good strategy for gene discovery in this organism.

Olivier Arnaiz - One of the best experts on this subject based on the ideXlab platform.

  • universal trends of post duplication evolution revealed by the genomes of 13 Paramecium species sharing an ancestral whole genome duplication
    bioRxiv, 2019
    Co-Authors: Olivier Arnaiz, Simran Bhullar, Jeanfrancois Pierre Gout, Parul Johri, Thomas G Doak, Arnaud Couloux, Frederic Guerin
    Abstract:

    Abstract Whole-Genome Duplications (WGDs) have shaped the gene repertoire of many eukaryotic lineages. The redundancy created by WGDs typically results in a phase of massive gene loss. However, some WGD-derived paralogs are maintained over long evolutionary periods and the relative contributions of different selective pressures to their maintenance is still debated. Previous studies have revealed a history of three successive WGDs in the lineage of the ciliate Paramecium tetraurelia and two of its sister species from the P. aurelia complex. Here, we report the genome sequence and analysis of 10 additional P. aurelia species and one additional outgroup, allowing us to track post-WGD evolution in 13 species that share a common ancestral WGD. We found similar biases in gene retention compatible with dosage constraints playing a major role opposing post-WGD gene loss across all 13 species. Interestingly we found that post-WGD gene loss was slower in Paramecium than in other species having experienced genome duplication, suggesting that the selective pressures against post-WGD gene loss are especially strong in Paramecium. We also report a lack of recent segmental duplications in Paramecium, which we interpret as additional evidence for strong selective pressures against individual genes dosage changes. Finally, we hope that this exceptional dataset of 13 species sharing an ancestral WGD and two closely related outgroup species will be a useful resource for future studies and will help establish Paramecium as a major model organism in the study of post-WGD evolution.

  • ParameciumDB 2019: integrating genomic data across the genus for functional and evolutionary biology
    Nucleic Acids Research, 2019
    Co-Authors: Olivier Arnaiz, Eric Meyer, Linda Sperling
    Abstract:

    ParameciumDB (https://Paramecium.i2bc.paris-saclay.fr) is a community model organism database for the genome and genetics of the ciliate Paramecium. ParameciumDB development relies on the GMOD (www.gmod.org) toolkit. The ParameciumDB web site has been publicly available since 2006 when the P. tetraurelia somatic genome sequence was released, revealing that a series of whole genome duplications punctuated the evolutionary history of the species. The genome is linked to available genetic data and stocks. ParameciumDB has undergone major changes in its content and website since the last update published in 2011. Genomes from multiple Paramecium species, especially from the P. aurelia complex, are now included in ParameciumDB. A new modern web interface accompanies this transition to a database for the whole Paramecium genus. Gene pages have been enriched with orthology relationships, among the Paramecium species and with a panel of model organisms across the eukaryotic tree. This update also presents expert curation of Paramecium mitochondrial genomes.

  • Improved methods and resources for Paramecium genomics: transcription units, gene annotation and gene expression
    BMC Genomics, 2017
    Co-Authors: Olivier Arnaiz, Erwin Van Dijk, Mireille Bétermier, Maoussi Lhuillier-akakpo, Augustin De Vanssay, Sandra Duharcourt, Erika Sallet, Jérôme Gouzy, Linda Sperling
    Abstract:

    Background The 15 sibling species of the Paramecium aurelia cryptic species complex emerged after a whole genome duplication that occurred tens of millions of years ago. Given extensive knowledge of the genetics and epigenetics of Paramecium acquired over the last century, this species complex offers a uniquely powerful system to investigate the consequences of whole genome duplication in a unicellular eukaryote as well as the genetic and epigenetic mechanisms that drive speciation. High quality Paramecium gene models are important for research using this system. The major aim of the work reported here was to build an improved gene annotation pipeline for the Paramecium lineage. Results We generated oriented RNA-Seq transcriptome data across the sexual process of autogamy for the model species Paramecium tetraurelia . We determined, for the first time in a ciliate, candidate P. tetraurelia transcription start sites using an adapted Cap-Seq protocol. We developed TrUC, multi-threaded Perl software that in conjunction with TopHat mapping of RNA-Seq data to a reference genome, predicts transcription units for the annotation pipeline. We used EuGene software to combine annotation evidence. The high quality gene structural annotations obtained for P. tetraurelia were used as evidence to improve published annotations for 3 other Paramecium species. The RNA-Seq data were also used for differential gene expression analysis, providing a gene expression atlas that is more sensitive than the previously established microarray resource. Conclusions We have developed a gene annotation pipeline tailored for the compact genomes and tiny introns of Paramecium species. A novel component of this pipeline, TrUC, predicts transcription units using Cap-Seq and oriented RNA-Seq data. TrUC could prove useful beyond Paramecium, especially in the case of high gene density. Accurate predictions of 3′ and 5′ UTR will be particularly valuable for studies of gene expression (e.g. nucleosome positioning, identification of cis regulatory motifs). The P. tetraurelia improved transcriptome resource, gene annotations for P. tetraurelia , P. biaurelia, P. sexaurelia and P. caudatum , and Paramecium -trained EuGene configuration are available through ParameciumDB ( http://Paramecium.i2bc.paris-saclay.fr ). TrUC software is freely distributed under a GNU GPL v3 licence ( https://github.com/oarnaiz/TrUC ).

  • improved methods and resources for Paramecium genomics transcription units gene annotation and gene expression
    BMC Genomics, 2017
    Co-Authors: Olivier Arnaiz, Mireille Bétermier, Augustin De Vanssay, Sandra Duharcourt, Erika Sallet, Jérôme Gouzy, Erwin Van Dijk, Maoussi Lhuillierakakpo, Linda Sperling
    Abstract:

    The 15 sibling species of the Paramecium aurelia cryptic species complex emerged after a whole genome duplication that occurred tens of millions of years ago. Given extensive knowledge of the genetics and epigenetics of Paramecium acquired over the last century, this species complex offers a uniquely powerful system to investigate the consequences of whole genome duplication in a unicellular eukaryote as well as the genetic and epigenetic mechanisms that drive speciation. High quality Paramecium gene models are important for research using this system. The major aim of the work reported here was to build an improved gene annotation pipeline for the Paramecium lineage. We generated oriented RNA-Seq transcriptome data across the sexual process of autogamy for the model species Paramecium tetraurelia. We determined, for the first time in a ciliate, candidate P. tetraurelia transcription start sites using an adapted Cap-Seq protocol. We developed TrUC, multi-threaded Perl software that in conjunction with TopHat mapping of RNA-Seq data to a reference genome, predicts transcription units for the annotation pipeline. We used EuGene software to combine annotation evidence. The high quality gene structural annotations obtained for P. tetraurelia were used as evidence to improve published annotations for 3 other Paramecium species. The RNA-Seq data were also used for differential gene expression analysis, providing a gene expression atlas that is more sensitive than the previously established microarray resource. We have developed a gene annotation pipeline tailored for the compact genomes and tiny introns of Paramecium species. A novel component of this pipeline, TrUC, predicts transcription units using Cap-Seq and oriented RNA-Seq data. TrUC could prove useful beyond Paramecium, especially in the case of high gene density. Accurate predictions of 3′ and 5′ UTR will be particularly valuable for studies of gene expression (e.g. nucleosome positioning, identification of cis regulatory motifs). The P. tetraurelia improved transcriptome resource, gene annotations for P. tetraurelia, P. biaurelia, P. sexaurelia and P. caudatum, and Paramecium-trained EuGene configuration are available through ParameciumDB ( http://Paramecium.i2bc.paris-saclay.fr ). TrUC software is freely distributed under a GNU GPL v3 licence ( https://github.com/oarnaiz/TrUC ).

  • The Ciliary Protein IFT57 in the Macronucleus of Paramecium.
    Journal of Eukaryotic Microbiology, 2017
    Co-Authors: Lei Shi, Olivier Arnaiz, Jean Cohen
    Abstract:

    The intraflagellar transport IFT57 protein is essential for ciliary growth and maintenance. Also known as HIPPI, human IFT57 can be translocated to the nucleus via a molecular partner of the Huntingtin, Hip1, inducing gene expression changes. In Paramecium tetraurelia, we identified four IFT57 genes forming two subfamilies IFT57A/B and IFT57C/D arising from whole genome duplications. The depletion of proteins of the two subfamilies induced ciliary defects and IFT57A and IFT57C localized in basal bodies and cilia. We observed that IFT57A, but not IFT57C, is also present in the macronucleus and able to traffic toward the developing anlage during autogamy. Analysis of chimeric IFT57A-IFT57C-GFP-tagged proteins allowed us to identify a region of IFT57A necessary for nuclear localization. We studied the localization of the unique IFT57 protein of Paramecium caudatum, a species, which diverged from Paramecium tetraurelia before the whole genome duplications. The Paramecium caudatum IFT57C protein was excluded from the nucleus. We also analyzed whether the overexpression of IFT57A in Paramecium could affect gene transcription as the human protein does in HeLa cells. The expression of some genes was indeed affected by overexpression of IFT57A, but the set of affected genes poorly overlaps the set of genes affected in human cells. This article is protected by copyright. All rights reserved.

Mireille Bétermier - One of the best experts on this subject based on the ideXlab platform.

  • Improved methods and resources for Paramecium genomics: transcription units, gene annotation and gene expression
    BMC Genomics, 2017
    Co-Authors: Olivier Arnaiz, Erwin Van Dijk, Mireille Bétermier, Maoussi Lhuillier-akakpo, Augustin De Vanssay, Sandra Duharcourt, Erika Sallet, Jérôme Gouzy, Linda Sperling
    Abstract:

    Background The 15 sibling species of the Paramecium aurelia cryptic species complex emerged after a whole genome duplication that occurred tens of millions of years ago. Given extensive knowledge of the genetics and epigenetics of Paramecium acquired over the last century, this species complex offers a uniquely powerful system to investigate the consequences of whole genome duplication in a unicellular eukaryote as well as the genetic and epigenetic mechanisms that drive speciation. High quality Paramecium gene models are important for research using this system. The major aim of the work reported here was to build an improved gene annotation pipeline for the Paramecium lineage. Results We generated oriented RNA-Seq transcriptome data across the sexual process of autogamy for the model species Paramecium tetraurelia . We determined, for the first time in a ciliate, candidate P. tetraurelia transcription start sites using an adapted Cap-Seq protocol. We developed TrUC, multi-threaded Perl software that in conjunction with TopHat mapping of RNA-Seq data to a reference genome, predicts transcription units for the annotation pipeline. We used EuGene software to combine annotation evidence. The high quality gene structural annotations obtained for P. tetraurelia were used as evidence to improve published annotations for 3 other Paramecium species. The RNA-Seq data were also used for differential gene expression analysis, providing a gene expression atlas that is more sensitive than the previously established microarray resource. Conclusions We have developed a gene annotation pipeline tailored for the compact genomes and tiny introns of Paramecium species. A novel component of this pipeline, TrUC, predicts transcription units using Cap-Seq and oriented RNA-Seq data. TrUC could prove useful beyond Paramecium, especially in the case of high gene density. Accurate predictions of 3′ and 5′ UTR will be particularly valuable for studies of gene expression (e.g. nucleosome positioning, identification of cis regulatory motifs). The P. tetraurelia improved transcriptome resource, gene annotations for P. tetraurelia , P. biaurelia, P. sexaurelia and P. caudatum , and Paramecium -trained EuGene configuration are available through ParameciumDB ( http://Paramecium.i2bc.paris-saclay.fr ). TrUC software is freely distributed under a GNU GPL v3 licence ( https://github.com/oarnaiz/TrUC ).

  • improved methods and resources for Paramecium genomics transcription units gene annotation and gene expression
    BMC Genomics, 2017
    Co-Authors: Olivier Arnaiz, Mireille Bétermier, Augustin De Vanssay, Sandra Duharcourt, Erika Sallet, Jérôme Gouzy, Erwin Van Dijk, Maoussi Lhuillierakakpo, Linda Sperling
    Abstract:

    The 15 sibling species of the Paramecium aurelia cryptic species complex emerged after a whole genome duplication that occurred tens of millions of years ago. Given extensive knowledge of the genetics and epigenetics of Paramecium acquired over the last century, this species complex offers a uniquely powerful system to investigate the consequences of whole genome duplication in a unicellular eukaryote as well as the genetic and epigenetic mechanisms that drive speciation. High quality Paramecium gene models are important for research using this system. The major aim of the work reported here was to build an improved gene annotation pipeline for the Paramecium lineage. We generated oriented RNA-Seq transcriptome data across the sexual process of autogamy for the model species Paramecium tetraurelia. We determined, for the first time in a ciliate, candidate P. tetraurelia transcription start sites using an adapted Cap-Seq protocol. We developed TrUC, multi-threaded Perl software that in conjunction with TopHat mapping of RNA-Seq data to a reference genome, predicts transcription units for the annotation pipeline. We used EuGene software to combine annotation evidence. The high quality gene structural annotations obtained for P. tetraurelia were used as evidence to improve published annotations for 3 other Paramecium species. The RNA-Seq data were also used for differential gene expression analysis, providing a gene expression atlas that is more sensitive than the previously established microarray resource. We have developed a gene annotation pipeline tailored for the compact genomes and tiny introns of Paramecium species. A novel component of this pipeline, TrUC, predicts transcription units using Cap-Seq and oriented RNA-Seq data. TrUC could prove useful beyond Paramecium, especially in the case of high gene density. Accurate predictions of 3′ and 5′ UTR will be particularly valuable for studies of gene expression (e.g. nucleosome positioning, identification of cis regulatory motifs). The P. tetraurelia improved transcriptome resource, gene annotations for P. tetraurelia, P. biaurelia, P. sexaurelia and P. caudatum, and Paramecium-trained EuGene configuration are available through ParameciumDB ( http://Paramecium.i2bc.paris-saclay.fr ). TrUC software is freely distributed under a GNU GPL v3 licence ( https://github.com/oarnaiz/TrUC ).

Augustin De Vanssay - One of the best experts on this subject based on the ideXlab platform.

  • Improved methods and resources for Paramecium genomics: transcription units, gene annotation and gene expression
    BMC Genomics, 2017
    Co-Authors: Olivier Arnaiz, Erwin Van Dijk, Mireille Bétermier, Maoussi Lhuillier-akakpo, Augustin De Vanssay, Sandra Duharcourt, Erika Sallet, Jérôme Gouzy, Linda Sperling
    Abstract:

    Background The 15 sibling species of the Paramecium aurelia cryptic species complex emerged after a whole genome duplication that occurred tens of millions of years ago. Given extensive knowledge of the genetics and epigenetics of Paramecium acquired over the last century, this species complex offers a uniquely powerful system to investigate the consequences of whole genome duplication in a unicellular eukaryote as well as the genetic and epigenetic mechanisms that drive speciation. High quality Paramecium gene models are important for research using this system. The major aim of the work reported here was to build an improved gene annotation pipeline for the Paramecium lineage. Results We generated oriented RNA-Seq transcriptome data across the sexual process of autogamy for the model species Paramecium tetraurelia . We determined, for the first time in a ciliate, candidate P. tetraurelia transcription start sites using an adapted Cap-Seq protocol. We developed TrUC, multi-threaded Perl software that in conjunction with TopHat mapping of RNA-Seq data to a reference genome, predicts transcription units for the annotation pipeline. We used EuGene software to combine annotation evidence. The high quality gene structural annotations obtained for P. tetraurelia were used as evidence to improve published annotations for 3 other Paramecium species. The RNA-Seq data were also used for differential gene expression analysis, providing a gene expression atlas that is more sensitive than the previously established microarray resource. Conclusions We have developed a gene annotation pipeline tailored for the compact genomes and tiny introns of Paramecium species. A novel component of this pipeline, TrUC, predicts transcription units using Cap-Seq and oriented RNA-Seq data. TrUC could prove useful beyond Paramecium, especially in the case of high gene density. Accurate predictions of 3′ and 5′ UTR will be particularly valuable for studies of gene expression (e.g. nucleosome positioning, identification of cis regulatory motifs). The P. tetraurelia improved transcriptome resource, gene annotations for P. tetraurelia , P. biaurelia, P. sexaurelia and P. caudatum , and Paramecium -trained EuGene configuration are available through ParameciumDB ( http://Paramecium.i2bc.paris-saclay.fr ). TrUC software is freely distributed under a GNU GPL v3 licence ( https://github.com/oarnaiz/TrUC ).

  • improved methods and resources for Paramecium genomics transcription units gene annotation and gene expression
    BMC Genomics, 2017
    Co-Authors: Olivier Arnaiz, Mireille Bétermier, Augustin De Vanssay, Sandra Duharcourt, Erika Sallet, Jérôme Gouzy, Erwin Van Dijk, Maoussi Lhuillierakakpo, Linda Sperling
    Abstract:

    The 15 sibling species of the Paramecium aurelia cryptic species complex emerged after a whole genome duplication that occurred tens of millions of years ago. Given extensive knowledge of the genetics and epigenetics of Paramecium acquired over the last century, this species complex offers a uniquely powerful system to investigate the consequences of whole genome duplication in a unicellular eukaryote as well as the genetic and epigenetic mechanisms that drive speciation. High quality Paramecium gene models are important for research using this system. The major aim of the work reported here was to build an improved gene annotation pipeline for the Paramecium lineage. We generated oriented RNA-Seq transcriptome data across the sexual process of autogamy for the model species Paramecium tetraurelia. We determined, for the first time in a ciliate, candidate P. tetraurelia transcription start sites using an adapted Cap-Seq protocol. We developed TrUC, multi-threaded Perl software that in conjunction with TopHat mapping of RNA-Seq data to a reference genome, predicts transcription units for the annotation pipeline. We used EuGene software to combine annotation evidence. The high quality gene structural annotations obtained for P. tetraurelia were used as evidence to improve published annotations for 3 other Paramecium species. The RNA-Seq data were also used for differential gene expression analysis, providing a gene expression atlas that is more sensitive than the previously established microarray resource. We have developed a gene annotation pipeline tailored for the compact genomes and tiny introns of Paramecium species. A novel component of this pipeline, TrUC, predicts transcription units using Cap-Seq and oriented RNA-Seq data. TrUC could prove useful beyond Paramecium, especially in the case of high gene density. Accurate predictions of 3′ and 5′ UTR will be particularly valuable for studies of gene expression (e.g. nucleosome positioning, identification of cis regulatory motifs). The P. tetraurelia improved transcriptome resource, gene annotations for P. tetraurelia, P. biaurelia, P. sexaurelia and P. caudatum, and Paramecium-trained EuGene configuration are available through ParameciumDB ( http://Paramecium.i2bc.paris-saclay.fr ). TrUC software is freely distributed under a GNU GPL v3 licence ( https://github.com/oarnaiz/TrUC ).

Sandra Duharcourt - One of the best experts on this subject based on the ideXlab platform.

  • Improved methods and resources for Paramecium genomics: transcription units, gene annotation and gene expression
    BMC Genomics, 2017
    Co-Authors: Olivier Arnaiz, Erwin Van Dijk, Mireille Bétermier, Maoussi Lhuillier-akakpo, Augustin De Vanssay, Sandra Duharcourt, Erika Sallet, Jérôme Gouzy, Linda Sperling
    Abstract:

    Background The 15 sibling species of the Paramecium aurelia cryptic species complex emerged after a whole genome duplication that occurred tens of millions of years ago. Given extensive knowledge of the genetics and epigenetics of Paramecium acquired over the last century, this species complex offers a uniquely powerful system to investigate the consequences of whole genome duplication in a unicellular eukaryote as well as the genetic and epigenetic mechanisms that drive speciation. High quality Paramecium gene models are important for research using this system. The major aim of the work reported here was to build an improved gene annotation pipeline for the Paramecium lineage. Results We generated oriented RNA-Seq transcriptome data across the sexual process of autogamy for the model species Paramecium tetraurelia . We determined, for the first time in a ciliate, candidate P. tetraurelia transcription start sites using an adapted Cap-Seq protocol. We developed TrUC, multi-threaded Perl software that in conjunction with TopHat mapping of RNA-Seq data to a reference genome, predicts transcription units for the annotation pipeline. We used EuGene software to combine annotation evidence. The high quality gene structural annotations obtained for P. tetraurelia were used as evidence to improve published annotations for 3 other Paramecium species. The RNA-Seq data were also used for differential gene expression analysis, providing a gene expression atlas that is more sensitive than the previously established microarray resource. Conclusions We have developed a gene annotation pipeline tailored for the compact genomes and tiny introns of Paramecium species. A novel component of this pipeline, TrUC, predicts transcription units using Cap-Seq and oriented RNA-Seq data. TrUC could prove useful beyond Paramecium, especially in the case of high gene density. Accurate predictions of 3′ and 5′ UTR will be particularly valuable for studies of gene expression (e.g. nucleosome positioning, identification of cis regulatory motifs). The P. tetraurelia improved transcriptome resource, gene annotations for P. tetraurelia , P. biaurelia, P. sexaurelia and P. caudatum , and Paramecium -trained EuGene configuration are available through ParameciumDB ( http://Paramecium.i2bc.paris-saclay.fr ). TrUC software is freely distributed under a GNU GPL v3 licence ( https://github.com/oarnaiz/TrUC ).

  • improved methods and resources for Paramecium genomics transcription units gene annotation and gene expression
    BMC Genomics, 2017
    Co-Authors: Olivier Arnaiz, Mireille Bétermier, Augustin De Vanssay, Sandra Duharcourt, Erika Sallet, Jérôme Gouzy, Erwin Van Dijk, Maoussi Lhuillierakakpo, Linda Sperling
    Abstract:

    The 15 sibling species of the Paramecium aurelia cryptic species complex emerged after a whole genome duplication that occurred tens of millions of years ago. Given extensive knowledge of the genetics and epigenetics of Paramecium acquired over the last century, this species complex offers a uniquely powerful system to investigate the consequences of whole genome duplication in a unicellular eukaryote as well as the genetic and epigenetic mechanisms that drive speciation. High quality Paramecium gene models are important for research using this system. The major aim of the work reported here was to build an improved gene annotation pipeline for the Paramecium lineage. We generated oriented RNA-Seq transcriptome data across the sexual process of autogamy for the model species Paramecium tetraurelia. We determined, for the first time in a ciliate, candidate P. tetraurelia transcription start sites using an adapted Cap-Seq protocol. We developed TrUC, multi-threaded Perl software that in conjunction with TopHat mapping of RNA-Seq data to a reference genome, predicts transcription units for the annotation pipeline. We used EuGene software to combine annotation evidence. The high quality gene structural annotations obtained for P. tetraurelia were used as evidence to improve published annotations for 3 other Paramecium species. The RNA-Seq data were also used for differential gene expression analysis, providing a gene expression atlas that is more sensitive than the previously established microarray resource. We have developed a gene annotation pipeline tailored for the compact genomes and tiny introns of Paramecium species. A novel component of this pipeline, TrUC, predicts transcription units using Cap-Seq and oriented RNA-Seq data. TrUC could prove useful beyond Paramecium, especially in the case of high gene density. Accurate predictions of 3′ and 5′ UTR will be particularly valuable for studies of gene expression (e.g. nucleosome positioning, identification of cis regulatory motifs). The P. tetraurelia improved transcriptome resource, gene annotations for P. tetraurelia, P. biaurelia, P. sexaurelia and P. caudatum, and Paramecium-trained EuGene configuration are available through ParameciumDB ( http://Paramecium.i2bc.paris-saclay.fr ). TrUC software is freely distributed under a GNU GPL v3 licence ( https://github.com/oarnaiz/TrUC ).