Rat Genome Assemblies, Annotation, and Variant Repository

Methods Mol Biol. 2019:2018:43-70. doi: 10.1007/978-1-4939-9581-3_2.

Abstract

The first and only published version of the rat reference genome sequence was RGSC3.1, accomplished by the Rat Genome Sequencing Project Consortium. Here we present the history of the community effort in the correction of sequence errors and filling missing gaps in the process of refining and providing researchers with a high-quality rat reference sequence. The genome assembly improvements, addition of different evidence resources over time, such as RNA-Seq data, and software development methodologies had a positive impact on the gene model annotations. Over the years we observed a great increase in the numbers of genes, protein coding sequences, predicted transcripts and transcript features. Before the sequencing of the rat genome was possible, first biochemical and next genomic markers like RAPD, AFLP, RFLP, and SSLP were fundamental in research studies involving cross-breeding between different rat strains, in finding the level of polymorphism, linkage mapping, and phylogeny. Linkage maps provide information on recombination rates, give insight into intra- and interspecies gene rearrangements, and help to identify Mendelian loci and Quantitative Trait Loci (QTL). In the 1990s many reports were published on the construction of rat linkage maps that incorporated increasing numbers of markers and facilitated the localization of disease loci. Current genetic monitoring and linkage mapping relies on single nucleotide polymorphisms (SNPs). The Rat Genome Database collects information on genetic variation from the worldwide community of rat researchers and provides tools for searching and retrieving these data. As of today we show details about almost 605 million variants coming from many studies in our Variant Visualizer tool.

Keywords: Gene model annotations; Genomic markers; Reference genome; SNPs; Variants.

MeSH terms

  • Animals
  • Breeding
  • Chromosome Mapping / veterinary
  • Databases, Genetic*
  • Models, Animal*
  • Molecular Sequence Annotation
  • Phylogeny
  • Polymorphism, Single Nucleotide*
  • Quantitative Trait Loci
  • Rats
  • Sequence Analysis, RNA / veterinary
  • Whole Genome Sequencing / veterinary*