GenBank

Nucleic Acids Res. 2018 Jan 4;46(D1):D41-D47. doi: 10.1093/nar/gkx1094.

Abstract

GenBank® (www.ncbi.nlm.nih.gov/genbank/) is a comprehensive database that contains publicly available nucleotide sequences for 400 000 formally described species. These sequences are obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects, including whole genome shotgun and environmental sampling projects. Most submissions are made using BankIt, the National Center for Biotechnology Information (NCBI) Submission Portal, or the tool tbl2asn. GenBank staff assign accession numbers upon data receipt. Daily data exchange with the European Nucleotide Archive and the DNA Data Bank of Japan ensures worldwide coverage. GenBank is accessible through the NCBI Nucleotide database, which links to related information such as taxonomy, genomes, protein sequences and structures, and biomedical journal literature in PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP. Recent updates include changes to sequence identifiers, submission wizards for 16S and Influenza sequences, and an Identical Protein Groups resource.

Publication types

  • Research Support, N.I.H., Intramural

MeSH terms

  • Animals
  • Computational Biology
  • Databases, Nucleic Acid* / statistics & numerical data
  • Databases, Nucleic Acid* / trends
  • Europe
  • Genomics
  • Humans
  • Information Dissemination
  • Information Storage and Retrieval
  • Internet
  • Japan
  • National Library of Medicine (U.S.)
  • Orthomyxoviridae / genetics
  • Proteomics
  • RNA, Ribosomal / genetics
  • Sequence Alignment
  • United States

Substances

  • RNA, Ribosomal