The Protein database is a collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeq and TPA, as well as records from SwissProt, PIR, PRF, and PDB.

Preferred Prefix
Missing Contact
Pattern for Local Unique Identifiers

Local identifiers in NCBI Protein should match this regular expression:

Example Local Unique Identifier
CAA71118.1   Resolve
Pattern for CURIES

Compact URIs (CURIEs) constructed from NCBI Protein should match this regular expression:

Example CURIE
Canonical For (2)
insdc.cds refseq
Metaregistry NCBI Protein

The metaregistry provides mappings between the Bioregistry and other registries. There are 6 mappings to external registries for ncbiprotein with 4 unique external prefixes.

Registry Name Metaprefix External Prefix
BioContext biocontext NCBIProtein
Gene Ontology Registry go NCBI_NP logo miriam ncbiprotein
Name-to-Thing Name-to-Thing logo n2t ncbiprotein
Prefix Commons prefixcommons ncbi.protein
Registry of Research Data Repositories re3data r3d100010776

Providers are various services that resolve CURIEs to URLs. The example CURIE ncbiprotein:CAA71118.1 is used to demonstrate the provides available for ncbiprotein. Generation of OLS and BioPortal URLs requires additional programmatic logic beyond string formatting.

Name Metaprefix URI
NCBI Protein ncbiprotein
Bioregistry bioregistry miriam
Name-to-Thing n2t