U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

CENPB centromere protein B [ Homo sapiens (human) ]

Gene ID: 1059, updated on 5-Mar-2024

Summary

Official Symbol
CENPBprovided by HGNC
Official Full Name
centromere protein Bprovided by HGNC
Primary source
HGNC:HGNC:1852
See related
Ensembl:ENSG00000125817 MIM:117140; AllianceGenome:HGNC:1852
Gene type
protein coding
RefSeq status
REVIEWED
Organism
Homo sapiens
Lineage
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo
Summary
This gene product is a highly conserved protein that facilitates centromere formation. It is a DNA-binding protein that is derived from transposases of the pogo DNA transposon family. It contains a helix-loop-helix DNA binding motif at the N-terminus, and a dimerization domain at the C-terminus. The DNA binding domain recognizes and binds a 17-bp sequence (CENP-B box) in the centromeric alpha satellite DNA. This protein is proposed to play an important role in the assembly of specific centromere structures in interphase nuclei and on mitotic chromosomes. It is also considered a major centromere autoantigen recognized by sera from patients with anti-centromere antibodies. [provided by RefSeq, Jul 2008]
Orthologs
NEW
Try the new Gene table
Try the new Transcript table

Genomic context

Location:
20p13
Exon count:
1
Annotation release Status Assembly Chr Location
RS_2023_10 current GRCh38.p14 (GCF_000001405.40) 20 NC_000020.11 (3783851..3786740, complement)
RS_2023_10 current T2T-CHM13v2.0 (GCF_009914755.1) 20 NC_060944.1 (3814791..3817680, complement)
105.20220307 previous assembly GRCh37.p13 (GCF_000001405.25) 20 NC_000020.10 (3764498..3767387, complement)

Chromosome 20 - NC_000020.11Genomic Context describing neighboring genes Neighboring gene adipose secreted signaling protein Neighboring gene H3K4me1 hESC enhancer GRCh37_chr20:3747545-3748309 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr20:3748437-3749023 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 17482 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 12627 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr20:3759113-3759730 Neighboring gene Sharpr-MPRA regulatory region 2213 Neighboring gene sperm flagellar 1 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr20:3766083-3766793 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr20:3766794-3767503 Neighboring gene cell division cycle 25B Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr20:3775953-3776684 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 12629 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 12630 Neighboring gene ReSE screen-validated silencer GRCh37_chr20:3777341-3777548 Neighboring gene Sharpr-MPRA regulatory region 3723 Neighboring gene Sharpr-MPRA regulatory region 8683 Neighboring gene P300/CBP strongly-dependent group 1 enhancer GRCh37_chr20:3782484-3783683 Neighboring gene NANOG-H3K27ac-H3K4me1 hESC enhancer GRCh37_chr20:3787799-3788390 Neighboring gene NANOG-H3K27ac-H3K4me1 hESC enhancer GRCh37_chr20:3788391-3788981 Neighboring gene long intergenic non-protein coding RNA 1730 Neighboring gene P300/CBP strongly-dependent group 1 enhancer GRCh37_chr20:3791796-3792995

Genomic regions, transcripts, and products

Bibliography

GeneRIFs: Gene References Into Functions

What's a GeneRIF?

Pathways from PubChem

Interactions

Products Interactant Other Gene Complex Source Pubs Description

General gene information

Markers

Gene Ontology Provided by GOA

Function Evidence Code Pubs
enables DNA binding IBA
Inferred from Biological aspect of Ancestor
more info
 
enables centromeric DNA binding IC
Inferred by Curator
more info
PubMed 
enables chromatin binding NAS
Non-traceable Author Statement
more info
PubMed 
enables satellite DNA binding NAS
Non-traceable Author Statement
more info
PubMed 
enables sequence-specific DNA binding TAS
Traceable Author Statement
more info
PubMed 
Component Evidence Code Pubs
located_in chromosome IDA
Inferred from Direct Assay
more info
PubMed 
located_in chromosome, centromeric region IDA
Inferred from Direct Assay
more info
PubMed 
located_in condensed chromosome, centromeric region IEA
Inferred from Electronic Annotation
more info
 
located_in nuclear body IDA
Inferred from Direct Assay
more info
 
located_in nucleoplasm IDA
Inferred from Direct Assay
more info
 
is_active_in nucleus IBA
Inferred from Biological aspect of Ancestor
more info
 
part_of pericentric heterochromatin IEA
Inferred from Electronic Annotation
more info
 

General protein information

Preferred Names
major centromere autoantigen B
Names
CENP-B
centromere autoantigen B
centromere protein B, 80kDa

NCBI Reference Sequences (RefSeq)

NEW Try the new Transcript table

RefSeqs maintained independently of Annotated Genomes

These reference sequences exist independently of genome builds. Explain

These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

mRNA and Protein(s)

  1. NM_001810.6NP_001801.1  major centromere autoantigen B

    See identical proteins and their annotated locations for NP_001801.1

    Status: REVIEWED

    Source sequence(s)
    AL109804, BC003649, BC053847
    Consensus CDS
    CCDS13064.1
    UniProtKB/Swiss-Prot
    P07199, Q96EI4
    Related
    ENSP00000369075.4, ENST00000379751.5
    Conserved Domains (4) summary
    smart00674
    Location:74135
    CENPB; Putative DNA-binding domain in centromere protein B, mouse jerky and transposases
    pfam03184
    Location:222384
    DDE_1; DDE superfamily endonuclease
    pfam04218
    Location:256
    CENP-B_N; CENP-B N-terminal DNA-binding domain
    pfam09026
    Location:539598
    CENP-B_dimeris; Centromere protein B dimerization domain

RefSeqs of Annotated Genomes: GCF_000001405.40-RS_2023_10

The following sections contain reference sequences that belong to a specific genome build. Explain

Reference GRCh38.p14 Primary Assembly

Genomic

  1. NC_000020.11 Reference GRCh38.p14 Primary Assembly

    Range
    3783851..3786740 complement
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

Alternate T2T-CHM13v2.0

Genomic

  1. NC_060944.1 Alternate T2T-CHM13v2.0

    Range
    3814791..3817680 complement
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)