Fusion Gene Studies
in Kim Lab

FusionBase FusionGDB FusionGDB2 FusionPDB FusionNeoAntigen FusionAI FusionNW FGviewer Publication Contact
FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:LARGE-CBX7 (FusionGDB2 ID:HG9215TG23492)

Fusion Gene Summary for LARGE-CBX7

check button Fusion gene summary
Fusion gene informationFusion gene name: LARGE-CBX7
Fusion gene ID: hg9215tg23492
HgeneTgene
Gene symbol

LARGE

CBX7

Gene ID

9215

23492

Gene nameLARGE xylosyl- and glucuronyltransferase 1chromobox 7
SynonymsLARGE|MDC1D|MDDGA6|MDDGB6-
Cytomap('LARGE1','LARGE')('CBX7','CBX7')

22q12.3

22q13.1

Type of geneprotein-codingprotein-coding
DescriptionLARGE xylosyl- and glucuronyltransferase 1acetylglucosaminyltransferase-like 1Aacetylglucosaminyltransferase-like proteinglycosyltransferase-like protein LARGE1chromobox protein homolog 7
Modification date2020032820200313
UniProtAcc.

O95931

Ensembl transtripts involved in fusion geneENST00000437602, ENST00000337431, 
ENST00000354992, ENST00000397394, 
ENST00000402320, ENST00000421232, 
ENST00000452586, 
Fusion gene scores* DoF score10 X 9 X 6=5402 X 2 X 2=8
# samples 133
** MAII scorelog2(13/540*10)=-2.05444778402238
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(3/8*10)=1.90689059560852
Context

PubMed: LARGE [Title/Abstract] AND CBX7 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointLARGE(34157357)-CBX7(39545837), # samples:1
LARGE(34157358)-CBX7(39545837), # samples:1
LARGE1(34157358)-CBX7(39545837), # samples:1
Anticipated loss of major functional domain due to fusion event.LARGE-CBX7 seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
LARGE-CBX7 seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
LARGE-CBX7 seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
LARGE-CBX7 seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
LARGE-CBX7 seems lost the major protein functional domain in Hgene partner, which is a essential gene due to the frame-shifted ORF.
LARGE-CBX7 seems lost the major protein functional domain in Tgene partner, which is a epigenetic factor due to the frame-shifted ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneLARGE

GO:0035269

protein O-linked mannosylation

22223806|25138275|25279697|25279699


check buttonFusion gene breakpoints across LARGE (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure
check buttonFusion gene breakpoints across CBX7 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4OVTCGA-61-2101-01ALARGE1chr22

34157358

-CBX7chr22

39545837

-
ChimerDB4OVTCGA-61-2101-01ALARGEchr22

34157358

-CBX7chr22

39545837

-
ChimerDB4OVTCGA-61-2101LARGEchr22

34157357

-CBX7chr22

39545837

-


Top

Fusion Gene ORF analysis for LARGE-CBX7

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
Frame-shiftENST00000437602ENST00000216133LARGEchr22

34157358

-CBX7chr22

39545837

-
Frame-shiftENST00000437602ENST00000216133LARGEchr22

34157357

-CBX7chr22

39545837

-
Frame-shiftENST00000437602ENST00000401405LARGEchr22

34157358

-CBX7chr22

39545837

-
Frame-shiftENST00000437602ENST00000401405LARGEchr22

34157357

-CBX7chr22

39545837

-
Frame-shiftENST00000437602ENST00000475962LARGEchr22

34157358

-CBX7chr22

39545837

-
Frame-shiftENST00000437602ENST00000475962LARGEchr22

34157357

-CBX7chr22

39545837

-
In-frameENST00000337431ENST00000216133LARGEchr22

34157358

-CBX7chr22

39545837

-
In-frameENST00000337431ENST00000216133LARGEchr22

34157357

-CBX7chr22

39545837

-
In-frameENST00000337431ENST00000401405LARGEchr22

34157358

-CBX7chr22

39545837

-
In-frameENST00000337431ENST00000401405LARGEchr22

34157357

-CBX7chr22

39545837

-
In-frameENST00000337431ENST00000475962LARGEchr22

34157358

-CBX7chr22

39545837

-
In-frameENST00000337431ENST00000475962LARGEchr22

34157357

-CBX7chr22

39545837

-
In-frameENST00000354992ENST00000216133LARGEchr22

34157358

-CBX7chr22

39545837

-
In-frameENST00000354992ENST00000216133LARGEchr22

34157357

-CBX7chr22

39545837

-
In-frameENST00000354992ENST00000401405LARGEchr22

34157358

-CBX7chr22

39545837

-
In-frameENST00000354992ENST00000401405LARGEchr22

34157357

-CBX7chr22

39545837

-
In-frameENST00000354992ENST00000475962LARGEchr22

34157358

-CBX7chr22

39545837

-
In-frameENST00000354992ENST00000475962LARGEchr22

34157357

-CBX7chr22

39545837

-
In-frameENST00000397394ENST00000216133LARGEchr22

34157358

-CBX7chr22

39545837

-
In-frameENST00000397394ENST00000216133LARGEchr22

34157357

-CBX7chr22

39545837

-
In-frameENST00000397394ENST00000401405LARGEchr22

34157358

-CBX7chr22

39545837

-
In-frameENST00000397394ENST00000401405LARGEchr22

34157357

-CBX7chr22

39545837

-
In-frameENST00000397394ENST00000475962LARGEchr22

34157358

-CBX7chr22

39545837

-
In-frameENST00000397394ENST00000475962LARGEchr22

34157357

-CBX7chr22

39545837

-
In-frameENST00000402320ENST00000216133LARGEchr22

34157358

-CBX7chr22

39545837

-
In-frameENST00000402320ENST00000216133LARGEchr22

34157357

-CBX7chr22

39545837

-
In-frameENST00000402320ENST00000401405LARGEchr22

34157358

-CBX7chr22

39545837

-
In-frameENST00000402320ENST00000401405LARGEchr22

34157357

-CBX7chr22

39545837

-
In-frameENST00000402320ENST00000475962LARGEchr22

34157358

-CBX7chr22

39545837

-
In-frameENST00000402320ENST00000475962LARGEchr22

34157357

-CBX7chr22

39545837

-
intron-3CDSENST00000421232ENST00000216133LARGEchr22

34157358

-CBX7chr22

39545837

-
intron-3CDSENST00000421232ENST00000216133LARGEchr22

34157357

-CBX7chr22

39545837

-
intron-3CDSENST00000421232ENST00000401405LARGEchr22

34157358

-CBX7chr22

39545837

-
intron-3CDSENST00000421232ENST00000401405LARGEchr22

34157357

-CBX7chr22

39545837

-
intron-3CDSENST00000421232ENST00000475962LARGEchr22

34157358

-CBX7chr22

39545837

-
intron-3CDSENST00000421232ENST00000475962LARGEchr22

34157357

-CBX7chr22

39545837

-
intron-3CDSENST00000452586ENST00000216133LARGEchr22

34157358

-CBX7chr22

39545837

-
intron-3CDSENST00000452586ENST00000216133LARGEchr22

34157357

-CBX7chr22

39545837

-
intron-3CDSENST00000452586ENST00000401405LARGEchr22

34157358

-CBX7chr22

39545837

-
intron-3CDSENST00000452586ENST00000401405LARGEchr22

34157357

-CBX7chr22

39545837

-
intron-3CDSENST00000452586ENST00000475962LARGEchr22

34157358

-CBX7chr22

39545837

-
intron-3CDSENST00000452586ENST00000475962LARGEchr22

34157357

-CBX7chr22

39545837

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000337431LARGEchr2234157357-ENST00000475962CBX7chr2239545837-10465953421114
ENST00000337431LARGEchr2234157357-ENST00000216133CBX7chr2239545837-44015955801281233
ENST00000337431LARGEchr2234157357-ENST00000401405CBX7chr2239545837-13105955801002140
ENST00000354992LARGEchr2234157357-ENST00000475962CBX7chr2239545837-11296784250142
ENST00000354992LARGEchr2234157357-ENST00000216133CBX7chr2239545837-44846786631364233
ENST00000354992LARGEchr2234157357-ENST00000401405CBX7chr2239545837-13936784250142
ENST00000397394LARGEchr2234157357-ENST00000475962CBX7chr2239545837-10736225821194
ENST00000397394LARGEchr2234157357-ENST00000216133CBX7chr2239545837-44286226071308233
ENST00000397394LARGEchr2234157357-ENST00000401405CBX7chr2239545837-13376225821194
ENST00000402320LARGEchr2234157357-ENST00000475962CBX7chr2239545837-10596085682189
ENST00000402320LARGEchr2234157357-ENST00000216133CBX7chr2239545837-44146085931294233
ENST00000402320LARGEchr2234157357-ENST00000401405CBX7chr2239545837-13236085682189
ENST00000337431LARGEchr2234157358-ENST00000475962CBX7chr2239545837-10465953421114
ENST00000337431LARGEchr2234157358-ENST00000216133CBX7chr2239545837-44015955801281233
ENST00000337431LARGEchr2234157358-ENST00000401405CBX7chr2239545837-13105955801002140
ENST00000354992LARGEchr2234157358-ENST00000475962CBX7chr2239545837-11296784250142
ENST00000354992LARGEchr2234157358-ENST00000216133CBX7chr2239545837-44846786631364233
ENST00000354992LARGEchr2234157358-ENST00000401405CBX7chr2239545837-13936784250142
ENST00000397394LARGEchr2234157358-ENST00000475962CBX7chr2239545837-10736225821194
ENST00000397394LARGEchr2234157358-ENST00000216133CBX7chr2239545837-44286226071308233
ENST00000397394LARGEchr2234157358-ENST00000401405CBX7chr2239545837-13376225821194
ENST00000402320LARGEchr2234157358-ENST00000475962CBX7chr2239545837-10596085682189
ENST00000402320LARGEchr2234157358-ENST00000216133CBX7chr2239545837-44146085931294233
ENST00000402320LARGEchr2234157358-ENST00000401405CBX7chr2239545837-13236085682189

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000337431ENST00000475962LARGEchr2234157357-CBX7chr2239545837-0.268701260.73129874
ENST00000337431ENST00000216133LARGEchr2234157357-CBX7chr2239545837-0.044984940.95501506
ENST00000337431ENST00000401405LARGEchr2234157357-CBX7chr2239545837-0.076428780.9235712
ENST00000354992ENST00000475962LARGEchr2234157357-CBX7chr2239545837-0.175239850.82476014
ENST00000354992ENST00000216133LARGEchr2234157357-CBX7chr2239545837-0.04682830.95317173
ENST00000354992ENST00000401405LARGEchr2234157357-CBX7chr2239545837-0.1013397050.89866024
ENST00000397394ENST00000475962LARGEchr2234157357-CBX7chr2239545837-0.200992930.7990071
ENST00000397394ENST00000216133LARGEchr2234157357-CBX7chr2239545837-0.0467006040.95329946
ENST00000397394ENST00000401405LARGEchr2234157357-CBX7chr2239545837-0.0572129670.94278705
ENST00000402320ENST00000475962LARGEchr2234157357-CBX7chr2239545837-0.260413940.7395861
ENST00000402320ENST00000216133LARGEchr2234157357-CBX7chr2239545837-0.0472384430.95276153
ENST00000402320ENST00000401405LARGEchr2234157357-CBX7chr2239545837-0.072791040.92720896
ENST00000337431ENST00000475962LARGEchr2234157358-CBX7chr2239545837-0.268701260.73129874
ENST00000337431ENST00000216133LARGEchr2234157358-CBX7chr2239545837-0.044984940.95501506
ENST00000337431ENST00000401405LARGEchr2234157358-CBX7chr2239545837-0.076428780.9235712
ENST00000354992ENST00000475962LARGEchr2234157358-CBX7chr2239545837-0.175239850.82476014
ENST00000354992ENST00000216133LARGEchr2234157358-CBX7chr2239545837-0.04682830.95317173
ENST00000354992ENST00000401405LARGEchr2234157358-CBX7chr2239545837-0.1013397050.89866024
ENST00000397394ENST00000475962LARGEchr2234157358-CBX7chr2239545837-0.200992930.7990071
ENST00000397394ENST00000216133LARGEchr2234157358-CBX7chr2239545837-0.0467006040.95329946
ENST00000397394ENST00000401405LARGEchr2234157358-CBX7chr2239545837-0.0572129670.94278705
ENST00000402320ENST00000475962LARGEchr2234157358-CBX7chr2239545837-0.260413940.7395861
ENST00000402320ENST00000216133LARGEchr2234157358-CBX7chr2239545837-0.0472384430.95276153
ENST00000402320ENST00000401405LARGEchr2234157358-CBX7chr2239545837-0.072791040.92720896

Top

Fusion Genomic Features for LARGE-CBX7


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

Top

Fusion Protein Features for LARGE-CBX7


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr22:34157357/chr22:39545837)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
.CBX7

O95931

FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.FUNCTION: Component of a Polycomb group (PcG) multiprotein PRC1-like complex, a complex class required to maintain the transcriptionally repressive state of many genes, including Hox genes, throughout development. PcG PRC1 complex acts via chromatin remodeling and modification of histones; it mediates monoubiquitination of histone H2A 'Lys-119', rendering chromatin heritably changed in its expressibility. Promotes histone H3 trimethylation at 'Lys-9' (H3K9me3). Binds to trimethylated lysine residues in histones, and possibly also other proteins. Regulator of cellular lifespan by maintaining the repression of CDKN2A, but not by inducing telomerase activity. {ECO:0000269|PubMed:19636380, ECO:0000269|PubMed:21047797, ECO:0000269|PubMed:21060834, ECO:0000269|PubMed:21282530}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneLARGEchr22:34157357chr22:39545837ENST00000337431-3151_1035705.0Topological domainCytoplasmic
HgeneLARGEchr22:34157357chr22:39545837ENST00000354992-3161_1035757.0Topological domainCytoplasmic
HgeneLARGEchr22:34157357chr22:39545837ENST00000397394-2151_1035757.0Topological domainCytoplasmic
HgeneLARGEchr22:34157357chr22:39545837ENST00000402320-2141_1035705.0Topological domainCytoplasmic
HgeneLARGEchr22:34157358chr22:39545837ENST00000337431-3151_1035705.0Topological domainCytoplasmic
HgeneLARGEchr22:34157358chr22:39545837ENST00000354992-3161_1035757.0Topological domainCytoplasmic
HgeneLARGEchr22:34157358chr22:39545837ENST00000397394-2151_1035757.0Topological domainCytoplasmic
HgeneLARGEchr22:34157358chr22:39545837ENST00000402320-2141_1035705.0Topological domainCytoplasmic
HgeneLARGEchr22:34157357chr22:39545837ENST00000337431-31511_3135705.0TransmembraneHelical%3B Signal-anchor for type II membrane protein
HgeneLARGEchr22:34157357chr22:39545837ENST00000354992-31611_3135757.0TransmembraneHelical%3B Signal-anchor for type II membrane protein
HgeneLARGEchr22:34157357chr22:39545837ENST00000397394-21511_3135757.0TransmembraneHelical%3B Signal-anchor for type II membrane protein
HgeneLARGEchr22:34157357chr22:39545837ENST00000402320-21411_3135705.0TransmembraneHelical%3B Signal-anchor for type II membrane protein
HgeneLARGEchr22:34157358chr22:39545837ENST00000337431-31511_3135705.0TransmembraneHelical%3B Signal-anchor for type II membrane protein
HgeneLARGEchr22:34157358chr22:39545837ENST00000354992-31611_3135757.0TransmembraneHelical%3B Signal-anchor for type II membrane protein
HgeneLARGEchr22:34157358chr22:39545837ENST00000397394-21511_3135757.0TransmembraneHelical%3B Signal-anchor for type II membrane protein
HgeneLARGEchr22:34157358chr22:39545837ENST00000402320-21411_3135705.0TransmembraneHelical%3B Signal-anchor for type II membrane protein
TgeneCBX7chr22:34157357chr22:39545837ENST0000021613306223_23623252.0RegionNote=Required for cellular lifespan extension
TgeneCBX7chr22:34157358chr22:39545837ENST0000021613306223_23623252.0RegionNote=Required for cellular lifespan extension

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneLARGEchr22:34157357chr22:39545837ENST00000337431-31553_9535705.0Coiled coilOntology_term=ECO:0000255
HgeneLARGEchr22:34157357chr22:39545837ENST00000354992-31653_9535757.0Coiled coilOntology_term=ECO:0000255
HgeneLARGEchr22:34157357chr22:39545837ENST00000397394-21553_9535757.0Coiled coilOntology_term=ECO:0000255
HgeneLARGEchr22:34157357chr22:39545837ENST00000402320-21453_9535705.0Coiled coilOntology_term=ECO:0000255
HgeneLARGEchr22:34157358chr22:39545837ENST00000337431-31553_9535705.0Coiled coilOntology_term=ECO:0000255
HgeneLARGEchr22:34157358chr22:39545837ENST00000354992-31653_9535757.0Coiled coilOntology_term=ECO:0000255
HgeneLARGEchr22:34157358chr22:39545837ENST00000397394-21553_9535757.0Coiled coilOntology_term=ECO:0000255
HgeneLARGEchr22:34157358chr22:39545837ENST00000402320-21453_9535705.0Coiled coilOntology_term=ECO:0000255
HgeneLARGEchr22:34157357chr22:39545837ENST00000337431-315138_41335705.0RegionXylosyltransferase activity
HgeneLARGEchr22:34157357chr22:39545837ENST00000337431-315414_75635705.0RegionGlucuronyltransferase activity
HgeneLARGEchr22:34157357chr22:39545837ENST00000354992-316138_41335757.0RegionXylosyltransferase activity
HgeneLARGEchr22:34157357chr22:39545837ENST00000354992-316414_75635757.0RegionGlucuronyltransferase activity
HgeneLARGEchr22:34157357chr22:39545837ENST00000397394-215138_41335757.0RegionXylosyltransferase activity
HgeneLARGEchr22:34157357chr22:39545837ENST00000397394-215414_75635757.0RegionGlucuronyltransferase activity
HgeneLARGEchr22:34157357chr22:39545837ENST00000402320-214138_41335705.0RegionXylosyltransferase activity
HgeneLARGEchr22:34157357chr22:39545837ENST00000402320-214414_75635705.0RegionGlucuronyltransferase activity
HgeneLARGEchr22:34157358chr22:39545837ENST00000337431-315138_41335705.0RegionXylosyltransferase activity
HgeneLARGEchr22:34157358chr22:39545837ENST00000337431-315414_75635705.0RegionGlucuronyltransferase activity
HgeneLARGEchr22:34157358chr22:39545837ENST00000354992-316138_41335757.0RegionXylosyltransferase activity
HgeneLARGEchr22:34157358chr22:39545837ENST00000354992-316414_75635757.0RegionGlucuronyltransferase activity
HgeneLARGEchr22:34157358chr22:39545837ENST00000397394-215138_41335757.0RegionXylosyltransferase activity
HgeneLARGEchr22:34157358chr22:39545837ENST00000397394-215414_75635757.0RegionGlucuronyltransferase activity
HgeneLARGEchr22:34157358chr22:39545837ENST00000402320-214138_41335705.0RegionXylosyltransferase activity
HgeneLARGEchr22:34157358chr22:39545837ENST00000402320-214414_75635705.0RegionGlucuronyltransferase activity
HgeneLARGEchr22:34157357chr22:39545837ENST00000337431-31532_75635705.0Topological domainLumenal
HgeneLARGEchr22:34157357chr22:39545837ENST00000354992-31632_75635757.0Topological domainLumenal
HgeneLARGEchr22:34157357chr22:39545837ENST00000397394-21532_75635757.0Topological domainLumenal
HgeneLARGEchr22:34157357chr22:39545837ENST00000402320-21432_75635705.0Topological domainLumenal
HgeneLARGEchr22:34157358chr22:39545837ENST00000337431-31532_75635705.0Topological domainLumenal
HgeneLARGEchr22:34157358chr22:39545837ENST00000354992-31632_75635757.0Topological domainLumenal
HgeneLARGEchr22:34157358chr22:39545837ENST00000397394-21532_75635757.0Topological domainLumenal
HgeneLARGEchr22:34157358chr22:39545837ENST00000402320-21432_75635705.0Topological domainLumenal
TgeneCBX7chr22:34157357chr22:39545837ENST000002161330611_6923252.0DomainChromo
TgeneCBX7chr22:34157358chr22:39545837ENST000002161330611_6923252.0DomainChromo


Top

Fusion Gene Sequence for LARGE-CBX7


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>44102_44102_1_LARGE-CBX7_LARGE_chr22_34157357_ENST00000337431_CBX7_chr22_39545837_ENST00000216133_length(transcript)=4401nt_BP=595nt
CGGCCCCGGCGGTCGCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGG
GCCGGGAGCAGCTCCAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTT
GGAGCCGGGGGCAGTCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGC
CTCGTAGGCGGCGGCGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAACCTACATTAAGAAA
CGCATTCAGATGCATCAGCCGTTTGTAAAGCACCTGTCGTGAGCCAGAGAGTATTATGAGGGAGGCCGAGGACTTCATGCTCCGGACAGA
GAAACGGCGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGACGGAAATTCTTGGCTGCCTCGTTGAGT
CTTCTCTGCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGTCGAGTATCTGGTGAAGTGGAAAGGATG
GCCCCCAAAGTACAGCACGTGGGAGCCAGAAGAGCACATCTTGGACCCCCGCCTCGTCATGGCCTACGAGGAGAAGGAGGAGAGAGACCG
AGCATCGGGGTATAGGAAGAGAGGTCCGAAACCCAAGCGGCTTCTGCTGCAGCGGCTGTACAGCATGGACCTGCGGAGCTCCCACAAGGC
CAAGGGCAAGGAGAAGCTCTGCTTCTCCCTGACGTGCCCACTCGGCAGCGGGAGCCCTGAGGGGGTGGTCAAGGCGGGGGCACCTGAGCT
GGTGGACAAGGGCCCCTTGGTGCCCACCCTGCCCTTCCCGCTCCGCAAGCCCCGAAAGGCCCACAAGTACCTGCGGCTCTCGCGCAAGAA
GTTCCCGCCCCGCGGGCCCAACCTGGAGAGCCACAGCCATCGACGGGAGCTCTTCCTGCAGGAGCCACCGGCCCCAGACGTCCTGCAGGC
GGCTGGCGAGTGGGAGCCTGCTGCGCAGCCCCCTGAAGAGGAGGCAGATGCCGACCTGGCCGAGGGGCCCCCTCCCTGGACACCTGCGCT
CCCCTCAAGTGAGGTGACCGTGACCGACATCACCGCCAACTCCATCACCGTCACCTTCCGCGAGGCCCAGGCAGCTGAGGGCTTCTTCCG
AGACCGCAGTGGGAAGTTCTGAATCACCGTTTTTACTCTTCTTAAACTGTTTTCTTTTGGGCTTGGGGTGGGACTTCCAGAGATAGGGAT
GGGTTGGGGGCGGGGTAATTATTTTATTTAAAAAAATACCGAGCAGCAAAAGGGGAGAAGATCCCACTACTCTCCCACCACCTGCCCTTT
CTCTGAGGGACGTTTACCACGAGGCCTCAGGCTGGGGATGGAGAGAGTTGCTCTGGGAGTTGGGGTACCACCCCCAGGGCAGGATGGGGA
CAGGATCACCTGCCCGGGACACCACCATTATCATTCTCCTCTAGTGACGCAGCAGCTGGTTCTGGGAGTTAAAGGAGCATTGGAAGGCCC
AAACCCTCTCCCTTGAGTGGCCACCCCAGCCTGGTTGGCTGGTTTTCCCCTTTTCTCTTGTTTCAATTGGGTCTTTACCTTGAACTCTCC
TCTCTGGCTTTGCGGTGGGCTGTGGAGGCTGGTTTTGACCAAAAGTGAGTGGGGCGGGAGGAAGGGGCAGGAGGAAGGGTTGAGGTTACT
TGGGGCGAGTCCCTTCCCCTTCAGAGAGGCTTCTATCCTTCCCAGGGAGGAGGCGCCGCTGAGACCCTTCTGCTGAGAGCTCTGCCCTCC
CCTCATCACCTGGCCTGTGCAGAAACGCTCATGCACACCTGGCTGCACAGGTGTGCACGCATTACCCTTCGCGTGTACGTTCCCATGTGC
CCCGTGAAAGCATGTGTGGCTGCAGACGTGTCCACATGGGCCTTGCGAACCTGGGTTAGAAACCCTGGCCAGGCGAACGTGGGGTGATTC
ACAGCACAAAAGACCTCACCACCACACCTGCACTCACCCCACCTTGCATGCACCTTGCTACCTGCTTGCGGCTTTCAGTGGAGGGCAGGG
GTCTGGCACAGGTGCGATGGCACCCCATGCTCCAGGCATACAGATGTGGTTTCTCGGCTGCACCGGGCCAGGCTGCGGGTGTGCAGGCGT
CTGCTAAGTTGTGTGATGTATCAGCACAGGCTTTGAGACGTCTGGACCCTGTCCTTCCTCCCGTGAGGGGTTCTTGTTCTTTCTGACTCA
GGTGACTTTTCAGCCCTTCCAATTCCCCTCTTTTTCTGCCCTCCCCTCCAACTCAGCCAACCCAGGTGTGGGCAGTCAGGGAGGGAGGGA
GTGTCCCACCACGTTCTCAGGGCAGCCCTTGACTCCTAAGCCCCTTCCTCCTTCCATTCTGCATCCCCTCCCCATCCAACCTAAATGCCA
CAGCTGGGGCTGAGCTGTATTCCTGTGGAGGGACCTCTGCCGTGCCTCTCTGAGGTCAGGCTGTGCTGTGTGATGGGCAGGCTTTGCCCC
AGCCCACCCCTGGCAAGGTGCACTTGTTTTCTGGTTTGTACAAGGTGTCCTGGGGGCCCGTGGCTTCCCTGCCAGTGAGGAGTGACTTCT
CCCTCTCTTCCAGTCCTGTAGGGGAGACAAAACCAGATTGGGGGGCCCAAGGGGAGCATGGAAAAGGCCGGCTCCCCTGTCTTTCCTTGG
CTGTCAGAGTCAGGGTAACACACACCAAGAGTGGAGTGCGGCCAGCAAGTTTGAGACCTGCCCGCCCTCCTCGCAGCTCTGCTCTGTGTC
CTCAGGAAGTCACAGAGTCTACTGAGGCAAGGAGAGGGTGATTCTTTCCCCAAATCCCTTCTTCCCTGGTTCCCAAACCAAAGACAGCCT
GCAGCCCTTTCTGCATGGGGTGCTCTGTTGACAGGCTTCCCAGATCCCTGAGTCTCTCTTTCCTTCCTCCTCGATCTTTAGTTGTCCACG
GTCAATTCAGTGCTTCCATTGGGGGACAGTCCCCTCCGGGATGACCTGATTCACCTCCAGCCCAGGGAATGGAATCTAGAGGAATACGTG
GGGTGGGTCTGGACAAGGAGCGGCAGGAATCACCACCCATCTCCAGCTGTGGAGCCCTGTGGAGGGGAAGGGGAAGCTTGGGGTTCAGAG
GGGACTCTTCCAGGAGAGGGGTGCCCAGCGGAGGTAAAGATGATAGAGGGTTGTGGGGGGTCTCTAGTTGAATGTTTTGGCCCATGACTT
TGGAACATGGCTGGCAGCTTCCAGCAGAAGTCACGCTCCCCATCCCCCAGGGGACATAGGACCTTTTTCCTGCTTCCTGGTCACTTTCAA
AGAACTATTTGCGCAATCTGTGGGTCTGTGGATTCACGGGGCTTTCTGTGTGGGTGCTGCAGTTGCTTTTGTCTGCAGCAGCAGGACACA
TCTTTCCTCTTACTCAGCCCTTTATGGCCCATGGGGAACTCCGTGGCTCAGGGAGAGCTGAACTCCAGGGGTGTGACCTGGGACGGGTGG
GCCTGAGGTGCCCAGCTCAGGGCAGCCAGGTGGCTCATGGGCTGTAGTGAGCCAGCTCCCTGGGGGAAAAGGCTGTGGGCCGTTAGGACC
ATCCTCCAGGACAGGTGACCTCTATGAGGTCACCTACGGCTGTGGCCGTGCAGGCCTCCTTCCAGCCCAGAGTGGCCCAGTAGAGCAAGG
CAGACAGTGACCTCCACCCCCGCAGCCCTCTTAAAAGGCCAGTACTCTTGGGGGTGGGGGGAGGGTTTAGAAAGCATTTGCCCATCTGCC
TTTCTTTCCCCCAGCCCCCACCCGCTTTGAATGTAGAGACCCGTGGGCACTTTTCCTTTTGTGGTGGGGGGTGCGGAGGAGGTACCCCCA
CCCCTGGCACAGCCGCCTGGAATGCAGGACTGTCACTGCTGTTCGGGTGATGACCTCGTTGCCAAGCTCCTCCTGTCCCCTTGTTCTGGG
GGCAGGCGCTGTGCTTCTGTGAGGTGGTTTAGCTTTTGCTTTCGAAGTGGCCAGCTGCGGCCACCAGGTCTCAGCACAAGAGCGCTTCCT
TTGCACAGAATGAGCTTCGAGCTTTGTTCAGACTAAATGAATGTATCTGGGAGGGGTCGGGGGCACGAGTTGATTCCAAGCACATGCCTT
TGCTGAGTGTGTGTGTGCTGGGAGAGTCAGAGTGGATGTAGAGCGCGGTTTTATTTTTGTACTGACATTGGTAAGAGACTGTATAGCATC

>44102_44102_1_LARGE-CBX7_LARGE_chr22_34157357_ENST00000337431_CBX7_chr22_39545837_ENST00000216133_length(amino acids)=233AA_BP=3
MGASKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEKEERDRASGYRKRGPKPKRLLLQRLYSMDLRSSHKAKGKEKLCFSLTCP
LGSGSPEGVVKAGAPELVDKGPLVPTLPFPLRKPRKAHKYLRLSRKKFPPRGPNLESHSHRRELFLQEPPAPDVLQAAGEWEPAAQPPEE

--------------------------------------------------------------
>44102_44102_2_LARGE-CBX7_LARGE_chr22_34157357_ENST00000337431_CBX7_chr22_39545837_ENST00000401405_length(transcript)=1310nt_BP=595nt
CGGCCCCGGCGGTCGCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGG
GCCGGGAGCAGCTCCAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTT
GGAGCCGGGGGCAGTCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGC
CTCGTAGGCGGCGGCGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAACCTACATTAAGAAA
CGCATTCAGATGCATCAGCCGTTTGTAAAGCACCTGTCGTGAGCCAGAGAGTATTATGAGGGAGGCCGAGGACTTCATGCTCCGGACAGA
GAAACGGCGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGACGGAAATTCTTGGCTGCCTCGTTGAGT
CTTCTCTGCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGTCGAGTATCTGGTGAAGTGGAAAGGATG
GCCCCCAAAGTACAGCACGTGGGAGCCAGAAGAGCACATCTTGGACCCCCGCCTCGTCATGGCCTACGAGGAGAAGGAGGAGAGAGACCG
AGCATCGGGGTATAGGAAGAGAGGTCCGAAACCCAAGCGGCTTCTGCTGCAGGAGCCACCGGCCCCAGACGTCCTGCAGGCGGCTGGCGA
GTGGGAGCCTGCTGCGCAGCCCCCTGAAGAGGAGGCAGATGCCGACCTGGCCGAGGGGCCCCCTCCCTGGACACCTGCGCTCCCCTCAAG
TGAGGTGACCGTGACCGACATCACCGCCAACTCCATCACCGTCACCTTCCGCGAGGCCCAGGCAGCTGAGGGCTTCTTCCGAGACCGCAG
TGGGAAGTTCTGAATCACCGTTTTTACTCTTCTTAAACTGTTTTCTTTTGGGCTTGGGGTGGGACTTCCAGAGATAGGGATGGGTTGGGG
GCGGGGTAATTATTTTATTTAAAAAAATACCGAGCAGCAAAAGGGGAGAAGATCCCACTACTCTCCCACCACCTGCCCTTTCTCTGAGGG
ACGTTTACCACGAGGCCTCAGGCTGGGGATGGAGAGAGTTGCTCTGGGAGTTGGGGTACCACCCCCAGGGCAGGATGGGGACAGGATCAC

>44102_44102_2_LARGE-CBX7_LARGE_chr22_34157357_ENST00000337431_CBX7_chr22_39545837_ENST00000401405_length(amino acids)=140AA_BP=3
MGASKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEKEERDRASGYRKRGPKPKRLLLQEPPAPDVLQAAGEWEPAAQPPEEEAD

--------------------------------------------------------------
>44102_44102_3_LARGE-CBX7_LARGE_chr22_34157357_ENST00000337431_CBX7_chr22_39545837_ENST00000475962_length(transcript)=1046nt_BP=595nt
CGGCCCCGGCGGTCGCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGG
GCCGGGAGCAGCTCCAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTT
GGAGCCGGGGGCAGTCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGC
CTCGTAGGCGGCGGCGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAACCTACATTAAGAAA
CGCATTCAGATGCATCAGCCGTTTGTAAAGCACCTGTCGTGAGCCAGAGAGTATTATGAGGGAGGCCGAGGACTTCATGCTCCGGACAGA
GAAACGGCGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGACGGAAATTCTTGGCTGCCTCGTTGAGT
CTTCTCTGCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGTCGAGTATCTGGTGAAGTGGAAAGGATG
GCCCCCAAACATCGCACTACTGATGACAAGCAGGCAACTGCGTTGGAGAGGGAGGTCATCATGGCTGCAAGGAAGGGACCGGACCCATAT
AATATACTACCCCTAAAGGCAGCTTCAGGCACCAAAGAAGACCCTAATTTAGTCCTCTCCATCACCAACAAGAGAAGAGTGGGCTGCATC
TGCGTAGAGGACAACAGTACCATCACCTGCTTTGGGCTGCACAAAGGCGAGACCCAGTGATGCCCCAGCTATGGAACCCATTACAAGCTG
GTGCCCCGCCAGCTGACGTACTGAGCACCTGCACCAAGTTACCCAAAATGTGCTGCAAAGTTTCTTCTTTCCAATAAAGAGTAGCCATTG

>44102_44102_3_LARGE-CBX7_LARGE_chr22_34157357_ENST00000337431_CBX7_chr22_39545837_ENST00000475962_length(amino acids)=114AA_BP=
MAVTAARQRETRKNRVARRRRRLRGAGGPGQGPGRRCPTRSGGAKHRVRDCPRLQMPRRGSGSPKLSTVPSSSSSGAAAWSCSRPGGLFR

--------------------------------------------------------------
>44102_44102_4_LARGE-CBX7_LARGE_chr22_34157357_ENST00000354992_CBX7_chr22_39545837_ENST00000216133_length(transcript)=4484nt_BP=678nt
CGGCCGCGCTCGTCTCGCCGGGCTGTTCGCGGGCAGGCCCTGCCCTGAAGGGACGAATCGGCTTGGAGCGCGGGAGGTGGAGTCGGCCCC
GGCGGTCGCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGGGCCGGGA
GCAGCTCCAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTTGGAGCCG
GGGGCAGTCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGCCTCGTAG
GCGGCGGCGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAACCTACATTAAGAAACGCATTC
AGATGCATCAGCCGTTTGTAAAGCACCTGTCGTGAGCCAGAGAGTATTATGAGGGAGGCCGAGGACTTCATGCTCCGGACAGAGAAACGG
CGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGACGGAAATTCTTGGCTGCCTCGTTGAGTCTTCTCT
GCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGTCGAGTATCTGGTGAAGTGGAAAGGATGGCCCCCA
AAGTACAGCACGTGGGAGCCAGAAGAGCACATCTTGGACCCCCGCCTCGTCATGGCCTACGAGGAGAAGGAGGAGAGAGACCGAGCATCG
GGGTATAGGAAGAGAGGTCCGAAACCCAAGCGGCTTCTGCTGCAGCGGCTGTACAGCATGGACCTGCGGAGCTCCCACAAGGCCAAGGGC
AAGGAGAAGCTCTGCTTCTCCCTGACGTGCCCACTCGGCAGCGGGAGCCCTGAGGGGGTGGTCAAGGCGGGGGCACCTGAGCTGGTGGAC
AAGGGCCCCTTGGTGCCCACCCTGCCCTTCCCGCTCCGCAAGCCCCGAAAGGCCCACAAGTACCTGCGGCTCTCGCGCAAGAAGTTCCCG
CCCCGCGGGCCCAACCTGGAGAGCCACAGCCATCGACGGGAGCTCTTCCTGCAGGAGCCACCGGCCCCAGACGTCCTGCAGGCGGCTGGC
GAGTGGGAGCCTGCTGCGCAGCCCCCTGAAGAGGAGGCAGATGCCGACCTGGCCGAGGGGCCCCCTCCCTGGACACCTGCGCTCCCCTCA
AGTGAGGTGACCGTGACCGACATCACCGCCAACTCCATCACCGTCACCTTCCGCGAGGCCCAGGCAGCTGAGGGCTTCTTCCGAGACCGC
AGTGGGAAGTTCTGAATCACCGTTTTTACTCTTCTTAAACTGTTTTCTTTTGGGCTTGGGGTGGGACTTCCAGAGATAGGGATGGGTTGG
GGGCGGGGTAATTATTTTATTTAAAAAAATACCGAGCAGCAAAAGGGGAGAAGATCCCACTACTCTCCCACCACCTGCCCTTTCTCTGAG
GGACGTTTACCACGAGGCCTCAGGCTGGGGATGGAGAGAGTTGCTCTGGGAGTTGGGGTACCACCCCCAGGGCAGGATGGGGACAGGATC
ACCTGCCCGGGACACCACCATTATCATTCTCCTCTAGTGACGCAGCAGCTGGTTCTGGGAGTTAAAGGAGCATTGGAAGGCCCAAACCCT
CTCCCTTGAGTGGCCACCCCAGCCTGGTTGGCTGGTTTTCCCCTTTTCTCTTGTTTCAATTGGGTCTTTACCTTGAACTCTCCTCTCTGG
CTTTGCGGTGGGCTGTGGAGGCTGGTTTTGACCAAAAGTGAGTGGGGCGGGAGGAAGGGGCAGGAGGAAGGGTTGAGGTTACTTGGGGCG
AGTCCCTTCCCCTTCAGAGAGGCTTCTATCCTTCCCAGGGAGGAGGCGCCGCTGAGACCCTTCTGCTGAGAGCTCTGCCCTCCCCTCATC
ACCTGGCCTGTGCAGAAACGCTCATGCACACCTGGCTGCACAGGTGTGCACGCATTACCCTTCGCGTGTACGTTCCCATGTGCCCCGTGA
AAGCATGTGTGGCTGCAGACGTGTCCACATGGGCCTTGCGAACCTGGGTTAGAAACCCTGGCCAGGCGAACGTGGGGTGATTCACAGCAC
AAAAGACCTCACCACCACACCTGCACTCACCCCACCTTGCATGCACCTTGCTACCTGCTTGCGGCTTTCAGTGGAGGGCAGGGGTCTGGC
ACAGGTGCGATGGCACCCCATGCTCCAGGCATACAGATGTGGTTTCTCGGCTGCACCGGGCCAGGCTGCGGGTGTGCAGGCGTCTGCTAA
GTTGTGTGATGTATCAGCACAGGCTTTGAGACGTCTGGACCCTGTCCTTCCTCCCGTGAGGGGTTCTTGTTCTTTCTGACTCAGGTGACT
TTTCAGCCCTTCCAATTCCCCTCTTTTTCTGCCCTCCCCTCCAACTCAGCCAACCCAGGTGTGGGCAGTCAGGGAGGGAGGGAGTGTCCC
ACCACGTTCTCAGGGCAGCCCTTGACTCCTAAGCCCCTTCCTCCTTCCATTCTGCATCCCCTCCCCATCCAACCTAAATGCCACAGCTGG
GGCTGAGCTGTATTCCTGTGGAGGGACCTCTGCCGTGCCTCTCTGAGGTCAGGCTGTGCTGTGTGATGGGCAGGCTTTGCCCCAGCCCAC
CCCTGGCAAGGTGCACTTGTTTTCTGGTTTGTACAAGGTGTCCTGGGGGCCCGTGGCTTCCCTGCCAGTGAGGAGTGACTTCTCCCTCTC
TTCCAGTCCTGTAGGGGAGACAAAACCAGATTGGGGGGCCCAAGGGGAGCATGGAAAAGGCCGGCTCCCCTGTCTTTCCTTGGCTGTCAG
AGTCAGGGTAACACACACCAAGAGTGGAGTGCGGCCAGCAAGTTTGAGACCTGCCCGCCCTCCTCGCAGCTCTGCTCTGTGTCCTCAGGA
AGTCACAGAGTCTACTGAGGCAAGGAGAGGGTGATTCTTTCCCCAAATCCCTTCTTCCCTGGTTCCCAAACCAAAGACAGCCTGCAGCCC
TTTCTGCATGGGGTGCTCTGTTGACAGGCTTCCCAGATCCCTGAGTCTCTCTTTCCTTCCTCCTCGATCTTTAGTTGTCCACGGTCAATT
CAGTGCTTCCATTGGGGGACAGTCCCCTCCGGGATGACCTGATTCACCTCCAGCCCAGGGAATGGAATCTAGAGGAATACGTGGGGTGGG
TCTGGACAAGGAGCGGCAGGAATCACCACCCATCTCCAGCTGTGGAGCCCTGTGGAGGGGAAGGGGAAGCTTGGGGTTCAGAGGGGACTC
TTCCAGGAGAGGGGTGCCCAGCGGAGGTAAAGATGATAGAGGGTTGTGGGGGGTCTCTAGTTGAATGTTTTGGCCCATGACTTTGGAACA
TGGCTGGCAGCTTCCAGCAGAAGTCACGCTCCCCATCCCCCAGGGGACATAGGACCTTTTTCCTGCTTCCTGGTCACTTTCAAAGAACTA
TTTGCGCAATCTGTGGGTCTGTGGATTCACGGGGCTTTCTGTGTGGGTGCTGCAGTTGCTTTTGTCTGCAGCAGCAGGACACATCTTTCC
TCTTACTCAGCCCTTTATGGCCCATGGGGAACTCCGTGGCTCAGGGAGAGCTGAACTCCAGGGGTGTGACCTGGGACGGGTGGGCCTGAG
GTGCCCAGCTCAGGGCAGCCAGGTGGCTCATGGGCTGTAGTGAGCCAGCTCCCTGGGGGAAAAGGCTGTGGGCCGTTAGGACCATCCTCC
AGGACAGGTGACCTCTATGAGGTCACCTACGGCTGTGGCCGTGCAGGCCTCCTTCCAGCCCAGAGTGGCCCAGTAGAGCAAGGCAGACAG
TGACCTCCACCCCCGCAGCCCTCTTAAAAGGCCAGTACTCTTGGGGGTGGGGGGAGGGTTTAGAAAGCATTTGCCCATCTGCCTTTCTTT
CCCCCAGCCCCCACCCGCTTTGAATGTAGAGACCCGTGGGCACTTTTCCTTTTGTGGTGGGGGGTGCGGAGGAGGTACCCCCACCCCTGG
CACAGCCGCCTGGAATGCAGGACTGTCACTGCTGTTCGGGTGATGACCTCGTTGCCAAGCTCCTCCTGTCCCCTTGTTCTGGGGGCAGGC
GCTGTGCTTCTGTGAGGTGGTTTAGCTTTTGCTTTCGAAGTGGCCAGCTGCGGCCACCAGGTCTCAGCACAAGAGCGCTTCCTTTGCACA
GAATGAGCTTCGAGCTTTGTTCAGACTAAATGAATGTATCTGGGAGGGGTCGGGGGCACGAGTTGATTCCAAGCACATGCCTTTGCTGAG
TGTGTGTGTGCTGGGAGAGTCAGAGTGGATGTAGAGCGCGGTTTTATTTTTGTACTGACATTGGTAAGAGACTGTATAGCATCTATTTAT

>44102_44102_4_LARGE-CBX7_LARGE_chr22_34157357_ENST00000354992_CBX7_chr22_39545837_ENST00000216133_length(amino acids)=233AA_BP=3
MGASKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEKEERDRASGYRKRGPKPKRLLLQRLYSMDLRSSHKAKGKEKLCFSLTCP
LGSGSPEGVVKAGAPELVDKGPLVPTLPFPLRKPRKAHKYLRLSRKKFPPRGPNLESHSHRRELFLQEPPAPDVLQAAGEWEPAAQPPEE

--------------------------------------------------------------
>44102_44102_5_LARGE-CBX7_LARGE_chr22_34157357_ENST00000354992_CBX7_chr22_39545837_ENST00000401405_length(transcript)=1393nt_BP=678nt
CGGCCGCGCTCGTCTCGCCGGGCTGTTCGCGGGCAGGCCCTGCCCTGAAGGGACGAATCGGCTTGGAGCGCGGGAGGTGGAGTCGGCCCC
GGCGGTCGCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGGGCCGGGA
GCAGCTCCAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTTGGAGCCG
GGGGCAGTCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGCCTCGTAG
GCGGCGGCGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAACCTACATTAAGAAACGCATTC
AGATGCATCAGCCGTTTGTAAAGCACCTGTCGTGAGCCAGAGAGTATTATGAGGGAGGCCGAGGACTTCATGCTCCGGACAGAGAAACGG
CGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGACGGAAATTCTTGGCTGCCTCGTTGAGTCTTCTCT
GCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGTCGAGTATCTGGTGAAGTGGAAAGGATGGCCCCCA
AAGTACAGCACGTGGGAGCCAGAAGAGCACATCTTGGACCCCCGCCTCGTCATGGCCTACGAGGAGAAGGAGGAGAGAGACCGAGCATCG
GGGTATAGGAAGAGAGGTCCGAAACCCAAGCGGCTTCTGCTGCAGGAGCCACCGGCCCCAGACGTCCTGCAGGCGGCTGGCGAGTGGGAG
CCTGCTGCGCAGCCCCCTGAAGAGGAGGCAGATGCCGACCTGGCCGAGGGGCCCCCTCCCTGGACACCTGCGCTCCCCTCAAGTGAGGTG
ACCGTGACCGACATCACCGCCAACTCCATCACCGTCACCTTCCGCGAGGCCCAGGCAGCTGAGGGCTTCTTCCGAGACCGCAGTGGGAAG
TTCTGAATCACCGTTTTTACTCTTCTTAAACTGTTTTCTTTTGGGCTTGGGGTGGGACTTCCAGAGATAGGGATGGGTTGGGGGCGGGGT
AATTATTTTATTTAAAAAAATACCGAGCAGCAAAAGGGGAGAAGATCCCACTACTCTCCCACCACCTGCCCTTTCTCTGAGGGACGTTTA
CCACGAGGCCTCAGGCTGGGGATGGAGAGAGTTGCTCTGGGAGTTGGGGTACCACCCCCAGGGCAGGATGGGGACAGGATCACCTGCCCG

>44102_44102_5_LARGE-CBX7_LARGE_chr22_34157357_ENST00000354992_CBX7_chr22_39545837_ENST00000401405_length(amino acids)=142AA_BP=
MAVTAARQRETRKNRVARRRRRLRGAGGPGQGPGRRCPTRSGGAKHRVRDCPRLQMPRRGSGSPKLSTVPSSSSSGAAAWSCSRPGGLFR

--------------------------------------------------------------
>44102_44102_6_LARGE-CBX7_LARGE_chr22_34157357_ENST00000354992_CBX7_chr22_39545837_ENST00000475962_length(transcript)=1129nt_BP=678nt
CGGCCGCGCTCGTCTCGCCGGGCTGTTCGCGGGCAGGCCCTGCCCTGAAGGGACGAATCGGCTTGGAGCGCGGGAGGTGGAGTCGGCCCC
GGCGGTCGCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGGGCCGGGA
GCAGCTCCAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTTGGAGCCG
GGGGCAGTCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGCCTCGTAG
GCGGCGGCGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAACCTACATTAAGAAACGCATTC
AGATGCATCAGCCGTTTGTAAAGCACCTGTCGTGAGCCAGAGAGTATTATGAGGGAGGCCGAGGACTTCATGCTCCGGACAGAGAAACGG
CGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGACGGAAATTCTTGGCTGCCTCGTTGAGTCTTCTCT
GCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGTCGAGTATCTGGTGAAGTGGAAAGGATGGCCCCCA
AACATCGCACTACTGATGACAAGCAGGCAACTGCGTTGGAGAGGGAGGTCATCATGGCTGCAAGGAAGGGACCGGACCCATATAATATAC
TACCCCTAAAGGCAGCTTCAGGCACCAAAGAAGACCCTAATTTAGTCCTCTCCATCACCAACAAGAGAAGAGTGGGCTGCATCTGCGTAG
AGGACAACAGTACCATCACCTGCTTTGGGCTGCACAAAGGCGAGACCCAGTGATGCCCCAGCTATGGAACCCATTACAAGCTGGTGCCCC
GCCAGCTGACGTACTGAGCACCTGCACCAAGTTACCCAAAATGTGCTGCAAAGTTTCTTCTTTCCAATAAAGAGTAGCCATTGCATTGGC

>44102_44102_6_LARGE-CBX7_LARGE_chr22_34157357_ENST00000354992_CBX7_chr22_39545837_ENST00000475962_length(amino acids)=142AA_BP=
MAVTAARQRETRKNRVARRRRRLRGAGGPGQGPGRRCPTRSGGAKHRVRDCPRLQMPRRGSGSPKLSTVPSSSSSGAAAWSCSRPGGLFR

--------------------------------------------------------------
>44102_44102_7_LARGE-CBX7_LARGE_chr22_34157357_ENST00000397394_CBX7_chr22_39545837_ENST00000216133_length(transcript)=4428nt_BP=622nt
GGGGGCGCGGCCGCGCTCGTCTCGCCGGGCTGTTCGCGGGCAGGCCCTGCCCTGAAGGGACGAATCGGCTTGGAGCGCGGGAGGTGGAGT
CGGCCCCGGCGGTCGCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGG
GCCGGGAGCAGCTCCAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTT
GGAGCCGGGGGCAGTCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGC
CTCGTAGGCGGCGGCGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAGAGTATTATGAGGGA
GGCCGAGGACTTCATGCTCCGGACAGAGAAACGGCGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGA
CGGAAATTCTTGGCTGCCTCGTTGAGTCTTCTCTGCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGT
CGAGTATCTGGTGAAGTGGAAAGGATGGCCCCCAAAGTACAGCACGTGGGAGCCAGAAGAGCACATCTTGGACCCCCGCCTCGTCATGGC
CTACGAGGAGAAGGAGGAGAGAGACCGAGCATCGGGGTATAGGAAGAGAGGTCCGAAACCCAAGCGGCTTCTGCTGCAGCGGCTGTACAG
CATGGACCTGCGGAGCTCCCACAAGGCCAAGGGCAAGGAGAAGCTCTGCTTCTCCCTGACGTGCCCACTCGGCAGCGGGAGCCCTGAGGG
GGTGGTCAAGGCGGGGGCACCTGAGCTGGTGGACAAGGGCCCCTTGGTGCCCACCCTGCCCTTCCCGCTCCGCAAGCCCCGAAAGGCCCA
CAAGTACCTGCGGCTCTCGCGCAAGAAGTTCCCGCCCCGCGGGCCCAACCTGGAGAGCCACAGCCATCGACGGGAGCTCTTCCTGCAGGA
GCCACCGGCCCCAGACGTCCTGCAGGCGGCTGGCGAGTGGGAGCCTGCTGCGCAGCCCCCTGAAGAGGAGGCAGATGCCGACCTGGCCGA
GGGGCCCCCTCCCTGGACACCTGCGCTCCCCTCAAGTGAGGTGACCGTGACCGACATCACCGCCAACTCCATCACCGTCACCTTCCGCGA
GGCCCAGGCAGCTGAGGGCTTCTTCCGAGACCGCAGTGGGAAGTTCTGAATCACCGTTTTTACTCTTCTTAAACTGTTTTCTTTTGGGCT
TGGGGTGGGACTTCCAGAGATAGGGATGGGTTGGGGGCGGGGTAATTATTTTATTTAAAAAAATACCGAGCAGCAAAAGGGGAGAAGATC
CCACTACTCTCCCACCACCTGCCCTTTCTCTGAGGGACGTTTACCACGAGGCCTCAGGCTGGGGATGGAGAGAGTTGCTCTGGGAGTTGG
GGTACCACCCCCAGGGCAGGATGGGGACAGGATCACCTGCCCGGGACACCACCATTATCATTCTCCTCTAGTGACGCAGCAGCTGGTTCT
GGGAGTTAAAGGAGCATTGGAAGGCCCAAACCCTCTCCCTTGAGTGGCCACCCCAGCCTGGTTGGCTGGTTTTCCCCTTTTCTCTTGTTT
CAATTGGGTCTTTACCTTGAACTCTCCTCTCTGGCTTTGCGGTGGGCTGTGGAGGCTGGTTTTGACCAAAAGTGAGTGGGGCGGGAGGAA
GGGGCAGGAGGAAGGGTTGAGGTTACTTGGGGCGAGTCCCTTCCCCTTCAGAGAGGCTTCTATCCTTCCCAGGGAGGAGGCGCCGCTGAG
ACCCTTCTGCTGAGAGCTCTGCCCTCCCCTCATCACCTGGCCTGTGCAGAAACGCTCATGCACACCTGGCTGCACAGGTGTGCACGCATT
ACCCTTCGCGTGTACGTTCCCATGTGCCCCGTGAAAGCATGTGTGGCTGCAGACGTGTCCACATGGGCCTTGCGAACCTGGGTTAGAAAC
CCTGGCCAGGCGAACGTGGGGTGATTCACAGCACAAAAGACCTCACCACCACACCTGCACTCACCCCACCTTGCATGCACCTTGCTACCT
GCTTGCGGCTTTCAGTGGAGGGCAGGGGTCTGGCACAGGTGCGATGGCACCCCATGCTCCAGGCATACAGATGTGGTTTCTCGGCTGCAC
CGGGCCAGGCTGCGGGTGTGCAGGCGTCTGCTAAGTTGTGTGATGTATCAGCACAGGCTTTGAGACGTCTGGACCCTGTCCTTCCTCCCG
TGAGGGGTTCTTGTTCTTTCTGACTCAGGTGACTTTTCAGCCCTTCCAATTCCCCTCTTTTTCTGCCCTCCCCTCCAACTCAGCCAACCC
AGGTGTGGGCAGTCAGGGAGGGAGGGAGTGTCCCACCACGTTCTCAGGGCAGCCCTTGACTCCTAAGCCCCTTCCTCCTTCCATTCTGCA
TCCCCTCCCCATCCAACCTAAATGCCACAGCTGGGGCTGAGCTGTATTCCTGTGGAGGGACCTCTGCCGTGCCTCTCTGAGGTCAGGCTG
TGCTGTGTGATGGGCAGGCTTTGCCCCAGCCCACCCCTGGCAAGGTGCACTTGTTTTCTGGTTTGTACAAGGTGTCCTGGGGGCCCGTGG
CTTCCCTGCCAGTGAGGAGTGACTTCTCCCTCTCTTCCAGTCCTGTAGGGGAGACAAAACCAGATTGGGGGGCCCAAGGGGAGCATGGAA
AAGGCCGGCTCCCCTGTCTTTCCTTGGCTGTCAGAGTCAGGGTAACACACACCAAGAGTGGAGTGCGGCCAGCAAGTTTGAGACCTGCCC
GCCCTCCTCGCAGCTCTGCTCTGTGTCCTCAGGAAGTCACAGAGTCTACTGAGGCAAGGAGAGGGTGATTCTTTCCCCAAATCCCTTCTT
CCCTGGTTCCCAAACCAAAGACAGCCTGCAGCCCTTTCTGCATGGGGTGCTCTGTTGACAGGCTTCCCAGATCCCTGAGTCTCTCTTTCC
TTCCTCCTCGATCTTTAGTTGTCCACGGTCAATTCAGTGCTTCCATTGGGGGACAGTCCCCTCCGGGATGACCTGATTCACCTCCAGCCC
AGGGAATGGAATCTAGAGGAATACGTGGGGTGGGTCTGGACAAGGAGCGGCAGGAATCACCACCCATCTCCAGCTGTGGAGCCCTGTGGA
GGGGAAGGGGAAGCTTGGGGTTCAGAGGGGACTCTTCCAGGAGAGGGGTGCCCAGCGGAGGTAAAGATGATAGAGGGTTGTGGGGGGTCT
CTAGTTGAATGTTTTGGCCCATGACTTTGGAACATGGCTGGCAGCTTCCAGCAGAAGTCACGCTCCCCATCCCCCAGGGGACATAGGACC
TTTTTCCTGCTTCCTGGTCACTTTCAAAGAACTATTTGCGCAATCTGTGGGTCTGTGGATTCACGGGGCTTTCTGTGTGGGTGCTGCAGT
TGCTTTTGTCTGCAGCAGCAGGACACATCTTTCCTCTTACTCAGCCCTTTATGGCCCATGGGGAACTCCGTGGCTCAGGGAGAGCTGAAC
TCCAGGGGTGTGACCTGGGACGGGTGGGCCTGAGGTGCCCAGCTCAGGGCAGCCAGGTGGCTCATGGGCTGTAGTGAGCCAGCTCCCTGG
GGGAAAAGGCTGTGGGCCGTTAGGACCATCCTCCAGGACAGGTGACCTCTATGAGGTCACCTACGGCTGTGGCCGTGCAGGCCTCCTTCC
AGCCCAGAGTGGCCCAGTAGAGCAAGGCAGACAGTGACCTCCACCCCCGCAGCCCTCTTAAAAGGCCAGTACTCTTGGGGGTGGGGGGAG
GGTTTAGAAAGCATTTGCCCATCTGCCTTTCTTTCCCCCAGCCCCCACCCGCTTTGAATGTAGAGACCCGTGGGCACTTTTCCTTTTGTG
GTGGGGGGTGCGGAGGAGGTACCCCCACCCCTGGCACAGCCGCCTGGAATGCAGGACTGTCACTGCTGTTCGGGTGATGACCTCGTTGCC
AAGCTCCTCCTGTCCCCTTGTTCTGGGGGCAGGCGCTGTGCTTCTGTGAGGTGGTTTAGCTTTTGCTTTCGAAGTGGCCAGCTGCGGCCA
CCAGGTCTCAGCACAAGAGCGCTTCCTTTGCACAGAATGAGCTTCGAGCTTTGTTCAGACTAAATGAATGTATCTGGGAGGGGTCGGGGG
CACGAGTTGATTCCAAGCACATGCCTTTGCTGAGTGTGTGTGTGCTGGGAGAGTCAGAGTGGATGTAGAGCGCGGTTTTATTTTTGTACT
GACATTGGTAAGAGACTGTATAGCATCTATTTATTTAGATGATTTATCTGGTAAATGAGGCAAAAAAATTATTAAAAATACATTAAAGAT

>44102_44102_7_LARGE-CBX7_LARGE_chr22_34157357_ENST00000397394_CBX7_chr22_39545837_ENST00000216133_length(amino acids)=233AA_BP=3
MGASKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEKEERDRASGYRKRGPKPKRLLLQRLYSMDLRSSHKAKGKEKLCFSLTCP
LGSGSPEGVVKAGAPELVDKGPLVPTLPFPLRKPRKAHKYLRLSRKKFPPRGPNLESHSHRRELFLQEPPAPDVLQAAGEWEPAAQPPEE

--------------------------------------------------------------
>44102_44102_8_LARGE-CBX7_LARGE_chr22_34157357_ENST00000397394_CBX7_chr22_39545837_ENST00000401405_length(transcript)=1337nt_BP=622nt
GGGGGCGCGGCCGCGCTCGTCTCGCCGGGCTGTTCGCGGGCAGGCCCTGCCCTGAAGGGACGAATCGGCTTGGAGCGCGGGAGGTGGAGT
CGGCCCCGGCGGTCGCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGG
GCCGGGAGCAGCTCCAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTT
GGAGCCGGGGGCAGTCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGC
CTCGTAGGCGGCGGCGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAGAGTATTATGAGGGA
GGCCGAGGACTTCATGCTCCGGACAGAGAAACGGCGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGA
CGGAAATTCTTGGCTGCCTCGTTGAGTCTTCTCTGCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGT
CGAGTATCTGGTGAAGTGGAAAGGATGGCCCCCAAAGTACAGCACGTGGGAGCCAGAAGAGCACATCTTGGACCCCCGCCTCGTCATGGC
CTACGAGGAGAAGGAGGAGAGAGACCGAGCATCGGGGTATAGGAAGAGAGGTCCGAAACCCAAGCGGCTTCTGCTGCAGGAGCCACCGGC
CCCAGACGTCCTGCAGGCGGCTGGCGAGTGGGAGCCTGCTGCGCAGCCCCCTGAAGAGGAGGCAGATGCCGACCTGGCCGAGGGGCCCCC
TCCCTGGACACCTGCGCTCCCCTCAAGTGAGGTGACCGTGACCGACATCACCGCCAACTCCATCACCGTCACCTTCCGCGAGGCCCAGGC
AGCTGAGGGCTTCTTCCGAGACCGCAGTGGGAAGTTCTGAATCACCGTTTTTACTCTTCTTAAACTGTTTTCTTTTGGGCTTGGGGTGGG
ACTTCCAGAGATAGGGATGGGTTGGGGGCGGGGTAATTATTTTATTTAAAAAAATACCGAGCAGCAAAAGGGGAGAAGATCCCACTACTC
TCCCACCACCTGCCCTTTCTCTGAGGGACGTTTACCACGAGGCCTCAGGCTGGGGATGGAGAGAGTTGCTCTGGGAGTTGGGGTACCACC

>44102_44102_8_LARGE-CBX7_LARGE_chr22_34157357_ENST00000397394_CBX7_chr22_39545837_ENST00000401405_length(amino acids)=194AA_BP=
MGCREDSTRQPRISVSPCRFPASSQKWQSLIPAPFLCPEHEVLGLPHNTLLAVTAARQRETRKNRVARRRRRLRGAGGPGQGPGRRCPTR
SGGAKHRVRDCPRLQMPRRGSGSPKLSTVPSSSSSGAAAWSCSRPGGLFRARGAPHGQGPGSASGWVQGATAGADSTSRAPSRFVPSGQG

--------------------------------------------------------------
>44102_44102_9_LARGE-CBX7_LARGE_chr22_34157357_ENST00000397394_CBX7_chr22_39545837_ENST00000475962_length(transcript)=1073nt_BP=622nt
GGGGGCGCGGCCGCGCTCGTCTCGCCGGGCTGTTCGCGGGCAGGCCCTGCCCTGAAGGGACGAATCGGCTTGGAGCGCGGGAGGTGGAGT
CGGCCCCGGCGGTCGCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGG
GCCGGGAGCAGCTCCAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTT
GGAGCCGGGGGCAGTCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGC
CTCGTAGGCGGCGGCGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAGAGTATTATGAGGGA
GGCCGAGGACTTCATGCTCCGGACAGAGAAACGGCGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGA
CGGAAATTCTTGGCTGCCTCGTTGAGTCTTCTCTGCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGT
CGAGTATCTGGTGAAGTGGAAAGGATGGCCCCCAAACATCGCACTACTGATGACAAGCAGGCAACTGCGTTGGAGAGGGAGGTCATCATG
GCTGCAAGGAAGGGACCGGACCCATATAATATACTACCCCTAAAGGCAGCTTCAGGCACCAAAGAAGACCCTAATTTAGTCCTCTCCATC
ACCAACAAGAGAAGAGTGGGCTGCATCTGCGTAGAGGACAACAGTACCATCACCTGCTTTGGGCTGCACAAAGGCGAGACCCAGTGATGC
CCCAGCTATGGAACCCATTACAAGCTGGTGCCCCGCCAGCTGACGTACTGAGCACCTGCACCAAGTTACCCAAAATGTGCTGCAAAGTTT

>44102_44102_9_LARGE-CBX7_LARGE_chr22_34157357_ENST00000397394_CBX7_chr22_39545837_ENST00000475962_length(amino acids)=194AA_BP=
MGCREDSTRQPRISVSPCRFPASSQKWQSLIPAPFLCPEHEVLGLPHNTLLAVTAARQRETRKNRVARRRRRLRGAGGPGQGPGRRCPTR
SGGAKHRVRDCPRLQMPRRGSGSPKLSTVPSSSSSGAAAWSCSRPGGLFRARGAPHGQGPGSASGWVQGATAGADSTSRAPSRFVPSGQG

--------------------------------------------------------------
>44102_44102_10_LARGE-CBX7_LARGE_chr22_34157357_ENST00000402320_CBX7_chr22_39545837_ENST00000216133_length(transcript)=4414nt_BP=608nt
GCTCGTCTCGCCGGGCTGTTCGCGGGCAGGCCCTGCCCTGAAGGGACGAATCGGCTTGGAGCGCGGGAGGTGGAGTCGGCCCCGGCGGTC
GCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGGGCCGGGAGCAGCTC
CAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTTGGAGCCGGGGGCAG
TCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGCCTCGTAGGCGGCGG
CGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAGAGTATTATGAGGGAGGCCGAGGACTTCA
TGCTCCGGACAGAGAAACGGCGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGACGGAAATTCTTGGC
TGCCTCGTTGAGTCTTCTCTGCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGTCGAGTATCTGGTGA
AGTGGAAAGGATGGCCCCCAAAGTACAGCACGTGGGAGCCAGAAGAGCACATCTTGGACCCCCGCCTCGTCATGGCCTACGAGGAGAAGG
AGGAGAGAGACCGAGCATCGGGGTATAGGAAGAGAGGTCCGAAACCCAAGCGGCTTCTGCTGCAGCGGCTGTACAGCATGGACCTGCGGA
GCTCCCACAAGGCCAAGGGCAAGGAGAAGCTCTGCTTCTCCCTGACGTGCCCACTCGGCAGCGGGAGCCCTGAGGGGGTGGTCAAGGCGG
GGGCACCTGAGCTGGTGGACAAGGGCCCCTTGGTGCCCACCCTGCCCTTCCCGCTCCGCAAGCCCCGAAAGGCCCACAAGTACCTGCGGC
TCTCGCGCAAGAAGTTCCCGCCCCGCGGGCCCAACCTGGAGAGCCACAGCCATCGACGGGAGCTCTTCCTGCAGGAGCCACCGGCCCCAG
ACGTCCTGCAGGCGGCTGGCGAGTGGGAGCCTGCTGCGCAGCCCCCTGAAGAGGAGGCAGATGCCGACCTGGCCGAGGGGCCCCCTCCCT
GGACACCTGCGCTCCCCTCAAGTGAGGTGACCGTGACCGACATCACCGCCAACTCCATCACCGTCACCTTCCGCGAGGCCCAGGCAGCTG
AGGGCTTCTTCCGAGACCGCAGTGGGAAGTTCTGAATCACCGTTTTTACTCTTCTTAAACTGTTTTCTTTTGGGCTTGGGGTGGGACTTC
CAGAGATAGGGATGGGTTGGGGGCGGGGTAATTATTTTATTTAAAAAAATACCGAGCAGCAAAAGGGGAGAAGATCCCACTACTCTCCCA
CCACCTGCCCTTTCTCTGAGGGACGTTTACCACGAGGCCTCAGGCTGGGGATGGAGAGAGTTGCTCTGGGAGTTGGGGTACCACCCCCAG
GGCAGGATGGGGACAGGATCACCTGCCCGGGACACCACCATTATCATTCTCCTCTAGTGACGCAGCAGCTGGTTCTGGGAGTTAAAGGAG
CATTGGAAGGCCCAAACCCTCTCCCTTGAGTGGCCACCCCAGCCTGGTTGGCTGGTTTTCCCCTTTTCTCTTGTTTCAATTGGGTCTTTA
CCTTGAACTCTCCTCTCTGGCTTTGCGGTGGGCTGTGGAGGCTGGTTTTGACCAAAAGTGAGTGGGGCGGGAGGAAGGGGCAGGAGGAAG
GGTTGAGGTTACTTGGGGCGAGTCCCTTCCCCTTCAGAGAGGCTTCTATCCTTCCCAGGGAGGAGGCGCCGCTGAGACCCTTCTGCTGAG
AGCTCTGCCCTCCCCTCATCACCTGGCCTGTGCAGAAACGCTCATGCACACCTGGCTGCACAGGTGTGCACGCATTACCCTTCGCGTGTA
CGTTCCCATGTGCCCCGTGAAAGCATGTGTGGCTGCAGACGTGTCCACATGGGCCTTGCGAACCTGGGTTAGAAACCCTGGCCAGGCGAA
CGTGGGGTGATTCACAGCACAAAAGACCTCACCACCACACCTGCACTCACCCCACCTTGCATGCACCTTGCTACCTGCTTGCGGCTTTCA
GTGGAGGGCAGGGGTCTGGCACAGGTGCGATGGCACCCCATGCTCCAGGCATACAGATGTGGTTTCTCGGCTGCACCGGGCCAGGCTGCG
GGTGTGCAGGCGTCTGCTAAGTTGTGTGATGTATCAGCACAGGCTTTGAGACGTCTGGACCCTGTCCTTCCTCCCGTGAGGGGTTCTTGT
TCTTTCTGACTCAGGTGACTTTTCAGCCCTTCCAATTCCCCTCTTTTTCTGCCCTCCCCTCCAACTCAGCCAACCCAGGTGTGGGCAGTC
AGGGAGGGAGGGAGTGTCCCACCACGTTCTCAGGGCAGCCCTTGACTCCTAAGCCCCTTCCTCCTTCCATTCTGCATCCCCTCCCCATCC
AACCTAAATGCCACAGCTGGGGCTGAGCTGTATTCCTGTGGAGGGACCTCTGCCGTGCCTCTCTGAGGTCAGGCTGTGCTGTGTGATGGG
CAGGCTTTGCCCCAGCCCACCCCTGGCAAGGTGCACTTGTTTTCTGGTTTGTACAAGGTGTCCTGGGGGCCCGTGGCTTCCCTGCCAGTG
AGGAGTGACTTCTCCCTCTCTTCCAGTCCTGTAGGGGAGACAAAACCAGATTGGGGGGCCCAAGGGGAGCATGGAAAAGGCCGGCTCCCC
TGTCTTTCCTTGGCTGTCAGAGTCAGGGTAACACACACCAAGAGTGGAGTGCGGCCAGCAAGTTTGAGACCTGCCCGCCCTCCTCGCAGC
TCTGCTCTGTGTCCTCAGGAAGTCACAGAGTCTACTGAGGCAAGGAGAGGGTGATTCTTTCCCCAAATCCCTTCTTCCCTGGTTCCCAAA
CCAAAGACAGCCTGCAGCCCTTTCTGCATGGGGTGCTCTGTTGACAGGCTTCCCAGATCCCTGAGTCTCTCTTTCCTTCCTCCTCGATCT
TTAGTTGTCCACGGTCAATTCAGTGCTTCCATTGGGGGACAGTCCCCTCCGGGATGACCTGATTCACCTCCAGCCCAGGGAATGGAATCT
AGAGGAATACGTGGGGTGGGTCTGGACAAGGAGCGGCAGGAATCACCACCCATCTCCAGCTGTGGAGCCCTGTGGAGGGGAAGGGGAAGC
TTGGGGTTCAGAGGGGACTCTTCCAGGAGAGGGGTGCCCAGCGGAGGTAAAGATGATAGAGGGTTGTGGGGGGTCTCTAGTTGAATGTTT
TGGCCCATGACTTTGGAACATGGCTGGCAGCTTCCAGCAGAAGTCACGCTCCCCATCCCCCAGGGGACATAGGACCTTTTTCCTGCTTCC
TGGTCACTTTCAAAGAACTATTTGCGCAATCTGTGGGTCTGTGGATTCACGGGGCTTTCTGTGTGGGTGCTGCAGTTGCTTTTGTCTGCA
GCAGCAGGACACATCTTTCCTCTTACTCAGCCCTTTATGGCCCATGGGGAACTCCGTGGCTCAGGGAGAGCTGAACTCCAGGGGTGTGAC
CTGGGACGGGTGGGCCTGAGGTGCCCAGCTCAGGGCAGCCAGGTGGCTCATGGGCTGTAGTGAGCCAGCTCCCTGGGGGAAAAGGCTGTG
GGCCGTTAGGACCATCCTCCAGGACAGGTGACCTCTATGAGGTCACCTACGGCTGTGGCCGTGCAGGCCTCCTTCCAGCCCAGAGTGGCC
CAGTAGAGCAAGGCAGACAGTGACCTCCACCCCCGCAGCCCTCTTAAAAGGCCAGTACTCTTGGGGGTGGGGGGAGGGTTTAGAAAGCAT
TTGCCCATCTGCCTTTCTTTCCCCCAGCCCCCACCCGCTTTGAATGTAGAGACCCGTGGGCACTTTTCCTTTTGTGGTGGGGGGTGCGGA
GGAGGTACCCCCACCCCTGGCACAGCCGCCTGGAATGCAGGACTGTCACTGCTGTTCGGGTGATGACCTCGTTGCCAAGCTCCTCCTGTC
CCCTTGTTCTGGGGGCAGGCGCTGTGCTTCTGTGAGGTGGTTTAGCTTTTGCTTTCGAAGTGGCCAGCTGCGGCCACCAGGTCTCAGCAC
AAGAGCGCTTCCTTTGCACAGAATGAGCTTCGAGCTTTGTTCAGACTAAATGAATGTATCTGGGAGGGGTCGGGGGCACGAGTTGATTCC
AAGCACATGCCTTTGCTGAGTGTGTGTGTGCTGGGAGAGTCAGAGTGGATGTAGAGCGCGGTTTTATTTTTGTACTGACATTGGTAAGAG
ACTGTATAGCATCTATTTATTTAGATGATTTATCTGGTAAATGAGGCAAAAAAATTATTAAAAATACATTAAAGATGATTTAAAAAAAAG

>44102_44102_10_LARGE-CBX7_LARGE_chr22_34157357_ENST00000402320_CBX7_chr22_39545837_ENST00000216133_length(amino acids)=233AA_BP=3
MGASKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEKEERDRASGYRKRGPKPKRLLLQRLYSMDLRSSHKAKGKEKLCFSLTCP
LGSGSPEGVVKAGAPELVDKGPLVPTLPFPLRKPRKAHKYLRLSRKKFPPRGPNLESHSHRRELFLQEPPAPDVLQAAGEWEPAAQPPEE

--------------------------------------------------------------
>44102_44102_11_LARGE-CBX7_LARGE_chr22_34157357_ENST00000402320_CBX7_chr22_39545837_ENST00000401405_length(transcript)=1323nt_BP=608nt
GCTCGTCTCGCCGGGCTGTTCGCGGGCAGGCCCTGCCCTGAAGGGACGAATCGGCTTGGAGCGCGGGAGGTGGAGTCGGCCCCGGCGGTC
GCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGGGCCGGGAGCAGCTC
CAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTTGGAGCCGGGGGCAG
TCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGCCTCGTAGGCGGCGG
CGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAGAGTATTATGAGGGAGGCCGAGGACTTCA
TGCTCCGGACAGAGAAACGGCGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGACGGAAATTCTTGGC
TGCCTCGTTGAGTCTTCTCTGCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGTCGAGTATCTGGTGA
AGTGGAAAGGATGGCCCCCAAAGTACAGCACGTGGGAGCCAGAAGAGCACATCTTGGACCCCCGCCTCGTCATGGCCTACGAGGAGAAGG
AGGAGAGAGACCGAGCATCGGGGTATAGGAAGAGAGGTCCGAAACCCAAGCGGCTTCTGCTGCAGGAGCCACCGGCCCCAGACGTCCTGC
AGGCGGCTGGCGAGTGGGAGCCTGCTGCGCAGCCCCCTGAAGAGGAGGCAGATGCCGACCTGGCCGAGGGGCCCCCTCCCTGGACACCTG
CGCTCCCCTCAAGTGAGGTGACCGTGACCGACATCACCGCCAACTCCATCACCGTCACCTTCCGCGAGGCCCAGGCAGCTGAGGGCTTCT
TCCGAGACCGCAGTGGGAAGTTCTGAATCACCGTTTTTACTCTTCTTAAACTGTTTTCTTTTGGGCTTGGGGTGGGACTTCCAGAGATAG
GGATGGGTTGGGGGCGGGGTAATTATTTTATTTAAAAAAATACCGAGCAGCAAAAGGGGAGAAGATCCCACTACTCTCCCACCACCTGCC
CTTTCTCTGAGGGACGTTTACCACGAGGCCTCAGGCTGGGGATGGAGAGAGTTGCTCTGGGAGTTGGGGTACCACCCCCAGGGCAGGATG

>44102_44102_11_LARGE-CBX7_LARGE_chr22_34157357_ENST00000402320_CBX7_chr22_39545837_ENST00000401405_length(amino acids)=189AA_BP=
MGCREDSTRQPRISVSPCRFPASSQKWQSLIPAPFLCPEHEVLGLPHNTLLAVTAARQRETRKNRVARRRRRLRGAGGPGQGPGRRCPTR
SGGAKHRVRDCPRLQMPRRGSGSPKLSTVPSSSSSGAAAWSCSRPGGLFRARGAPHGQGPGSASGWVQGATAGADSTSRAPSRFVPSGQG

--------------------------------------------------------------
>44102_44102_12_LARGE-CBX7_LARGE_chr22_34157357_ENST00000402320_CBX7_chr22_39545837_ENST00000475962_length(transcript)=1059nt_BP=608nt
GCTCGTCTCGCCGGGCTGTTCGCGGGCAGGCCCTGCCCTGAAGGGACGAATCGGCTTGGAGCGCGGGAGGTGGAGTCGGCCCCGGCGGTC
GCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGGGCCGGGAGCAGCTC
CAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTTGGAGCCGGGGGCAG
TCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGCCTCGTAGGCGGCGG
CGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAGAGTATTATGAGGGAGGCCGAGGACTTCA
TGCTCCGGACAGAGAAACGGCGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGACGGAAATTCTTGGC
TGCCTCGTTGAGTCTTCTCTGCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGTCGAGTATCTGGTGA
AGTGGAAAGGATGGCCCCCAAACATCGCACTACTGATGACAAGCAGGCAACTGCGTTGGAGAGGGAGGTCATCATGGCTGCAAGGAAGGG
ACCGGACCCATATAATATACTACCCCTAAAGGCAGCTTCAGGCACCAAAGAAGACCCTAATTTAGTCCTCTCCATCACCAACAAGAGAAG
AGTGGGCTGCATCTGCGTAGAGGACAACAGTACCATCACCTGCTTTGGGCTGCACAAAGGCGAGACCCAGTGATGCCCCAGCTATGGAAC
CCATTACAAGCTGGTGCCCCGCCAGCTGACGTACTGAGCACCTGCACCAAGTTACCCAAAATGTGCTGCAAAGTTTCTTCTTTCCAATAA

>44102_44102_12_LARGE-CBX7_LARGE_chr22_34157357_ENST00000402320_CBX7_chr22_39545837_ENST00000475962_length(amino acids)=189AA_BP=
MGCREDSTRQPRISVSPCRFPASSQKWQSLIPAPFLCPEHEVLGLPHNTLLAVTAARQRETRKNRVARRRRRLRGAGGPGQGPGRRCPTR
SGGAKHRVRDCPRLQMPRRGSGSPKLSTVPSSSSSGAAAWSCSRPGGLFRARGAPHGQGPGSASGWVQGATAGADSTSRAPSRFVPSGQG

--------------------------------------------------------------
>44102_44102_13_LARGE-CBX7_LARGE_chr22_34157358_ENST00000337431_CBX7_chr22_39545837_ENST00000216133_length(transcript)=4401nt_BP=595nt
CGGCCCCGGCGGTCGCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGG
GCCGGGAGCAGCTCCAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTT
GGAGCCGGGGGCAGTCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGC
CTCGTAGGCGGCGGCGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAACCTACATTAAGAAA
CGCATTCAGATGCATCAGCCGTTTGTAAAGCACCTGTCGTGAGCCAGAGAGTATTATGAGGGAGGCCGAGGACTTCATGCTCCGGACAGA
GAAACGGCGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGACGGAAATTCTTGGCTGCCTCGTTGAGT
CTTCTCTGCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGTCGAGTATCTGGTGAAGTGGAAAGGATG
GCCCCCAAAGTACAGCACGTGGGAGCCAGAAGAGCACATCTTGGACCCCCGCCTCGTCATGGCCTACGAGGAGAAGGAGGAGAGAGACCG
AGCATCGGGGTATAGGAAGAGAGGTCCGAAACCCAAGCGGCTTCTGCTGCAGCGGCTGTACAGCATGGACCTGCGGAGCTCCCACAAGGC
CAAGGGCAAGGAGAAGCTCTGCTTCTCCCTGACGTGCCCACTCGGCAGCGGGAGCCCTGAGGGGGTGGTCAAGGCGGGGGCACCTGAGCT
GGTGGACAAGGGCCCCTTGGTGCCCACCCTGCCCTTCCCGCTCCGCAAGCCCCGAAAGGCCCACAAGTACCTGCGGCTCTCGCGCAAGAA
GTTCCCGCCCCGCGGGCCCAACCTGGAGAGCCACAGCCATCGACGGGAGCTCTTCCTGCAGGAGCCACCGGCCCCAGACGTCCTGCAGGC
GGCTGGCGAGTGGGAGCCTGCTGCGCAGCCCCCTGAAGAGGAGGCAGATGCCGACCTGGCCGAGGGGCCCCCTCCCTGGACACCTGCGCT
CCCCTCAAGTGAGGTGACCGTGACCGACATCACCGCCAACTCCATCACCGTCACCTTCCGCGAGGCCCAGGCAGCTGAGGGCTTCTTCCG
AGACCGCAGTGGGAAGTTCTGAATCACCGTTTTTACTCTTCTTAAACTGTTTTCTTTTGGGCTTGGGGTGGGACTTCCAGAGATAGGGAT
GGGTTGGGGGCGGGGTAATTATTTTATTTAAAAAAATACCGAGCAGCAAAAGGGGAGAAGATCCCACTACTCTCCCACCACCTGCCCTTT
CTCTGAGGGACGTTTACCACGAGGCCTCAGGCTGGGGATGGAGAGAGTTGCTCTGGGAGTTGGGGTACCACCCCCAGGGCAGGATGGGGA
CAGGATCACCTGCCCGGGACACCACCATTATCATTCTCCTCTAGTGACGCAGCAGCTGGTTCTGGGAGTTAAAGGAGCATTGGAAGGCCC
AAACCCTCTCCCTTGAGTGGCCACCCCAGCCTGGTTGGCTGGTTTTCCCCTTTTCTCTTGTTTCAATTGGGTCTTTACCTTGAACTCTCC
TCTCTGGCTTTGCGGTGGGCTGTGGAGGCTGGTTTTGACCAAAAGTGAGTGGGGCGGGAGGAAGGGGCAGGAGGAAGGGTTGAGGTTACT
TGGGGCGAGTCCCTTCCCCTTCAGAGAGGCTTCTATCCTTCCCAGGGAGGAGGCGCCGCTGAGACCCTTCTGCTGAGAGCTCTGCCCTCC
CCTCATCACCTGGCCTGTGCAGAAACGCTCATGCACACCTGGCTGCACAGGTGTGCACGCATTACCCTTCGCGTGTACGTTCCCATGTGC
CCCGTGAAAGCATGTGTGGCTGCAGACGTGTCCACATGGGCCTTGCGAACCTGGGTTAGAAACCCTGGCCAGGCGAACGTGGGGTGATTC
ACAGCACAAAAGACCTCACCACCACACCTGCACTCACCCCACCTTGCATGCACCTTGCTACCTGCTTGCGGCTTTCAGTGGAGGGCAGGG
GTCTGGCACAGGTGCGATGGCACCCCATGCTCCAGGCATACAGATGTGGTTTCTCGGCTGCACCGGGCCAGGCTGCGGGTGTGCAGGCGT
CTGCTAAGTTGTGTGATGTATCAGCACAGGCTTTGAGACGTCTGGACCCTGTCCTTCCTCCCGTGAGGGGTTCTTGTTCTTTCTGACTCA
GGTGACTTTTCAGCCCTTCCAATTCCCCTCTTTTTCTGCCCTCCCCTCCAACTCAGCCAACCCAGGTGTGGGCAGTCAGGGAGGGAGGGA
GTGTCCCACCACGTTCTCAGGGCAGCCCTTGACTCCTAAGCCCCTTCCTCCTTCCATTCTGCATCCCCTCCCCATCCAACCTAAATGCCA
CAGCTGGGGCTGAGCTGTATTCCTGTGGAGGGACCTCTGCCGTGCCTCTCTGAGGTCAGGCTGTGCTGTGTGATGGGCAGGCTTTGCCCC
AGCCCACCCCTGGCAAGGTGCACTTGTTTTCTGGTTTGTACAAGGTGTCCTGGGGGCCCGTGGCTTCCCTGCCAGTGAGGAGTGACTTCT
CCCTCTCTTCCAGTCCTGTAGGGGAGACAAAACCAGATTGGGGGGCCCAAGGGGAGCATGGAAAAGGCCGGCTCCCCTGTCTTTCCTTGG
CTGTCAGAGTCAGGGTAACACACACCAAGAGTGGAGTGCGGCCAGCAAGTTTGAGACCTGCCCGCCCTCCTCGCAGCTCTGCTCTGTGTC
CTCAGGAAGTCACAGAGTCTACTGAGGCAAGGAGAGGGTGATTCTTTCCCCAAATCCCTTCTTCCCTGGTTCCCAAACCAAAGACAGCCT
GCAGCCCTTTCTGCATGGGGTGCTCTGTTGACAGGCTTCCCAGATCCCTGAGTCTCTCTTTCCTTCCTCCTCGATCTTTAGTTGTCCACG
GTCAATTCAGTGCTTCCATTGGGGGACAGTCCCCTCCGGGATGACCTGATTCACCTCCAGCCCAGGGAATGGAATCTAGAGGAATACGTG
GGGTGGGTCTGGACAAGGAGCGGCAGGAATCACCACCCATCTCCAGCTGTGGAGCCCTGTGGAGGGGAAGGGGAAGCTTGGGGTTCAGAG
GGGACTCTTCCAGGAGAGGGGTGCCCAGCGGAGGTAAAGATGATAGAGGGTTGTGGGGGGTCTCTAGTTGAATGTTTTGGCCCATGACTT
TGGAACATGGCTGGCAGCTTCCAGCAGAAGTCACGCTCCCCATCCCCCAGGGGACATAGGACCTTTTTCCTGCTTCCTGGTCACTTTCAA
AGAACTATTTGCGCAATCTGTGGGTCTGTGGATTCACGGGGCTTTCTGTGTGGGTGCTGCAGTTGCTTTTGTCTGCAGCAGCAGGACACA
TCTTTCCTCTTACTCAGCCCTTTATGGCCCATGGGGAACTCCGTGGCTCAGGGAGAGCTGAACTCCAGGGGTGTGACCTGGGACGGGTGG
GCCTGAGGTGCCCAGCTCAGGGCAGCCAGGTGGCTCATGGGCTGTAGTGAGCCAGCTCCCTGGGGGAAAAGGCTGTGGGCCGTTAGGACC
ATCCTCCAGGACAGGTGACCTCTATGAGGTCACCTACGGCTGTGGCCGTGCAGGCCTCCTTCCAGCCCAGAGTGGCCCAGTAGAGCAAGG
CAGACAGTGACCTCCACCCCCGCAGCCCTCTTAAAAGGCCAGTACTCTTGGGGGTGGGGGGAGGGTTTAGAAAGCATTTGCCCATCTGCC
TTTCTTTCCCCCAGCCCCCACCCGCTTTGAATGTAGAGACCCGTGGGCACTTTTCCTTTTGTGGTGGGGGGTGCGGAGGAGGTACCCCCA
CCCCTGGCACAGCCGCCTGGAATGCAGGACTGTCACTGCTGTTCGGGTGATGACCTCGTTGCCAAGCTCCTCCTGTCCCCTTGTTCTGGG
GGCAGGCGCTGTGCTTCTGTGAGGTGGTTTAGCTTTTGCTTTCGAAGTGGCCAGCTGCGGCCACCAGGTCTCAGCACAAGAGCGCTTCCT
TTGCACAGAATGAGCTTCGAGCTTTGTTCAGACTAAATGAATGTATCTGGGAGGGGTCGGGGGCACGAGTTGATTCCAAGCACATGCCTT
TGCTGAGTGTGTGTGTGCTGGGAGAGTCAGAGTGGATGTAGAGCGCGGTTTTATTTTTGTACTGACATTGGTAAGAGACTGTATAGCATC

>44102_44102_13_LARGE-CBX7_LARGE_chr22_34157358_ENST00000337431_CBX7_chr22_39545837_ENST00000216133_length(amino acids)=233AA_BP=3
MGASKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEKEERDRASGYRKRGPKPKRLLLQRLYSMDLRSSHKAKGKEKLCFSLTCP
LGSGSPEGVVKAGAPELVDKGPLVPTLPFPLRKPRKAHKYLRLSRKKFPPRGPNLESHSHRRELFLQEPPAPDVLQAAGEWEPAAQPPEE

--------------------------------------------------------------
>44102_44102_14_LARGE-CBX7_LARGE_chr22_34157358_ENST00000337431_CBX7_chr22_39545837_ENST00000401405_length(transcript)=1310nt_BP=595nt
CGGCCCCGGCGGTCGCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGG
GCCGGGAGCAGCTCCAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTT
GGAGCCGGGGGCAGTCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGC
CTCGTAGGCGGCGGCGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAACCTACATTAAGAAA
CGCATTCAGATGCATCAGCCGTTTGTAAAGCACCTGTCGTGAGCCAGAGAGTATTATGAGGGAGGCCGAGGACTTCATGCTCCGGACAGA
GAAACGGCGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGACGGAAATTCTTGGCTGCCTCGTTGAGT
CTTCTCTGCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGTCGAGTATCTGGTGAAGTGGAAAGGATG
GCCCCCAAAGTACAGCACGTGGGAGCCAGAAGAGCACATCTTGGACCCCCGCCTCGTCATGGCCTACGAGGAGAAGGAGGAGAGAGACCG
AGCATCGGGGTATAGGAAGAGAGGTCCGAAACCCAAGCGGCTTCTGCTGCAGGAGCCACCGGCCCCAGACGTCCTGCAGGCGGCTGGCGA
GTGGGAGCCTGCTGCGCAGCCCCCTGAAGAGGAGGCAGATGCCGACCTGGCCGAGGGGCCCCCTCCCTGGACACCTGCGCTCCCCTCAAG
TGAGGTGACCGTGACCGACATCACCGCCAACTCCATCACCGTCACCTTCCGCGAGGCCCAGGCAGCTGAGGGCTTCTTCCGAGACCGCAG
TGGGAAGTTCTGAATCACCGTTTTTACTCTTCTTAAACTGTTTTCTTTTGGGCTTGGGGTGGGACTTCCAGAGATAGGGATGGGTTGGGG
GCGGGGTAATTATTTTATTTAAAAAAATACCGAGCAGCAAAAGGGGAGAAGATCCCACTACTCTCCCACCACCTGCCCTTTCTCTGAGGG
ACGTTTACCACGAGGCCTCAGGCTGGGGATGGAGAGAGTTGCTCTGGGAGTTGGGGTACCACCCCCAGGGCAGGATGGGGACAGGATCAC

>44102_44102_14_LARGE-CBX7_LARGE_chr22_34157358_ENST00000337431_CBX7_chr22_39545837_ENST00000401405_length(amino acids)=140AA_BP=3
MGASKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEKEERDRASGYRKRGPKPKRLLLQEPPAPDVLQAAGEWEPAAQPPEEEAD

--------------------------------------------------------------
>44102_44102_15_LARGE-CBX7_LARGE_chr22_34157358_ENST00000337431_CBX7_chr22_39545837_ENST00000475962_length(transcript)=1046nt_BP=595nt
CGGCCCCGGCGGTCGCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGG
GCCGGGAGCAGCTCCAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTT
GGAGCCGGGGGCAGTCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGC
CTCGTAGGCGGCGGCGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAACCTACATTAAGAAA
CGCATTCAGATGCATCAGCCGTTTGTAAAGCACCTGTCGTGAGCCAGAGAGTATTATGAGGGAGGCCGAGGACTTCATGCTCCGGACAGA
GAAACGGCGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGACGGAAATTCTTGGCTGCCTCGTTGAGT
CTTCTCTGCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGTCGAGTATCTGGTGAAGTGGAAAGGATG
GCCCCCAAACATCGCACTACTGATGACAAGCAGGCAACTGCGTTGGAGAGGGAGGTCATCATGGCTGCAAGGAAGGGACCGGACCCATAT
AATATACTACCCCTAAAGGCAGCTTCAGGCACCAAAGAAGACCCTAATTTAGTCCTCTCCATCACCAACAAGAGAAGAGTGGGCTGCATC
TGCGTAGAGGACAACAGTACCATCACCTGCTTTGGGCTGCACAAAGGCGAGACCCAGTGATGCCCCAGCTATGGAACCCATTACAAGCTG
GTGCCCCGCCAGCTGACGTACTGAGCACCTGCACCAAGTTACCCAAAATGTGCTGCAAAGTTTCTTCTTTCCAATAAAGAGTAGCCATTG

>44102_44102_15_LARGE-CBX7_LARGE_chr22_34157358_ENST00000337431_CBX7_chr22_39545837_ENST00000475962_length(amino acids)=114AA_BP=
MAVTAARQRETRKNRVARRRRRLRGAGGPGQGPGRRCPTRSGGAKHRVRDCPRLQMPRRGSGSPKLSTVPSSSSSGAAAWSCSRPGGLFR

--------------------------------------------------------------
>44102_44102_16_LARGE-CBX7_LARGE_chr22_34157358_ENST00000354992_CBX7_chr22_39545837_ENST00000216133_length(transcript)=4484nt_BP=678nt
CGGCCGCGCTCGTCTCGCCGGGCTGTTCGCGGGCAGGCCCTGCCCTGAAGGGACGAATCGGCTTGGAGCGCGGGAGGTGGAGTCGGCCCC
GGCGGTCGCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGGGCCGGGA
GCAGCTCCAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTTGGAGCCG
GGGGCAGTCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGCCTCGTAG
GCGGCGGCGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAACCTACATTAAGAAACGCATTC
AGATGCATCAGCCGTTTGTAAAGCACCTGTCGTGAGCCAGAGAGTATTATGAGGGAGGCCGAGGACTTCATGCTCCGGACAGAGAAACGG
CGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGACGGAAATTCTTGGCTGCCTCGTTGAGTCTTCTCT
GCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGTCGAGTATCTGGTGAAGTGGAAAGGATGGCCCCCA
AAGTACAGCACGTGGGAGCCAGAAGAGCACATCTTGGACCCCCGCCTCGTCATGGCCTACGAGGAGAAGGAGGAGAGAGACCGAGCATCG
GGGTATAGGAAGAGAGGTCCGAAACCCAAGCGGCTTCTGCTGCAGCGGCTGTACAGCATGGACCTGCGGAGCTCCCACAAGGCCAAGGGC
AAGGAGAAGCTCTGCTTCTCCCTGACGTGCCCACTCGGCAGCGGGAGCCCTGAGGGGGTGGTCAAGGCGGGGGCACCTGAGCTGGTGGAC
AAGGGCCCCTTGGTGCCCACCCTGCCCTTCCCGCTCCGCAAGCCCCGAAAGGCCCACAAGTACCTGCGGCTCTCGCGCAAGAAGTTCCCG
CCCCGCGGGCCCAACCTGGAGAGCCACAGCCATCGACGGGAGCTCTTCCTGCAGGAGCCACCGGCCCCAGACGTCCTGCAGGCGGCTGGC
GAGTGGGAGCCTGCTGCGCAGCCCCCTGAAGAGGAGGCAGATGCCGACCTGGCCGAGGGGCCCCCTCCCTGGACACCTGCGCTCCCCTCA
AGTGAGGTGACCGTGACCGACATCACCGCCAACTCCATCACCGTCACCTTCCGCGAGGCCCAGGCAGCTGAGGGCTTCTTCCGAGACCGC
AGTGGGAAGTTCTGAATCACCGTTTTTACTCTTCTTAAACTGTTTTCTTTTGGGCTTGGGGTGGGACTTCCAGAGATAGGGATGGGTTGG
GGGCGGGGTAATTATTTTATTTAAAAAAATACCGAGCAGCAAAAGGGGAGAAGATCCCACTACTCTCCCACCACCTGCCCTTTCTCTGAG
GGACGTTTACCACGAGGCCTCAGGCTGGGGATGGAGAGAGTTGCTCTGGGAGTTGGGGTACCACCCCCAGGGCAGGATGGGGACAGGATC
ACCTGCCCGGGACACCACCATTATCATTCTCCTCTAGTGACGCAGCAGCTGGTTCTGGGAGTTAAAGGAGCATTGGAAGGCCCAAACCCT
CTCCCTTGAGTGGCCACCCCAGCCTGGTTGGCTGGTTTTCCCCTTTTCTCTTGTTTCAATTGGGTCTTTACCTTGAACTCTCCTCTCTGG
CTTTGCGGTGGGCTGTGGAGGCTGGTTTTGACCAAAAGTGAGTGGGGCGGGAGGAAGGGGCAGGAGGAAGGGTTGAGGTTACTTGGGGCG
AGTCCCTTCCCCTTCAGAGAGGCTTCTATCCTTCCCAGGGAGGAGGCGCCGCTGAGACCCTTCTGCTGAGAGCTCTGCCCTCCCCTCATC
ACCTGGCCTGTGCAGAAACGCTCATGCACACCTGGCTGCACAGGTGTGCACGCATTACCCTTCGCGTGTACGTTCCCATGTGCCCCGTGA
AAGCATGTGTGGCTGCAGACGTGTCCACATGGGCCTTGCGAACCTGGGTTAGAAACCCTGGCCAGGCGAACGTGGGGTGATTCACAGCAC
AAAAGACCTCACCACCACACCTGCACTCACCCCACCTTGCATGCACCTTGCTACCTGCTTGCGGCTTTCAGTGGAGGGCAGGGGTCTGGC
ACAGGTGCGATGGCACCCCATGCTCCAGGCATACAGATGTGGTTTCTCGGCTGCACCGGGCCAGGCTGCGGGTGTGCAGGCGTCTGCTAA
GTTGTGTGATGTATCAGCACAGGCTTTGAGACGTCTGGACCCTGTCCTTCCTCCCGTGAGGGGTTCTTGTTCTTTCTGACTCAGGTGACT
TTTCAGCCCTTCCAATTCCCCTCTTTTTCTGCCCTCCCCTCCAACTCAGCCAACCCAGGTGTGGGCAGTCAGGGAGGGAGGGAGTGTCCC
ACCACGTTCTCAGGGCAGCCCTTGACTCCTAAGCCCCTTCCTCCTTCCATTCTGCATCCCCTCCCCATCCAACCTAAATGCCACAGCTGG
GGCTGAGCTGTATTCCTGTGGAGGGACCTCTGCCGTGCCTCTCTGAGGTCAGGCTGTGCTGTGTGATGGGCAGGCTTTGCCCCAGCCCAC
CCCTGGCAAGGTGCACTTGTTTTCTGGTTTGTACAAGGTGTCCTGGGGGCCCGTGGCTTCCCTGCCAGTGAGGAGTGACTTCTCCCTCTC
TTCCAGTCCTGTAGGGGAGACAAAACCAGATTGGGGGGCCCAAGGGGAGCATGGAAAAGGCCGGCTCCCCTGTCTTTCCTTGGCTGTCAG
AGTCAGGGTAACACACACCAAGAGTGGAGTGCGGCCAGCAAGTTTGAGACCTGCCCGCCCTCCTCGCAGCTCTGCTCTGTGTCCTCAGGA
AGTCACAGAGTCTACTGAGGCAAGGAGAGGGTGATTCTTTCCCCAAATCCCTTCTTCCCTGGTTCCCAAACCAAAGACAGCCTGCAGCCC
TTTCTGCATGGGGTGCTCTGTTGACAGGCTTCCCAGATCCCTGAGTCTCTCTTTCCTTCCTCCTCGATCTTTAGTTGTCCACGGTCAATT
CAGTGCTTCCATTGGGGGACAGTCCCCTCCGGGATGACCTGATTCACCTCCAGCCCAGGGAATGGAATCTAGAGGAATACGTGGGGTGGG
TCTGGACAAGGAGCGGCAGGAATCACCACCCATCTCCAGCTGTGGAGCCCTGTGGAGGGGAAGGGGAAGCTTGGGGTTCAGAGGGGACTC
TTCCAGGAGAGGGGTGCCCAGCGGAGGTAAAGATGATAGAGGGTTGTGGGGGGTCTCTAGTTGAATGTTTTGGCCCATGACTTTGGAACA
TGGCTGGCAGCTTCCAGCAGAAGTCACGCTCCCCATCCCCCAGGGGACATAGGACCTTTTTCCTGCTTCCTGGTCACTTTCAAAGAACTA
TTTGCGCAATCTGTGGGTCTGTGGATTCACGGGGCTTTCTGTGTGGGTGCTGCAGTTGCTTTTGTCTGCAGCAGCAGGACACATCTTTCC
TCTTACTCAGCCCTTTATGGCCCATGGGGAACTCCGTGGCTCAGGGAGAGCTGAACTCCAGGGGTGTGACCTGGGACGGGTGGGCCTGAG
GTGCCCAGCTCAGGGCAGCCAGGTGGCTCATGGGCTGTAGTGAGCCAGCTCCCTGGGGGAAAAGGCTGTGGGCCGTTAGGACCATCCTCC
AGGACAGGTGACCTCTATGAGGTCACCTACGGCTGTGGCCGTGCAGGCCTCCTTCCAGCCCAGAGTGGCCCAGTAGAGCAAGGCAGACAG
TGACCTCCACCCCCGCAGCCCTCTTAAAAGGCCAGTACTCTTGGGGGTGGGGGGAGGGTTTAGAAAGCATTTGCCCATCTGCCTTTCTTT
CCCCCAGCCCCCACCCGCTTTGAATGTAGAGACCCGTGGGCACTTTTCCTTTTGTGGTGGGGGGTGCGGAGGAGGTACCCCCACCCCTGG
CACAGCCGCCTGGAATGCAGGACTGTCACTGCTGTTCGGGTGATGACCTCGTTGCCAAGCTCCTCCTGTCCCCTTGTTCTGGGGGCAGGC
GCTGTGCTTCTGTGAGGTGGTTTAGCTTTTGCTTTCGAAGTGGCCAGCTGCGGCCACCAGGTCTCAGCACAAGAGCGCTTCCTTTGCACA
GAATGAGCTTCGAGCTTTGTTCAGACTAAATGAATGTATCTGGGAGGGGTCGGGGGCACGAGTTGATTCCAAGCACATGCCTTTGCTGAG
TGTGTGTGTGCTGGGAGAGTCAGAGTGGATGTAGAGCGCGGTTTTATTTTTGTACTGACATTGGTAAGAGACTGTATAGCATCTATTTAT

>44102_44102_16_LARGE-CBX7_LARGE_chr22_34157358_ENST00000354992_CBX7_chr22_39545837_ENST00000216133_length(amino acids)=233AA_BP=3
MGASKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEKEERDRASGYRKRGPKPKRLLLQRLYSMDLRSSHKAKGKEKLCFSLTCP
LGSGSPEGVVKAGAPELVDKGPLVPTLPFPLRKPRKAHKYLRLSRKKFPPRGPNLESHSHRRELFLQEPPAPDVLQAAGEWEPAAQPPEE

--------------------------------------------------------------
>44102_44102_17_LARGE-CBX7_LARGE_chr22_34157358_ENST00000354992_CBX7_chr22_39545837_ENST00000401405_length(transcript)=1393nt_BP=678nt
CGGCCGCGCTCGTCTCGCCGGGCTGTTCGCGGGCAGGCCCTGCCCTGAAGGGACGAATCGGCTTGGAGCGCGGGAGGTGGAGTCGGCCCC
GGCGGTCGCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGGGCCGGGA
GCAGCTCCAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTTGGAGCCG
GGGGCAGTCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGCCTCGTAG
GCGGCGGCGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAACCTACATTAAGAAACGCATTC
AGATGCATCAGCCGTTTGTAAAGCACCTGTCGTGAGCCAGAGAGTATTATGAGGGAGGCCGAGGACTTCATGCTCCGGACAGAGAAACGG
CGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGACGGAAATTCTTGGCTGCCTCGTTGAGTCTTCTCT
GCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGTCGAGTATCTGGTGAAGTGGAAAGGATGGCCCCCA
AAGTACAGCACGTGGGAGCCAGAAGAGCACATCTTGGACCCCCGCCTCGTCATGGCCTACGAGGAGAAGGAGGAGAGAGACCGAGCATCG
GGGTATAGGAAGAGAGGTCCGAAACCCAAGCGGCTTCTGCTGCAGGAGCCACCGGCCCCAGACGTCCTGCAGGCGGCTGGCGAGTGGGAG
CCTGCTGCGCAGCCCCCTGAAGAGGAGGCAGATGCCGACCTGGCCGAGGGGCCCCCTCCCTGGACACCTGCGCTCCCCTCAAGTGAGGTG
ACCGTGACCGACATCACCGCCAACTCCATCACCGTCACCTTCCGCGAGGCCCAGGCAGCTGAGGGCTTCTTCCGAGACCGCAGTGGGAAG
TTCTGAATCACCGTTTTTACTCTTCTTAAACTGTTTTCTTTTGGGCTTGGGGTGGGACTTCCAGAGATAGGGATGGGTTGGGGGCGGGGT
AATTATTTTATTTAAAAAAATACCGAGCAGCAAAAGGGGAGAAGATCCCACTACTCTCCCACCACCTGCCCTTTCTCTGAGGGACGTTTA
CCACGAGGCCTCAGGCTGGGGATGGAGAGAGTTGCTCTGGGAGTTGGGGTACCACCCCCAGGGCAGGATGGGGACAGGATCACCTGCCCG

>44102_44102_17_LARGE-CBX7_LARGE_chr22_34157358_ENST00000354992_CBX7_chr22_39545837_ENST00000401405_length(amino acids)=142AA_BP=
MAVTAARQRETRKNRVARRRRRLRGAGGPGQGPGRRCPTRSGGAKHRVRDCPRLQMPRRGSGSPKLSTVPSSSSSGAAAWSCSRPGGLFR

--------------------------------------------------------------
>44102_44102_18_LARGE-CBX7_LARGE_chr22_34157358_ENST00000354992_CBX7_chr22_39545837_ENST00000475962_length(transcript)=1129nt_BP=678nt
CGGCCGCGCTCGTCTCGCCGGGCTGTTCGCGGGCAGGCCCTGCCCTGAAGGGACGAATCGGCTTGGAGCGCGGGAGGTGGAGTCGGCCCC
GGCGGTCGCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGGGCCGGGA
GCAGCTCCAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTTGGAGCCG
GGGGCAGTCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGCCTCGTAG
GCGGCGGCGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAACCTACATTAAGAAACGCATTC
AGATGCATCAGCCGTTTGTAAAGCACCTGTCGTGAGCCAGAGAGTATTATGAGGGAGGCCGAGGACTTCATGCTCCGGACAGAGAAACGG
CGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGACGGAAATTCTTGGCTGCCTCGTTGAGTCTTCTCT
GCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGTCGAGTATCTGGTGAAGTGGAAAGGATGGCCCCCA
AACATCGCACTACTGATGACAAGCAGGCAACTGCGTTGGAGAGGGAGGTCATCATGGCTGCAAGGAAGGGACCGGACCCATATAATATAC
TACCCCTAAAGGCAGCTTCAGGCACCAAAGAAGACCCTAATTTAGTCCTCTCCATCACCAACAAGAGAAGAGTGGGCTGCATCTGCGTAG
AGGACAACAGTACCATCACCTGCTTTGGGCTGCACAAAGGCGAGACCCAGTGATGCCCCAGCTATGGAACCCATTACAAGCTGGTGCCCC
GCCAGCTGACGTACTGAGCACCTGCACCAAGTTACCCAAAATGTGCTGCAAAGTTTCTTCTTTCCAATAAAGAGTAGCCATTGCATTGGC

>44102_44102_18_LARGE-CBX7_LARGE_chr22_34157358_ENST00000354992_CBX7_chr22_39545837_ENST00000475962_length(amino acids)=142AA_BP=
MAVTAARQRETRKNRVARRRRRLRGAGGPGQGPGRRCPTRSGGAKHRVRDCPRLQMPRRGSGSPKLSTVPSSSSSGAAAWSCSRPGGLFR

--------------------------------------------------------------
>44102_44102_19_LARGE-CBX7_LARGE_chr22_34157358_ENST00000397394_CBX7_chr22_39545837_ENST00000216133_length(transcript)=4428nt_BP=622nt
GGGGGCGCGGCCGCGCTCGTCTCGCCGGGCTGTTCGCGGGCAGGCCCTGCCCTGAAGGGACGAATCGGCTTGGAGCGCGGGAGGTGGAGT
CGGCCCCGGCGGTCGCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGG
GCCGGGAGCAGCTCCAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTT
GGAGCCGGGGGCAGTCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGC
CTCGTAGGCGGCGGCGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAGAGTATTATGAGGGA
GGCCGAGGACTTCATGCTCCGGACAGAGAAACGGCGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGA
CGGAAATTCTTGGCTGCCTCGTTGAGTCTTCTCTGCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGT
CGAGTATCTGGTGAAGTGGAAAGGATGGCCCCCAAAGTACAGCACGTGGGAGCCAGAAGAGCACATCTTGGACCCCCGCCTCGTCATGGC
CTACGAGGAGAAGGAGGAGAGAGACCGAGCATCGGGGTATAGGAAGAGAGGTCCGAAACCCAAGCGGCTTCTGCTGCAGCGGCTGTACAG
CATGGACCTGCGGAGCTCCCACAAGGCCAAGGGCAAGGAGAAGCTCTGCTTCTCCCTGACGTGCCCACTCGGCAGCGGGAGCCCTGAGGG
GGTGGTCAAGGCGGGGGCACCTGAGCTGGTGGACAAGGGCCCCTTGGTGCCCACCCTGCCCTTCCCGCTCCGCAAGCCCCGAAAGGCCCA
CAAGTACCTGCGGCTCTCGCGCAAGAAGTTCCCGCCCCGCGGGCCCAACCTGGAGAGCCACAGCCATCGACGGGAGCTCTTCCTGCAGGA
GCCACCGGCCCCAGACGTCCTGCAGGCGGCTGGCGAGTGGGAGCCTGCTGCGCAGCCCCCTGAAGAGGAGGCAGATGCCGACCTGGCCGA
GGGGCCCCCTCCCTGGACACCTGCGCTCCCCTCAAGTGAGGTGACCGTGACCGACATCACCGCCAACTCCATCACCGTCACCTTCCGCGA
GGCCCAGGCAGCTGAGGGCTTCTTCCGAGACCGCAGTGGGAAGTTCTGAATCACCGTTTTTACTCTTCTTAAACTGTTTTCTTTTGGGCT
TGGGGTGGGACTTCCAGAGATAGGGATGGGTTGGGGGCGGGGTAATTATTTTATTTAAAAAAATACCGAGCAGCAAAAGGGGAGAAGATC
CCACTACTCTCCCACCACCTGCCCTTTCTCTGAGGGACGTTTACCACGAGGCCTCAGGCTGGGGATGGAGAGAGTTGCTCTGGGAGTTGG
GGTACCACCCCCAGGGCAGGATGGGGACAGGATCACCTGCCCGGGACACCACCATTATCATTCTCCTCTAGTGACGCAGCAGCTGGTTCT
GGGAGTTAAAGGAGCATTGGAAGGCCCAAACCCTCTCCCTTGAGTGGCCACCCCAGCCTGGTTGGCTGGTTTTCCCCTTTTCTCTTGTTT
CAATTGGGTCTTTACCTTGAACTCTCCTCTCTGGCTTTGCGGTGGGCTGTGGAGGCTGGTTTTGACCAAAAGTGAGTGGGGCGGGAGGAA
GGGGCAGGAGGAAGGGTTGAGGTTACTTGGGGCGAGTCCCTTCCCCTTCAGAGAGGCTTCTATCCTTCCCAGGGAGGAGGCGCCGCTGAG
ACCCTTCTGCTGAGAGCTCTGCCCTCCCCTCATCACCTGGCCTGTGCAGAAACGCTCATGCACACCTGGCTGCACAGGTGTGCACGCATT
ACCCTTCGCGTGTACGTTCCCATGTGCCCCGTGAAAGCATGTGTGGCTGCAGACGTGTCCACATGGGCCTTGCGAACCTGGGTTAGAAAC
CCTGGCCAGGCGAACGTGGGGTGATTCACAGCACAAAAGACCTCACCACCACACCTGCACTCACCCCACCTTGCATGCACCTTGCTACCT
GCTTGCGGCTTTCAGTGGAGGGCAGGGGTCTGGCACAGGTGCGATGGCACCCCATGCTCCAGGCATACAGATGTGGTTTCTCGGCTGCAC
CGGGCCAGGCTGCGGGTGTGCAGGCGTCTGCTAAGTTGTGTGATGTATCAGCACAGGCTTTGAGACGTCTGGACCCTGTCCTTCCTCCCG
TGAGGGGTTCTTGTTCTTTCTGACTCAGGTGACTTTTCAGCCCTTCCAATTCCCCTCTTTTTCTGCCCTCCCCTCCAACTCAGCCAACCC
AGGTGTGGGCAGTCAGGGAGGGAGGGAGTGTCCCACCACGTTCTCAGGGCAGCCCTTGACTCCTAAGCCCCTTCCTCCTTCCATTCTGCA
TCCCCTCCCCATCCAACCTAAATGCCACAGCTGGGGCTGAGCTGTATTCCTGTGGAGGGACCTCTGCCGTGCCTCTCTGAGGTCAGGCTG
TGCTGTGTGATGGGCAGGCTTTGCCCCAGCCCACCCCTGGCAAGGTGCACTTGTTTTCTGGTTTGTACAAGGTGTCCTGGGGGCCCGTGG
CTTCCCTGCCAGTGAGGAGTGACTTCTCCCTCTCTTCCAGTCCTGTAGGGGAGACAAAACCAGATTGGGGGGCCCAAGGGGAGCATGGAA
AAGGCCGGCTCCCCTGTCTTTCCTTGGCTGTCAGAGTCAGGGTAACACACACCAAGAGTGGAGTGCGGCCAGCAAGTTTGAGACCTGCCC
GCCCTCCTCGCAGCTCTGCTCTGTGTCCTCAGGAAGTCACAGAGTCTACTGAGGCAAGGAGAGGGTGATTCTTTCCCCAAATCCCTTCTT
CCCTGGTTCCCAAACCAAAGACAGCCTGCAGCCCTTTCTGCATGGGGTGCTCTGTTGACAGGCTTCCCAGATCCCTGAGTCTCTCTTTCC
TTCCTCCTCGATCTTTAGTTGTCCACGGTCAATTCAGTGCTTCCATTGGGGGACAGTCCCCTCCGGGATGACCTGATTCACCTCCAGCCC
AGGGAATGGAATCTAGAGGAATACGTGGGGTGGGTCTGGACAAGGAGCGGCAGGAATCACCACCCATCTCCAGCTGTGGAGCCCTGTGGA
GGGGAAGGGGAAGCTTGGGGTTCAGAGGGGACTCTTCCAGGAGAGGGGTGCCCAGCGGAGGTAAAGATGATAGAGGGTTGTGGGGGGTCT
CTAGTTGAATGTTTTGGCCCATGACTTTGGAACATGGCTGGCAGCTTCCAGCAGAAGTCACGCTCCCCATCCCCCAGGGGACATAGGACC
TTTTTCCTGCTTCCTGGTCACTTTCAAAGAACTATTTGCGCAATCTGTGGGTCTGTGGATTCACGGGGCTTTCTGTGTGGGTGCTGCAGT
TGCTTTTGTCTGCAGCAGCAGGACACATCTTTCCTCTTACTCAGCCCTTTATGGCCCATGGGGAACTCCGTGGCTCAGGGAGAGCTGAAC
TCCAGGGGTGTGACCTGGGACGGGTGGGCCTGAGGTGCCCAGCTCAGGGCAGCCAGGTGGCTCATGGGCTGTAGTGAGCCAGCTCCCTGG
GGGAAAAGGCTGTGGGCCGTTAGGACCATCCTCCAGGACAGGTGACCTCTATGAGGTCACCTACGGCTGTGGCCGTGCAGGCCTCCTTCC
AGCCCAGAGTGGCCCAGTAGAGCAAGGCAGACAGTGACCTCCACCCCCGCAGCCCTCTTAAAAGGCCAGTACTCTTGGGGGTGGGGGGAG
GGTTTAGAAAGCATTTGCCCATCTGCCTTTCTTTCCCCCAGCCCCCACCCGCTTTGAATGTAGAGACCCGTGGGCACTTTTCCTTTTGTG
GTGGGGGGTGCGGAGGAGGTACCCCCACCCCTGGCACAGCCGCCTGGAATGCAGGACTGTCACTGCTGTTCGGGTGATGACCTCGTTGCC
AAGCTCCTCCTGTCCCCTTGTTCTGGGGGCAGGCGCTGTGCTTCTGTGAGGTGGTTTAGCTTTTGCTTTCGAAGTGGCCAGCTGCGGCCA
CCAGGTCTCAGCACAAGAGCGCTTCCTTTGCACAGAATGAGCTTCGAGCTTTGTTCAGACTAAATGAATGTATCTGGGAGGGGTCGGGGG
CACGAGTTGATTCCAAGCACATGCCTTTGCTGAGTGTGTGTGTGCTGGGAGAGTCAGAGTGGATGTAGAGCGCGGTTTTATTTTTGTACT
GACATTGGTAAGAGACTGTATAGCATCTATTTATTTAGATGATTTATCTGGTAAATGAGGCAAAAAAATTATTAAAAATACATTAAAGAT

>44102_44102_19_LARGE-CBX7_LARGE_chr22_34157358_ENST00000397394_CBX7_chr22_39545837_ENST00000216133_length(amino acids)=233AA_BP=3
MGASKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEKEERDRASGYRKRGPKPKRLLLQRLYSMDLRSSHKAKGKEKLCFSLTCP
LGSGSPEGVVKAGAPELVDKGPLVPTLPFPLRKPRKAHKYLRLSRKKFPPRGPNLESHSHRRELFLQEPPAPDVLQAAGEWEPAAQPPEE

--------------------------------------------------------------
>44102_44102_20_LARGE-CBX7_LARGE_chr22_34157358_ENST00000397394_CBX7_chr22_39545837_ENST00000401405_length(transcript)=1337nt_BP=622nt
GGGGGCGCGGCCGCGCTCGTCTCGCCGGGCTGTTCGCGGGCAGGCCCTGCCCTGAAGGGACGAATCGGCTTGGAGCGCGGGAGGTGGAGT
CGGCCCCGGCGGTCGCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGG
GCCGGGAGCAGCTCCAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTT
GGAGCCGGGGGCAGTCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGC
CTCGTAGGCGGCGGCGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAGAGTATTATGAGGGA
GGCCGAGGACTTCATGCTCCGGACAGAGAAACGGCGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGA
CGGAAATTCTTGGCTGCCTCGTTGAGTCTTCTCTGCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGT
CGAGTATCTGGTGAAGTGGAAAGGATGGCCCCCAAAGTACAGCACGTGGGAGCCAGAAGAGCACATCTTGGACCCCCGCCTCGTCATGGC
CTACGAGGAGAAGGAGGAGAGAGACCGAGCATCGGGGTATAGGAAGAGAGGTCCGAAACCCAAGCGGCTTCTGCTGCAGGAGCCACCGGC
CCCAGACGTCCTGCAGGCGGCTGGCGAGTGGGAGCCTGCTGCGCAGCCCCCTGAAGAGGAGGCAGATGCCGACCTGGCCGAGGGGCCCCC
TCCCTGGACACCTGCGCTCCCCTCAAGTGAGGTGACCGTGACCGACATCACCGCCAACTCCATCACCGTCACCTTCCGCGAGGCCCAGGC
AGCTGAGGGCTTCTTCCGAGACCGCAGTGGGAAGTTCTGAATCACCGTTTTTACTCTTCTTAAACTGTTTTCTTTTGGGCTTGGGGTGGG
ACTTCCAGAGATAGGGATGGGTTGGGGGCGGGGTAATTATTTTATTTAAAAAAATACCGAGCAGCAAAAGGGGAGAAGATCCCACTACTC
TCCCACCACCTGCCCTTTCTCTGAGGGACGTTTACCACGAGGCCTCAGGCTGGGGATGGAGAGAGTTGCTCTGGGAGTTGGGGTACCACC

>44102_44102_20_LARGE-CBX7_LARGE_chr22_34157358_ENST00000397394_CBX7_chr22_39545837_ENST00000401405_length(amino acids)=194AA_BP=
MGCREDSTRQPRISVSPCRFPASSQKWQSLIPAPFLCPEHEVLGLPHNTLLAVTAARQRETRKNRVARRRRRLRGAGGPGQGPGRRCPTR
SGGAKHRVRDCPRLQMPRRGSGSPKLSTVPSSSSSGAAAWSCSRPGGLFRARGAPHGQGPGSASGWVQGATAGADSTSRAPSRFVPSGQG

--------------------------------------------------------------
>44102_44102_21_LARGE-CBX7_LARGE_chr22_34157358_ENST00000397394_CBX7_chr22_39545837_ENST00000475962_length(transcript)=1073nt_BP=622nt
GGGGGCGCGGCCGCGCTCGTCTCGCCGGGCTGTTCGCGGGCAGGCCCTGCCCTGAAGGGACGAATCGGCTTGGAGCGCGGGAGGTGGAGT
CGGCCCCGGCGGTCGCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGG
GCCGGGAGCAGCTCCAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTT
GGAGCCGGGGGCAGTCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGC
CTCGTAGGCGGCGGCGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAGAGTATTATGAGGGA
GGCCGAGGACTTCATGCTCCGGACAGAGAAACGGCGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGA
CGGAAATTCTTGGCTGCCTCGTTGAGTCTTCTCTGCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGT
CGAGTATCTGGTGAAGTGGAAAGGATGGCCCCCAAACATCGCACTACTGATGACAAGCAGGCAACTGCGTTGGAGAGGGAGGTCATCATG
GCTGCAAGGAAGGGACCGGACCCATATAATATACTACCCCTAAAGGCAGCTTCAGGCACCAAAGAAGACCCTAATTTAGTCCTCTCCATC
ACCAACAAGAGAAGAGTGGGCTGCATCTGCGTAGAGGACAACAGTACCATCACCTGCTTTGGGCTGCACAAAGGCGAGACCCAGTGATGC
CCCAGCTATGGAACCCATTACAAGCTGGTGCCCCGCCAGCTGACGTACTGAGCACCTGCACCAAGTTACCCAAAATGTGCTGCAAAGTTT

>44102_44102_21_LARGE-CBX7_LARGE_chr22_34157358_ENST00000397394_CBX7_chr22_39545837_ENST00000475962_length(amino acids)=194AA_BP=
MGCREDSTRQPRISVSPCRFPASSQKWQSLIPAPFLCPEHEVLGLPHNTLLAVTAARQRETRKNRVARRRRRLRGAGGPGQGPGRRCPTR
SGGAKHRVRDCPRLQMPRRGSGSPKLSTVPSSSSSGAAAWSCSRPGGLFRARGAPHGQGPGSASGWVQGATAGADSTSRAPSRFVPSGQG

--------------------------------------------------------------
>44102_44102_22_LARGE-CBX7_LARGE_chr22_34157358_ENST00000402320_CBX7_chr22_39545837_ENST00000216133_length(transcript)=4414nt_BP=608nt
GCTCGTCTCGCCGGGCTGTTCGCGGGCAGGCCCTGCCCTGAAGGGACGAATCGGCTTGGAGCGCGGGAGGTGGAGTCGGCCCCGGCGGTC
GCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGGGCCGGGAGCAGCTC
CAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTTGGAGCCGGGGGCAG
TCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGCCTCGTAGGCGGCGG
CGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAGAGTATTATGAGGGAGGCCGAGGACTTCA
TGCTCCGGACAGAGAAACGGCGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGACGGAAATTCTTGGC
TGCCTCGTTGAGTCTTCTCTGCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGTCGAGTATCTGGTGA
AGTGGAAAGGATGGCCCCCAAAGTACAGCACGTGGGAGCCAGAAGAGCACATCTTGGACCCCCGCCTCGTCATGGCCTACGAGGAGAAGG
AGGAGAGAGACCGAGCATCGGGGTATAGGAAGAGAGGTCCGAAACCCAAGCGGCTTCTGCTGCAGCGGCTGTACAGCATGGACCTGCGGA
GCTCCCACAAGGCCAAGGGCAAGGAGAAGCTCTGCTTCTCCCTGACGTGCCCACTCGGCAGCGGGAGCCCTGAGGGGGTGGTCAAGGCGG
GGGCACCTGAGCTGGTGGACAAGGGCCCCTTGGTGCCCACCCTGCCCTTCCCGCTCCGCAAGCCCCGAAAGGCCCACAAGTACCTGCGGC
TCTCGCGCAAGAAGTTCCCGCCCCGCGGGCCCAACCTGGAGAGCCACAGCCATCGACGGGAGCTCTTCCTGCAGGAGCCACCGGCCCCAG
ACGTCCTGCAGGCGGCTGGCGAGTGGGAGCCTGCTGCGCAGCCCCCTGAAGAGGAGGCAGATGCCGACCTGGCCGAGGGGCCCCCTCCCT
GGACACCTGCGCTCCCCTCAAGTGAGGTGACCGTGACCGACATCACCGCCAACTCCATCACCGTCACCTTCCGCGAGGCCCAGGCAGCTG
AGGGCTTCTTCCGAGACCGCAGTGGGAAGTTCTGAATCACCGTTTTTACTCTTCTTAAACTGTTTTCTTTTGGGCTTGGGGTGGGACTTC
CAGAGATAGGGATGGGTTGGGGGCGGGGTAATTATTTTATTTAAAAAAATACCGAGCAGCAAAAGGGGAGAAGATCCCACTACTCTCCCA
CCACCTGCCCTTTCTCTGAGGGACGTTTACCACGAGGCCTCAGGCTGGGGATGGAGAGAGTTGCTCTGGGAGTTGGGGTACCACCCCCAG
GGCAGGATGGGGACAGGATCACCTGCCCGGGACACCACCATTATCATTCTCCTCTAGTGACGCAGCAGCTGGTTCTGGGAGTTAAAGGAG
CATTGGAAGGCCCAAACCCTCTCCCTTGAGTGGCCACCCCAGCCTGGTTGGCTGGTTTTCCCCTTTTCTCTTGTTTCAATTGGGTCTTTA
CCTTGAACTCTCCTCTCTGGCTTTGCGGTGGGCTGTGGAGGCTGGTTTTGACCAAAAGTGAGTGGGGCGGGAGGAAGGGGCAGGAGGAAG
GGTTGAGGTTACTTGGGGCGAGTCCCTTCCCCTTCAGAGAGGCTTCTATCCTTCCCAGGGAGGAGGCGCCGCTGAGACCCTTCTGCTGAG
AGCTCTGCCCTCCCCTCATCACCTGGCCTGTGCAGAAACGCTCATGCACACCTGGCTGCACAGGTGTGCACGCATTACCCTTCGCGTGTA
CGTTCCCATGTGCCCCGTGAAAGCATGTGTGGCTGCAGACGTGTCCACATGGGCCTTGCGAACCTGGGTTAGAAACCCTGGCCAGGCGAA
CGTGGGGTGATTCACAGCACAAAAGACCTCACCACCACACCTGCACTCACCCCACCTTGCATGCACCTTGCTACCTGCTTGCGGCTTTCA
GTGGAGGGCAGGGGTCTGGCACAGGTGCGATGGCACCCCATGCTCCAGGCATACAGATGTGGTTTCTCGGCTGCACCGGGCCAGGCTGCG
GGTGTGCAGGCGTCTGCTAAGTTGTGTGATGTATCAGCACAGGCTTTGAGACGTCTGGACCCTGTCCTTCCTCCCGTGAGGGGTTCTTGT
TCTTTCTGACTCAGGTGACTTTTCAGCCCTTCCAATTCCCCTCTTTTTCTGCCCTCCCCTCCAACTCAGCCAACCCAGGTGTGGGCAGTC
AGGGAGGGAGGGAGTGTCCCACCACGTTCTCAGGGCAGCCCTTGACTCCTAAGCCCCTTCCTCCTTCCATTCTGCATCCCCTCCCCATCC
AACCTAAATGCCACAGCTGGGGCTGAGCTGTATTCCTGTGGAGGGACCTCTGCCGTGCCTCTCTGAGGTCAGGCTGTGCTGTGTGATGGG
CAGGCTTTGCCCCAGCCCACCCCTGGCAAGGTGCACTTGTTTTCTGGTTTGTACAAGGTGTCCTGGGGGCCCGTGGCTTCCCTGCCAGTG
AGGAGTGACTTCTCCCTCTCTTCCAGTCCTGTAGGGGAGACAAAACCAGATTGGGGGGCCCAAGGGGAGCATGGAAAAGGCCGGCTCCCC
TGTCTTTCCTTGGCTGTCAGAGTCAGGGTAACACACACCAAGAGTGGAGTGCGGCCAGCAAGTTTGAGACCTGCCCGCCCTCCTCGCAGC
TCTGCTCTGTGTCCTCAGGAAGTCACAGAGTCTACTGAGGCAAGGAGAGGGTGATTCTTTCCCCAAATCCCTTCTTCCCTGGTTCCCAAA
CCAAAGACAGCCTGCAGCCCTTTCTGCATGGGGTGCTCTGTTGACAGGCTTCCCAGATCCCTGAGTCTCTCTTTCCTTCCTCCTCGATCT
TTAGTTGTCCACGGTCAATTCAGTGCTTCCATTGGGGGACAGTCCCCTCCGGGATGACCTGATTCACCTCCAGCCCAGGGAATGGAATCT
AGAGGAATACGTGGGGTGGGTCTGGACAAGGAGCGGCAGGAATCACCACCCATCTCCAGCTGTGGAGCCCTGTGGAGGGGAAGGGGAAGC
TTGGGGTTCAGAGGGGACTCTTCCAGGAGAGGGGTGCCCAGCGGAGGTAAAGATGATAGAGGGTTGTGGGGGGTCTCTAGTTGAATGTTT
TGGCCCATGACTTTGGAACATGGCTGGCAGCTTCCAGCAGAAGTCACGCTCCCCATCCCCCAGGGGACATAGGACCTTTTTCCTGCTTCC
TGGTCACTTTCAAAGAACTATTTGCGCAATCTGTGGGTCTGTGGATTCACGGGGCTTTCTGTGTGGGTGCTGCAGTTGCTTTTGTCTGCA
GCAGCAGGACACATCTTTCCTCTTACTCAGCCCTTTATGGCCCATGGGGAACTCCGTGGCTCAGGGAGAGCTGAACTCCAGGGGTGTGAC
CTGGGACGGGTGGGCCTGAGGTGCCCAGCTCAGGGCAGCCAGGTGGCTCATGGGCTGTAGTGAGCCAGCTCCCTGGGGGAAAAGGCTGTG
GGCCGTTAGGACCATCCTCCAGGACAGGTGACCTCTATGAGGTCACCTACGGCTGTGGCCGTGCAGGCCTCCTTCCAGCCCAGAGTGGCC
CAGTAGAGCAAGGCAGACAGTGACCTCCACCCCCGCAGCCCTCTTAAAAGGCCAGTACTCTTGGGGGTGGGGGGAGGGTTTAGAAAGCAT
TTGCCCATCTGCCTTTCTTTCCCCCAGCCCCCACCCGCTTTGAATGTAGAGACCCGTGGGCACTTTTCCTTTTGTGGTGGGGGGTGCGGA
GGAGGTACCCCCACCCCTGGCACAGCCGCCTGGAATGCAGGACTGTCACTGCTGTTCGGGTGATGACCTCGTTGCCAAGCTCCTCCTGTC
CCCTTGTTCTGGGGGCAGGCGCTGTGCTTCTGTGAGGTGGTTTAGCTTTTGCTTTCGAAGTGGCCAGCTGCGGCCACCAGGTCTCAGCAC
AAGAGCGCTTCCTTTGCACAGAATGAGCTTCGAGCTTTGTTCAGACTAAATGAATGTATCTGGGAGGGGTCGGGGGCACGAGTTGATTCC
AAGCACATGCCTTTGCTGAGTGTGTGTGTGCTGGGAGAGTCAGAGTGGATGTAGAGCGCGGTTTTATTTTTGTACTGACATTGGTAAGAG
ACTGTATAGCATCTATTTATTTAGATGATTTATCTGGTAAATGAGGCAAAAAAATTATTAAAAATACATTAAAGATGATTTAAAAAAAAG

>44102_44102_22_LARGE-CBX7_LARGE_chr22_34157358_ENST00000402320_CBX7_chr22_39545837_ENST00000216133_length(amino acids)=233AA_BP=3
MGASKGKVEYLVKWKGWPPKYSTWEPEEHILDPRLVMAYEEKEERDRASGYRKRGPKPKRLLLQRLYSMDLRSSHKAKGKEKLCFSLTCP
LGSGSPEGVVKAGAPELVDKGPLVPTLPFPLRKPRKAHKYLRLSRKKFPPRGPNLESHSHRRELFLQEPPAPDVLQAAGEWEPAAQPPEE

--------------------------------------------------------------
>44102_44102_23_LARGE-CBX7_LARGE_chr22_34157358_ENST00000402320_CBX7_chr22_39545837_ENST00000401405_length(transcript)=1323nt_BP=608nt
GCTCGTCTCGCCGGGCTGTTCGCGGGCAGGCCCTGCCCTGAAGGGACGAATCGGCTTGGAGCGCGGGAGGTGGAGTCGGCCCCGGCGGTC
GCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGGGCCGGGAGCAGCTC
CAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTTGGAGCCGGGGGCAG
TCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGCCTCGTAGGCGGCGG
CGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAGAGTATTATGAGGGAGGCCGAGGACTTCA
TGCTCCGGACAGAGAAACGGCGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGACGGAAATTCTTGGC
TGCCTCGTTGAGTCTTCTCTGCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGTCGAGTATCTGGTGA
AGTGGAAAGGATGGCCCCCAAAGTACAGCACGTGGGAGCCAGAAGAGCACATCTTGGACCCCCGCCTCGTCATGGCCTACGAGGAGAAGG
AGGAGAGAGACCGAGCATCGGGGTATAGGAAGAGAGGTCCGAAACCCAAGCGGCTTCTGCTGCAGGAGCCACCGGCCCCAGACGTCCTGC
AGGCGGCTGGCGAGTGGGAGCCTGCTGCGCAGCCCCCTGAAGAGGAGGCAGATGCCGACCTGGCCGAGGGGCCCCCTCCCTGGACACCTG
CGCTCCCCTCAAGTGAGGTGACCGTGACCGACATCACCGCCAACTCCATCACCGTCACCTTCCGCGAGGCCCAGGCAGCTGAGGGCTTCT
TCCGAGACCGCAGTGGGAAGTTCTGAATCACCGTTTTTACTCTTCTTAAACTGTTTTCTTTTGGGCTTGGGGTGGGACTTCCAGAGATAG
GGATGGGTTGGGGGCGGGGTAATTATTTTATTTAAAAAAATACCGAGCAGCAAAAGGGGAGAAGATCCCACTACTCTCCCACCACCTGCC
CTTTCTCTGAGGGACGTTTACCACGAGGCCTCAGGCTGGGGATGGAGAGAGTTGCTCTGGGAGTTGGGGTACCACCCCCAGGGCAGGATG

>44102_44102_23_LARGE-CBX7_LARGE_chr22_34157358_ENST00000402320_CBX7_chr22_39545837_ENST00000401405_length(amino acids)=189AA_BP=
MGCREDSTRQPRISVSPCRFPASSQKWQSLIPAPFLCPEHEVLGLPHNTLLAVTAARQRETRKNRVARRRRRLRGAGGPGQGPGRRCPTR
SGGAKHRVRDCPRLQMPRRGSGSPKLSTVPSSSSSGAAAWSCSRPGGLFRARGAPHGQGPGSASGWVQGATAGADSTSRAPSRFVPSGQG

--------------------------------------------------------------
>44102_44102_24_LARGE-CBX7_LARGE_chr22_34157358_ENST00000402320_CBX7_chr22_39545837_ENST00000475962_length(transcript)=1059nt_BP=608nt
GCTCGTCTCGCCGGGCTGTTCGCGGGCAGGCCCTGCCCTGAAGGGACGAATCGGCTTGGAGCGCGGGAGGTGGAGTCGGCCCCGGCGGTC
GCTCCCTGGACCCAACCCGAGGCTGACCCAGGCCCCTGCCCATGCGGGGCGCCCCTGGCTCGGAAGAGTCCCCCGGGCCGGGAGCAGCTC
CAGGCAGCGGCCCCGGAGGAAGAGGAAGAAGGGACAGTGCTCAGCTTGGGGGACCCGGACCCTCGCCGCGGCATTTGGAGCCGGGGGCAG
TCCCGAACTCTGTGCTTGGCACCGCCGCTCCGAGTAGGGCAGCGCCTGCCGGGACCCTGACCCGGACCCCCTGCGCCTCGTAGGCGGCGG
CGCCGCCGCGCCACCCTGTTCTTCCGTGTCTCCCTCTGCCTGGCGGCAGTCACGGCCAAGAGAGTATTATGAGGGAGGCCGAGGACTTCA
TGCTCCGGACAGAGAAACGGCGCTGGGATTAGGGATTGCCACTTCTGAGAGGATGCTGGGAATCTGCAGGGGGAGACGGAAATTCTTGGC
TGCCTCGTTGAGTCTTCTCTGCATCCCAGCCATCACCTGGATTTACCTGTTTTCTGGGAGCTTCGAAGGGTAAAGTCGAGTATCTGGTGA
AGTGGAAAGGATGGCCCCCAAACATCGCACTACTGATGACAAGCAGGCAACTGCGTTGGAGAGGGAGGTCATCATGGCTGCAAGGAAGGG
ACCGGACCCATATAATATACTACCCCTAAAGGCAGCTTCAGGCACCAAAGAAGACCCTAATTTAGTCCTCTCCATCACCAACAAGAGAAG
AGTGGGCTGCATCTGCGTAGAGGACAACAGTACCATCACCTGCTTTGGGCTGCACAAAGGCGAGACCCAGTGATGCCCCAGCTATGGAAC
CCATTACAAGCTGGTGCCCCGCCAGCTGACGTACTGAGCACCTGCACCAAGTTACCCAAAATGTGCTGCAAAGTTTCTTCTTTCCAATAA

>44102_44102_24_LARGE-CBX7_LARGE_chr22_34157358_ENST00000402320_CBX7_chr22_39545837_ENST00000475962_length(amino acids)=189AA_BP=
MGCREDSTRQPRISVSPCRFPASSQKWQSLIPAPFLCPEHEVLGLPHNTLLAVTAARQRETRKNRVARRRRRLRGAGGPGQGPGRRCPTR
SGGAKHRVRDCPRLQMPRRGSGSPKLSTVPSSSSSGAAAWSCSRPGGLFRARGAPHGQGPGSASGWVQGATAGADSTSRAPSRFVPSGQG

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for LARGE-CBX7


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for LARGE-CBX7


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for LARGE-CBX7


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
HgeneLARGE1C3150414MUSCULAR DYSTROPHY-DYSTROGLYCANOPATHY (CONGENITAL WITH BRAIN AND EYE ANOMALIES), TYPE A, 67GENOMICS_ENGLAND;UNIPROT
HgeneLARGE1C1837229Muscular Dystrophy, Congenital, Type 1D5CTD_human;GENOMICS_ENGLAND;UNIPROT
HgeneLARGE1C0265221Walker-Warburg congenital muscular dystrophy3CTD_human;GENOMICS_ENGLAND;ORPHANET
HgeneLARGE1C0457133Muscle eye brain disease2CTD_human;GENOMICS_ENGLAND;ORPHANET
HgeneLARGE1C0023903Liver neoplasms1CTD_human
HgeneLARGE1C0043094Weight Gain1CTD_human
HgeneLARGE1C0236733Amphetamine-Related Disorders1CTD_human
HgeneLARGE1C0236804Amphetamine Addiction1CTD_human
HgeneLARGE1C0236807Amphetamine Abuse1CTD_human
HgeneLARGE1C0345904Malignant neoplasm of liver1CTD_human
HgeneLARGE1C3714756Intellectual Disability1GENOMICS_ENGLAND
TgeneC0026764Multiple Myeloma1CTD_human