FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:CELSR1-CD52 (FusionGDB2 ID:15677)

Fusion Gene Summary for CELSR1-CD52

check button Fusion gene summary
Fusion gene informationFusion gene name: CELSR1-CD52
Fusion gene ID: 15677
HgeneTgene
Gene symbol

CELSR1

CD52

Gene ID

9620

1043

Gene namecadherin EGF LAG seven-pass G-type receptor 1CD52 molecule
SynonymsADGRC1|CDHF9|FMI2|HFMI2|ME2CDW52|EDDM5|HE5
Cytomap

22q13.31

1p36.11

Type of geneprotein-codingprotein-coding
Descriptioncadherin EGF LAG seven-pass G-type receptor 1adhesion G protein-coupled receptor C1cadherin family member 9cadherin, EGF LAG seven-pass G-type receptor 1 (flamingo homolog, Drosophila)flamingo homolog 2protocadherin flamingo 2CAMPATH-1 antigenCD52 antigen (CAMPATH-1 antigen)CDW52 antigen (CAMPATH-1 antigen)HEL-S-171mPcambridge pathology 1 antigenepididymal secretory protein E5epididymis secretory sperm binding protein Li 171mPhuman epididymis-specific protein 5
Modification date2020032220200313
UniProtAcc

Q9NYQ6

.
Ensembl transtripts involved in fusion geneENST00000262738, ENST00000395964, 
ENST00000497509, 
ENST00000374213, 
ENST00000492808, 
Fusion gene scores* DoF score23 X 19 X 11=48075 X 3 X 5=75
# samples 286
** MAII scorelog2(28/4807*10)=-4.10163807119293
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(6/75*10)=-0.321928094887362
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: CELSR1 [Title/Abstract] AND CD52 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointCELSR1(46859604)-CD52(26646662), # samples:3
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
TgeneCD52

GO:0007204

positive regulation of cytosolic calcium ion concentration

8223854


check buttonFusion gene breakpoints across CELSR1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.

check buttonFusion gene breakpoints across CD52 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4LUADTCGA-62-8394-01ACELSR1chr22

46859604

-CD52chr1

26646662

+
ChimerDB4LUADTCGA-62-8394CELSR1chr22

46859603

-CD52chr1

26646661

+
ChimerDB4LUADTCGA-62-8394-01ACELSR1chr22

46859604

-CD52chr1

26646662

+
ChimerDB4LUADTCGA-62-8394-01ACELSR1chr22

46859604

-CD52chr1

26646662

+
ChimerDB4LUADTCGA-62-8394-01ACELSR1chr22

46859603

-CD52chr1

26646661

+


Top

Fusion Gene ORF analysis for CELSR1-CD52

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
In-frameENST00000262738ENST00000374213CELSR1chr22

46859604

-CD52chr1

26646662

+
5CDS-intronENST00000262738ENST00000492808CELSR1chr22

46859604

-CD52chr1

26646662

+
In-frameENST00000395964ENST00000374213CELSR1chr22

46859604

-CD52chr1

26646662

+
5CDS-intronENST00000395964ENST00000492808CELSR1chr22

46859604

-CD52chr1

26646662

+
intron-3CDSENST00000497509ENST00000374213CELSR1chr22

46859604

-CD52chr1

26646662

+
intron-intronENST00000497509ENST00000492808CELSR1chr22

46859604

-CD52chr1

26646662

+
In-frameENST00000262738ENST00000374213CELSR1chr22

46859603

-CD52chr1

26646661

+
5CDS-intronENST00000262738ENST00000492808CELSR1chr22

46859603

-CD52chr1

26646661

+
In-frameENST00000395964ENST00000374213CELSR1chr22

46859603

-CD52chr1

26646661

+
5CDS-intronENST00000395964ENST00000492808CELSR1chr22

46859603

-CD52chr1

26646661

+
intron-3CDSENST00000497509ENST00000374213CELSR1chr22

46859603

-CD52chr1

26646661

+
intron-intronENST00000497509ENST00000492808CELSR1chr22

46859603

-CD52chr1

26646661

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000262738CELSR1chr2246859604-ENST00000374213CD52chr126646662+45364183043191439
ENST00000395964CELSR1chr2246859604-ENST00000374213CD52chr126646662+45364183043191439
ENST00000262738CELSR1chr2246859603-ENST00000374213CD52chr126646661+45364183043191439
ENST00000395964CELSR1chr2246859603-ENST00000374213CD52chr126646661+45364183043191439

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000262738ENST00000374213CELSR1chr2246859604-CD52chr126646662+0.0021499070.99785006
ENST00000395964ENST00000374213CELSR1chr2246859604-CD52chr126646662+0.0021499070.99785006
ENST00000262738ENST00000374213CELSR1chr2246859603-CD52chr126646661+0.0021499070.99785006
ENST00000395964ENST00000374213CELSR1chr2246859603-CD52chr126646661+0.0021499070.99785006

Top

Fusion Genomic Features for CELSR1-CD52


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
CELSR1chr2246859603-CD52chr126646661+1.05E-050.9999895
CELSR1chr2246859603-CD52chr126646661+1.05E-050.9999895
CELSR1chr2246859603-CD52chr126646661+1.05E-050.9999895
CELSR1chr2246859603-CD52chr126646661+1.05E-050.9999895

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.

Top

Fusion Protein Features for CELSR1-CD52


check button Go to

FGviewer for the breakpoints of chr22:46859604-chr1:26646662

.
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
CELSR1

Q9NYQ6

.
FUNCTION: Receptor that may have an important role in cell/cell signaling during nervous system formation.FUNCTION: Might normally function as a transcriptional repressor. EWS-fusion-proteins (EFPS) may play a role in the tumorigenic process. They may disturb gene expression by mimicking, or interfering with the normal function of CTD-POLII within the transcription initiation complex. They may also contribute to an aberrant activation of the fusion protein target genes.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2351000_11011394.33333333333333015.0DomainCadherin 8
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2351106_12241394.33333333333333015.0DomainCadherin 9
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2351303_13611394.33333333333333015.0DomainEGF-like 1%3B calcium-binding
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-235246_3531394.33333333333333015.0DomainCadherin 1
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-235354_4591394.33333333333333015.0DomainCadherin 2
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-235460_5651394.33333333333333015.0DomainCadherin 3
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-235566_6871394.33333333333333015.0DomainCadherin 4
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-235688_7891394.33333333333333015.0DomainCadherin 5
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-235790_8921394.33333333333333015.0DomainCadherin 6
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-235893_9991394.33333333333333015.0DomainCadherin 7
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2351000_11011394.33333333333333015.0DomainCadherin 8
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2351106_12241394.33333333333333015.0DomainCadherin 9
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2351303_13611394.33333333333333015.0DomainEGF-like 1%3B calcium-binding
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-235246_3531394.33333333333333015.0DomainCadherin 1
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-235354_4591394.33333333333333015.0DomainCadherin 2
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-235460_5651394.33333333333333015.0DomainCadherin 3
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-235566_6871394.33333333333333015.0DomainCadherin 4
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-235688_7891394.33333333333333015.0DomainCadherin 5
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-235790_8921394.33333333333333015.0DomainCadherin 6
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-235893_9991394.33333333333333015.0DomainCadherin 7

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2352659_26631394.33333333333333015.0Compositional biasNote=Poly-Leu
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2352659_26631394.33333333333333015.0Compositional biasNote=Poly-Leu
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2351363_13991394.33333333333333015.0DomainEGF-like 2%3B calcium-binding
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2351403_14411394.33333333333333015.0DomainEGF-like 3%3B calcium-binding
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2351442_16461394.33333333333333015.0DomainLaminin G-like 1
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2351649_16851394.33333333333333015.0DomainEGF-like 4%3B calcium-binding
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2351689_18701394.33333333333333015.0DomainLaminin G-like 2
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2351872_19071394.33333333333333015.0DomainEGF-like 5%3B calcium-binding
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2351908_19461394.33333333333333015.0DomainEGF-like 6%3B calcium-binding
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2351947_19791394.33333333333333015.0DomainEGF-like 7%3B calcium-binding
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2351981_20161394.33333333333333015.0DomainEGF-like 8%3B calcium-binding
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2352003_20501394.33333333333333015.0DomainLaminin EGF-like
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2352408_24601394.33333333333333015.0DomainGPS
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2351363_13991394.33333333333333015.0DomainEGF-like 2%3B calcium-binding
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2351403_14411394.33333333333333015.0DomainEGF-like 3%3B calcium-binding
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2351442_16461394.33333333333333015.0DomainLaminin G-like 1
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2351649_16851394.33333333333333015.0DomainEGF-like 4%3B calcium-binding
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2351689_18701394.33333333333333015.0DomainLaminin G-like 2
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2351872_19071394.33333333333333015.0DomainEGF-like 5%3B calcium-binding
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2351908_19461394.33333333333333015.0DomainEGF-like 6%3B calcium-binding
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2351947_19791394.33333333333333015.0DomainEGF-like 7%3B calcium-binding
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2351981_20161394.33333333333333015.0DomainEGF-like 8%3B calcium-binding
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2352003_20501394.33333333333333015.0DomainLaminin EGF-like
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2352408_24601394.33333333333333015.0DomainGPS
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-23522_24691394.33333333333333015.0Topological domainExtracellular
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2352491_25011394.33333333333333015.0Topological domainCytoplasmic
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2352523_25271394.33333333333333015.0Topological domainExtracellular
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2352549_25721394.33333333333333015.0Topological domainCytoplasmic
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2352594_26111394.33333333333333015.0Topological domainExtracellular
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2352633_26551394.33333333333333015.0Topological domainCytoplasmic
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2352677_26831394.33333333333333015.0Topological domainExtracellular
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2352705_30141394.33333333333333015.0Topological domainCytoplasmic
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-23522_24691394.33333333333333015.0Topological domainExtracellular
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2352491_25011394.33333333333333015.0Topological domainCytoplasmic
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2352523_25271394.33333333333333015.0Topological domainExtracellular
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2352549_25721394.33333333333333015.0Topological domainCytoplasmic
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2352594_26111394.33333333333333015.0Topological domainExtracellular
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2352633_26551394.33333333333333015.0Topological domainCytoplasmic
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2352677_26831394.33333333333333015.0Topological domainExtracellular
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2352705_30141394.33333333333333015.0Topological domainCytoplasmic
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2352470_24901394.33333333333333015.0TransmembraneHelical%3B Name%3D1
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2352502_25221394.33333333333333015.0TransmembraneHelical%3B Name%3D2
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2352528_25481394.33333333333333015.0TransmembraneHelical%3B Name%3D3
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2352573_25931394.33333333333333015.0TransmembraneHelical%3B Name%3D4
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2352612_26321394.33333333333333015.0TransmembraneHelical%3B Name%3D5
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2352656_26761394.33333333333333015.0TransmembraneHelical%3B Name%3D6
HgeneCELSR1chr22:46859603chr1:26646661ENST00000262738-2352684_27041394.33333333333333015.0TransmembraneHelical%3B Name%3D7
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2352470_24901394.33333333333333015.0TransmembraneHelical%3B Name%3D1
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2352502_25221394.33333333333333015.0TransmembraneHelical%3B Name%3D2
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2352528_25481394.33333333333333015.0TransmembraneHelical%3B Name%3D3
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2352573_25931394.33333333333333015.0TransmembraneHelical%3B Name%3D4
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2352612_26321394.33333333333333015.0TransmembraneHelical%3B Name%3D5
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2352656_26761394.33333333333333015.0TransmembraneHelical%3B Name%3D6
HgeneCELSR1chr22:46859604chr1:26646662ENST00000262738-2352684_27041394.33333333333333015.0TransmembraneHelical%3B Name%3D7


Top

Fusion Gene Sequence for CELSR1-CD52


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>In-frame_ENST00000262738_ENST00000374213_TCGA-62-8394-01A_CELSR1_chr22_46859604_-_CD52_chr1_26646662_length(transcript)=4536nt_BP=4183nt
ATGGCGCCGCCGCCGCCGCCCGTGCTGCCCGTGCTGCTGCTCCTGGCCGCCGCCGCCGCCCTGCCGGCGATGGGGCTGCGAGCGGCCGCC
TGGGAGCCGCGCGTACCCGGCGGGACCCGCGCCTTCGCCCTCCGGCCCGGCTGTACCTACGCGGTGGGCGCCGCTTGCACGCCCCGGGCG
CCGCGGGAGCTGCTGGACGTGGGCCGCGATGGGCGGCTGGCAGGACGTCGGCGCGTCTCGGGCGCGGGGCGCCCGCTGCCGCTGCAAGTC
CGCTTGGTGGCCCGCAGTGCCCCGACGGCGCTGAGCCGCCGCCTGCGGGCGCGCACGCACCTTCCCGGCTGCGGAGCCCGTGCCCGGCTC
TGCGGAACCGGTGCCCGGCTCTGCGGGGCGCTCTGCTTCCCCGTCCCCGGCGGCTGCGCGGCCGCGCAGCATTCGGCGCTCGCAGCTCCG
ACCACCTTACCCGCCTGCCGCTGCCCGCCGCGCCCCAGGCCCCGCTGTCCCGGCCGTCCCATCTGCCTGCCGCCGGGCGGCTCGGTCCGC
CTGCGTCTGCTGTGCGCCCTGCGGCGCGCGGCTGGCGCCGTCCGGGTGGGACTGGCGCTGGAGGCCGCCACCGCGGGGACGCCCTCCGCG
TCGCCATCCCCATCGCCGCCCCTGCCGCCGAACTTGCCCGAAGCCCGGGCGGGGCCGGCGCGACGGGCCCGGCGGGGCACGAGCGGCAGA
GGGAGCCTGAAGTTTCCGATGCCCAACTACCAGGTGGCGTTGTTTGAGAACGAACCGGCGGGCACCCTCATCCTCCAGCTGCACGCGCAC
TACACCATCGAGGGCGAGGAGGAGCGCGTGAGCTATTACATGGAGGGGCTGTTCGACGAGCGCTCCCGGGGCTACTTCCGAATCGACTCT
GCCACGGGCGCCGTGAGCACGGACAGCGTACTGGACCGCGAGACCAAGGAGACGCACGTCCTCAGGGTGAAAGCCGTGGACTACAGTACG
CCGCCGCGCTCGGCCACCACCTACATCACTGTCTTGGTCAAAGACACCAACGACCACAGCCCGGTCTTCGAGCAGTCGGAGTACCGCGAG
CGCGTGCGGGAGAACCTGGAGGTGGGCTACGAGGTGCTGACCATCCGCGCCAGCGACCGCGACTCGCCCATCAACGCCAACTTGCGTTAC
CGCGTGTTGGGGGGCGCGTGGGACGTCTTCCAGCTCAACGAGAGCTCTGGCGTGGTGAGCACACGGGCGGTGCTGGACCGGGAGGAGGCG
GCCGAGTACCAGCTCCTGGTGGAGGCCAACGACCAGGGGCGCAATCCGGGCCCGCTCAGTGCCACGGCCACCGTGTACATCGAGGTGGAG
GACGAGAACGACAACTACCCCCAGTTCAGCGAGCAGAACTACGTGGTCCAGGTGCCCGAGGACGTGGGGCTCAACACGGCTGTGCTGCGA
GTGCAGGCCACGGACCGGGACCAGGGCCAGAACGCGGCCATTCACTACAGCATCCTCAGCGGGAACGTGGCCGGCCAGTTCTACCTGCAC
TCGCTGAGCGGGATCCTGGATGTGATCAACCCCTTGGATTTCGAGGATGTCCAGAAATACTCGCTGAGCATTAAGGCCCAGGATGGGGGC
CGGCCCCCGCTCATCAATTCTTCAGGGGTGGTGTCTGTGCAGGTGCTGGATGTCAACGACAACGAGCCTATCTTTGTGAGCAGCCCCTTC
CAGGCCACGGTGCTGGAGAATGTGCCCCTGGGCTACCCCGTGGTGCACATTCAGGCGGTGGACGCGGACTCTGGAGAGAACGCCCGGCTG
CACTATCGCCTGGTGGACACGGCCTCCACCTTTCTGGGGGGCGGCAGCGCTGGGCCTAAGAATCCTGCCCCCACCCCTGACTTCCCCTTC
CAGATCCACAACAGCTCCGGTTGGATCACAGTGTGTGCCGAGCTGGACCGCGAGGAGGTGGAGCACTACAGCTTCGGGGTGGAGGCGGTG
GACCACGGCTCGCCCCCCATGAGCTCCTCCACCAGCGTGTCCATCACGGTGCTGGACGTGAATGACAACGACCCGGTGTTCACGCAGCCC
ACCTACGAGCTTCGTCTGAATGAGGATGCGGCCGTGGGGAGCAGCGTGCTGACCCTGCAGGCCCGCGACCGTGACGCCAACAGTGTGATT
ACCTACCAGCTCACAGGCGGCAACACCCGGAACCGCTTTGCACTCAGCAGCCAGAGAGGGGGCGGCCTCATCACCCTGGCGCTACCTCTG
GACTACAAGCAGGAGCAGCAGTACGTGCTGGCGGTGACAGCATCCGACGGCACACGGTCGCACACTGCGCATGTCCTAATCAACGTCACT
GATGCCAACACCCACAGGCCTGTCTTTCAGAGCTCCCATTACACAGTGAGTGTCAGTGAGGACAGGCCTGTGGGCACCTCCATTGCTACC
CTCAGTGCCAACGATGAGGACACAGGAGAGAATGCCCGCATCACCTACGTGATTCAGGACCCCGTGCCGCAGTTCCGCATTGACCCCGAC
AGTGGCACCATGTACACCATGATGGAGCTGGACTATGAGAACCAGGTCGCCTACACGCTGACCATCATGGCCCAGGACAACGGCATCCCG
CAGAAATCAGACACCACCACCCTAGAGATCCTCATCCTCGATGCCAATGACAATGCACCCCAGTTCCTGTGGGATTTCTACCAGGGTTCC
ATCTTTGAGGATGCTCCACCCTCGACCAGCATCCTCCAGGTCTCTGCCACGGACCGGGACTCAGGTCCCAATGGGCGTCTGCTGTACACC
TTCCAGGGTGGGGACGACGGCGATGGGGACTTCTACATCGAGCCCACGTCCGGTGTGATTCGCACCCAGCGCCGGCTGGACCGGGAGAAT
GTGGCCGTGTACAACCTTTGGGCTCTGGCTGTGGATCGGGGCAGTCCCACTCCCCTTAGCGCCTCGGTAGAAATCCAGGTGACCATCTTG
GACATTAATGACAATGCCCCCATGTTTGAGAAGGACGAACTGGAGCTGTTTGTTGAGGAGAACAACCCAGTGGGGTCGGTGGTGGCAAAG
ATTCGTGCTAACGACCCTGATGAAGGCCCTAATGCCCAGATCATGTATCAGATTGTGGAAGGGGACATGCGGCATTTCTTCCAGCTGGAC
CTGCTCAACGGGGACCTGCGTGCCATGGTGGAGCTGGACTTTGAGGTCCGGCGGGAGTATGTGCTGGTGGTGCAGGCCACGTCGGCTCCG
CTGGTGAGCCGAGCCACGGTGCACATCCTTCTCGTGGACCAGAATGACAACCCGCCTGTGCTGCCCGACTTCCAGATCCTCTTCAACAAC
TATGTCACCAACAAGTCCAACAGTTTCCCCACCGGCGTGATCGGCTGCATCCCGGCCCATGACCCCGACGTGTCAGACAGCCTCAACTAC
ACCTTCGTGCAGGGCAACGAGCTGCGCCTGTTGCTGCTGGACCCCGCCACGGGCGAACTGCAGCTCAGCCGCGACCTGGACAACAACCGG
CCGCTGGAGGCGCTCATGGAGGTGTCTGTGTCTGATGGCATCCACAGCGTCACGGCCTTCTGCACCCTGCGTGTCACCATCATCACGGAC
GACATGCTGACCAACAGCATCACTGTCCGCCTGGAGAACATGTCCCAGGAGAAGTTCCTGTCCCCGCTGCTGGCCCTCTTCGTGGAGGGG
GTGGCCGCCGTGCTGTCCACCACCAAGGACGACGTCTTCGTCTTCAACGTCCAGAACGACACCGACGTCAGCTCCAACATCCTGAACGTG
ACCTTCTCGGCGCTGCTGCCTGGCGGCGTCCGCGGCCAGTTCTTCCCGTCGGAGGACCTGCAGGAGCAGATCTACCTGAATCGGACGCTG
CTGACCACCATCTCCACGCAGCGCGTGCTGCCCTTCGACGACAACATCTGCCTGCGCGAGCCCTGCGAGAACTACATGAAGTGCGTGTCC
GTTCTGCGATTCGACAGCTCCGCGCCCTTCCTCAGCTCCACCACCGTGCTCTTCCGGCCCATCCACCCCATCAACGGCCTGCGCTGCCGC
TGCCCGCCCGGCTTCACCGGCGACTACTGCGAGACGGAGATCGACCTCTGCTACTCCGACCCGTGCGGCGCCAACGGCCGCTGCCGCAGC
CGCGAGGGCGGCTACACCTGCGAGTGCTTCGAGGACTTCACTGATACAAACTGGACTCTCAGGACAAAACGACACCAGCCAAACCAGCAG
CCCCTCAGCATCCAGCAACATAAGCGGAGGCATTTTCCTTTTCTTCGTGGCCAATGCCATAATCCACCTCTTCTGCTTCAGTTGAGGTGA
CACGTCTCAGCCTTAGCCCTGTGCCCCCTGAAACAGCTGCCACCATCACTCGCAAGAGAATCCCCTCCATCTTTGGGAGGGGTTGATGCC
AGACATCACCAGGTTGTAGAAGTTGACAGGCAGTGCCATGGGGGCAACAGCCAAAATAGGGGGGTAATGATGTAGGGGCCAAGCAGTGCC

>In-frame_ENST00000262738_ENST00000374213_TCGA-62-8394-01A_CELSR1_chr22_46859604_-_CD52_chr1_26646662_length(amino acids)=1439AA_start in transcript=0_stop in transcript=4319
MAPPPPPVLPVLLLLAAAAALPAMGLRAAAWEPRVPGGTRAFALRPGCTYAVGAACTPRAPRELLDVGRDGRLAGRRRVSGAGRPLPLQV
RLVARSAPTALSRRLRARTHLPGCGARARLCGTGARLCGALCFPVPGGCAAAQHSALAAPTTLPACRCPPRPRPRCPGRPICLPPGGSVR
LRLLCALRRAAGAVRVGLALEAATAGTPSASPSPSPPLPPNLPEARAGPARRARRGTSGRGSLKFPMPNYQVALFENEPAGTLILQLHAH
YTIEGEEERVSYYMEGLFDERSRGYFRIDSATGAVSTDSVLDRETKETHVLRVKAVDYSTPPRSATTYITVLVKDTNDHSPVFEQSEYRE
RVRENLEVGYEVLTIRASDRDSPINANLRYRVLGGAWDVFQLNESSGVVSTRAVLDREEAAEYQLLVEANDQGRNPGPLSATATVYIEVE
DENDNYPQFSEQNYVVQVPEDVGLNTAVLRVQATDRDQGQNAAIHYSILSGNVAGQFYLHSLSGILDVINPLDFEDVQKYSLSIKAQDGG
RPPLINSSGVVSVQVLDVNDNEPIFVSSPFQATVLENVPLGYPVVHIQAVDADSGENARLHYRLVDTASTFLGGGSAGPKNPAPTPDFPF
QIHNSSGWITVCAELDREEVEHYSFGVEAVDHGSPPMSSSTSVSITVLDVNDNDPVFTQPTYELRLNEDAAVGSSVLTLQARDRDANSVI
TYQLTGGNTRNRFALSSQRGGGLITLALPLDYKQEQQYVLAVTASDGTRSHTAHVLINVTDANTHRPVFQSSHYTVSVSEDRPVGTSIAT
LSANDEDTGENARITYVIQDPVPQFRIDPDSGTMYTMMELDYENQVAYTLTIMAQDNGIPQKSDTTTLEILILDANDNAPQFLWDFYQGS
IFEDAPPSTSILQVSATDRDSGPNGRLLYTFQGGDDGDGDFYIEPTSGVIRTQRRLDRENVAVYNLWALAVDRGSPTPLSASVEIQVTIL
DINDNAPMFEKDELELFVEENNPVGSVVAKIRANDPDEGPNAQIMYQIVEGDMRHFFQLDLLNGDLRAMVELDFEVRREYVLVVQATSAP
LVSRATVHILLVDQNDNPPVLPDFQILFNNYVTNKSNSFPTGVIGCIPAHDPDVSDSLNYTFVQGNELRLLLLDPATGELQLSRDLDNNR
PLEALMEVSVSDGIHSVTAFCTLRVTIITDDMLTNSITVRLENMSQEKFLSPLLALFVEGVAAVLSTTKDDVFVFNVQNDTDVSSNILNV
TFSALLPGGVRGQFFPSEDLQEQIYLNRTLLTTISTQRVLPFDDNICLREPCENYMKCVSVLRFDSSAPFLSSTTVLFRPIHPINGLRCR

--------------------------------------------------------------
>In-frame_ENST00000395964_ENST00000374213_TCGA-62-8394-01A_CELSR1_chr22_46859604_-_CD52_chr1_26646662_length(transcript)=4536nt_BP=4183nt
ATGGCGCCGCCGCCGCCGCCCGTGCTGCCCGTGCTGCTGCTCCTGGCCGCCGCCGCCGCCCTGCCGGCGATGGGGCTGCGAGCGGCCGCC
TGGGAGCCGCGCGTACCCGGCGGGACCCGCGCCTTCGCCCTCCGGCCCGGCTGTACCTACGCGGTGGGCGCCGCTTGCACGCCCCGGGCG
CCGCGGGAGCTGCTGGACGTGGGCCGCGATGGGCGGCTGGCAGGACGTCGGCGCGTCTCGGGCGCGGGGCGCCCGCTGCCGCTGCAAGTC
CGCTTGGTGGCCCGCAGTGCCCCGACGGCGCTGAGCCGCCGCCTGCGGGCGCGCACGCACCTTCCCGGCTGCGGAGCCCGTGCCCGGCTC
TGCGGAACCGGTGCCCGGCTCTGCGGGGCGCTCTGCTTCCCCGTCCCCGGCGGCTGCGCGGCCGCGCAGCATTCGGCGCTCGCAGCTCCG
ACCACCTTACCCGCCTGCCGCTGCCCGCCGCGCCCCAGGCCCCGCTGTCCCGGCCGTCCCATCTGCCTGCCGCCGGGCGGCTCGGTCCGC
CTGCGTCTGCTGTGCGCCCTGCGGCGCGCGGCTGGCGCCGTCCGGGTGGGACTGGCGCTGGAGGCCGCCACCGCGGGGACGCCCTCCGCG
TCGCCATCCCCATCGCCGCCCCTGCCGCCGAACTTGCCCGAAGCCCGGGCGGGGCCGGCGCGACGGGCCCGGCGGGGCACGAGCGGCAGA
GGGAGCCTGAAGTTTCCGATGCCCAACTACCAGGTGGCGTTGTTTGAGAACGAACCGGCGGGCACCCTCATCCTCCAGCTGCACGCGCAC
TACACCATCGAGGGCGAGGAGGAGCGCGTGAGCTATTACATGGAGGGGCTGTTCGACGAGCGCTCCCGGGGCTACTTCCGAATCGACTCT
GCCACGGGCGCCGTGAGCACGGACAGCGTACTGGACCGCGAGACCAAGGAGACGCACGTCCTCAGGGTGAAAGCCGTGGACTACAGTACG
CCGCCGCGCTCGGCCACCACCTACATCACTGTCTTGGTCAAAGACACCAACGACCACAGCCCGGTCTTCGAGCAGTCGGAGTACCGCGAG
CGCGTGCGGGAGAACCTGGAGGTGGGCTACGAGGTGCTGACCATCCGCGCCAGCGACCGCGACTCGCCCATCAACGCCAACTTGCGTTAC
CGCGTGTTGGGGGGCGCGTGGGACGTCTTCCAGCTCAACGAGAGCTCTGGCGTGGTGAGCACACGGGCGGTGCTGGACCGGGAGGAGGCG
GCCGAGTACCAGCTCCTGGTGGAGGCCAACGACCAGGGGCGCAATCCGGGCCCGCTCAGTGCCACGGCCACCGTGTACATCGAGGTGGAG
GACGAGAACGACAACTACCCCCAGTTCAGCGAGCAGAACTACGTGGTCCAGGTGCCCGAGGACGTGGGGCTCAACACGGCTGTGCTGCGA
GTGCAGGCCACGGACCGGGACCAGGGCCAGAACGCGGCCATTCACTACAGCATCCTCAGCGGGAACGTGGCCGGCCAGTTCTACCTGCAC
TCGCTGAGCGGGATCCTGGATGTGATCAACCCCTTGGATTTCGAGGATGTCCAGAAATACTCGCTGAGCATTAAGGCCCAGGATGGGGGC
CGGCCCCCGCTCATCAATTCTTCAGGGGTGGTGTCTGTGCAGGTGCTGGATGTCAACGACAACGAGCCTATCTTTGTGAGCAGCCCCTTC
CAGGCCACGGTGCTGGAGAATGTGCCCCTGGGCTACCCCGTGGTGCACATTCAGGCGGTGGACGCGGACTCTGGAGAGAACGCCCGGCTG
CACTATCGCCTGGTGGACACGGCCTCCACCTTTCTGGGGGGCGGCAGCGCTGGGCCTAAGAATCCTGCCCCCACCCCTGACTTCCCCTTC
CAGATCCACAACAGCTCCGGTTGGATCACAGTGTGTGCCGAGCTGGACCGCGAGGAGGTGGAGCACTACAGCTTCGGGGTGGAGGCGGTG
GACCACGGCTCGCCCCCCATGAGCTCCTCCACCAGCGTGTCCATCACGGTGCTGGACGTGAATGACAACGACCCGGTGTTCACGCAGCCC
ACCTACGAGCTTCGTCTGAATGAGGATGCGGCCGTGGGGAGCAGCGTGCTGACCCTGCAGGCCCGCGACCGTGACGCCAACAGTGTGATT
ACCTACCAGCTCACAGGCGGCAACACCCGGAACCGCTTTGCACTCAGCAGCCAGAGAGGGGGCGGCCTCATCACCCTGGCGCTACCTCTG
GACTACAAGCAGGAGCAGCAGTACGTGCTGGCGGTGACAGCATCCGACGGCACACGGTCGCACACTGCGCATGTCCTAATCAACGTCACT
GATGCCAACACCCACAGGCCTGTCTTTCAGAGCTCCCATTACACAGTGAGTGTCAGTGAGGACAGGCCTGTGGGCACCTCCATTGCTACC
CTCAGTGCCAACGATGAGGACACAGGAGAGAATGCCCGCATCACCTACGTGATTCAGGACCCCGTGCCGCAGTTCCGCATTGACCCCGAC
AGTGGCACCATGTACACCATGATGGAGCTGGACTATGAGAACCAGGTCGCCTACACGCTGACCATCATGGCCCAGGACAACGGCATCCCG
CAGAAATCAGACACCACCACCCTAGAGATCCTCATCCTCGATGCCAATGACAATGCACCCCAGTTCCTGTGGGATTTCTACCAGGGTTCC
ATCTTTGAGGATGCTCCACCCTCGACCAGCATCCTCCAGGTCTCTGCCACGGACCGGGACTCAGGTCCCAATGGGCGTCTGCTGTACACC
TTCCAGGGTGGGGACGACGGCGATGGGGACTTCTACATCGAGCCCACGTCCGGTGTGATTCGCACCCAGCGCCGGCTGGACCGGGAGAAT
GTGGCCGTGTACAACCTTTGGGCTCTGGCTGTGGATCGGGGCAGTCCCACTCCCCTTAGCGCCTCGGTAGAAATCCAGGTGACCATCTTG
GACATTAATGACAATGCCCCCATGTTTGAGAAGGACGAACTGGAGCTGTTTGTTGAGGAGAACAACCCAGTGGGGTCGGTGGTGGCAAAG
ATTCGTGCTAACGACCCTGATGAAGGCCCTAATGCCCAGATCATGTATCAGATTGTGGAAGGGGACATGCGGCATTTCTTCCAGCTGGAC
CTGCTCAACGGGGACCTGCGTGCCATGGTGGAGCTGGACTTTGAGGTCCGGCGGGAGTATGTGCTGGTGGTGCAGGCCACGTCGGCTCCG
CTGGTGAGCCGAGCCACGGTGCACATCCTTCTCGTGGACCAGAATGACAACCCGCCTGTGCTGCCCGACTTCCAGATCCTCTTCAACAAC
TATGTCACCAACAAGTCCAACAGTTTCCCCACCGGCGTGATCGGCTGCATCCCGGCCCATGACCCCGACGTGTCAGACAGCCTCAACTAC
ACCTTCGTGCAGGGCAACGAGCTGCGCCTGTTGCTGCTGGACCCCGCCACGGGCGAACTGCAGCTCAGCCGCGACCTGGACAACAACCGG
CCGCTGGAGGCGCTCATGGAGGTGTCTGTGTCTGATGGCATCCACAGCGTCACGGCCTTCTGCACCCTGCGTGTCACCATCATCACGGAC
GACATGCTGACCAACAGCATCACTGTCCGCCTGGAGAACATGTCCCAGGAGAAGTTCCTGTCCCCGCTGCTGGCCCTCTTCGTGGAGGGG
GTGGCCGCCGTGCTGTCCACCACCAAGGACGACGTCTTCGTCTTCAACGTCCAGAACGACACCGACGTCAGCTCCAACATCCTGAACGTG
ACCTTCTCGGCGCTGCTGCCTGGCGGCGTCCGCGGCCAGTTCTTCCCGTCGGAGGACCTGCAGGAGCAGATCTACCTGAATCGGACGCTG
CTGACCACCATCTCCACGCAGCGCGTGCTGCCCTTCGACGACAACATCTGCCTGCGCGAGCCCTGCGAGAACTACATGAAGTGCGTGTCC
GTTCTGCGATTCGACAGCTCCGCGCCCTTCCTCAGCTCCACCACCGTGCTCTTCCGGCCCATCCACCCCATCAACGGCCTGCGCTGCCGC
TGCCCGCCCGGCTTCACCGGCGACTACTGCGAGACGGAGATCGACCTCTGCTACTCCGACCCGTGCGGCGCCAACGGCCGCTGCCGCAGC
CGCGAGGGCGGCTACACCTGCGAGTGCTTCGAGGACTTCACTGATACAAACTGGACTCTCAGGACAAAACGACACCAGCCAAACCAGCAG
CCCCTCAGCATCCAGCAACATAAGCGGAGGCATTTTCCTTTTCTTCGTGGCCAATGCCATAATCCACCTCTTCTGCTTCAGTTGAGGTGA
CACGTCTCAGCCTTAGCCCTGTGCCCCCTGAAACAGCTGCCACCATCACTCGCAAGAGAATCCCCTCCATCTTTGGGAGGGGTTGATGCC
AGACATCACCAGGTTGTAGAAGTTGACAGGCAGTGCCATGGGGGCAACAGCCAAAATAGGGGGGTAATGATGTAGGGGCCAAGCAGTGCC

>In-frame_ENST00000395964_ENST00000374213_TCGA-62-8394-01A_CELSR1_chr22_46859604_-_CD52_chr1_26646662_length(amino acids)=1439AA_start in transcript=0_stop in transcript=4319
MAPPPPPVLPVLLLLAAAAALPAMGLRAAAWEPRVPGGTRAFALRPGCTYAVGAACTPRAPRELLDVGRDGRLAGRRRVSGAGRPLPLQV
RLVARSAPTALSRRLRARTHLPGCGARARLCGTGARLCGALCFPVPGGCAAAQHSALAAPTTLPACRCPPRPRPRCPGRPICLPPGGSVR
LRLLCALRRAAGAVRVGLALEAATAGTPSASPSPSPPLPPNLPEARAGPARRARRGTSGRGSLKFPMPNYQVALFENEPAGTLILQLHAH
YTIEGEEERVSYYMEGLFDERSRGYFRIDSATGAVSTDSVLDRETKETHVLRVKAVDYSTPPRSATTYITVLVKDTNDHSPVFEQSEYRE
RVRENLEVGYEVLTIRASDRDSPINANLRYRVLGGAWDVFQLNESSGVVSTRAVLDREEAAEYQLLVEANDQGRNPGPLSATATVYIEVE
DENDNYPQFSEQNYVVQVPEDVGLNTAVLRVQATDRDQGQNAAIHYSILSGNVAGQFYLHSLSGILDVINPLDFEDVQKYSLSIKAQDGG
RPPLINSSGVVSVQVLDVNDNEPIFVSSPFQATVLENVPLGYPVVHIQAVDADSGENARLHYRLVDTASTFLGGGSAGPKNPAPTPDFPF
QIHNSSGWITVCAELDREEVEHYSFGVEAVDHGSPPMSSSTSVSITVLDVNDNDPVFTQPTYELRLNEDAAVGSSVLTLQARDRDANSVI
TYQLTGGNTRNRFALSSQRGGGLITLALPLDYKQEQQYVLAVTASDGTRSHTAHVLINVTDANTHRPVFQSSHYTVSVSEDRPVGTSIAT
LSANDEDTGENARITYVIQDPVPQFRIDPDSGTMYTMMELDYENQVAYTLTIMAQDNGIPQKSDTTTLEILILDANDNAPQFLWDFYQGS
IFEDAPPSTSILQVSATDRDSGPNGRLLYTFQGGDDGDGDFYIEPTSGVIRTQRRLDRENVAVYNLWALAVDRGSPTPLSASVEIQVTIL
DINDNAPMFEKDELELFVEENNPVGSVVAKIRANDPDEGPNAQIMYQIVEGDMRHFFQLDLLNGDLRAMVELDFEVRREYVLVVQATSAP
LVSRATVHILLVDQNDNPPVLPDFQILFNNYVTNKSNSFPTGVIGCIPAHDPDVSDSLNYTFVQGNELRLLLLDPATGELQLSRDLDNNR
PLEALMEVSVSDGIHSVTAFCTLRVTIITDDMLTNSITVRLENMSQEKFLSPLLALFVEGVAAVLSTTKDDVFVFNVQNDTDVSSNILNV
TFSALLPGGVRGQFFPSEDLQEQIYLNRTLLTTISTQRVLPFDDNICLREPCENYMKCVSVLRFDSSAPFLSSTTVLFRPIHPINGLRCR

--------------------------------------------------------------
>In-frame_ENST00000262738_ENST00000374213_TCGA-62-8394_CELSR1_chr22_46859603_-_CD52_chr1_26646661_length(transcript)=4536nt_BP=4183nt
ATGGCGCCGCCGCCGCCGCCCGTGCTGCCCGTGCTGCTGCTCCTGGCCGCCGCCGCCGCCCTGCCGGCGATGGGGCTGCGAGCGGCCGCC
TGGGAGCCGCGCGTACCCGGCGGGACCCGCGCCTTCGCCCTCCGGCCCGGCTGTACCTACGCGGTGGGCGCCGCTTGCACGCCCCGGGCG
CCGCGGGAGCTGCTGGACGTGGGCCGCGATGGGCGGCTGGCAGGACGTCGGCGCGTCTCGGGCGCGGGGCGCCCGCTGCCGCTGCAAGTC
CGCTTGGTGGCCCGCAGTGCCCCGACGGCGCTGAGCCGCCGCCTGCGGGCGCGCACGCACCTTCCCGGCTGCGGAGCCCGTGCCCGGCTC
TGCGGAACCGGTGCCCGGCTCTGCGGGGCGCTCTGCTTCCCCGTCCCCGGCGGCTGCGCGGCCGCGCAGCATTCGGCGCTCGCAGCTCCG
ACCACCTTACCCGCCTGCCGCTGCCCGCCGCGCCCCAGGCCCCGCTGTCCCGGCCGTCCCATCTGCCTGCCGCCGGGCGGCTCGGTCCGC
CTGCGTCTGCTGTGCGCCCTGCGGCGCGCGGCTGGCGCCGTCCGGGTGGGACTGGCGCTGGAGGCCGCCACCGCGGGGACGCCCTCCGCG
TCGCCATCCCCATCGCCGCCCCTGCCGCCGAACTTGCCCGAAGCCCGGGCGGGGCCGGCGCGACGGGCCCGGCGGGGCACGAGCGGCAGA
GGGAGCCTGAAGTTTCCGATGCCCAACTACCAGGTGGCGTTGTTTGAGAACGAACCGGCGGGCACCCTCATCCTCCAGCTGCACGCGCAC
TACACCATCGAGGGCGAGGAGGAGCGCGTGAGCTATTACATGGAGGGGCTGTTCGACGAGCGCTCCCGGGGCTACTTCCGAATCGACTCT
GCCACGGGCGCCGTGAGCACGGACAGCGTACTGGACCGCGAGACCAAGGAGACGCACGTCCTCAGGGTGAAAGCCGTGGACTACAGTACG
CCGCCGCGCTCGGCCACCACCTACATCACTGTCTTGGTCAAAGACACCAACGACCACAGCCCGGTCTTCGAGCAGTCGGAGTACCGCGAG
CGCGTGCGGGAGAACCTGGAGGTGGGCTACGAGGTGCTGACCATCCGCGCCAGCGACCGCGACTCGCCCATCAACGCCAACTTGCGTTAC
CGCGTGTTGGGGGGCGCGTGGGACGTCTTCCAGCTCAACGAGAGCTCTGGCGTGGTGAGCACACGGGCGGTGCTGGACCGGGAGGAGGCG
GCCGAGTACCAGCTCCTGGTGGAGGCCAACGACCAGGGGCGCAATCCGGGCCCGCTCAGTGCCACGGCCACCGTGTACATCGAGGTGGAG
GACGAGAACGACAACTACCCCCAGTTCAGCGAGCAGAACTACGTGGTCCAGGTGCCCGAGGACGTGGGGCTCAACACGGCTGTGCTGCGA
GTGCAGGCCACGGACCGGGACCAGGGCCAGAACGCGGCCATTCACTACAGCATCCTCAGCGGGAACGTGGCCGGCCAGTTCTACCTGCAC
TCGCTGAGCGGGATCCTGGATGTGATCAACCCCTTGGATTTCGAGGATGTCCAGAAATACTCGCTGAGCATTAAGGCCCAGGATGGGGGC
CGGCCCCCGCTCATCAATTCTTCAGGGGTGGTGTCTGTGCAGGTGCTGGATGTCAACGACAACGAGCCTATCTTTGTGAGCAGCCCCTTC
CAGGCCACGGTGCTGGAGAATGTGCCCCTGGGCTACCCCGTGGTGCACATTCAGGCGGTGGACGCGGACTCTGGAGAGAACGCCCGGCTG
CACTATCGCCTGGTGGACACGGCCTCCACCTTTCTGGGGGGCGGCAGCGCTGGGCCTAAGAATCCTGCCCCCACCCCTGACTTCCCCTTC
CAGATCCACAACAGCTCCGGTTGGATCACAGTGTGTGCCGAGCTGGACCGCGAGGAGGTGGAGCACTACAGCTTCGGGGTGGAGGCGGTG
GACCACGGCTCGCCCCCCATGAGCTCCTCCACCAGCGTGTCCATCACGGTGCTGGACGTGAATGACAACGACCCGGTGTTCACGCAGCCC
ACCTACGAGCTTCGTCTGAATGAGGATGCGGCCGTGGGGAGCAGCGTGCTGACCCTGCAGGCCCGCGACCGTGACGCCAACAGTGTGATT
ACCTACCAGCTCACAGGCGGCAACACCCGGAACCGCTTTGCACTCAGCAGCCAGAGAGGGGGCGGCCTCATCACCCTGGCGCTACCTCTG
GACTACAAGCAGGAGCAGCAGTACGTGCTGGCGGTGACAGCATCCGACGGCACACGGTCGCACACTGCGCATGTCCTAATCAACGTCACT
GATGCCAACACCCACAGGCCTGTCTTTCAGAGCTCCCATTACACAGTGAGTGTCAGTGAGGACAGGCCTGTGGGCACCTCCATTGCTACC
CTCAGTGCCAACGATGAGGACACAGGAGAGAATGCCCGCATCACCTACGTGATTCAGGACCCCGTGCCGCAGTTCCGCATTGACCCCGAC
AGTGGCACCATGTACACCATGATGGAGCTGGACTATGAGAACCAGGTCGCCTACACGCTGACCATCATGGCCCAGGACAACGGCATCCCG
CAGAAATCAGACACCACCACCCTAGAGATCCTCATCCTCGATGCCAATGACAATGCACCCCAGTTCCTGTGGGATTTCTACCAGGGTTCC
ATCTTTGAGGATGCTCCACCCTCGACCAGCATCCTCCAGGTCTCTGCCACGGACCGGGACTCAGGTCCCAATGGGCGTCTGCTGTACACC
TTCCAGGGTGGGGACGACGGCGATGGGGACTTCTACATCGAGCCCACGTCCGGTGTGATTCGCACCCAGCGCCGGCTGGACCGGGAGAAT
GTGGCCGTGTACAACCTTTGGGCTCTGGCTGTGGATCGGGGCAGTCCCACTCCCCTTAGCGCCTCGGTAGAAATCCAGGTGACCATCTTG
GACATTAATGACAATGCCCCCATGTTTGAGAAGGACGAACTGGAGCTGTTTGTTGAGGAGAACAACCCAGTGGGGTCGGTGGTGGCAAAG
ATTCGTGCTAACGACCCTGATGAAGGCCCTAATGCCCAGATCATGTATCAGATTGTGGAAGGGGACATGCGGCATTTCTTCCAGCTGGAC
CTGCTCAACGGGGACCTGCGTGCCATGGTGGAGCTGGACTTTGAGGTCCGGCGGGAGTATGTGCTGGTGGTGCAGGCCACGTCGGCTCCG
CTGGTGAGCCGAGCCACGGTGCACATCCTTCTCGTGGACCAGAATGACAACCCGCCTGTGCTGCCCGACTTCCAGATCCTCTTCAACAAC
TATGTCACCAACAAGTCCAACAGTTTCCCCACCGGCGTGATCGGCTGCATCCCGGCCCATGACCCCGACGTGTCAGACAGCCTCAACTAC
ACCTTCGTGCAGGGCAACGAGCTGCGCCTGTTGCTGCTGGACCCCGCCACGGGCGAACTGCAGCTCAGCCGCGACCTGGACAACAACCGG
CCGCTGGAGGCGCTCATGGAGGTGTCTGTGTCTGATGGCATCCACAGCGTCACGGCCTTCTGCACCCTGCGTGTCACCATCATCACGGAC
GACATGCTGACCAACAGCATCACTGTCCGCCTGGAGAACATGTCCCAGGAGAAGTTCCTGTCCCCGCTGCTGGCCCTCTTCGTGGAGGGG
GTGGCCGCCGTGCTGTCCACCACCAAGGACGACGTCTTCGTCTTCAACGTCCAGAACGACACCGACGTCAGCTCCAACATCCTGAACGTG
ACCTTCTCGGCGCTGCTGCCTGGCGGCGTCCGCGGCCAGTTCTTCCCGTCGGAGGACCTGCAGGAGCAGATCTACCTGAATCGGACGCTG
CTGACCACCATCTCCACGCAGCGCGTGCTGCCCTTCGACGACAACATCTGCCTGCGCGAGCCCTGCGAGAACTACATGAAGTGCGTGTCC
GTTCTGCGATTCGACAGCTCCGCGCCCTTCCTCAGCTCCACCACCGTGCTCTTCCGGCCCATCCACCCCATCAACGGCCTGCGCTGCCGC
TGCCCGCCCGGCTTCACCGGCGACTACTGCGAGACGGAGATCGACCTCTGCTACTCCGACCCGTGCGGCGCCAACGGCCGCTGCCGCAGC
CGCGAGGGCGGCTACACCTGCGAGTGCTTCGAGGACTTCACTGATACAAACTGGACTCTCAGGACAAAACGACACCAGCCAAACCAGCAG
CCCCTCAGCATCCAGCAACATAAGCGGAGGCATTTTCCTTTTCTTCGTGGCCAATGCCATAATCCACCTCTTCTGCTTCAGTTGAGGTGA
CACGTCTCAGCCTTAGCCCTGTGCCCCCTGAAACAGCTGCCACCATCACTCGCAAGAGAATCCCCTCCATCTTTGGGAGGGGTTGATGCC
AGACATCACCAGGTTGTAGAAGTTGACAGGCAGTGCCATGGGGGCAACAGCCAAAATAGGGGGGTAATGATGTAGGGGCCAAGCAGTGCC

>In-frame_ENST00000262738_ENST00000374213_TCGA-62-8394_CELSR1_chr22_46859603_-_CD52_chr1_26646661_length(amino acids)=1439AA_start in transcript=0_stop in transcript=4319
MAPPPPPVLPVLLLLAAAAALPAMGLRAAAWEPRVPGGTRAFALRPGCTYAVGAACTPRAPRELLDVGRDGRLAGRRRVSGAGRPLPLQV
RLVARSAPTALSRRLRARTHLPGCGARARLCGTGARLCGALCFPVPGGCAAAQHSALAAPTTLPACRCPPRPRPRCPGRPICLPPGGSVR
LRLLCALRRAAGAVRVGLALEAATAGTPSASPSPSPPLPPNLPEARAGPARRARRGTSGRGSLKFPMPNYQVALFENEPAGTLILQLHAH
YTIEGEEERVSYYMEGLFDERSRGYFRIDSATGAVSTDSVLDRETKETHVLRVKAVDYSTPPRSATTYITVLVKDTNDHSPVFEQSEYRE
RVRENLEVGYEVLTIRASDRDSPINANLRYRVLGGAWDVFQLNESSGVVSTRAVLDREEAAEYQLLVEANDQGRNPGPLSATATVYIEVE
DENDNYPQFSEQNYVVQVPEDVGLNTAVLRVQATDRDQGQNAAIHYSILSGNVAGQFYLHSLSGILDVINPLDFEDVQKYSLSIKAQDGG
RPPLINSSGVVSVQVLDVNDNEPIFVSSPFQATVLENVPLGYPVVHIQAVDADSGENARLHYRLVDTASTFLGGGSAGPKNPAPTPDFPF
QIHNSSGWITVCAELDREEVEHYSFGVEAVDHGSPPMSSSTSVSITVLDVNDNDPVFTQPTYELRLNEDAAVGSSVLTLQARDRDANSVI
TYQLTGGNTRNRFALSSQRGGGLITLALPLDYKQEQQYVLAVTASDGTRSHTAHVLINVTDANTHRPVFQSSHYTVSVSEDRPVGTSIAT
LSANDEDTGENARITYVIQDPVPQFRIDPDSGTMYTMMELDYENQVAYTLTIMAQDNGIPQKSDTTTLEILILDANDNAPQFLWDFYQGS
IFEDAPPSTSILQVSATDRDSGPNGRLLYTFQGGDDGDGDFYIEPTSGVIRTQRRLDRENVAVYNLWALAVDRGSPTPLSASVEIQVTIL
DINDNAPMFEKDELELFVEENNPVGSVVAKIRANDPDEGPNAQIMYQIVEGDMRHFFQLDLLNGDLRAMVELDFEVRREYVLVVQATSAP
LVSRATVHILLVDQNDNPPVLPDFQILFNNYVTNKSNSFPTGVIGCIPAHDPDVSDSLNYTFVQGNELRLLLLDPATGELQLSRDLDNNR
PLEALMEVSVSDGIHSVTAFCTLRVTIITDDMLTNSITVRLENMSQEKFLSPLLALFVEGVAAVLSTTKDDVFVFNVQNDTDVSSNILNV
TFSALLPGGVRGQFFPSEDLQEQIYLNRTLLTTISTQRVLPFDDNICLREPCENYMKCVSVLRFDSSAPFLSSTTVLFRPIHPINGLRCR

--------------------------------------------------------------
>In-frame_ENST00000395964_ENST00000374213_TCGA-62-8394_CELSR1_chr22_46859603_-_CD52_chr1_26646661_length(transcript)=4536nt_BP=4183nt
ATGGCGCCGCCGCCGCCGCCCGTGCTGCCCGTGCTGCTGCTCCTGGCCGCCGCCGCCGCCCTGCCGGCGATGGGGCTGCGAGCGGCCGCC
TGGGAGCCGCGCGTACCCGGCGGGACCCGCGCCTTCGCCCTCCGGCCCGGCTGTACCTACGCGGTGGGCGCCGCTTGCACGCCCCGGGCG
CCGCGGGAGCTGCTGGACGTGGGCCGCGATGGGCGGCTGGCAGGACGTCGGCGCGTCTCGGGCGCGGGGCGCCCGCTGCCGCTGCAAGTC
CGCTTGGTGGCCCGCAGTGCCCCGACGGCGCTGAGCCGCCGCCTGCGGGCGCGCACGCACCTTCCCGGCTGCGGAGCCCGTGCCCGGCTC
TGCGGAACCGGTGCCCGGCTCTGCGGGGCGCTCTGCTTCCCCGTCCCCGGCGGCTGCGCGGCCGCGCAGCATTCGGCGCTCGCAGCTCCG
ACCACCTTACCCGCCTGCCGCTGCCCGCCGCGCCCCAGGCCCCGCTGTCCCGGCCGTCCCATCTGCCTGCCGCCGGGCGGCTCGGTCCGC
CTGCGTCTGCTGTGCGCCCTGCGGCGCGCGGCTGGCGCCGTCCGGGTGGGACTGGCGCTGGAGGCCGCCACCGCGGGGACGCCCTCCGCG
TCGCCATCCCCATCGCCGCCCCTGCCGCCGAACTTGCCCGAAGCCCGGGCGGGGCCGGCGCGACGGGCCCGGCGGGGCACGAGCGGCAGA
GGGAGCCTGAAGTTTCCGATGCCCAACTACCAGGTGGCGTTGTTTGAGAACGAACCGGCGGGCACCCTCATCCTCCAGCTGCACGCGCAC
TACACCATCGAGGGCGAGGAGGAGCGCGTGAGCTATTACATGGAGGGGCTGTTCGACGAGCGCTCCCGGGGCTACTTCCGAATCGACTCT
GCCACGGGCGCCGTGAGCACGGACAGCGTACTGGACCGCGAGACCAAGGAGACGCACGTCCTCAGGGTGAAAGCCGTGGACTACAGTACG
CCGCCGCGCTCGGCCACCACCTACATCACTGTCTTGGTCAAAGACACCAACGACCACAGCCCGGTCTTCGAGCAGTCGGAGTACCGCGAG
CGCGTGCGGGAGAACCTGGAGGTGGGCTACGAGGTGCTGACCATCCGCGCCAGCGACCGCGACTCGCCCATCAACGCCAACTTGCGTTAC
CGCGTGTTGGGGGGCGCGTGGGACGTCTTCCAGCTCAACGAGAGCTCTGGCGTGGTGAGCACACGGGCGGTGCTGGACCGGGAGGAGGCG
GCCGAGTACCAGCTCCTGGTGGAGGCCAACGACCAGGGGCGCAATCCGGGCCCGCTCAGTGCCACGGCCACCGTGTACATCGAGGTGGAG
GACGAGAACGACAACTACCCCCAGTTCAGCGAGCAGAACTACGTGGTCCAGGTGCCCGAGGACGTGGGGCTCAACACGGCTGTGCTGCGA
GTGCAGGCCACGGACCGGGACCAGGGCCAGAACGCGGCCATTCACTACAGCATCCTCAGCGGGAACGTGGCCGGCCAGTTCTACCTGCAC
TCGCTGAGCGGGATCCTGGATGTGATCAACCCCTTGGATTTCGAGGATGTCCAGAAATACTCGCTGAGCATTAAGGCCCAGGATGGGGGC
CGGCCCCCGCTCATCAATTCTTCAGGGGTGGTGTCTGTGCAGGTGCTGGATGTCAACGACAACGAGCCTATCTTTGTGAGCAGCCCCTTC
CAGGCCACGGTGCTGGAGAATGTGCCCCTGGGCTACCCCGTGGTGCACATTCAGGCGGTGGACGCGGACTCTGGAGAGAACGCCCGGCTG
CACTATCGCCTGGTGGACACGGCCTCCACCTTTCTGGGGGGCGGCAGCGCTGGGCCTAAGAATCCTGCCCCCACCCCTGACTTCCCCTTC
CAGATCCACAACAGCTCCGGTTGGATCACAGTGTGTGCCGAGCTGGACCGCGAGGAGGTGGAGCACTACAGCTTCGGGGTGGAGGCGGTG
GACCACGGCTCGCCCCCCATGAGCTCCTCCACCAGCGTGTCCATCACGGTGCTGGACGTGAATGACAACGACCCGGTGTTCACGCAGCCC
ACCTACGAGCTTCGTCTGAATGAGGATGCGGCCGTGGGGAGCAGCGTGCTGACCCTGCAGGCCCGCGACCGTGACGCCAACAGTGTGATT
ACCTACCAGCTCACAGGCGGCAACACCCGGAACCGCTTTGCACTCAGCAGCCAGAGAGGGGGCGGCCTCATCACCCTGGCGCTACCTCTG
GACTACAAGCAGGAGCAGCAGTACGTGCTGGCGGTGACAGCATCCGACGGCACACGGTCGCACACTGCGCATGTCCTAATCAACGTCACT
GATGCCAACACCCACAGGCCTGTCTTTCAGAGCTCCCATTACACAGTGAGTGTCAGTGAGGACAGGCCTGTGGGCACCTCCATTGCTACC
CTCAGTGCCAACGATGAGGACACAGGAGAGAATGCCCGCATCACCTACGTGATTCAGGACCCCGTGCCGCAGTTCCGCATTGACCCCGAC
AGTGGCACCATGTACACCATGATGGAGCTGGACTATGAGAACCAGGTCGCCTACACGCTGACCATCATGGCCCAGGACAACGGCATCCCG
CAGAAATCAGACACCACCACCCTAGAGATCCTCATCCTCGATGCCAATGACAATGCACCCCAGTTCCTGTGGGATTTCTACCAGGGTTCC
ATCTTTGAGGATGCTCCACCCTCGACCAGCATCCTCCAGGTCTCTGCCACGGACCGGGACTCAGGTCCCAATGGGCGTCTGCTGTACACC
TTCCAGGGTGGGGACGACGGCGATGGGGACTTCTACATCGAGCCCACGTCCGGTGTGATTCGCACCCAGCGCCGGCTGGACCGGGAGAAT
GTGGCCGTGTACAACCTTTGGGCTCTGGCTGTGGATCGGGGCAGTCCCACTCCCCTTAGCGCCTCGGTAGAAATCCAGGTGACCATCTTG
GACATTAATGACAATGCCCCCATGTTTGAGAAGGACGAACTGGAGCTGTTTGTTGAGGAGAACAACCCAGTGGGGTCGGTGGTGGCAAAG
ATTCGTGCTAACGACCCTGATGAAGGCCCTAATGCCCAGATCATGTATCAGATTGTGGAAGGGGACATGCGGCATTTCTTCCAGCTGGAC
CTGCTCAACGGGGACCTGCGTGCCATGGTGGAGCTGGACTTTGAGGTCCGGCGGGAGTATGTGCTGGTGGTGCAGGCCACGTCGGCTCCG
CTGGTGAGCCGAGCCACGGTGCACATCCTTCTCGTGGACCAGAATGACAACCCGCCTGTGCTGCCCGACTTCCAGATCCTCTTCAACAAC
TATGTCACCAACAAGTCCAACAGTTTCCCCACCGGCGTGATCGGCTGCATCCCGGCCCATGACCCCGACGTGTCAGACAGCCTCAACTAC
ACCTTCGTGCAGGGCAACGAGCTGCGCCTGTTGCTGCTGGACCCCGCCACGGGCGAACTGCAGCTCAGCCGCGACCTGGACAACAACCGG
CCGCTGGAGGCGCTCATGGAGGTGTCTGTGTCTGATGGCATCCACAGCGTCACGGCCTTCTGCACCCTGCGTGTCACCATCATCACGGAC
GACATGCTGACCAACAGCATCACTGTCCGCCTGGAGAACATGTCCCAGGAGAAGTTCCTGTCCCCGCTGCTGGCCCTCTTCGTGGAGGGG
GTGGCCGCCGTGCTGTCCACCACCAAGGACGACGTCTTCGTCTTCAACGTCCAGAACGACACCGACGTCAGCTCCAACATCCTGAACGTG
ACCTTCTCGGCGCTGCTGCCTGGCGGCGTCCGCGGCCAGTTCTTCCCGTCGGAGGACCTGCAGGAGCAGATCTACCTGAATCGGACGCTG
CTGACCACCATCTCCACGCAGCGCGTGCTGCCCTTCGACGACAACATCTGCCTGCGCGAGCCCTGCGAGAACTACATGAAGTGCGTGTCC
GTTCTGCGATTCGACAGCTCCGCGCCCTTCCTCAGCTCCACCACCGTGCTCTTCCGGCCCATCCACCCCATCAACGGCCTGCGCTGCCGC
TGCCCGCCCGGCTTCACCGGCGACTACTGCGAGACGGAGATCGACCTCTGCTACTCCGACCCGTGCGGCGCCAACGGCCGCTGCCGCAGC
CGCGAGGGCGGCTACACCTGCGAGTGCTTCGAGGACTTCACTGATACAAACTGGACTCTCAGGACAAAACGACACCAGCCAAACCAGCAG
CCCCTCAGCATCCAGCAACATAAGCGGAGGCATTTTCCTTTTCTTCGTGGCCAATGCCATAATCCACCTCTTCTGCTTCAGTTGAGGTGA
CACGTCTCAGCCTTAGCCCTGTGCCCCCTGAAACAGCTGCCACCATCACTCGCAAGAGAATCCCCTCCATCTTTGGGAGGGGTTGATGCC
AGACATCACCAGGTTGTAGAAGTTGACAGGCAGTGCCATGGGGGCAACAGCCAAAATAGGGGGGTAATGATGTAGGGGCCAAGCAGTGCC

>In-frame_ENST00000395964_ENST00000374213_TCGA-62-8394_CELSR1_chr22_46859603_-_CD52_chr1_26646661_length(amino acids)=1439AA_start in transcript=0_stop in transcript=4319
MAPPPPPVLPVLLLLAAAAALPAMGLRAAAWEPRVPGGTRAFALRPGCTYAVGAACTPRAPRELLDVGRDGRLAGRRRVSGAGRPLPLQV
RLVARSAPTALSRRLRARTHLPGCGARARLCGTGARLCGALCFPVPGGCAAAQHSALAAPTTLPACRCPPRPRPRCPGRPICLPPGGSVR
LRLLCALRRAAGAVRVGLALEAATAGTPSASPSPSPPLPPNLPEARAGPARRARRGTSGRGSLKFPMPNYQVALFENEPAGTLILQLHAH
YTIEGEEERVSYYMEGLFDERSRGYFRIDSATGAVSTDSVLDRETKETHVLRVKAVDYSTPPRSATTYITVLVKDTNDHSPVFEQSEYRE
RVRENLEVGYEVLTIRASDRDSPINANLRYRVLGGAWDVFQLNESSGVVSTRAVLDREEAAEYQLLVEANDQGRNPGPLSATATVYIEVE
DENDNYPQFSEQNYVVQVPEDVGLNTAVLRVQATDRDQGQNAAIHYSILSGNVAGQFYLHSLSGILDVINPLDFEDVQKYSLSIKAQDGG
RPPLINSSGVVSVQVLDVNDNEPIFVSSPFQATVLENVPLGYPVVHIQAVDADSGENARLHYRLVDTASTFLGGGSAGPKNPAPTPDFPF
QIHNSSGWITVCAELDREEVEHYSFGVEAVDHGSPPMSSSTSVSITVLDVNDNDPVFTQPTYELRLNEDAAVGSSVLTLQARDRDANSVI
TYQLTGGNTRNRFALSSQRGGGLITLALPLDYKQEQQYVLAVTASDGTRSHTAHVLINVTDANTHRPVFQSSHYTVSVSEDRPVGTSIAT
LSANDEDTGENARITYVIQDPVPQFRIDPDSGTMYTMMELDYENQVAYTLTIMAQDNGIPQKSDTTTLEILILDANDNAPQFLWDFYQGS
IFEDAPPSTSILQVSATDRDSGPNGRLLYTFQGGDDGDGDFYIEPTSGVIRTQRRLDRENVAVYNLWALAVDRGSPTPLSASVEIQVTIL
DINDNAPMFEKDELELFVEENNPVGSVVAKIRANDPDEGPNAQIMYQIVEGDMRHFFQLDLLNGDLRAMVELDFEVRREYVLVVQATSAP
LVSRATVHILLVDQNDNPPVLPDFQILFNNYVTNKSNSFPTGVIGCIPAHDPDVSDSLNYTFVQGNELRLLLLDPATGELQLSRDLDNNR
PLEALMEVSVSDGIHSVTAFCTLRVTIITDDMLTNSITVRLENMSQEKFLSPLLALFVEGVAAVLSTTKDDVFVFNVQNDTDVSSNILNV
TFSALLPGGVRGQFFPSEDLQEQIYLNRTLLTTISTQRVLPFDDNICLREPCENYMKCVSVLRFDSSAPFLSSTTVLFRPIHPINGLRCR

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for CELSR1-CD52


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for CELSR1-CD52


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for CELSR1-CD52


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
HgeneCELSR1C1704423Milroy Disease1GENOMICS_ENGLAND
HgeneCELSR1C3891448NEURAL TUBE DEFECTS, SUSCEPTIBILITY TO1UNIPROT