FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:CELSR1-CERK (FusionGDB2 ID:15643)

Fusion Gene Summary for CELSR1-CERK

check button Fusion gene summary
Fusion gene informationFusion gene name: CELSR1-CERK
Fusion gene ID: 15643
HgeneTgene
Gene symbol

CELSR1

CERK

Gene ID

9620

64781

Gene namecadherin EGF LAG seven-pass G-type receptor 1ceramide kinase
SynonymsADGRC1|CDHF9|FMI2|HFMI2|ME2LK4|dA59H18.2|dA59H18.3|hCERK
Cytomap

22q13.31

22q13.31

Type of geneprotein-codingprotein-coding
Descriptioncadherin EGF LAG seven-pass G-type receptor 1adhesion G protein-coupled receptor C1cadherin family member 9cadherin, EGF LAG seven-pass G-type receptor 1 (flamingo homolog, Drosophila)flamingo homolog 2protocadherin flamingo 2ceramide kinaseacylsphingosine kinaselipid kinase 4lipid kinase LK4
Modification date2020032220200313
UniProtAcc

Q9NYQ6

Q49MI3

Ensembl transtripts involved in fusion geneENST00000262738, ENST00000395964, 
ENST00000497509, 
ENST00000216264, 
ENST00000471929, ENST00000541677, 
Fusion gene scores* DoF score23 X 19 X 11=480716 X 13 X 6=1248
# samples 2817
** MAII scorelog2(28/4807*10)=-4.10163807119293
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(17/1248*10)=-2.87601128272455
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: CELSR1 [Title/Abstract] AND CERK [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointCERK(47133904)-CELSR1(46835308), # samples:1
CELSR1(46929524)-CERK(47083103), # samples:3
Anticipated loss of major functional domain due to fusion event.CELSR1-CERK seems lost the major protein functional domain in Hgene partner, which is a essential gene due to the frame-shifted ORF.
CELSR1-CERK seems lost the major protein functional domain in Hgene partner, which is a IUPHAR drug target due to the frame-shifted ORF.
CELSR1-CERK seems lost the major protein functional domain in Tgene partner, which is a cell metabolism gene due to the frame-shifted ORF.
CELSR1-CERK seems lost the major protein functional domain in Tgene partner, which is a essential gene due to the frame-shifted ORF.
CELSR1-CERK seems lost the major protein functional domain in Tgene partner, which is a IUPHAR drug target due to the frame-shifted ORF.
CERK-CELSR1 seems lost the major protein functional domain in Hgene partner, which is a cell metabolism gene due to the frame-shifted ORF.
CERK-CELSR1 seems lost the major protein functional domain in Hgene partner, which is a essential gene due to the frame-shifted ORF.
CERK-CELSR1 seems lost the major protein functional domain in Hgene partner, which is a IUPHAR drug target due to the frame-shifted ORF.
CERK-CELSR1 seems lost the major protein functional domain in Tgene partner, which is a essential gene due to the frame-shifted ORF.
CERK-CELSR1 seems lost the major protein functional domain in Tgene partner, which is a IUPHAR drug target due to the frame-shifted ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
TgeneCERK

GO:0006672

ceramide metabolic process

19501188


check buttonFusion gene breakpoints across CELSR1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across CERK (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4BRCATCGA-E2-A14P-01ACELSR1chr22

46929523

-CERKchr22

47086096

-
ChimerDB4BRCATCGA-E2-A14P-01ACELSR1chr22

46929524

-CERKchr22

47083103

-
ChimerDB4BRCATCGA-E2-A14P-01ACELSR1chr22

46929524

-CERKchr22

47086097

-
ChimerDB4STADTCGA-CD-A4MH-01ACELSR1chr22

46829290

-CERKchr22

47089385

-
ChimerDB4UCECTCGA-B5-A1N2-01ACELSR1chr22

46829290

-CERKchr22

47082536

-


Top

Fusion Gene ORF analysis for CELSR1-CERK

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-3UTRENST00000262738ENST00000216264CELSR1chr22

46829290

-CERKchr22

47082536

-
5CDS-5UTRENST00000262738ENST00000471929CELSR1chr22

46929523

-CERKchr22

47086096

-
5CDS-5UTRENST00000262738ENST00000471929CELSR1chr22

46929524

-CERKchr22

47086097

-
5CDS-5UTRENST00000262738ENST00000471929CELSR1chr22

46829290

-CERKchr22

47089385

-
5CDS-5UTRENST00000395964ENST00000471929CELSR1chr22

46929523

-CERKchr22

47086096

-
5CDS-5UTRENST00000395964ENST00000471929CELSR1chr22

46929524

-CERKchr22

47086097

-
5CDS-intronENST00000262738ENST00000471929CELSR1chr22

46929524

-CERKchr22

47083103

-
5CDS-intronENST00000262738ENST00000471929CELSR1chr22

46829290

-CERKchr22

47082536

-
5CDS-intronENST00000262738ENST00000541677CELSR1chr22

46829290

-CERKchr22

47082536

-
5CDS-intronENST00000395964ENST00000471929CELSR1chr22

46929524

-CERKchr22

47083103

-
5UTR-3CDSENST00000497509ENST00000216264CELSR1chr22

46929523

-CERKchr22

47086096

-
5UTR-3CDSENST00000497509ENST00000216264CELSR1chr22

46929524

-CERKchr22

47083103

-
5UTR-3CDSENST00000497509ENST00000216264CELSR1chr22

46929524

-CERKchr22

47086097

-
5UTR-3CDSENST00000497509ENST00000541677CELSR1chr22

46929523

-CERKchr22

47086096

-
5UTR-3CDSENST00000497509ENST00000541677CELSR1chr22

46929524

-CERKchr22

47083103

-
5UTR-3CDSENST00000497509ENST00000541677CELSR1chr22

46929524

-CERKchr22

47086097

-
5UTR-5UTRENST00000497509ENST00000471929CELSR1chr22

46929523

-CERKchr22

47086096

-
5UTR-5UTRENST00000497509ENST00000471929CELSR1chr22

46929524

-CERKchr22

47086097

-
5UTR-intronENST00000497509ENST00000471929CELSR1chr22

46929524

-CERKchr22

47083103

-
Frame-shiftENST00000262738ENST00000216264CELSR1chr22

46929524

-CERKchr22

47083103

-
Frame-shiftENST00000262738ENST00000216264CELSR1chr22

46829290

-CERKchr22

47089385

-
Frame-shiftENST00000262738ENST00000541677CELSR1chr22

46929524

-CERKchr22

47083103

-
Frame-shiftENST00000262738ENST00000541677CELSR1chr22

46829290

-CERKchr22

47089385

-
Frame-shiftENST00000395964ENST00000216264CELSR1chr22

46929524

-CERKchr22

47083103

-
Frame-shiftENST00000395964ENST00000541677CELSR1chr22

46929524

-CERKchr22

47083103

-
In-frameENST00000262738ENST00000216264CELSR1chr22

46929523

-CERKchr22

47086096

-
In-frameENST00000262738ENST00000216264CELSR1chr22

46929524

-CERKchr22

47086097

-
In-frameENST00000262738ENST00000541677CELSR1chr22

46929523

-CERKchr22

47086096

-
In-frameENST00000262738ENST00000541677CELSR1chr22

46929524

-CERKchr22

47086097

-
In-frameENST00000395964ENST00000216264CELSR1chr22

46929523

-CERKchr22

47086096

-
In-frameENST00000395964ENST00000216264CELSR1chr22

46929524

-CERKchr22

47086097

-
In-frameENST00000395964ENST00000541677CELSR1chr22

46929523

-CERKchr22

47086096

-
In-frameENST00000395964ENST00000541677CELSR1chr22

46929524

-CERKchr22

47086097

-
intron-3CDSENST00000395964ENST00000216264CELSR1chr22

46829290

-CERKchr22

47089385

-
intron-3CDSENST00000395964ENST00000541677CELSR1chr22

46829290

-CERKchr22

47089385

-
intron-3CDSENST00000497509ENST00000216264CELSR1chr22

46829290

-CERKchr22

47089385

-
intron-3CDSENST00000497509ENST00000541677CELSR1chr22

46829290

-CERKchr22

47089385

-
intron-3UTRENST00000395964ENST00000216264CELSR1chr22

46829290

-CERKchr22

47082536

-
intron-3UTRENST00000497509ENST00000216264CELSR1chr22

46829290

-CERKchr22

47082536

-
intron-5UTRENST00000395964ENST00000471929CELSR1chr22

46829290

-CERKchr22

47089385

-
intron-5UTRENST00000497509ENST00000471929CELSR1chr22

46829290

-CERKchr22

47089385

-
intron-intronENST00000395964ENST00000471929CELSR1chr22

46829290

-CERKchr22

47082536

-
intron-intronENST00000395964ENST00000541677CELSR1chr22

46829290

-CERKchr22

47082536

-
intron-intronENST00000497509ENST00000471929CELSR1chr22

46829290

-CERKchr22

47082536

-
intron-intronENST00000497509ENST00000541677CELSR1chr22

46829290

-CERKchr22

47082536

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000262738CELSR1chr2246929524-ENST00000216264CERKchr2247086097-65493544035481182
ENST00000262738CELSR1chr2246929524-ENST00000541677CERKchr2247086097-39723544035481182
ENST00000395964CELSR1chr2246929524-ENST00000216264CERKchr2247086097-65493544035481182
ENST00000395964CELSR1chr2246929524-ENST00000541677CERKchr2247086097-39723544035481182
ENST00000262738CELSR1chr2246929523-ENST00000216264CERKchr2247086096-65493544035481182
ENST00000262738CELSR1chr2246929523-ENST00000541677CERKchr2247086096-39723544035481182
ENST00000395964CELSR1chr2246929523-ENST00000216264CERKchr2247086096-65493544035481182
ENST00000395964CELSR1chr2246929523-ENST00000541677CERKchr2247086096-39723544035481182

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000262738ENST00000216264CELSR1chr2246929524-CERKchr2247086097-0.0004725030.99952745
ENST00000262738ENST00000541677CELSR1chr2246929524-CERKchr2247086097-0.0010897620.9989102
ENST00000395964ENST00000216264CELSR1chr2246929524-CERKchr2247086097-0.0004725030.99952745
ENST00000395964ENST00000541677CELSR1chr2246929524-CERKchr2247086097-0.0010897620.9989102
ENST00000262738ENST00000216264CELSR1chr2246929523-CERKchr2247086096-0.0004725030.99952745
ENST00000262738ENST00000541677CELSR1chr2246929523-CERKchr2247086096-0.0010897620.9989102
ENST00000395964ENST00000216264CELSR1chr2246929523-CERKchr2247086096-0.0004725030.99952745
ENST00000395964ENST00000541677CELSR1chr2246929523-CERKchr2247086096-0.0010897620.9989102

Top

Fusion Genomic Features for CELSR1-CERK


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.

Top

Fusion Protein Features for CELSR1-CERK


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr22:47133904/chr22:46835308)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
CELSR1

Q9NYQ6

CERK

Q49MI3

FUNCTION: Receptor that may have an important role in cell/cell signaling during nervous system formation.FUNCTION: Has no detectable ceramide-kinase activity. Overexpression of CERKL protects cells from apoptosis in oxidative stress conditions. {ECO:0000269|PubMed:15708351, ECO:0000269|PubMed:19158957}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1351000_110111813015.0DomainCadherin 8
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-135246_35311813015.0DomainCadherin 1
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-135354_45911813015.0DomainCadherin 2
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-135460_56511813015.0DomainCadherin 3
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-135566_68711813015.0DomainCadherin 4
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-135688_78911813015.0DomainCadherin 5
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-135790_89211813015.0DomainCadherin 6
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-135893_99911813015.0DomainCadherin 7
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1351000_110111813015.0DomainCadherin 8
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-135246_35311813015.0DomainCadherin 1
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-135354_45911813015.0DomainCadherin 2
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-135460_56511813015.0DomainCadherin 3
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-135566_68711813015.0DomainCadherin 4
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-135688_78911813015.0DomainCadherin 5
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-135790_89211813015.0DomainCadherin 6
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-135893_99911813015.0DomainCadherin 7
TgeneCERKchr22:46929523chr22:47086096ENST000002162641013502_504444538.0Nucleotide bindingATP
TgeneCERKchr22:46929524chr22:47086097ENST000002162641013502_504444538.0Nucleotide bindingATP

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1352659_266311813015.0Compositional biasNote=Poly-Leu
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1352659_266311813015.0Compositional biasNote=Poly-Leu
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1351106_122411813015.0DomainCadherin 9
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1351303_136111813015.0DomainEGF-like 1%3B calcium-binding
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1351363_139911813015.0DomainEGF-like 2%3B calcium-binding
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1351403_144111813015.0DomainEGF-like 3%3B calcium-binding
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1351442_164611813015.0DomainLaminin G-like 1
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1351649_168511813015.0DomainEGF-like 4%3B calcium-binding
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1351689_187011813015.0DomainLaminin G-like 2
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1351872_190711813015.0DomainEGF-like 5%3B calcium-binding
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1351908_194611813015.0DomainEGF-like 6%3B calcium-binding
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1351947_197911813015.0DomainEGF-like 7%3B calcium-binding
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1351981_201611813015.0DomainEGF-like 8%3B calcium-binding
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1352003_205011813015.0DomainLaminin EGF-like
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1352408_246011813015.0DomainGPS
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1351106_122411813015.0DomainCadherin 9
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1351303_136111813015.0DomainEGF-like 1%3B calcium-binding
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1351363_139911813015.0DomainEGF-like 2%3B calcium-binding
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1351403_144111813015.0DomainEGF-like 3%3B calcium-binding
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1351442_164611813015.0DomainLaminin G-like 1
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1351649_168511813015.0DomainEGF-like 4%3B calcium-binding
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1351689_187011813015.0DomainLaminin G-like 2
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1351872_190711813015.0DomainEGF-like 5%3B calcium-binding
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1351908_194611813015.0DomainEGF-like 6%3B calcium-binding
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1351947_197911813015.0DomainEGF-like 7%3B calcium-binding
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1351981_201611813015.0DomainEGF-like 8%3B calcium-binding
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1352003_205011813015.0DomainLaminin EGF-like
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1352408_246011813015.0DomainGPS
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-13522_246911813015.0Topological domainExtracellular
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1352491_250111813015.0Topological domainCytoplasmic
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1352523_252711813015.0Topological domainExtracellular
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1352549_257211813015.0Topological domainCytoplasmic
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1352594_261111813015.0Topological domainExtracellular
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1352633_265511813015.0Topological domainCytoplasmic
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1352677_268311813015.0Topological domainExtracellular
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1352705_301411813015.0Topological domainCytoplasmic
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-13522_246911813015.0Topological domainExtracellular
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1352491_250111813015.0Topological domainCytoplasmic
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1352523_252711813015.0Topological domainExtracellular
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1352549_257211813015.0Topological domainCytoplasmic
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1352594_261111813015.0Topological domainExtracellular
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1352633_265511813015.0Topological domainCytoplasmic
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1352677_268311813015.0Topological domainExtracellular
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1352705_301411813015.0Topological domainCytoplasmic
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1352470_249011813015.0TransmembraneHelical%3B Name%3D1
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1352502_252211813015.0TransmembraneHelical%3B Name%3D2
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1352528_254811813015.0TransmembraneHelical%3B Name%3D3
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1352573_259311813015.0TransmembraneHelical%3B Name%3D4
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1352612_263211813015.0TransmembraneHelical%3B Name%3D5
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1352656_267611813015.0TransmembraneHelical%3B Name%3D6
HgeneCELSR1chr22:46929523chr22:47086096ENST00000262738-1352684_270411813015.0TransmembraneHelical%3B Name%3D7
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1352470_249011813015.0TransmembraneHelical%3B Name%3D1
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1352502_252211813015.0TransmembraneHelical%3B Name%3D2
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1352528_254811813015.0TransmembraneHelical%3B Name%3D3
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1352573_259311813015.0TransmembraneHelical%3B Name%3D4
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1352612_263211813015.0TransmembraneHelical%3B Name%3D5
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1352656_267611813015.0TransmembraneHelical%3B Name%3D6
HgeneCELSR1chr22:46929524chr22:47086097ENST00000262738-1352684_270411813015.0TransmembraneHelical%3B Name%3D7
TgeneCERKchr22:46929523chr22:47086096ENST000002162641013128_278444538.0DomainDAGKc
TgeneCERKchr22:46929524chr22:47086097ENST000002162641013128_278444538.0DomainDAGKc
TgeneCERKchr22:46929523chr22:47086096ENST000002162641013138_140444538.0Nucleotide bindingATP
TgeneCERKchr22:46929523chr22:47086096ENST000002162641013170_174444538.0Nucleotide bindingATP
TgeneCERKchr22:46929523chr22:47086096ENST000002162641013239_241444538.0Nucleotide bindingATP
TgeneCERKchr22:46929524chr22:47086097ENST000002162641013138_140444538.0Nucleotide bindingATP
TgeneCERKchr22:46929524chr22:47086097ENST000002162641013170_174444538.0Nucleotide bindingATP
TgeneCERKchr22:46929524chr22:47086097ENST000002162641013239_241444538.0Nucleotide bindingATP
TgeneCERKchr22:46929523chr22:47086096ENST000002162641013195_198444538.0RegionSubstrate binding
TgeneCERKchr22:46929523chr22:47086096ENST0000021626410131_115444538.0RegionEssential for enzyme activity
TgeneCERKchr22:46929523chr22:47086096ENST0000021626410131_125444538.0RegionRequired for binding to sulfatide and phosphoinositides
TgeneCERKchr22:46929524chr22:47086097ENST000002162641013195_198444538.0RegionSubstrate binding
TgeneCERKchr22:46929524chr22:47086097ENST0000021626410131_115444538.0RegionEssential for enzyme activity
TgeneCERKchr22:46929524chr22:47086097ENST0000021626410131_125444538.0RegionRequired for binding to sulfatide and phosphoinositides


Top

Fusion Gene Sequence for CELSR1-CERK


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>15643_15643_1_CELSR1-CERK_CELSR1_chr22_46929523_ENST00000262738_CERK_chr22_47086096_ENST00000216264_length(transcript)=6549nt_BP=3544nt
ATGGCGCCGCCGCCGCCGCCCGTGCTGCCCGTGCTGCTGCTCCTGGCCGCCGCCGCCGCCCTGCCGGCGATGGGGCTGCGAGCGGCCGCC
TGGGAGCCGCGCGTACCCGGCGGGACCCGCGCCTTCGCCCTCCGGCCCGGCTGTACCTACGCGGTGGGCGCCGCTTGCACGCCCCGGGCG
CCGCGGGAGCTGCTGGACGTGGGCCGCGATGGGCGGCTGGCAGGACGTCGGCGCGTCTCGGGCGCGGGGCGCCCGCTGCCGCTGCAAGTC
CGCTTGGTGGCCCGCAGTGCCCCGACGGCGCTGAGCCGCCGCCTGCGGGCGCGCACGCACCTTCCCGGCTGCGGAGCCCGTGCCCGGCTC
TGCGGAACCGGTGCCCGGCTCTGCGGGGCGCTCTGCTTCCCCGTCCCCGGCGGCTGCGCGGCCGCGCAGCATTCGGCGCTCGCAGCTCCG
ACCACCTTACCCGCCTGCCGCTGCCCGCCGCGCCCCAGGCCCCGCTGTCCCGGCCGTCCCATCTGCCTGCCGCCGGGCGGCTCGGTCCGC
CTGCGTCTGCTGTGCGCCCTGCGGCGCGCGGCTGGCGCCGTCCGGGTGGGACTGGCGCTGGAGGCCGCCACCGCGGGGACGCCCTCCGCG
TCGCCATCCCCATCGCCGCCCCTGCCGCCGAACTTGCCCGAAGCCCGGGCGGGGCCGGCGCGACGGGCCCGGCGGGGCACGAGCGGCAGA
GGGAGCCTGAAGTTTCCGATGCCCAACTACCAGGTGGCGTTGTTTGAGAACGAACCGGCGGGCACCCTCATCCTCCAGCTGCACGCGCAC
TACACCATCGAGGGCGAGGAGGAGCGCGTGAGCTATTACATGGAGGGGCTGTTCGACGAGCGCTCCCGGGGCTACTTCCGAATCGACTCT
GCCACGGGCGCCGTGAGCACGGACAGCGTACTGGACCGCGAGACCAAGGAGACGCACGTCCTCAGGGTGAAAGCCGTGGACTACAGTACG
CCGCCGCGCTCGGCCACCACCTACATCACTGTCTTGGTCAAAGACACCAACGACCACAGCCCGGTCTTCGAGCAGTCGGAGTACCGCGAG
CGCGTGCGGGAGAACCTGGAGGTGGGCTACGAGGTGCTGACCATCCGCGCCAGCGACCGCGACTCGCCCATCAACGCCAACTTGCGTTAC
CGCGTGTTGGGGGGCGCGTGGGACGTCTTCCAGCTCAACGAGAGCTCTGGCGTGGTGAGCACACGGGCGGTGCTGGACCGGGAGGAGGCG
GCCGAGTACCAGCTCCTGGTGGAGGCCAACGACCAGGGGCGCAATCCGGGCCCGCTCAGTGCCACGGCCACCGTGTACATCGAGGTGGAG
GACGAGAACGACAACTACCCCCAGTTCAGCGAGCAGAACTACGTGGTCCAGGTGCCCGAGGACGTGGGGCTCAACACGGCTGTGCTGCGA
GTGCAGGCCACGGACCGGGACCAGGGCCAGAACGCGGCCATTCACTACAGCATCCTCAGCGGGAACGTGGCCGGCCAGTTCTACCTGCAC
TCGCTGAGCGGGATCCTGGATGTGATCAACCCCTTGGATTTCGAGGATGTCCAGAAATACTCGCTGAGCATTAAGGCCCAGGATGGGGGC
CGGCCCCCGCTCATCAATTCTTCAGGGGTGGTGTCTGTGCAGGTGCTGGATGTCAACGACAACGAGCCTATCTTTGTGAGCAGCCCCTTC
CAGGCCACGGTGCTGGAGAATGTGCCCCTGGGCTACCCCGTGGTGCACATTCAGGCGGTGGACGCGGACTCTGGAGAGAACGCCCGGCTG
CACTATCGCCTGGTGGACACGGCCTCCACCTTTCTGGGGGGCGGCAGCGCTGGGCCTAAGAATCCTGCCCCCACCCCTGACTTCCCCTTC
CAGATCCACAACAGCTCCGGTTGGATCACAGTGTGTGCCGAGCTGGACCGCGAGGAGGTGGAGCACTACAGCTTCGGGGTGGAGGCGGTG
GACCACGGCTCGCCCCCCATGAGCTCCTCCACCAGCGTGTCCATCACGGTGCTGGACGTGAATGACAACGACCCGGTGTTCACGCAGCCC
ACCTACGAGCTTCGTCTGAATGAGGATGCGGCCGTGGGGAGCAGCGTGCTGACCCTGCAGGCCCGCGACCGTGACGCCAACAGTGTGATT
ACCTACCAGCTCACAGGCGGCAACACCCGGAACCGCTTTGCACTCAGCAGCCAGAGAGGGGGCGGCCTCATCACCCTGGCGCTACCTCTG
GACTACAAGCAGGAGCAGCAGTACGTGCTGGCGGTGACAGCATCCGACGGCACACGGTCGCACACTGCGCATGTCCTAATCAACGTCACT
GATGCCAACACCCACAGGCCTGTCTTTCAGAGCTCCCATTACACAGTGAGTGTCAGTGAGGACAGGCCTGTGGGCACCTCCATTGCTACC
CTCAGTGCCAACGATGAGGACACAGGAGAGAATGCCCGCATCACCTACGTGATTCAGGACCCCGTGCCGCAGTTCCGCATTGACCCCGAC
AGTGGCACCATGTACACCATGATGGAGCTGGACTATGAGAACCAGGTCGCCTACACGCTGACCATCATGGCCCAGGACAACGGCATCCCG
CAGAAATCAGACACCACCACCCTAGAGATCCTCATCCTCGATGCCAATGACAATGCACCCCAGTTCCTGTGGGATTTCTACCAGGGTTCC
ATCTTTGAGGATGCTCCACCCTCGACCAGCATCCTCCAGGTCTCTGCCACGGACCGGGACTCAGGTCCCAATGGGCGTCTGCTGTACACC
TTCCAGGGTGGGGACGACGGCGATGGGGACTTCTACATCGAGCCCACGTCCGGTGTGATTCGCACCCAGCGCCGGCTGGACCGGGAGAAT
GTGGCCGTGTACAACCTTTGGGCTCTGGCTGTGGATCGGGGCAGTCCCACTCCCCTTAGCGCCTCGGTAGAAATCCAGGTGACCATCTTG
GACATTAATGACAATGCCCCCATGTTTGAGAAGGACGAACTGGAGCTGTTTGTTGAGGAGAACAACCCAGTGGGGTCGGTGGTGGCAAAG
ATTCGTGCTAACGACCCTGATGAAGGCCCTAATGCCCAGATCATGTATCAGATTGTGGAAGGGGACATGCGGCATTTCTTCCAGCTGGAC
CTGCTCAACGGGGACCTGCGTGCCATGGTGGAGCTGGACTTTGAGGTCCGGCGGGAGTATGTGCTGGTGGTGCAGGCCACGTCGGCTCCG
CTGGTGAGCCGAGCCACGGTGCACATCCTTCTCGTGGACCAGAATGACAACCCGCCTGTGCTGCCCGACTTCCAGATCCTCTTCAACAAC
TATGTCACCAACAAGTCCAACAGTTTCCCCACCGGCGTGATCGGCTGCATCCCGGCCCATGACCCCGACGTGTCAGACAGCCTCAACTAC
ACCTTCGTGCAGGGCAACGAGCTGCGCCTGTTGCTGCTGGACCCCGCCACGGGCGAACTGCAGCTCAGCCGCGACCTGGACAACAACCGG
CCGCTGGAGGCGCTCATGGAGGTGTCTGTGTCTGTTTGACTTCACTTTTGTTGAAGTTTATCGCGTCAAGAAATTCCAGTTTACGTCGAA
GCACATGGAGGATGAGGACAGCGACCTCAAGGAGGGGGGGAAGAAGCGCTTTGGGCACATTTGCAGCAGCCACCCCTCCTGCTGCTGCAC
CGTCTCCAACAGCTCCTGGAACTGCGACGGGGAGGTCCTGCACAGCCCTGCCATCGAGGTCAGAGTCCACTGCCAGCTGGTTCGACTCTT
TGCACGAGGAATTGAAGAGAATCCGAAGCCAGACTCACACAGCTGAGAAGCCGGCGTCCTGCTCACAAACTGGGAAAGTGTGAAAACTAT
TTAAGATAATTATTACAGACCAATTATGTTGATATATACATTTAAATGTAGAAATTTATTTTTGATAGTTAAATCTTGATTTTAGAAGAA
AACCCTTTTGTCAACAATTTTGTGTACATATTTGGCATTTTCAGTTCTGTACGCATCTGCGGGTTGCAGCCCACGCCGCTTACTCTCAGC
GGATGCAGCTGCTCACTTGGGGGCACTGGCCTCTTAGGTTTTAACGATGTCAACAGTGTAGTTTAGAAAATGGCCCGTTAGTGGCTCTAT
TGCAATAATGTTAGGGACATTATATGATTTCCACGCAGGTCACACCATCTGGGCCTGAGGTAGCAGTGGGTCACTTTGATCCACTTTGCA
GGACTTATTCTGTAACGGTTTGTGGCCAAGTTTTGGGAAGTGGTTGATTCTCTTTGCCTTCATTTCACCTTCCTCTTCGTTTACGGTTAG
GACATCGCTGCTTGATCCTTACAATACTGTGCAACTGCAATGCAACGTGGCCCTGCTTCAGGTGATCCGCGGGAGGGGCCTCCACGCCAG
CGCCGGGAAGGCTGCTGGGGCCTCCACACCTGCCTCATCACGGCGGCGAGGCTACGACAATCCGGCTGGGAGCATGACCTTGGCGTCTGT
TCTGGGAGCACGGATGATAAGCTCTGGAAGCTGGCAGTGTGTAAAGCACTGGCAAGTTTGTTACTGTTAAAATGTCAAATACCAATGCTT
TATATCGACGCGAAGTGCTTAACACAGCCGGGCTTGGGGGCAGTCAGGAGGAAGCTGGCCATCCGTGGAGGAGGGGCCGGTCCTGGACTC
CCGCAGGACTCCTCTGAGGCAGGGCCTGAAGTCTGTACACGTGGTCCAGATTTGTCCTTGTCTTTTCTTCACACTGAGTTCTCTATATTT
ATTGAACATCTTGTCCTTTTAAGCCAGAGTAGTGTAAACTGCGTCTCGGATGTCTGTCTTTTGCCTCGAAGCCACGATGGATCGCTGGTT
TCCTCTGCAGCGCGAGGGCTCCGGCGACCAGAGGATTCTTCCCGGAAGGCATTCCTGCCGCGCTCCCCGGGGCACCCCTCAATTGTGTAC
TACGTCCTTGTTTAGTGTGTATCCGTGCCCACGTAGATGATGTCTGTAACGTAGTTTTGTTTGAAATATGAGAATATGCGGCTTAAACTT
TGATCTGTAAGGAGCGGGGCCGTGGCCGTTTGGAGCACGCTGTAGACACCGTTCCTCATGCTGCCGGGTGGGTTTTGCAGAAGCTCCCTT
AGTGATTTCATGTTTAACAGGCAGCATCCATTTTCAGAATTTCCTGGCATTGATTTATATTTTGAAGCATACAGGAAACTTCTCGTTTCC
TCGTTTAGCCCCACCCAGATCAGGTGAAAGGGCAGCTTTAATGGTGGTTTTTATGGACCACATTATCAGAGAGCACTGTGCAAGCCAAAT
GGTTCAATAATGAATGAAAATTCTGGGTGTAAAGAGTAAATATGCCCTGGCTCTTTCTACCAATGTTTGCTCCTGGTTGGAAAGAAACCA
AAGATTTAAGACGGGCTGCTCTTCCAGACTGGCTGTGCCTGCCTGTGCCCAGCAACCTGTGCAGCCGGCAGTGTGCCTGGTGTCACGCCA
GGAGGCTGTGGCTGCTGTGGGCCCTCTGGAATTGTGCTCCTCACAAAGTTTCCCCAAAAGGTTCTTCTAAGCCTTTATTGTCCCTGGTAA
ATGTTTCCCGGCTGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACCTAAGGTCAGGAGTTTG
AGATCAGCCTGCCCAACATGGTGAAACCTCGTCTCTACTAAAAATACACAACTTAGCCAGTCTTGTTGGCGCACGCCTGTAATCTCAGCT
ACTAGGGATGCTGAGGCAGGAGAATCGCTTGAACCCAAGAAAGAGGTGGAGGTTGCGGTGAGCCAAGATTGCGCCACTGCACTCCAGCCT
GGGCAAACAGAGGGAGACTCCATCGCCCCCCCCAACAAAAAAAAAAGTTTCCCATACACTGGCCTGCCCCAAAACCCACTAACAATTTTA
GCAAAACAGTCCAGGCCAAAGAGGAAGCATTTCATGTTCAATAAGAAACCCAGCCATTCCGCATGGCTGGTTCCTGAGTGGCTCTGGTGA
TACTCTCCAGCCACCTGCTGACATTCAGAATCTCAGACCTCGGGACTGCTGTTGCGGTACCGTGTGTCTGACACCTGCCAGCAGCCCTTT
GCTATCTGCGCGCAGGATGGGGGTGACTGCCCAGACATTCCCGCTAGATAGGCTCTGATTTCCGGGGCAGCCTTTCAGATGCGGCAGACA
TACAACACCTGTACTTTAGAGTTTTAAGGGAAAAAAAATCAGAAGTGCTGGTTAGATAGTAAAAACTTAGGATAACTTAGAAAGGCTAGT
TTTAGCTTCCTTTGTGGCTCCCTGGTGCAAAACAATTAGCAGTTATGCAATGGACCTGATTCTAGTTTATTCTAATTAAGAAGTGAGGCC
GAGTTTGACTTCGTTCCTGAATACAATCTTGAGTAACTGGGAAAGTCTGAGTGAAAGGATGGCCTCATTCTCTTTCTAATCTTGCTGGTT

>15643_15643_1_CELSR1-CERK_CELSR1_chr22_46929523_ENST00000262738_CERK_chr22_47086096_ENST00000216264_length(amino acids)=1182AA_BP=
MAPPPPPVLPVLLLLAAAAALPAMGLRAAAWEPRVPGGTRAFALRPGCTYAVGAACTPRAPRELLDVGRDGRLAGRRRVSGAGRPLPLQV
RLVARSAPTALSRRLRARTHLPGCGARARLCGTGARLCGALCFPVPGGCAAAQHSALAAPTTLPACRCPPRPRPRCPGRPICLPPGGSVR
LRLLCALRRAAGAVRVGLALEAATAGTPSASPSPSPPLPPNLPEARAGPARRARRGTSGRGSLKFPMPNYQVALFENEPAGTLILQLHAH
YTIEGEEERVSYYMEGLFDERSRGYFRIDSATGAVSTDSVLDRETKETHVLRVKAVDYSTPPRSATTYITVLVKDTNDHSPVFEQSEYRE
RVRENLEVGYEVLTIRASDRDSPINANLRYRVLGGAWDVFQLNESSGVVSTRAVLDREEAAEYQLLVEANDQGRNPGPLSATATVYIEVE
DENDNYPQFSEQNYVVQVPEDVGLNTAVLRVQATDRDQGQNAAIHYSILSGNVAGQFYLHSLSGILDVINPLDFEDVQKYSLSIKAQDGG
RPPLINSSGVVSVQVLDVNDNEPIFVSSPFQATVLENVPLGYPVVHIQAVDADSGENARLHYRLVDTASTFLGGGSAGPKNPAPTPDFPF
QIHNSSGWITVCAELDREEVEHYSFGVEAVDHGSPPMSSSTSVSITVLDVNDNDPVFTQPTYELRLNEDAAVGSSVLTLQARDRDANSVI
TYQLTGGNTRNRFALSSQRGGGLITLALPLDYKQEQQYVLAVTASDGTRSHTAHVLINVTDANTHRPVFQSSHYTVSVSEDRPVGTSIAT
LSANDEDTGENARITYVIQDPVPQFRIDPDSGTMYTMMELDYENQVAYTLTIMAQDNGIPQKSDTTTLEILILDANDNAPQFLWDFYQGS
IFEDAPPSTSILQVSATDRDSGPNGRLLYTFQGGDDGDGDFYIEPTSGVIRTQRRLDRENVAVYNLWALAVDRGSPTPLSASVEIQVTIL
DINDNAPMFEKDELELFVEENNPVGSVVAKIRANDPDEGPNAQIMYQIVEGDMRHFFQLDLLNGDLRAMVELDFEVRREYVLVVQATSAP
LVSRATVHILLVDQNDNPPVLPDFQILFNNYVTNKSNSFPTGVIGCIPAHDPDVSDSLNYTFVQGNELRLLLLDPATGELQLSRDLDNNR

--------------------------------------------------------------
>15643_15643_2_CELSR1-CERK_CELSR1_chr22_46929523_ENST00000262738_CERK_chr22_47086096_ENST00000541677_length(transcript)=3972nt_BP=3544nt
ATGGCGCCGCCGCCGCCGCCCGTGCTGCCCGTGCTGCTGCTCCTGGCCGCCGCCGCCGCCCTGCCGGCGATGGGGCTGCGAGCGGCCGCC
TGGGAGCCGCGCGTACCCGGCGGGACCCGCGCCTTCGCCCTCCGGCCCGGCTGTACCTACGCGGTGGGCGCCGCTTGCACGCCCCGGGCG
CCGCGGGAGCTGCTGGACGTGGGCCGCGATGGGCGGCTGGCAGGACGTCGGCGCGTCTCGGGCGCGGGGCGCCCGCTGCCGCTGCAAGTC
CGCTTGGTGGCCCGCAGTGCCCCGACGGCGCTGAGCCGCCGCCTGCGGGCGCGCACGCACCTTCCCGGCTGCGGAGCCCGTGCCCGGCTC
TGCGGAACCGGTGCCCGGCTCTGCGGGGCGCTCTGCTTCCCCGTCCCCGGCGGCTGCGCGGCCGCGCAGCATTCGGCGCTCGCAGCTCCG
ACCACCTTACCCGCCTGCCGCTGCCCGCCGCGCCCCAGGCCCCGCTGTCCCGGCCGTCCCATCTGCCTGCCGCCGGGCGGCTCGGTCCGC
CTGCGTCTGCTGTGCGCCCTGCGGCGCGCGGCTGGCGCCGTCCGGGTGGGACTGGCGCTGGAGGCCGCCACCGCGGGGACGCCCTCCGCG
TCGCCATCCCCATCGCCGCCCCTGCCGCCGAACTTGCCCGAAGCCCGGGCGGGGCCGGCGCGACGGGCCCGGCGGGGCACGAGCGGCAGA
GGGAGCCTGAAGTTTCCGATGCCCAACTACCAGGTGGCGTTGTTTGAGAACGAACCGGCGGGCACCCTCATCCTCCAGCTGCACGCGCAC
TACACCATCGAGGGCGAGGAGGAGCGCGTGAGCTATTACATGGAGGGGCTGTTCGACGAGCGCTCCCGGGGCTACTTCCGAATCGACTCT
GCCACGGGCGCCGTGAGCACGGACAGCGTACTGGACCGCGAGACCAAGGAGACGCACGTCCTCAGGGTGAAAGCCGTGGACTACAGTACG
CCGCCGCGCTCGGCCACCACCTACATCACTGTCTTGGTCAAAGACACCAACGACCACAGCCCGGTCTTCGAGCAGTCGGAGTACCGCGAG
CGCGTGCGGGAGAACCTGGAGGTGGGCTACGAGGTGCTGACCATCCGCGCCAGCGACCGCGACTCGCCCATCAACGCCAACTTGCGTTAC
CGCGTGTTGGGGGGCGCGTGGGACGTCTTCCAGCTCAACGAGAGCTCTGGCGTGGTGAGCACACGGGCGGTGCTGGACCGGGAGGAGGCG
GCCGAGTACCAGCTCCTGGTGGAGGCCAACGACCAGGGGCGCAATCCGGGCCCGCTCAGTGCCACGGCCACCGTGTACATCGAGGTGGAG
GACGAGAACGACAACTACCCCCAGTTCAGCGAGCAGAACTACGTGGTCCAGGTGCCCGAGGACGTGGGGCTCAACACGGCTGTGCTGCGA
GTGCAGGCCACGGACCGGGACCAGGGCCAGAACGCGGCCATTCACTACAGCATCCTCAGCGGGAACGTGGCCGGCCAGTTCTACCTGCAC
TCGCTGAGCGGGATCCTGGATGTGATCAACCCCTTGGATTTCGAGGATGTCCAGAAATACTCGCTGAGCATTAAGGCCCAGGATGGGGGC
CGGCCCCCGCTCATCAATTCTTCAGGGGTGGTGTCTGTGCAGGTGCTGGATGTCAACGACAACGAGCCTATCTTTGTGAGCAGCCCCTTC
CAGGCCACGGTGCTGGAGAATGTGCCCCTGGGCTACCCCGTGGTGCACATTCAGGCGGTGGACGCGGACTCTGGAGAGAACGCCCGGCTG
CACTATCGCCTGGTGGACACGGCCTCCACCTTTCTGGGGGGCGGCAGCGCTGGGCCTAAGAATCCTGCCCCCACCCCTGACTTCCCCTTC
CAGATCCACAACAGCTCCGGTTGGATCACAGTGTGTGCCGAGCTGGACCGCGAGGAGGTGGAGCACTACAGCTTCGGGGTGGAGGCGGTG
GACCACGGCTCGCCCCCCATGAGCTCCTCCACCAGCGTGTCCATCACGGTGCTGGACGTGAATGACAACGACCCGGTGTTCACGCAGCCC
ACCTACGAGCTTCGTCTGAATGAGGATGCGGCCGTGGGGAGCAGCGTGCTGACCCTGCAGGCCCGCGACCGTGACGCCAACAGTGTGATT
ACCTACCAGCTCACAGGCGGCAACACCCGGAACCGCTTTGCACTCAGCAGCCAGAGAGGGGGCGGCCTCATCACCCTGGCGCTACCTCTG
GACTACAAGCAGGAGCAGCAGTACGTGCTGGCGGTGACAGCATCCGACGGCACACGGTCGCACACTGCGCATGTCCTAATCAACGTCACT
GATGCCAACACCCACAGGCCTGTCTTTCAGAGCTCCCATTACACAGTGAGTGTCAGTGAGGACAGGCCTGTGGGCACCTCCATTGCTACC
CTCAGTGCCAACGATGAGGACACAGGAGAGAATGCCCGCATCACCTACGTGATTCAGGACCCCGTGCCGCAGTTCCGCATTGACCCCGAC
AGTGGCACCATGTACACCATGATGGAGCTGGACTATGAGAACCAGGTCGCCTACACGCTGACCATCATGGCCCAGGACAACGGCATCCCG
CAGAAATCAGACACCACCACCCTAGAGATCCTCATCCTCGATGCCAATGACAATGCACCCCAGTTCCTGTGGGATTTCTACCAGGGTTCC
ATCTTTGAGGATGCTCCACCCTCGACCAGCATCCTCCAGGTCTCTGCCACGGACCGGGACTCAGGTCCCAATGGGCGTCTGCTGTACACC
TTCCAGGGTGGGGACGACGGCGATGGGGACTTCTACATCGAGCCCACGTCCGGTGTGATTCGCACCCAGCGCCGGCTGGACCGGGAGAAT
GTGGCCGTGTACAACCTTTGGGCTCTGGCTGTGGATCGGGGCAGTCCCACTCCCCTTAGCGCCTCGGTAGAAATCCAGGTGACCATCTTG
GACATTAATGACAATGCCCCCATGTTTGAGAAGGACGAACTGGAGCTGTTTGTTGAGGAGAACAACCCAGTGGGGTCGGTGGTGGCAAAG
ATTCGTGCTAACGACCCTGATGAAGGCCCTAATGCCCAGATCATGTATCAGATTGTGGAAGGGGACATGCGGCATTTCTTCCAGCTGGAC
CTGCTCAACGGGGACCTGCGTGCCATGGTGGAGCTGGACTTTGAGGTCCGGCGGGAGTATGTGCTGGTGGTGCAGGCCACGTCGGCTCCG
CTGGTGAGCCGAGCCACGGTGCACATCCTTCTCGTGGACCAGAATGACAACCCGCCTGTGCTGCCCGACTTCCAGATCCTCTTCAACAAC
TATGTCACCAACAAGTCCAACAGTTTCCCCACCGGCGTGATCGGCTGCATCCCGGCCCATGACCCCGACGTGTCAGACAGCCTCAACTAC
ACCTTCGTGCAGGGCAACGAGCTGCGCCTGTTGCTGCTGGACCCCGCCACGGGCGAACTGCAGCTCAGCCGCGACCTGGACAACAACCGG
CCGCTGGAGGCGCTCATGGAGGTGTCTGTGTCTGTTTGACTTCACTTTTGTTGAAGTTTATCGCGTCAAGAAATTCCAGTTTACGTCGAA
GCACATGGAGGATGAGGACAGCGACCTCAAGGAGGGGGGGAAGAAGCGCTTTGGGCACATTTGCAGCAGCCACCCCTCCTGCTGCTGCAC
CGTCTCCAACAGCTCCTGGAACTGCGACGGGGAGGTCCTGCACAGCCCTGCCATCGAGGTCAGAGTCCACTGCCAGCTGGTTCGACTCTT
TGCACGAGGAATTGAAGAGAATCCGAAGCCAGACTCACACAGCTGAGAAGCCGGCGTCCTGCTCACAAACTGGGAAAGTGTGAAAACTAT
TTAAGATAATTATTACAGACCAATTATGTTGATATATACATTTAAATGTAGAAATTTATTTTTGATAGTTAAATCTTGATTTTAGAAGAA

>15643_15643_2_CELSR1-CERK_CELSR1_chr22_46929523_ENST00000262738_CERK_chr22_47086096_ENST00000541677_length(amino acids)=1182AA_BP=
MAPPPPPVLPVLLLLAAAAALPAMGLRAAAWEPRVPGGTRAFALRPGCTYAVGAACTPRAPRELLDVGRDGRLAGRRRVSGAGRPLPLQV
RLVARSAPTALSRRLRARTHLPGCGARARLCGTGARLCGALCFPVPGGCAAAQHSALAAPTTLPACRCPPRPRPRCPGRPICLPPGGSVR
LRLLCALRRAAGAVRVGLALEAATAGTPSASPSPSPPLPPNLPEARAGPARRARRGTSGRGSLKFPMPNYQVALFENEPAGTLILQLHAH
YTIEGEEERVSYYMEGLFDERSRGYFRIDSATGAVSTDSVLDRETKETHVLRVKAVDYSTPPRSATTYITVLVKDTNDHSPVFEQSEYRE
RVRENLEVGYEVLTIRASDRDSPINANLRYRVLGGAWDVFQLNESSGVVSTRAVLDREEAAEYQLLVEANDQGRNPGPLSATATVYIEVE
DENDNYPQFSEQNYVVQVPEDVGLNTAVLRVQATDRDQGQNAAIHYSILSGNVAGQFYLHSLSGILDVINPLDFEDVQKYSLSIKAQDGG
RPPLINSSGVVSVQVLDVNDNEPIFVSSPFQATVLENVPLGYPVVHIQAVDADSGENARLHYRLVDTASTFLGGGSAGPKNPAPTPDFPF
QIHNSSGWITVCAELDREEVEHYSFGVEAVDHGSPPMSSSTSVSITVLDVNDNDPVFTQPTYELRLNEDAAVGSSVLTLQARDRDANSVI
TYQLTGGNTRNRFALSSQRGGGLITLALPLDYKQEQQYVLAVTASDGTRSHTAHVLINVTDANTHRPVFQSSHYTVSVSEDRPVGTSIAT
LSANDEDTGENARITYVIQDPVPQFRIDPDSGTMYTMMELDYENQVAYTLTIMAQDNGIPQKSDTTTLEILILDANDNAPQFLWDFYQGS
IFEDAPPSTSILQVSATDRDSGPNGRLLYTFQGGDDGDGDFYIEPTSGVIRTQRRLDRENVAVYNLWALAVDRGSPTPLSASVEIQVTIL
DINDNAPMFEKDELELFVEENNPVGSVVAKIRANDPDEGPNAQIMYQIVEGDMRHFFQLDLLNGDLRAMVELDFEVRREYVLVVQATSAP
LVSRATVHILLVDQNDNPPVLPDFQILFNNYVTNKSNSFPTGVIGCIPAHDPDVSDSLNYTFVQGNELRLLLLDPATGELQLSRDLDNNR

--------------------------------------------------------------
>15643_15643_3_CELSR1-CERK_CELSR1_chr22_46929523_ENST00000395964_CERK_chr22_47086096_ENST00000216264_length(transcript)=6549nt_BP=3544nt
ATGGCGCCGCCGCCGCCGCCCGTGCTGCCCGTGCTGCTGCTCCTGGCCGCCGCCGCCGCCCTGCCGGCGATGGGGCTGCGAGCGGCCGCC
TGGGAGCCGCGCGTACCCGGCGGGACCCGCGCCTTCGCCCTCCGGCCCGGCTGTACCTACGCGGTGGGCGCCGCTTGCACGCCCCGGGCG
CCGCGGGAGCTGCTGGACGTGGGCCGCGATGGGCGGCTGGCAGGACGTCGGCGCGTCTCGGGCGCGGGGCGCCCGCTGCCGCTGCAAGTC
CGCTTGGTGGCCCGCAGTGCCCCGACGGCGCTGAGCCGCCGCCTGCGGGCGCGCACGCACCTTCCCGGCTGCGGAGCCCGTGCCCGGCTC
TGCGGAACCGGTGCCCGGCTCTGCGGGGCGCTCTGCTTCCCCGTCCCCGGCGGCTGCGCGGCCGCGCAGCATTCGGCGCTCGCAGCTCCG
ACCACCTTACCCGCCTGCCGCTGCCCGCCGCGCCCCAGGCCCCGCTGTCCCGGCCGTCCCATCTGCCTGCCGCCGGGCGGCTCGGTCCGC
CTGCGTCTGCTGTGCGCCCTGCGGCGCGCGGCTGGCGCCGTCCGGGTGGGACTGGCGCTGGAGGCCGCCACCGCGGGGACGCCCTCCGCG
TCGCCATCCCCATCGCCGCCCCTGCCGCCGAACTTGCCCGAAGCCCGGGCGGGGCCGGCGCGACGGGCCCGGCGGGGCACGAGCGGCAGA
GGGAGCCTGAAGTTTCCGATGCCCAACTACCAGGTGGCGTTGTTTGAGAACGAACCGGCGGGCACCCTCATCCTCCAGCTGCACGCGCAC
TACACCATCGAGGGCGAGGAGGAGCGCGTGAGCTATTACATGGAGGGGCTGTTCGACGAGCGCTCCCGGGGCTACTTCCGAATCGACTCT
GCCACGGGCGCCGTGAGCACGGACAGCGTACTGGACCGCGAGACCAAGGAGACGCACGTCCTCAGGGTGAAAGCCGTGGACTACAGTACG
CCGCCGCGCTCGGCCACCACCTACATCACTGTCTTGGTCAAAGACACCAACGACCACAGCCCGGTCTTCGAGCAGTCGGAGTACCGCGAG
CGCGTGCGGGAGAACCTGGAGGTGGGCTACGAGGTGCTGACCATCCGCGCCAGCGACCGCGACTCGCCCATCAACGCCAACTTGCGTTAC
CGCGTGTTGGGGGGCGCGTGGGACGTCTTCCAGCTCAACGAGAGCTCTGGCGTGGTGAGCACACGGGCGGTGCTGGACCGGGAGGAGGCG
GCCGAGTACCAGCTCCTGGTGGAGGCCAACGACCAGGGGCGCAATCCGGGCCCGCTCAGTGCCACGGCCACCGTGTACATCGAGGTGGAG
GACGAGAACGACAACTACCCCCAGTTCAGCGAGCAGAACTACGTGGTCCAGGTGCCCGAGGACGTGGGGCTCAACACGGCTGTGCTGCGA
GTGCAGGCCACGGACCGGGACCAGGGCCAGAACGCGGCCATTCACTACAGCATCCTCAGCGGGAACGTGGCCGGCCAGTTCTACCTGCAC
TCGCTGAGCGGGATCCTGGATGTGATCAACCCCTTGGATTTCGAGGATGTCCAGAAATACTCGCTGAGCATTAAGGCCCAGGATGGGGGC
CGGCCCCCGCTCATCAATTCTTCAGGGGTGGTGTCTGTGCAGGTGCTGGATGTCAACGACAACGAGCCTATCTTTGTGAGCAGCCCCTTC
CAGGCCACGGTGCTGGAGAATGTGCCCCTGGGCTACCCCGTGGTGCACATTCAGGCGGTGGACGCGGACTCTGGAGAGAACGCCCGGCTG
CACTATCGCCTGGTGGACACGGCCTCCACCTTTCTGGGGGGCGGCAGCGCTGGGCCTAAGAATCCTGCCCCCACCCCTGACTTCCCCTTC
CAGATCCACAACAGCTCCGGTTGGATCACAGTGTGTGCCGAGCTGGACCGCGAGGAGGTGGAGCACTACAGCTTCGGGGTGGAGGCGGTG
GACCACGGCTCGCCCCCCATGAGCTCCTCCACCAGCGTGTCCATCACGGTGCTGGACGTGAATGACAACGACCCGGTGTTCACGCAGCCC
ACCTACGAGCTTCGTCTGAATGAGGATGCGGCCGTGGGGAGCAGCGTGCTGACCCTGCAGGCCCGCGACCGTGACGCCAACAGTGTGATT
ACCTACCAGCTCACAGGCGGCAACACCCGGAACCGCTTTGCACTCAGCAGCCAGAGAGGGGGCGGCCTCATCACCCTGGCGCTACCTCTG
GACTACAAGCAGGAGCAGCAGTACGTGCTGGCGGTGACAGCATCCGACGGCACACGGTCGCACACTGCGCATGTCCTAATCAACGTCACT
GATGCCAACACCCACAGGCCTGTCTTTCAGAGCTCCCATTACACAGTGAGTGTCAGTGAGGACAGGCCTGTGGGCACCTCCATTGCTACC
CTCAGTGCCAACGATGAGGACACAGGAGAGAATGCCCGCATCACCTACGTGATTCAGGACCCCGTGCCGCAGTTCCGCATTGACCCCGAC
AGTGGCACCATGTACACCATGATGGAGCTGGACTATGAGAACCAGGTCGCCTACACGCTGACCATCATGGCCCAGGACAACGGCATCCCG
CAGAAATCAGACACCACCACCCTAGAGATCCTCATCCTCGATGCCAATGACAATGCACCCCAGTTCCTGTGGGATTTCTACCAGGGTTCC
ATCTTTGAGGATGCTCCACCCTCGACCAGCATCCTCCAGGTCTCTGCCACGGACCGGGACTCAGGTCCCAATGGGCGTCTGCTGTACACC
TTCCAGGGTGGGGACGACGGCGATGGGGACTTCTACATCGAGCCCACGTCCGGTGTGATTCGCACCCAGCGCCGGCTGGACCGGGAGAAT
GTGGCCGTGTACAACCTTTGGGCTCTGGCTGTGGATCGGGGCAGTCCCACTCCCCTTAGCGCCTCGGTAGAAATCCAGGTGACCATCTTG
GACATTAATGACAATGCCCCCATGTTTGAGAAGGACGAACTGGAGCTGTTTGTTGAGGAGAACAACCCAGTGGGGTCGGTGGTGGCAAAG
ATTCGTGCTAACGACCCTGATGAAGGCCCTAATGCCCAGATCATGTATCAGATTGTGGAAGGGGACATGCGGCATTTCTTCCAGCTGGAC
CTGCTCAACGGGGACCTGCGTGCCATGGTGGAGCTGGACTTTGAGGTCCGGCGGGAGTATGTGCTGGTGGTGCAGGCCACGTCGGCTCCG
CTGGTGAGCCGAGCCACGGTGCACATCCTTCTCGTGGACCAGAATGACAACCCGCCTGTGCTGCCCGACTTCCAGATCCTCTTCAACAAC
TATGTCACCAACAAGTCCAACAGTTTCCCCACCGGCGTGATCGGCTGCATCCCGGCCCATGACCCCGACGTGTCAGACAGCCTCAACTAC
ACCTTCGTGCAGGGCAACGAGCTGCGCCTGTTGCTGCTGGACCCCGCCACGGGCGAACTGCAGCTCAGCCGCGACCTGGACAACAACCGG
CCGCTGGAGGCGCTCATGGAGGTGTCTGTGTCTGTTTGACTTCACTTTTGTTGAAGTTTATCGCGTCAAGAAATTCCAGTTTACGTCGAA
GCACATGGAGGATGAGGACAGCGACCTCAAGGAGGGGGGGAAGAAGCGCTTTGGGCACATTTGCAGCAGCCACCCCTCCTGCTGCTGCAC
CGTCTCCAACAGCTCCTGGAACTGCGACGGGGAGGTCCTGCACAGCCCTGCCATCGAGGTCAGAGTCCACTGCCAGCTGGTTCGACTCTT
TGCACGAGGAATTGAAGAGAATCCGAAGCCAGACTCACACAGCTGAGAAGCCGGCGTCCTGCTCACAAACTGGGAAAGTGTGAAAACTAT
TTAAGATAATTATTACAGACCAATTATGTTGATATATACATTTAAATGTAGAAATTTATTTTTGATAGTTAAATCTTGATTTTAGAAGAA
AACCCTTTTGTCAACAATTTTGTGTACATATTTGGCATTTTCAGTTCTGTACGCATCTGCGGGTTGCAGCCCACGCCGCTTACTCTCAGC
GGATGCAGCTGCTCACTTGGGGGCACTGGCCTCTTAGGTTTTAACGATGTCAACAGTGTAGTTTAGAAAATGGCCCGTTAGTGGCTCTAT
TGCAATAATGTTAGGGACATTATATGATTTCCACGCAGGTCACACCATCTGGGCCTGAGGTAGCAGTGGGTCACTTTGATCCACTTTGCA
GGACTTATTCTGTAACGGTTTGTGGCCAAGTTTTGGGAAGTGGTTGATTCTCTTTGCCTTCATTTCACCTTCCTCTTCGTTTACGGTTAG
GACATCGCTGCTTGATCCTTACAATACTGTGCAACTGCAATGCAACGTGGCCCTGCTTCAGGTGATCCGCGGGAGGGGCCTCCACGCCAG
CGCCGGGAAGGCTGCTGGGGCCTCCACACCTGCCTCATCACGGCGGCGAGGCTACGACAATCCGGCTGGGAGCATGACCTTGGCGTCTGT
TCTGGGAGCACGGATGATAAGCTCTGGAAGCTGGCAGTGTGTAAAGCACTGGCAAGTTTGTTACTGTTAAAATGTCAAATACCAATGCTT
TATATCGACGCGAAGTGCTTAACACAGCCGGGCTTGGGGGCAGTCAGGAGGAAGCTGGCCATCCGTGGAGGAGGGGCCGGTCCTGGACTC
CCGCAGGACTCCTCTGAGGCAGGGCCTGAAGTCTGTACACGTGGTCCAGATTTGTCCTTGTCTTTTCTTCACACTGAGTTCTCTATATTT
ATTGAACATCTTGTCCTTTTAAGCCAGAGTAGTGTAAACTGCGTCTCGGATGTCTGTCTTTTGCCTCGAAGCCACGATGGATCGCTGGTT
TCCTCTGCAGCGCGAGGGCTCCGGCGACCAGAGGATTCTTCCCGGAAGGCATTCCTGCCGCGCTCCCCGGGGCACCCCTCAATTGTGTAC
TACGTCCTTGTTTAGTGTGTATCCGTGCCCACGTAGATGATGTCTGTAACGTAGTTTTGTTTGAAATATGAGAATATGCGGCTTAAACTT
TGATCTGTAAGGAGCGGGGCCGTGGCCGTTTGGAGCACGCTGTAGACACCGTTCCTCATGCTGCCGGGTGGGTTTTGCAGAAGCTCCCTT
AGTGATTTCATGTTTAACAGGCAGCATCCATTTTCAGAATTTCCTGGCATTGATTTATATTTTGAAGCATACAGGAAACTTCTCGTTTCC
TCGTTTAGCCCCACCCAGATCAGGTGAAAGGGCAGCTTTAATGGTGGTTTTTATGGACCACATTATCAGAGAGCACTGTGCAAGCCAAAT
GGTTCAATAATGAATGAAAATTCTGGGTGTAAAGAGTAAATATGCCCTGGCTCTTTCTACCAATGTTTGCTCCTGGTTGGAAAGAAACCA
AAGATTTAAGACGGGCTGCTCTTCCAGACTGGCTGTGCCTGCCTGTGCCCAGCAACCTGTGCAGCCGGCAGTGTGCCTGGTGTCACGCCA
GGAGGCTGTGGCTGCTGTGGGCCCTCTGGAATTGTGCTCCTCACAAAGTTTCCCCAAAAGGTTCTTCTAAGCCTTTATTGTCCCTGGTAA
ATGTTTCCCGGCTGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACCTAAGGTCAGGAGTTTG
AGATCAGCCTGCCCAACATGGTGAAACCTCGTCTCTACTAAAAATACACAACTTAGCCAGTCTTGTTGGCGCACGCCTGTAATCTCAGCT
ACTAGGGATGCTGAGGCAGGAGAATCGCTTGAACCCAAGAAAGAGGTGGAGGTTGCGGTGAGCCAAGATTGCGCCACTGCACTCCAGCCT
GGGCAAACAGAGGGAGACTCCATCGCCCCCCCCAACAAAAAAAAAAGTTTCCCATACACTGGCCTGCCCCAAAACCCACTAACAATTTTA
GCAAAACAGTCCAGGCCAAAGAGGAAGCATTTCATGTTCAATAAGAAACCCAGCCATTCCGCATGGCTGGTTCCTGAGTGGCTCTGGTGA
TACTCTCCAGCCACCTGCTGACATTCAGAATCTCAGACCTCGGGACTGCTGTTGCGGTACCGTGTGTCTGACACCTGCCAGCAGCCCTTT
GCTATCTGCGCGCAGGATGGGGGTGACTGCCCAGACATTCCCGCTAGATAGGCTCTGATTTCCGGGGCAGCCTTTCAGATGCGGCAGACA
TACAACACCTGTACTTTAGAGTTTTAAGGGAAAAAAAATCAGAAGTGCTGGTTAGATAGTAAAAACTTAGGATAACTTAGAAAGGCTAGT
TTTAGCTTCCTTTGTGGCTCCCTGGTGCAAAACAATTAGCAGTTATGCAATGGACCTGATTCTAGTTTATTCTAATTAAGAAGTGAGGCC
GAGTTTGACTTCGTTCCTGAATACAATCTTGAGTAACTGGGAAAGTCTGAGTGAAAGGATGGCCTCATTCTCTTTCTAATCTTGCTGGTT

>15643_15643_3_CELSR1-CERK_CELSR1_chr22_46929523_ENST00000395964_CERK_chr22_47086096_ENST00000216264_length(amino acids)=1182AA_BP=
MAPPPPPVLPVLLLLAAAAALPAMGLRAAAWEPRVPGGTRAFALRPGCTYAVGAACTPRAPRELLDVGRDGRLAGRRRVSGAGRPLPLQV
RLVARSAPTALSRRLRARTHLPGCGARARLCGTGARLCGALCFPVPGGCAAAQHSALAAPTTLPACRCPPRPRPRCPGRPICLPPGGSVR
LRLLCALRRAAGAVRVGLALEAATAGTPSASPSPSPPLPPNLPEARAGPARRARRGTSGRGSLKFPMPNYQVALFENEPAGTLILQLHAH
YTIEGEEERVSYYMEGLFDERSRGYFRIDSATGAVSTDSVLDRETKETHVLRVKAVDYSTPPRSATTYITVLVKDTNDHSPVFEQSEYRE
RVRENLEVGYEVLTIRASDRDSPINANLRYRVLGGAWDVFQLNESSGVVSTRAVLDREEAAEYQLLVEANDQGRNPGPLSATATVYIEVE
DENDNYPQFSEQNYVVQVPEDVGLNTAVLRVQATDRDQGQNAAIHYSILSGNVAGQFYLHSLSGILDVINPLDFEDVQKYSLSIKAQDGG
RPPLINSSGVVSVQVLDVNDNEPIFVSSPFQATVLENVPLGYPVVHIQAVDADSGENARLHYRLVDTASTFLGGGSAGPKNPAPTPDFPF
QIHNSSGWITVCAELDREEVEHYSFGVEAVDHGSPPMSSSTSVSITVLDVNDNDPVFTQPTYELRLNEDAAVGSSVLTLQARDRDANSVI
TYQLTGGNTRNRFALSSQRGGGLITLALPLDYKQEQQYVLAVTASDGTRSHTAHVLINVTDANTHRPVFQSSHYTVSVSEDRPVGTSIAT
LSANDEDTGENARITYVIQDPVPQFRIDPDSGTMYTMMELDYENQVAYTLTIMAQDNGIPQKSDTTTLEILILDANDNAPQFLWDFYQGS
IFEDAPPSTSILQVSATDRDSGPNGRLLYTFQGGDDGDGDFYIEPTSGVIRTQRRLDRENVAVYNLWALAVDRGSPTPLSASVEIQVTIL
DINDNAPMFEKDELELFVEENNPVGSVVAKIRANDPDEGPNAQIMYQIVEGDMRHFFQLDLLNGDLRAMVELDFEVRREYVLVVQATSAP
LVSRATVHILLVDQNDNPPVLPDFQILFNNYVTNKSNSFPTGVIGCIPAHDPDVSDSLNYTFVQGNELRLLLLDPATGELQLSRDLDNNR

--------------------------------------------------------------
>15643_15643_4_CELSR1-CERK_CELSR1_chr22_46929523_ENST00000395964_CERK_chr22_47086096_ENST00000541677_length(transcript)=3972nt_BP=3544nt
ATGGCGCCGCCGCCGCCGCCCGTGCTGCCCGTGCTGCTGCTCCTGGCCGCCGCCGCCGCCCTGCCGGCGATGGGGCTGCGAGCGGCCGCC
TGGGAGCCGCGCGTACCCGGCGGGACCCGCGCCTTCGCCCTCCGGCCCGGCTGTACCTACGCGGTGGGCGCCGCTTGCACGCCCCGGGCG
CCGCGGGAGCTGCTGGACGTGGGCCGCGATGGGCGGCTGGCAGGACGTCGGCGCGTCTCGGGCGCGGGGCGCCCGCTGCCGCTGCAAGTC
CGCTTGGTGGCCCGCAGTGCCCCGACGGCGCTGAGCCGCCGCCTGCGGGCGCGCACGCACCTTCCCGGCTGCGGAGCCCGTGCCCGGCTC
TGCGGAACCGGTGCCCGGCTCTGCGGGGCGCTCTGCTTCCCCGTCCCCGGCGGCTGCGCGGCCGCGCAGCATTCGGCGCTCGCAGCTCCG
ACCACCTTACCCGCCTGCCGCTGCCCGCCGCGCCCCAGGCCCCGCTGTCCCGGCCGTCCCATCTGCCTGCCGCCGGGCGGCTCGGTCCGC
CTGCGTCTGCTGTGCGCCCTGCGGCGCGCGGCTGGCGCCGTCCGGGTGGGACTGGCGCTGGAGGCCGCCACCGCGGGGACGCCCTCCGCG
TCGCCATCCCCATCGCCGCCCCTGCCGCCGAACTTGCCCGAAGCCCGGGCGGGGCCGGCGCGACGGGCCCGGCGGGGCACGAGCGGCAGA
GGGAGCCTGAAGTTTCCGATGCCCAACTACCAGGTGGCGTTGTTTGAGAACGAACCGGCGGGCACCCTCATCCTCCAGCTGCACGCGCAC
TACACCATCGAGGGCGAGGAGGAGCGCGTGAGCTATTACATGGAGGGGCTGTTCGACGAGCGCTCCCGGGGCTACTTCCGAATCGACTCT
GCCACGGGCGCCGTGAGCACGGACAGCGTACTGGACCGCGAGACCAAGGAGACGCACGTCCTCAGGGTGAAAGCCGTGGACTACAGTACG
CCGCCGCGCTCGGCCACCACCTACATCACTGTCTTGGTCAAAGACACCAACGACCACAGCCCGGTCTTCGAGCAGTCGGAGTACCGCGAG
CGCGTGCGGGAGAACCTGGAGGTGGGCTACGAGGTGCTGACCATCCGCGCCAGCGACCGCGACTCGCCCATCAACGCCAACTTGCGTTAC
CGCGTGTTGGGGGGCGCGTGGGACGTCTTCCAGCTCAACGAGAGCTCTGGCGTGGTGAGCACACGGGCGGTGCTGGACCGGGAGGAGGCG
GCCGAGTACCAGCTCCTGGTGGAGGCCAACGACCAGGGGCGCAATCCGGGCCCGCTCAGTGCCACGGCCACCGTGTACATCGAGGTGGAG
GACGAGAACGACAACTACCCCCAGTTCAGCGAGCAGAACTACGTGGTCCAGGTGCCCGAGGACGTGGGGCTCAACACGGCTGTGCTGCGA
GTGCAGGCCACGGACCGGGACCAGGGCCAGAACGCGGCCATTCACTACAGCATCCTCAGCGGGAACGTGGCCGGCCAGTTCTACCTGCAC
TCGCTGAGCGGGATCCTGGATGTGATCAACCCCTTGGATTTCGAGGATGTCCAGAAATACTCGCTGAGCATTAAGGCCCAGGATGGGGGC
CGGCCCCCGCTCATCAATTCTTCAGGGGTGGTGTCTGTGCAGGTGCTGGATGTCAACGACAACGAGCCTATCTTTGTGAGCAGCCCCTTC
CAGGCCACGGTGCTGGAGAATGTGCCCCTGGGCTACCCCGTGGTGCACATTCAGGCGGTGGACGCGGACTCTGGAGAGAACGCCCGGCTG
CACTATCGCCTGGTGGACACGGCCTCCACCTTTCTGGGGGGCGGCAGCGCTGGGCCTAAGAATCCTGCCCCCACCCCTGACTTCCCCTTC
CAGATCCACAACAGCTCCGGTTGGATCACAGTGTGTGCCGAGCTGGACCGCGAGGAGGTGGAGCACTACAGCTTCGGGGTGGAGGCGGTG
GACCACGGCTCGCCCCCCATGAGCTCCTCCACCAGCGTGTCCATCACGGTGCTGGACGTGAATGACAACGACCCGGTGTTCACGCAGCCC
ACCTACGAGCTTCGTCTGAATGAGGATGCGGCCGTGGGGAGCAGCGTGCTGACCCTGCAGGCCCGCGACCGTGACGCCAACAGTGTGATT
ACCTACCAGCTCACAGGCGGCAACACCCGGAACCGCTTTGCACTCAGCAGCCAGAGAGGGGGCGGCCTCATCACCCTGGCGCTACCTCTG
GACTACAAGCAGGAGCAGCAGTACGTGCTGGCGGTGACAGCATCCGACGGCACACGGTCGCACACTGCGCATGTCCTAATCAACGTCACT
GATGCCAACACCCACAGGCCTGTCTTTCAGAGCTCCCATTACACAGTGAGTGTCAGTGAGGACAGGCCTGTGGGCACCTCCATTGCTACC
CTCAGTGCCAACGATGAGGACACAGGAGAGAATGCCCGCATCACCTACGTGATTCAGGACCCCGTGCCGCAGTTCCGCATTGACCCCGAC
AGTGGCACCATGTACACCATGATGGAGCTGGACTATGAGAACCAGGTCGCCTACACGCTGACCATCATGGCCCAGGACAACGGCATCCCG
CAGAAATCAGACACCACCACCCTAGAGATCCTCATCCTCGATGCCAATGACAATGCACCCCAGTTCCTGTGGGATTTCTACCAGGGTTCC
ATCTTTGAGGATGCTCCACCCTCGACCAGCATCCTCCAGGTCTCTGCCACGGACCGGGACTCAGGTCCCAATGGGCGTCTGCTGTACACC
TTCCAGGGTGGGGACGACGGCGATGGGGACTTCTACATCGAGCCCACGTCCGGTGTGATTCGCACCCAGCGCCGGCTGGACCGGGAGAAT
GTGGCCGTGTACAACCTTTGGGCTCTGGCTGTGGATCGGGGCAGTCCCACTCCCCTTAGCGCCTCGGTAGAAATCCAGGTGACCATCTTG
GACATTAATGACAATGCCCCCATGTTTGAGAAGGACGAACTGGAGCTGTTTGTTGAGGAGAACAACCCAGTGGGGTCGGTGGTGGCAAAG
ATTCGTGCTAACGACCCTGATGAAGGCCCTAATGCCCAGATCATGTATCAGATTGTGGAAGGGGACATGCGGCATTTCTTCCAGCTGGAC
CTGCTCAACGGGGACCTGCGTGCCATGGTGGAGCTGGACTTTGAGGTCCGGCGGGAGTATGTGCTGGTGGTGCAGGCCACGTCGGCTCCG
CTGGTGAGCCGAGCCACGGTGCACATCCTTCTCGTGGACCAGAATGACAACCCGCCTGTGCTGCCCGACTTCCAGATCCTCTTCAACAAC
TATGTCACCAACAAGTCCAACAGTTTCCCCACCGGCGTGATCGGCTGCATCCCGGCCCATGACCCCGACGTGTCAGACAGCCTCAACTAC
ACCTTCGTGCAGGGCAACGAGCTGCGCCTGTTGCTGCTGGACCCCGCCACGGGCGAACTGCAGCTCAGCCGCGACCTGGACAACAACCGG
CCGCTGGAGGCGCTCATGGAGGTGTCTGTGTCTGTTTGACTTCACTTTTGTTGAAGTTTATCGCGTCAAGAAATTCCAGTTTACGTCGAA
GCACATGGAGGATGAGGACAGCGACCTCAAGGAGGGGGGGAAGAAGCGCTTTGGGCACATTTGCAGCAGCCACCCCTCCTGCTGCTGCAC
CGTCTCCAACAGCTCCTGGAACTGCGACGGGGAGGTCCTGCACAGCCCTGCCATCGAGGTCAGAGTCCACTGCCAGCTGGTTCGACTCTT
TGCACGAGGAATTGAAGAGAATCCGAAGCCAGACTCACACAGCTGAGAAGCCGGCGTCCTGCTCACAAACTGGGAAAGTGTGAAAACTAT
TTAAGATAATTATTACAGACCAATTATGTTGATATATACATTTAAATGTAGAAATTTATTTTTGATAGTTAAATCTTGATTTTAGAAGAA

>15643_15643_4_CELSR1-CERK_CELSR1_chr22_46929523_ENST00000395964_CERK_chr22_47086096_ENST00000541677_length(amino acids)=1182AA_BP=
MAPPPPPVLPVLLLLAAAAALPAMGLRAAAWEPRVPGGTRAFALRPGCTYAVGAACTPRAPRELLDVGRDGRLAGRRRVSGAGRPLPLQV
RLVARSAPTALSRRLRARTHLPGCGARARLCGTGARLCGALCFPVPGGCAAAQHSALAAPTTLPACRCPPRPRPRCPGRPICLPPGGSVR
LRLLCALRRAAGAVRVGLALEAATAGTPSASPSPSPPLPPNLPEARAGPARRARRGTSGRGSLKFPMPNYQVALFENEPAGTLILQLHAH
YTIEGEEERVSYYMEGLFDERSRGYFRIDSATGAVSTDSVLDRETKETHVLRVKAVDYSTPPRSATTYITVLVKDTNDHSPVFEQSEYRE
RVRENLEVGYEVLTIRASDRDSPINANLRYRVLGGAWDVFQLNESSGVVSTRAVLDREEAAEYQLLVEANDQGRNPGPLSATATVYIEVE
DENDNYPQFSEQNYVVQVPEDVGLNTAVLRVQATDRDQGQNAAIHYSILSGNVAGQFYLHSLSGILDVINPLDFEDVQKYSLSIKAQDGG
RPPLINSSGVVSVQVLDVNDNEPIFVSSPFQATVLENVPLGYPVVHIQAVDADSGENARLHYRLVDTASTFLGGGSAGPKNPAPTPDFPF
QIHNSSGWITVCAELDREEVEHYSFGVEAVDHGSPPMSSSTSVSITVLDVNDNDPVFTQPTYELRLNEDAAVGSSVLTLQARDRDANSVI
TYQLTGGNTRNRFALSSQRGGGLITLALPLDYKQEQQYVLAVTASDGTRSHTAHVLINVTDANTHRPVFQSSHYTVSVSEDRPVGTSIAT
LSANDEDTGENARITYVIQDPVPQFRIDPDSGTMYTMMELDYENQVAYTLTIMAQDNGIPQKSDTTTLEILILDANDNAPQFLWDFYQGS
IFEDAPPSTSILQVSATDRDSGPNGRLLYTFQGGDDGDGDFYIEPTSGVIRTQRRLDRENVAVYNLWALAVDRGSPTPLSASVEIQVTIL
DINDNAPMFEKDELELFVEENNPVGSVVAKIRANDPDEGPNAQIMYQIVEGDMRHFFQLDLLNGDLRAMVELDFEVRREYVLVVQATSAP
LVSRATVHILLVDQNDNPPVLPDFQILFNNYVTNKSNSFPTGVIGCIPAHDPDVSDSLNYTFVQGNELRLLLLDPATGELQLSRDLDNNR

--------------------------------------------------------------
>15643_15643_5_CELSR1-CERK_CELSR1_chr22_46929524_ENST00000262738_CERK_chr22_47086097_ENST00000216264_length(transcript)=6549nt_BP=3544nt
ATGGCGCCGCCGCCGCCGCCCGTGCTGCCCGTGCTGCTGCTCCTGGCCGCCGCCGCCGCCCTGCCGGCGATGGGGCTGCGAGCGGCCGCC
TGGGAGCCGCGCGTACCCGGCGGGACCCGCGCCTTCGCCCTCCGGCCCGGCTGTACCTACGCGGTGGGCGCCGCTTGCACGCCCCGGGCG
CCGCGGGAGCTGCTGGACGTGGGCCGCGATGGGCGGCTGGCAGGACGTCGGCGCGTCTCGGGCGCGGGGCGCCCGCTGCCGCTGCAAGTC
CGCTTGGTGGCCCGCAGTGCCCCGACGGCGCTGAGCCGCCGCCTGCGGGCGCGCACGCACCTTCCCGGCTGCGGAGCCCGTGCCCGGCTC
TGCGGAACCGGTGCCCGGCTCTGCGGGGCGCTCTGCTTCCCCGTCCCCGGCGGCTGCGCGGCCGCGCAGCATTCGGCGCTCGCAGCTCCG
ACCACCTTACCCGCCTGCCGCTGCCCGCCGCGCCCCAGGCCCCGCTGTCCCGGCCGTCCCATCTGCCTGCCGCCGGGCGGCTCGGTCCGC
CTGCGTCTGCTGTGCGCCCTGCGGCGCGCGGCTGGCGCCGTCCGGGTGGGACTGGCGCTGGAGGCCGCCACCGCGGGGACGCCCTCCGCG
TCGCCATCCCCATCGCCGCCCCTGCCGCCGAACTTGCCCGAAGCCCGGGCGGGGCCGGCGCGACGGGCCCGGCGGGGCACGAGCGGCAGA
GGGAGCCTGAAGTTTCCGATGCCCAACTACCAGGTGGCGTTGTTTGAGAACGAACCGGCGGGCACCCTCATCCTCCAGCTGCACGCGCAC
TACACCATCGAGGGCGAGGAGGAGCGCGTGAGCTATTACATGGAGGGGCTGTTCGACGAGCGCTCCCGGGGCTACTTCCGAATCGACTCT
GCCACGGGCGCCGTGAGCACGGACAGCGTACTGGACCGCGAGACCAAGGAGACGCACGTCCTCAGGGTGAAAGCCGTGGACTACAGTACG
CCGCCGCGCTCGGCCACCACCTACATCACTGTCTTGGTCAAAGACACCAACGACCACAGCCCGGTCTTCGAGCAGTCGGAGTACCGCGAG
CGCGTGCGGGAGAACCTGGAGGTGGGCTACGAGGTGCTGACCATCCGCGCCAGCGACCGCGACTCGCCCATCAACGCCAACTTGCGTTAC
CGCGTGTTGGGGGGCGCGTGGGACGTCTTCCAGCTCAACGAGAGCTCTGGCGTGGTGAGCACACGGGCGGTGCTGGACCGGGAGGAGGCG
GCCGAGTACCAGCTCCTGGTGGAGGCCAACGACCAGGGGCGCAATCCGGGCCCGCTCAGTGCCACGGCCACCGTGTACATCGAGGTGGAG
GACGAGAACGACAACTACCCCCAGTTCAGCGAGCAGAACTACGTGGTCCAGGTGCCCGAGGACGTGGGGCTCAACACGGCTGTGCTGCGA
GTGCAGGCCACGGACCGGGACCAGGGCCAGAACGCGGCCATTCACTACAGCATCCTCAGCGGGAACGTGGCCGGCCAGTTCTACCTGCAC
TCGCTGAGCGGGATCCTGGATGTGATCAACCCCTTGGATTTCGAGGATGTCCAGAAATACTCGCTGAGCATTAAGGCCCAGGATGGGGGC
CGGCCCCCGCTCATCAATTCTTCAGGGGTGGTGTCTGTGCAGGTGCTGGATGTCAACGACAACGAGCCTATCTTTGTGAGCAGCCCCTTC
CAGGCCACGGTGCTGGAGAATGTGCCCCTGGGCTACCCCGTGGTGCACATTCAGGCGGTGGACGCGGACTCTGGAGAGAACGCCCGGCTG
CACTATCGCCTGGTGGACACGGCCTCCACCTTTCTGGGGGGCGGCAGCGCTGGGCCTAAGAATCCTGCCCCCACCCCTGACTTCCCCTTC
CAGATCCACAACAGCTCCGGTTGGATCACAGTGTGTGCCGAGCTGGACCGCGAGGAGGTGGAGCACTACAGCTTCGGGGTGGAGGCGGTG
GACCACGGCTCGCCCCCCATGAGCTCCTCCACCAGCGTGTCCATCACGGTGCTGGACGTGAATGACAACGACCCGGTGTTCACGCAGCCC
ACCTACGAGCTTCGTCTGAATGAGGATGCGGCCGTGGGGAGCAGCGTGCTGACCCTGCAGGCCCGCGACCGTGACGCCAACAGTGTGATT
ACCTACCAGCTCACAGGCGGCAACACCCGGAACCGCTTTGCACTCAGCAGCCAGAGAGGGGGCGGCCTCATCACCCTGGCGCTACCTCTG
GACTACAAGCAGGAGCAGCAGTACGTGCTGGCGGTGACAGCATCCGACGGCACACGGTCGCACACTGCGCATGTCCTAATCAACGTCACT
GATGCCAACACCCACAGGCCTGTCTTTCAGAGCTCCCATTACACAGTGAGTGTCAGTGAGGACAGGCCTGTGGGCACCTCCATTGCTACC
CTCAGTGCCAACGATGAGGACACAGGAGAGAATGCCCGCATCACCTACGTGATTCAGGACCCCGTGCCGCAGTTCCGCATTGACCCCGAC
AGTGGCACCATGTACACCATGATGGAGCTGGACTATGAGAACCAGGTCGCCTACACGCTGACCATCATGGCCCAGGACAACGGCATCCCG
CAGAAATCAGACACCACCACCCTAGAGATCCTCATCCTCGATGCCAATGACAATGCACCCCAGTTCCTGTGGGATTTCTACCAGGGTTCC
ATCTTTGAGGATGCTCCACCCTCGACCAGCATCCTCCAGGTCTCTGCCACGGACCGGGACTCAGGTCCCAATGGGCGTCTGCTGTACACC
TTCCAGGGTGGGGACGACGGCGATGGGGACTTCTACATCGAGCCCACGTCCGGTGTGATTCGCACCCAGCGCCGGCTGGACCGGGAGAAT
GTGGCCGTGTACAACCTTTGGGCTCTGGCTGTGGATCGGGGCAGTCCCACTCCCCTTAGCGCCTCGGTAGAAATCCAGGTGACCATCTTG
GACATTAATGACAATGCCCCCATGTTTGAGAAGGACGAACTGGAGCTGTTTGTTGAGGAGAACAACCCAGTGGGGTCGGTGGTGGCAAAG
ATTCGTGCTAACGACCCTGATGAAGGCCCTAATGCCCAGATCATGTATCAGATTGTGGAAGGGGACATGCGGCATTTCTTCCAGCTGGAC
CTGCTCAACGGGGACCTGCGTGCCATGGTGGAGCTGGACTTTGAGGTCCGGCGGGAGTATGTGCTGGTGGTGCAGGCCACGTCGGCTCCG
CTGGTGAGCCGAGCCACGGTGCACATCCTTCTCGTGGACCAGAATGACAACCCGCCTGTGCTGCCCGACTTCCAGATCCTCTTCAACAAC
TATGTCACCAACAAGTCCAACAGTTTCCCCACCGGCGTGATCGGCTGCATCCCGGCCCATGACCCCGACGTGTCAGACAGCCTCAACTAC
ACCTTCGTGCAGGGCAACGAGCTGCGCCTGTTGCTGCTGGACCCCGCCACGGGCGAACTGCAGCTCAGCCGCGACCTGGACAACAACCGG
CCGCTGGAGGCGCTCATGGAGGTGTCTGTGTCTGTTTGACTTCACTTTTGTTGAAGTTTATCGCGTCAAGAAATTCCAGTTTACGTCGAA
GCACATGGAGGATGAGGACAGCGACCTCAAGGAGGGGGGGAAGAAGCGCTTTGGGCACATTTGCAGCAGCCACCCCTCCTGCTGCTGCAC
CGTCTCCAACAGCTCCTGGAACTGCGACGGGGAGGTCCTGCACAGCCCTGCCATCGAGGTCAGAGTCCACTGCCAGCTGGTTCGACTCTT
TGCACGAGGAATTGAAGAGAATCCGAAGCCAGACTCACACAGCTGAGAAGCCGGCGTCCTGCTCACAAACTGGGAAAGTGTGAAAACTAT
TTAAGATAATTATTACAGACCAATTATGTTGATATATACATTTAAATGTAGAAATTTATTTTTGATAGTTAAATCTTGATTTTAGAAGAA
AACCCTTTTGTCAACAATTTTGTGTACATATTTGGCATTTTCAGTTCTGTACGCATCTGCGGGTTGCAGCCCACGCCGCTTACTCTCAGC
GGATGCAGCTGCTCACTTGGGGGCACTGGCCTCTTAGGTTTTAACGATGTCAACAGTGTAGTTTAGAAAATGGCCCGTTAGTGGCTCTAT
TGCAATAATGTTAGGGACATTATATGATTTCCACGCAGGTCACACCATCTGGGCCTGAGGTAGCAGTGGGTCACTTTGATCCACTTTGCA
GGACTTATTCTGTAACGGTTTGTGGCCAAGTTTTGGGAAGTGGTTGATTCTCTTTGCCTTCATTTCACCTTCCTCTTCGTTTACGGTTAG
GACATCGCTGCTTGATCCTTACAATACTGTGCAACTGCAATGCAACGTGGCCCTGCTTCAGGTGATCCGCGGGAGGGGCCTCCACGCCAG
CGCCGGGAAGGCTGCTGGGGCCTCCACACCTGCCTCATCACGGCGGCGAGGCTACGACAATCCGGCTGGGAGCATGACCTTGGCGTCTGT
TCTGGGAGCACGGATGATAAGCTCTGGAAGCTGGCAGTGTGTAAAGCACTGGCAAGTTTGTTACTGTTAAAATGTCAAATACCAATGCTT
TATATCGACGCGAAGTGCTTAACACAGCCGGGCTTGGGGGCAGTCAGGAGGAAGCTGGCCATCCGTGGAGGAGGGGCCGGTCCTGGACTC
CCGCAGGACTCCTCTGAGGCAGGGCCTGAAGTCTGTACACGTGGTCCAGATTTGTCCTTGTCTTTTCTTCACACTGAGTTCTCTATATTT
ATTGAACATCTTGTCCTTTTAAGCCAGAGTAGTGTAAACTGCGTCTCGGATGTCTGTCTTTTGCCTCGAAGCCACGATGGATCGCTGGTT
TCCTCTGCAGCGCGAGGGCTCCGGCGACCAGAGGATTCTTCCCGGAAGGCATTCCTGCCGCGCTCCCCGGGGCACCCCTCAATTGTGTAC
TACGTCCTTGTTTAGTGTGTATCCGTGCCCACGTAGATGATGTCTGTAACGTAGTTTTGTTTGAAATATGAGAATATGCGGCTTAAACTT
TGATCTGTAAGGAGCGGGGCCGTGGCCGTTTGGAGCACGCTGTAGACACCGTTCCTCATGCTGCCGGGTGGGTTTTGCAGAAGCTCCCTT
AGTGATTTCATGTTTAACAGGCAGCATCCATTTTCAGAATTTCCTGGCATTGATTTATATTTTGAAGCATACAGGAAACTTCTCGTTTCC
TCGTTTAGCCCCACCCAGATCAGGTGAAAGGGCAGCTTTAATGGTGGTTTTTATGGACCACATTATCAGAGAGCACTGTGCAAGCCAAAT
GGTTCAATAATGAATGAAAATTCTGGGTGTAAAGAGTAAATATGCCCTGGCTCTTTCTACCAATGTTTGCTCCTGGTTGGAAAGAAACCA
AAGATTTAAGACGGGCTGCTCTTCCAGACTGGCTGTGCCTGCCTGTGCCCAGCAACCTGTGCAGCCGGCAGTGTGCCTGGTGTCACGCCA
GGAGGCTGTGGCTGCTGTGGGCCCTCTGGAATTGTGCTCCTCACAAAGTTTCCCCAAAAGGTTCTTCTAAGCCTTTATTGTCCCTGGTAA
ATGTTTCCCGGCTGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACCTAAGGTCAGGAGTTTG
AGATCAGCCTGCCCAACATGGTGAAACCTCGTCTCTACTAAAAATACACAACTTAGCCAGTCTTGTTGGCGCACGCCTGTAATCTCAGCT
ACTAGGGATGCTGAGGCAGGAGAATCGCTTGAACCCAAGAAAGAGGTGGAGGTTGCGGTGAGCCAAGATTGCGCCACTGCACTCCAGCCT
GGGCAAACAGAGGGAGACTCCATCGCCCCCCCCAACAAAAAAAAAAGTTTCCCATACACTGGCCTGCCCCAAAACCCACTAACAATTTTA
GCAAAACAGTCCAGGCCAAAGAGGAAGCATTTCATGTTCAATAAGAAACCCAGCCATTCCGCATGGCTGGTTCCTGAGTGGCTCTGGTGA
TACTCTCCAGCCACCTGCTGACATTCAGAATCTCAGACCTCGGGACTGCTGTTGCGGTACCGTGTGTCTGACACCTGCCAGCAGCCCTTT
GCTATCTGCGCGCAGGATGGGGGTGACTGCCCAGACATTCCCGCTAGATAGGCTCTGATTTCCGGGGCAGCCTTTCAGATGCGGCAGACA
TACAACACCTGTACTTTAGAGTTTTAAGGGAAAAAAAATCAGAAGTGCTGGTTAGATAGTAAAAACTTAGGATAACTTAGAAAGGCTAGT
TTTAGCTTCCTTTGTGGCTCCCTGGTGCAAAACAATTAGCAGTTATGCAATGGACCTGATTCTAGTTTATTCTAATTAAGAAGTGAGGCC
GAGTTTGACTTCGTTCCTGAATACAATCTTGAGTAACTGGGAAAGTCTGAGTGAAAGGATGGCCTCATTCTCTTTCTAATCTTGCTGGTT

>15643_15643_5_CELSR1-CERK_CELSR1_chr22_46929524_ENST00000262738_CERK_chr22_47086097_ENST00000216264_length(amino acids)=1182AA_BP=
MAPPPPPVLPVLLLLAAAAALPAMGLRAAAWEPRVPGGTRAFALRPGCTYAVGAACTPRAPRELLDVGRDGRLAGRRRVSGAGRPLPLQV
RLVARSAPTALSRRLRARTHLPGCGARARLCGTGARLCGALCFPVPGGCAAAQHSALAAPTTLPACRCPPRPRPRCPGRPICLPPGGSVR
LRLLCALRRAAGAVRVGLALEAATAGTPSASPSPSPPLPPNLPEARAGPARRARRGTSGRGSLKFPMPNYQVALFENEPAGTLILQLHAH
YTIEGEEERVSYYMEGLFDERSRGYFRIDSATGAVSTDSVLDRETKETHVLRVKAVDYSTPPRSATTYITVLVKDTNDHSPVFEQSEYRE
RVRENLEVGYEVLTIRASDRDSPINANLRYRVLGGAWDVFQLNESSGVVSTRAVLDREEAAEYQLLVEANDQGRNPGPLSATATVYIEVE
DENDNYPQFSEQNYVVQVPEDVGLNTAVLRVQATDRDQGQNAAIHYSILSGNVAGQFYLHSLSGILDVINPLDFEDVQKYSLSIKAQDGG
RPPLINSSGVVSVQVLDVNDNEPIFVSSPFQATVLENVPLGYPVVHIQAVDADSGENARLHYRLVDTASTFLGGGSAGPKNPAPTPDFPF
QIHNSSGWITVCAELDREEVEHYSFGVEAVDHGSPPMSSSTSVSITVLDVNDNDPVFTQPTYELRLNEDAAVGSSVLTLQARDRDANSVI
TYQLTGGNTRNRFALSSQRGGGLITLALPLDYKQEQQYVLAVTASDGTRSHTAHVLINVTDANTHRPVFQSSHYTVSVSEDRPVGTSIAT
LSANDEDTGENARITYVIQDPVPQFRIDPDSGTMYTMMELDYENQVAYTLTIMAQDNGIPQKSDTTTLEILILDANDNAPQFLWDFYQGS
IFEDAPPSTSILQVSATDRDSGPNGRLLYTFQGGDDGDGDFYIEPTSGVIRTQRRLDRENVAVYNLWALAVDRGSPTPLSASVEIQVTIL
DINDNAPMFEKDELELFVEENNPVGSVVAKIRANDPDEGPNAQIMYQIVEGDMRHFFQLDLLNGDLRAMVELDFEVRREYVLVVQATSAP
LVSRATVHILLVDQNDNPPVLPDFQILFNNYVTNKSNSFPTGVIGCIPAHDPDVSDSLNYTFVQGNELRLLLLDPATGELQLSRDLDNNR

--------------------------------------------------------------
>15643_15643_6_CELSR1-CERK_CELSR1_chr22_46929524_ENST00000262738_CERK_chr22_47086097_ENST00000541677_length(transcript)=3972nt_BP=3544nt
ATGGCGCCGCCGCCGCCGCCCGTGCTGCCCGTGCTGCTGCTCCTGGCCGCCGCCGCCGCCCTGCCGGCGATGGGGCTGCGAGCGGCCGCC
TGGGAGCCGCGCGTACCCGGCGGGACCCGCGCCTTCGCCCTCCGGCCCGGCTGTACCTACGCGGTGGGCGCCGCTTGCACGCCCCGGGCG
CCGCGGGAGCTGCTGGACGTGGGCCGCGATGGGCGGCTGGCAGGACGTCGGCGCGTCTCGGGCGCGGGGCGCCCGCTGCCGCTGCAAGTC
CGCTTGGTGGCCCGCAGTGCCCCGACGGCGCTGAGCCGCCGCCTGCGGGCGCGCACGCACCTTCCCGGCTGCGGAGCCCGTGCCCGGCTC
TGCGGAACCGGTGCCCGGCTCTGCGGGGCGCTCTGCTTCCCCGTCCCCGGCGGCTGCGCGGCCGCGCAGCATTCGGCGCTCGCAGCTCCG
ACCACCTTACCCGCCTGCCGCTGCCCGCCGCGCCCCAGGCCCCGCTGTCCCGGCCGTCCCATCTGCCTGCCGCCGGGCGGCTCGGTCCGC
CTGCGTCTGCTGTGCGCCCTGCGGCGCGCGGCTGGCGCCGTCCGGGTGGGACTGGCGCTGGAGGCCGCCACCGCGGGGACGCCCTCCGCG
TCGCCATCCCCATCGCCGCCCCTGCCGCCGAACTTGCCCGAAGCCCGGGCGGGGCCGGCGCGACGGGCCCGGCGGGGCACGAGCGGCAGA
GGGAGCCTGAAGTTTCCGATGCCCAACTACCAGGTGGCGTTGTTTGAGAACGAACCGGCGGGCACCCTCATCCTCCAGCTGCACGCGCAC
TACACCATCGAGGGCGAGGAGGAGCGCGTGAGCTATTACATGGAGGGGCTGTTCGACGAGCGCTCCCGGGGCTACTTCCGAATCGACTCT
GCCACGGGCGCCGTGAGCACGGACAGCGTACTGGACCGCGAGACCAAGGAGACGCACGTCCTCAGGGTGAAAGCCGTGGACTACAGTACG
CCGCCGCGCTCGGCCACCACCTACATCACTGTCTTGGTCAAAGACACCAACGACCACAGCCCGGTCTTCGAGCAGTCGGAGTACCGCGAG
CGCGTGCGGGAGAACCTGGAGGTGGGCTACGAGGTGCTGACCATCCGCGCCAGCGACCGCGACTCGCCCATCAACGCCAACTTGCGTTAC
CGCGTGTTGGGGGGCGCGTGGGACGTCTTCCAGCTCAACGAGAGCTCTGGCGTGGTGAGCACACGGGCGGTGCTGGACCGGGAGGAGGCG
GCCGAGTACCAGCTCCTGGTGGAGGCCAACGACCAGGGGCGCAATCCGGGCCCGCTCAGTGCCACGGCCACCGTGTACATCGAGGTGGAG
GACGAGAACGACAACTACCCCCAGTTCAGCGAGCAGAACTACGTGGTCCAGGTGCCCGAGGACGTGGGGCTCAACACGGCTGTGCTGCGA
GTGCAGGCCACGGACCGGGACCAGGGCCAGAACGCGGCCATTCACTACAGCATCCTCAGCGGGAACGTGGCCGGCCAGTTCTACCTGCAC
TCGCTGAGCGGGATCCTGGATGTGATCAACCCCTTGGATTTCGAGGATGTCCAGAAATACTCGCTGAGCATTAAGGCCCAGGATGGGGGC
CGGCCCCCGCTCATCAATTCTTCAGGGGTGGTGTCTGTGCAGGTGCTGGATGTCAACGACAACGAGCCTATCTTTGTGAGCAGCCCCTTC
CAGGCCACGGTGCTGGAGAATGTGCCCCTGGGCTACCCCGTGGTGCACATTCAGGCGGTGGACGCGGACTCTGGAGAGAACGCCCGGCTG
CACTATCGCCTGGTGGACACGGCCTCCACCTTTCTGGGGGGCGGCAGCGCTGGGCCTAAGAATCCTGCCCCCACCCCTGACTTCCCCTTC
CAGATCCACAACAGCTCCGGTTGGATCACAGTGTGTGCCGAGCTGGACCGCGAGGAGGTGGAGCACTACAGCTTCGGGGTGGAGGCGGTG
GACCACGGCTCGCCCCCCATGAGCTCCTCCACCAGCGTGTCCATCACGGTGCTGGACGTGAATGACAACGACCCGGTGTTCACGCAGCCC
ACCTACGAGCTTCGTCTGAATGAGGATGCGGCCGTGGGGAGCAGCGTGCTGACCCTGCAGGCCCGCGACCGTGACGCCAACAGTGTGATT
ACCTACCAGCTCACAGGCGGCAACACCCGGAACCGCTTTGCACTCAGCAGCCAGAGAGGGGGCGGCCTCATCACCCTGGCGCTACCTCTG
GACTACAAGCAGGAGCAGCAGTACGTGCTGGCGGTGACAGCATCCGACGGCACACGGTCGCACACTGCGCATGTCCTAATCAACGTCACT
GATGCCAACACCCACAGGCCTGTCTTTCAGAGCTCCCATTACACAGTGAGTGTCAGTGAGGACAGGCCTGTGGGCACCTCCATTGCTACC
CTCAGTGCCAACGATGAGGACACAGGAGAGAATGCCCGCATCACCTACGTGATTCAGGACCCCGTGCCGCAGTTCCGCATTGACCCCGAC
AGTGGCACCATGTACACCATGATGGAGCTGGACTATGAGAACCAGGTCGCCTACACGCTGACCATCATGGCCCAGGACAACGGCATCCCG
CAGAAATCAGACACCACCACCCTAGAGATCCTCATCCTCGATGCCAATGACAATGCACCCCAGTTCCTGTGGGATTTCTACCAGGGTTCC
ATCTTTGAGGATGCTCCACCCTCGACCAGCATCCTCCAGGTCTCTGCCACGGACCGGGACTCAGGTCCCAATGGGCGTCTGCTGTACACC
TTCCAGGGTGGGGACGACGGCGATGGGGACTTCTACATCGAGCCCACGTCCGGTGTGATTCGCACCCAGCGCCGGCTGGACCGGGAGAAT
GTGGCCGTGTACAACCTTTGGGCTCTGGCTGTGGATCGGGGCAGTCCCACTCCCCTTAGCGCCTCGGTAGAAATCCAGGTGACCATCTTG
GACATTAATGACAATGCCCCCATGTTTGAGAAGGACGAACTGGAGCTGTTTGTTGAGGAGAACAACCCAGTGGGGTCGGTGGTGGCAAAG
ATTCGTGCTAACGACCCTGATGAAGGCCCTAATGCCCAGATCATGTATCAGATTGTGGAAGGGGACATGCGGCATTTCTTCCAGCTGGAC
CTGCTCAACGGGGACCTGCGTGCCATGGTGGAGCTGGACTTTGAGGTCCGGCGGGAGTATGTGCTGGTGGTGCAGGCCACGTCGGCTCCG
CTGGTGAGCCGAGCCACGGTGCACATCCTTCTCGTGGACCAGAATGACAACCCGCCTGTGCTGCCCGACTTCCAGATCCTCTTCAACAAC
TATGTCACCAACAAGTCCAACAGTTTCCCCACCGGCGTGATCGGCTGCATCCCGGCCCATGACCCCGACGTGTCAGACAGCCTCAACTAC
ACCTTCGTGCAGGGCAACGAGCTGCGCCTGTTGCTGCTGGACCCCGCCACGGGCGAACTGCAGCTCAGCCGCGACCTGGACAACAACCGG
CCGCTGGAGGCGCTCATGGAGGTGTCTGTGTCTGTTTGACTTCACTTTTGTTGAAGTTTATCGCGTCAAGAAATTCCAGTTTACGTCGAA
GCACATGGAGGATGAGGACAGCGACCTCAAGGAGGGGGGGAAGAAGCGCTTTGGGCACATTTGCAGCAGCCACCCCTCCTGCTGCTGCAC
CGTCTCCAACAGCTCCTGGAACTGCGACGGGGAGGTCCTGCACAGCCCTGCCATCGAGGTCAGAGTCCACTGCCAGCTGGTTCGACTCTT
TGCACGAGGAATTGAAGAGAATCCGAAGCCAGACTCACACAGCTGAGAAGCCGGCGTCCTGCTCACAAACTGGGAAAGTGTGAAAACTAT
TTAAGATAATTATTACAGACCAATTATGTTGATATATACATTTAAATGTAGAAATTTATTTTTGATAGTTAAATCTTGATTTTAGAAGAA

>15643_15643_6_CELSR1-CERK_CELSR1_chr22_46929524_ENST00000262738_CERK_chr22_47086097_ENST00000541677_length(amino acids)=1182AA_BP=
MAPPPPPVLPVLLLLAAAAALPAMGLRAAAWEPRVPGGTRAFALRPGCTYAVGAACTPRAPRELLDVGRDGRLAGRRRVSGAGRPLPLQV
RLVARSAPTALSRRLRARTHLPGCGARARLCGTGARLCGALCFPVPGGCAAAQHSALAAPTTLPACRCPPRPRPRCPGRPICLPPGGSVR
LRLLCALRRAAGAVRVGLALEAATAGTPSASPSPSPPLPPNLPEARAGPARRARRGTSGRGSLKFPMPNYQVALFENEPAGTLILQLHAH
YTIEGEEERVSYYMEGLFDERSRGYFRIDSATGAVSTDSVLDRETKETHVLRVKAVDYSTPPRSATTYITVLVKDTNDHSPVFEQSEYRE
RVRENLEVGYEVLTIRASDRDSPINANLRYRVLGGAWDVFQLNESSGVVSTRAVLDREEAAEYQLLVEANDQGRNPGPLSATATVYIEVE
DENDNYPQFSEQNYVVQVPEDVGLNTAVLRVQATDRDQGQNAAIHYSILSGNVAGQFYLHSLSGILDVINPLDFEDVQKYSLSIKAQDGG
RPPLINSSGVVSVQVLDVNDNEPIFVSSPFQATVLENVPLGYPVVHIQAVDADSGENARLHYRLVDTASTFLGGGSAGPKNPAPTPDFPF
QIHNSSGWITVCAELDREEVEHYSFGVEAVDHGSPPMSSSTSVSITVLDVNDNDPVFTQPTYELRLNEDAAVGSSVLTLQARDRDANSVI
TYQLTGGNTRNRFALSSQRGGGLITLALPLDYKQEQQYVLAVTASDGTRSHTAHVLINVTDANTHRPVFQSSHYTVSVSEDRPVGTSIAT
LSANDEDTGENARITYVIQDPVPQFRIDPDSGTMYTMMELDYENQVAYTLTIMAQDNGIPQKSDTTTLEILILDANDNAPQFLWDFYQGS
IFEDAPPSTSILQVSATDRDSGPNGRLLYTFQGGDDGDGDFYIEPTSGVIRTQRRLDRENVAVYNLWALAVDRGSPTPLSASVEIQVTIL
DINDNAPMFEKDELELFVEENNPVGSVVAKIRANDPDEGPNAQIMYQIVEGDMRHFFQLDLLNGDLRAMVELDFEVRREYVLVVQATSAP
LVSRATVHILLVDQNDNPPVLPDFQILFNNYVTNKSNSFPTGVIGCIPAHDPDVSDSLNYTFVQGNELRLLLLDPATGELQLSRDLDNNR

--------------------------------------------------------------
>15643_15643_7_CELSR1-CERK_CELSR1_chr22_46929524_ENST00000395964_CERK_chr22_47086097_ENST00000216264_length(transcript)=6549nt_BP=3544nt
ATGGCGCCGCCGCCGCCGCCCGTGCTGCCCGTGCTGCTGCTCCTGGCCGCCGCCGCCGCCCTGCCGGCGATGGGGCTGCGAGCGGCCGCC
TGGGAGCCGCGCGTACCCGGCGGGACCCGCGCCTTCGCCCTCCGGCCCGGCTGTACCTACGCGGTGGGCGCCGCTTGCACGCCCCGGGCG
CCGCGGGAGCTGCTGGACGTGGGCCGCGATGGGCGGCTGGCAGGACGTCGGCGCGTCTCGGGCGCGGGGCGCCCGCTGCCGCTGCAAGTC
CGCTTGGTGGCCCGCAGTGCCCCGACGGCGCTGAGCCGCCGCCTGCGGGCGCGCACGCACCTTCCCGGCTGCGGAGCCCGTGCCCGGCTC
TGCGGAACCGGTGCCCGGCTCTGCGGGGCGCTCTGCTTCCCCGTCCCCGGCGGCTGCGCGGCCGCGCAGCATTCGGCGCTCGCAGCTCCG
ACCACCTTACCCGCCTGCCGCTGCCCGCCGCGCCCCAGGCCCCGCTGTCCCGGCCGTCCCATCTGCCTGCCGCCGGGCGGCTCGGTCCGC
CTGCGTCTGCTGTGCGCCCTGCGGCGCGCGGCTGGCGCCGTCCGGGTGGGACTGGCGCTGGAGGCCGCCACCGCGGGGACGCCCTCCGCG
TCGCCATCCCCATCGCCGCCCCTGCCGCCGAACTTGCCCGAAGCCCGGGCGGGGCCGGCGCGACGGGCCCGGCGGGGCACGAGCGGCAGA
GGGAGCCTGAAGTTTCCGATGCCCAACTACCAGGTGGCGTTGTTTGAGAACGAACCGGCGGGCACCCTCATCCTCCAGCTGCACGCGCAC
TACACCATCGAGGGCGAGGAGGAGCGCGTGAGCTATTACATGGAGGGGCTGTTCGACGAGCGCTCCCGGGGCTACTTCCGAATCGACTCT
GCCACGGGCGCCGTGAGCACGGACAGCGTACTGGACCGCGAGACCAAGGAGACGCACGTCCTCAGGGTGAAAGCCGTGGACTACAGTACG
CCGCCGCGCTCGGCCACCACCTACATCACTGTCTTGGTCAAAGACACCAACGACCACAGCCCGGTCTTCGAGCAGTCGGAGTACCGCGAG
CGCGTGCGGGAGAACCTGGAGGTGGGCTACGAGGTGCTGACCATCCGCGCCAGCGACCGCGACTCGCCCATCAACGCCAACTTGCGTTAC
CGCGTGTTGGGGGGCGCGTGGGACGTCTTCCAGCTCAACGAGAGCTCTGGCGTGGTGAGCACACGGGCGGTGCTGGACCGGGAGGAGGCG
GCCGAGTACCAGCTCCTGGTGGAGGCCAACGACCAGGGGCGCAATCCGGGCCCGCTCAGTGCCACGGCCACCGTGTACATCGAGGTGGAG
GACGAGAACGACAACTACCCCCAGTTCAGCGAGCAGAACTACGTGGTCCAGGTGCCCGAGGACGTGGGGCTCAACACGGCTGTGCTGCGA
GTGCAGGCCACGGACCGGGACCAGGGCCAGAACGCGGCCATTCACTACAGCATCCTCAGCGGGAACGTGGCCGGCCAGTTCTACCTGCAC
TCGCTGAGCGGGATCCTGGATGTGATCAACCCCTTGGATTTCGAGGATGTCCAGAAATACTCGCTGAGCATTAAGGCCCAGGATGGGGGC
CGGCCCCCGCTCATCAATTCTTCAGGGGTGGTGTCTGTGCAGGTGCTGGATGTCAACGACAACGAGCCTATCTTTGTGAGCAGCCCCTTC
CAGGCCACGGTGCTGGAGAATGTGCCCCTGGGCTACCCCGTGGTGCACATTCAGGCGGTGGACGCGGACTCTGGAGAGAACGCCCGGCTG
CACTATCGCCTGGTGGACACGGCCTCCACCTTTCTGGGGGGCGGCAGCGCTGGGCCTAAGAATCCTGCCCCCACCCCTGACTTCCCCTTC
CAGATCCACAACAGCTCCGGTTGGATCACAGTGTGTGCCGAGCTGGACCGCGAGGAGGTGGAGCACTACAGCTTCGGGGTGGAGGCGGTG
GACCACGGCTCGCCCCCCATGAGCTCCTCCACCAGCGTGTCCATCACGGTGCTGGACGTGAATGACAACGACCCGGTGTTCACGCAGCCC
ACCTACGAGCTTCGTCTGAATGAGGATGCGGCCGTGGGGAGCAGCGTGCTGACCCTGCAGGCCCGCGACCGTGACGCCAACAGTGTGATT
ACCTACCAGCTCACAGGCGGCAACACCCGGAACCGCTTTGCACTCAGCAGCCAGAGAGGGGGCGGCCTCATCACCCTGGCGCTACCTCTG
GACTACAAGCAGGAGCAGCAGTACGTGCTGGCGGTGACAGCATCCGACGGCACACGGTCGCACACTGCGCATGTCCTAATCAACGTCACT
GATGCCAACACCCACAGGCCTGTCTTTCAGAGCTCCCATTACACAGTGAGTGTCAGTGAGGACAGGCCTGTGGGCACCTCCATTGCTACC
CTCAGTGCCAACGATGAGGACACAGGAGAGAATGCCCGCATCACCTACGTGATTCAGGACCCCGTGCCGCAGTTCCGCATTGACCCCGAC
AGTGGCACCATGTACACCATGATGGAGCTGGACTATGAGAACCAGGTCGCCTACACGCTGACCATCATGGCCCAGGACAACGGCATCCCG
CAGAAATCAGACACCACCACCCTAGAGATCCTCATCCTCGATGCCAATGACAATGCACCCCAGTTCCTGTGGGATTTCTACCAGGGTTCC
ATCTTTGAGGATGCTCCACCCTCGACCAGCATCCTCCAGGTCTCTGCCACGGACCGGGACTCAGGTCCCAATGGGCGTCTGCTGTACACC
TTCCAGGGTGGGGACGACGGCGATGGGGACTTCTACATCGAGCCCACGTCCGGTGTGATTCGCACCCAGCGCCGGCTGGACCGGGAGAAT
GTGGCCGTGTACAACCTTTGGGCTCTGGCTGTGGATCGGGGCAGTCCCACTCCCCTTAGCGCCTCGGTAGAAATCCAGGTGACCATCTTG
GACATTAATGACAATGCCCCCATGTTTGAGAAGGACGAACTGGAGCTGTTTGTTGAGGAGAACAACCCAGTGGGGTCGGTGGTGGCAAAG
ATTCGTGCTAACGACCCTGATGAAGGCCCTAATGCCCAGATCATGTATCAGATTGTGGAAGGGGACATGCGGCATTTCTTCCAGCTGGAC
CTGCTCAACGGGGACCTGCGTGCCATGGTGGAGCTGGACTTTGAGGTCCGGCGGGAGTATGTGCTGGTGGTGCAGGCCACGTCGGCTCCG
CTGGTGAGCCGAGCCACGGTGCACATCCTTCTCGTGGACCAGAATGACAACCCGCCTGTGCTGCCCGACTTCCAGATCCTCTTCAACAAC
TATGTCACCAACAAGTCCAACAGTTTCCCCACCGGCGTGATCGGCTGCATCCCGGCCCATGACCCCGACGTGTCAGACAGCCTCAACTAC
ACCTTCGTGCAGGGCAACGAGCTGCGCCTGTTGCTGCTGGACCCCGCCACGGGCGAACTGCAGCTCAGCCGCGACCTGGACAACAACCGG
CCGCTGGAGGCGCTCATGGAGGTGTCTGTGTCTGTTTGACTTCACTTTTGTTGAAGTTTATCGCGTCAAGAAATTCCAGTTTACGTCGAA
GCACATGGAGGATGAGGACAGCGACCTCAAGGAGGGGGGGAAGAAGCGCTTTGGGCACATTTGCAGCAGCCACCCCTCCTGCTGCTGCAC
CGTCTCCAACAGCTCCTGGAACTGCGACGGGGAGGTCCTGCACAGCCCTGCCATCGAGGTCAGAGTCCACTGCCAGCTGGTTCGACTCTT
TGCACGAGGAATTGAAGAGAATCCGAAGCCAGACTCACACAGCTGAGAAGCCGGCGTCCTGCTCACAAACTGGGAAAGTGTGAAAACTAT
TTAAGATAATTATTACAGACCAATTATGTTGATATATACATTTAAATGTAGAAATTTATTTTTGATAGTTAAATCTTGATTTTAGAAGAA
AACCCTTTTGTCAACAATTTTGTGTACATATTTGGCATTTTCAGTTCTGTACGCATCTGCGGGTTGCAGCCCACGCCGCTTACTCTCAGC
GGATGCAGCTGCTCACTTGGGGGCACTGGCCTCTTAGGTTTTAACGATGTCAACAGTGTAGTTTAGAAAATGGCCCGTTAGTGGCTCTAT
TGCAATAATGTTAGGGACATTATATGATTTCCACGCAGGTCACACCATCTGGGCCTGAGGTAGCAGTGGGTCACTTTGATCCACTTTGCA
GGACTTATTCTGTAACGGTTTGTGGCCAAGTTTTGGGAAGTGGTTGATTCTCTTTGCCTTCATTTCACCTTCCTCTTCGTTTACGGTTAG
GACATCGCTGCTTGATCCTTACAATACTGTGCAACTGCAATGCAACGTGGCCCTGCTTCAGGTGATCCGCGGGAGGGGCCTCCACGCCAG
CGCCGGGAAGGCTGCTGGGGCCTCCACACCTGCCTCATCACGGCGGCGAGGCTACGACAATCCGGCTGGGAGCATGACCTTGGCGTCTGT
TCTGGGAGCACGGATGATAAGCTCTGGAAGCTGGCAGTGTGTAAAGCACTGGCAAGTTTGTTACTGTTAAAATGTCAAATACCAATGCTT
TATATCGACGCGAAGTGCTTAACACAGCCGGGCTTGGGGGCAGTCAGGAGGAAGCTGGCCATCCGTGGAGGAGGGGCCGGTCCTGGACTC
CCGCAGGACTCCTCTGAGGCAGGGCCTGAAGTCTGTACACGTGGTCCAGATTTGTCCTTGTCTTTTCTTCACACTGAGTTCTCTATATTT
ATTGAACATCTTGTCCTTTTAAGCCAGAGTAGTGTAAACTGCGTCTCGGATGTCTGTCTTTTGCCTCGAAGCCACGATGGATCGCTGGTT
TCCTCTGCAGCGCGAGGGCTCCGGCGACCAGAGGATTCTTCCCGGAAGGCATTCCTGCCGCGCTCCCCGGGGCACCCCTCAATTGTGTAC
TACGTCCTTGTTTAGTGTGTATCCGTGCCCACGTAGATGATGTCTGTAACGTAGTTTTGTTTGAAATATGAGAATATGCGGCTTAAACTT
TGATCTGTAAGGAGCGGGGCCGTGGCCGTTTGGAGCACGCTGTAGACACCGTTCCTCATGCTGCCGGGTGGGTTTTGCAGAAGCTCCCTT
AGTGATTTCATGTTTAACAGGCAGCATCCATTTTCAGAATTTCCTGGCATTGATTTATATTTTGAAGCATACAGGAAACTTCTCGTTTCC
TCGTTTAGCCCCACCCAGATCAGGTGAAAGGGCAGCTTTAATGGTGGTTTTTATGGACCACATTATCAGAGAGCACTGTGCAAGCCAAAT
GGTTCAATAATGAATGAAAATTCTGGGTGTAAAGAGTAAATATGCCCTGGCTCTTTCTACCAATGTTTGCTCCTGGTTGGAAAGAAACCA
AAGATTTAAGACGGGCTGCTCTTCCAGACTGGCTGTGCCTGCCTGTGCCCAGCAACCTGTGCAGCCGGCAGTGTGCCTGGTGTCACGCCA
GGAGGCTGTGGCTGCTGTGGGCCCTCTGGAATTGTGCTCCTCACAAAGTTTCCCCAAAAGGTTCTTCTAAGCCTTTATTGTCCCTGGTAA
ATGTTTCCCGGCTGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACCTAAGGTCAGGAGTTTG
AGATCAGCCTGCCCAACATGGTGAAACCTCGTCTCTACTAAAAATACACAACTTAGCCAGTCTTGTTGGCGCACGCCTGTAATCTCAGCT
ACTAGGGATGCTGAGGCAGGAGAATCGCTTGAACCCAAGAAAGAGGTGGAGGTTGCGGTGAGCCAAGATTGCGCCACTGCACTCCAGCCT
GGGCAAACAGAGGGAGACTCCATCGCCCCCCCCAACAAAAAAAAAAGTTTCCCATACACTGGCCTGCCCCAAAACCCACTAACAATTTTA
GCAAAACAGTCCAGGCCAAAGAGGAAGCATTTCATGTTCAATAAGAAACCCAGCCATTCCGCATGGCTGGTTCCTGAGTGGCTCTGGTGA
TACTCTCCAGCCACCTGCTGACATTCAGAATCTCAGACCTCGGGACTGCTGTTGCGGTACCGTGTGTCTGACACCTGCCAGCAGCCCTTT
GCTATCTGCGCGCAGGATGGGGGTGACTGCCCAGACATTCCCGCTAGATAGGCTCTGATTTCCGGGGCAGCCTTTCAGATGCGGCAGACA
TACAACACCTGTACTTTAGAGTTTTAAGGGAAAAAAAATCAGAAGTGCTGGTTAGATAGTAAAAACTTAGGATAACTTAGAAAGGCTAGT
TTTAGCTTCCTTTGTGGCTCCCTGGTGCAAAACAATTAGCAGTTATGCAATGGACCTGATTCTAGTTTATTCTAATTAAGAAGTGAGGCC
GAGTTTGACTTCGTTCCTGAATACAATCTTGAGTAACTGGGAAAGTCTGAGTGAAAGGATGGCCTCATTCTCTTTCTAATCTTGCTGGTT

>15643_15643_7_CELSR1-CERK_CELSR1_chr22_46929524_ENST00000395964_CERK_chr22_47086097_ENST00000216264_length(amino acids)=1182AA_BP=
MAPPPPPVLPVLLLLAAAAALPAMGLRAAAWEPRVPGGTRAFALRPGCTYAVGAACTPRAPRELLDVGRDGRLAGRRRVSGAGRPLPLQV
RLVARSAPTALSRRLRARTHLPGCGARARLCGTGARLCGALCFPVPGGCAAAQHSALAAPTTLPACRCPPRPRPRCPGRPICLPPGGSVR
LRLLCALRRAAGAVRVGLALEAATAGTPSASPSPSPPLPPNLPEARAGPARRARRGTSGRGSLKFPMPNYQVALFENEPAGTLILQLHAH
YTIEGEEERVSYYMEGLFDERSRGYFRIDSATGAVSTDSVLDRETKETHVLRVKAVDYSTPPRSATTYITVLVKDTNDHSPVFEQSEYRE
RVRENLEVGYEVLTIRASDRDSPINANLRYRVLGGAWDVFQLNESSGVVSTRAVLDREEAAEYQLLVEANDQGRNPGPLSATATVYIEVE
DENDNYPQFSEQNYVVQVPEDVGLNTAVLRVQATDRDQGQNAAIHYSILSGNVAGQFYLHSLSGILDVINPLDFEDVQKYSLSIKAQDGG
RPPLINSSGVVSVQVLDVNDNEPIFVSSPFQATVLENVPLGYPVVHIQAVDADSGENARLHYRLVDTASTFLGGGSAGPKNPAPTPDFPF
QIHNSSGWITVCAELDREEVEHYSFGVEAVDHGSPPMSSSTSVSITVLDVNDNDPVFTQPTYELRLNEDAAVGSSVLTLQARDRDANSVI
TYQLTGGNTRNRFALSSQRGGGLITLALPLDYKQEQQYVLAVTASDGTRSHTAHVLINVTDANTHRPVFQSSHYTVSVSEDRPVGTSIAT
LSANDEDTGENARITYVIQDPVPQFRIDPDSGTMYTMMELDYENQVAYTLTIMAQDNGIPQKSDTTTLEILILDANDNAPQFLWDFYQGS
IFEDAPPSTSILQVSATDRDSGPNGRLLYTFQGGDDGDGDFYIEPTSGVIRTQRRLDRENVAVYNLWALAVDRGSPTPLSASVEIQVTIL
DINDNAPMFEKDELELFVEENNPVGSVVAKIRANDPDEGPNAQIMYQIVEGDMRHFFQLDLLNGDLRAMVELDFEVRREYVLVVQATSAP
LVSRATVHILLVDQNDNPPVLPDFQILFNNYVTNKSNSFPTGVIGCIPAHDPDVSDSLNYTFVQGNELRLLLLDPATGELQLSRDLDNNR

--------------------------------------------------------------
>15643_15643_8_CELSR1-CERK_CELSR1_chr22_46929524_ENST00000395964_CERK_chr22_47086097_ENST00000541677_length(transcript)=3972nt_BP=3544nt
ATGGCGCCGCCGCCGCCGCCCGTGCTGCCCGTGCTGCTGCTCCTGGCCGCCGCCGCCGCCCTGCCGGCGATGGGGCTGCGAGCGGCCGCC
TGGGAGCCGCGCGTACCCGGCGGGACCCGCGCCTTCGCCCTCCGGCCCGGCTGTACCTACGCGGTGGGCGCCGCTTGCACGCCCCGGGCG
CCGCGGGAGCTGCTGGACGTGGGCCGCGATGGGCGGCTGGCAGGACGTCGGCGCGTCTCGGGCGCGGGGCGCCCGCTGCCGCTGCAAGTC
CGCTTGGTGGCCCGCAGTGCCCCGACGGCGCTGAGCCGCCGCCTGCGGGCGCGCACGCACCTTCCCGGCTGCGGAGCCCGTGCCCGGCTC
TGCGGAACCGGTGCCCGGCTCTGCGGGGCGCTCTGCTTCCCCGTCCCCGGCGGCTGCGCGGCCGCGCAGCATTCGGCGCTCGCAGCTCCG
ACCACCTTACCCGCCTGCCGCTGCCCGCCGCGCCCCAGGCCCCGCTGTCCCGGCCGTCCCATCTGCCTGCCGCCGGGCGGCTCGGTCCGC
CTGCGTCTGCTGTGCGCCCTGCGGCGCGCGGCTGGCGCCGTCCGGGTGGGACTGGCGCTGGAGGCCGCCACCGCGGGGACGCCCTCCGCG
TCGCCATCCCCATCGCCGCCCCTGCCGCCGAACTTGCCCGAAGCCCGGGCGGGGCCGGCGCGACGGGCCCGGCGGGGCACGAGCGGCAGA
GGGAGCCTGAAGTTTCCGATGCCCAACTACCAGGTGGCGTTGTTTGAGAACGAACCGGCGGGCACCCTCATCCTCCAGCTGCACGCGCAC
TACACCATCGAGGGCGAGGAGGAGCGCGTGAGCTATTACATGGAGGGGCTGTTCGACGAGCGCTCCCGGGGCTACTTCCGAATCGACTCT
GCCACGGGCGCCGTGAGCACGGACAGCGTACTGGACCGCGAGACCAAGGAGACGCACGTCCTCAGGGTGAAAGCCGTGGACTACAGTACG
CCGCCGCGCTCGGCCACCACCTACATCACTGTCTTGGTCAAAGACACCAACGACCACAGCCCGGTCTTCGAGCAGTCGGAGTACCGCGAG
CGCGTGCGGGAGAACCTGGAGGTGGGCTACGAGGTGCTGACCATCCGCGCCAGCGACCGCGACTCGCCCATCAACGCCAACTTGCGTTAC
CGCGTGTTGGGGGGCGCGTGGGACGTCTTCCAGCTCAACGAGAGCTCTGGCGTGGTGAGCACACGGGCGGTGCTGGACCGGGAGGAGGCG
GCCGAGTACCAGCTCCTGGTGGAGGCCAACGACCAGGGGCGCAATCCGGGCCCGCTCAGTGCCACGGCCACCGTGTACATCGAGGTGGAG
GACGAGAACGACAACTACCCCCAGTTCAGCGAGCAGAACTACGTGGTCCAGGTGCCCGAGGACGTGGGGCTCAACACGGCTGTGCTGCGA
GTGCAGGCCACGGACCGGGACCAGGGCCAGAACGCGGCCATTCACTACAGCATCCTCAGCGGGAACGTGGCCGGCCAGTTCTACCTGCAC
TCGCTGAGCGGGATCCTGGATGTGATCAACCCCTTGGATTTCGAGGATGTCCAGAAATACTCGCTGAGCATTAAGGCCCAGGATGGGGGC
CGGCCCCCGCTCATCAATTCTTCAGGGGTGGTGTCTGTGCAGGTGCTGGATGTCAACGACAACGAGCCTATCTTTGTGAGCAGCCCCTTC
CAGGCCACGGTGCTGGAGAATGTGCCCCTGGGCTACCCCGTGGTGCACATTCAGGCGGTGGACGCGGACTCTGGAGAGAACGCCCGGCTG
CACTATCGCCTGGTGGACACGGCCTCCACCTTTCTGGGGGGCGGCAGCGCTGGGCCTAAGAATCCTGCCCCCACCCCTGACTTCCCCTTC
CAGATCCACAACAGCTCCGGTTGGATCACAGTGTGTGCCGAGCTGGACCGCGAGGAGGTGGAGCACTACAGCTTCGGGGTGGAGGCGGTG
GACCACGGCTCGCCCCCCATGAGCTCCTCCACCAGCGTGTCCATCACGGTGCTGGACGTGAATGACAACGACCCGGTGTTCACGCAGCCC
ACCTACGAGCTTCGTCTGAATGAGGATGCGGCCGTGGGGAGCAGCGTGCTGACCCTGCAGGCCCGCGACCGTGACGCCAACAGTGTGATT
ACCTACCAGCTCACAGGCGGCAACACCCGGAACCGCTTTGCACTCAGCAGCCAGAGAGGGGGCGGCCTCATCACCCTGGCGCTACCTCTG
GACTACAAGCAGGAGCAGCAGTACGTGCTGGCGGTGACAGCATCCGACGGCACACGGTCGCACACTGCGCATGTCCTAATCAACGTCACT
GATGCCAACACCCACAGGCCTGTCTTTCAGAGCTCCCATTACACAGTGAGTGTCAGTGAGGACAGGCCTGTGGGCACCTCCATTGCTACC
CTCAGTGCCAACGATGAGGACACAGGAGAGAATGCCCGCATCACCTACGTGATTCAGGACCCCGTGCCGCAGTTCCGCATTGACCCCGAC
AGTGGCACCATGTACACCATGATGGAGCTGGACTATGAGAACCAGGTCGCCTACACGCTGACCATCATGGCCCAGGACAACGGCATCCCG
CAGAAATCAGACACCACCACCCTAGAGATCCTCATCCTCGATGCCAATGACAATGCACCCCAGTTCCTGTGGGATTTCTACCAGGGTTCC
ATCTTTGAGGATGCTCCACCCTCGACCAGCATCCTCCAGGTCTCTGCCACGGACCGGGACTCAGGTCCCAATGGGCGTCTGCTGTACACC
TTCCAGGGTGGGGACGACGGCGATGGGGACTTCTACATCGAGCCCACGTCCGGTGTGATTCGCACCCAGCGCCGGCTGGACCGGGAGAAT
GTGGCCGTGTACAACCTTTGGGCTCTGGCTGTGGATCGGGGCAGTCCCACTCCCCTTAGCGCCTCGGTAGAAATCCAGGTGACCATCTTG
GACATTAATGACAATGCCCCCATGTTTGAGAAGGACGAACTGGAGCTGTTTGTTGAGGAGAACAACCCAGTGGGGTCGGTGGTGGCAAAG
ATTCGTGCTAACGACCCTGATGAAGGCCCTAATGCCCAGATCATGTATCAGATTGTGGAAGGGGACATGCGGCATTTCTTCCAGCTGGAC
CTGCTCAACGGGGACCTGCGTGCCATGGTGGAGCTGGACTTTGAGGTCCGGCGGGAGTATGTGCTGGTGGTGCAGGCCACGTCGGCTCCG
CTGGTGAGCCGAGCCACGGTGCACATCCTTCTCGTGGACCAGAATGACAACCCGCCTGTGCTGCCCGACTTCCAGATCCTCTTCAACAAC
TATGTCACCAACAAGTCCAACAGTTTCCCCACCGGCGTGATCGGCTGCATCCCGGCCCATGACCCCGACGTGTCAGACAGCCTCAACTAC
ACCTTCGTGCAGGGCAACGAGCTGCGCCTGTTGCTGCTGGACCCCGCCACGGGCGAACTGCAGCTCAGCCGCGACCTGGACAACAACCGG
CCGCTGGAGGCGCTCATGGAGGTGTCTGTGTCTGTTTGACTTCACTTTTGTTGAAGTTTATCGCGTCAAGAAATTCCAGTTTACGTCGAA
GCACATGGAGGATGAGGACAGCGACCTCAAGGAGGGGGGGAAGAAGCGCTTTGGGCACATTTGCAGCAGCCACCCCTCCTGCTGCTGCAC
CGTCTCCAACAGCTCCTGGAACTGCGACGGGGAGGTCCTGCACAGCCCTGCCATCGAGGTCAGAGTCCACTGCCAGCTGGTTCGACTCTT
TGCACGAGGAATTGAAGAGAATCCGAAGCCAGACTCACACAGCTGAGAAGCCGGCGTCCTGCTCACAAACTGGGAAAGTGTGAAAACTAT
TTAAGATAATTATTACAGACCAATTATGTTGATATATACATTTAAATGTAGAAATTTATTTTTGATAGTTAAATCTTGATTTTAGAAGAA

>15643_15643_8_CELSR1-CERK_CELSR1_chr22_46929524_ENST00000395964_CERK_chr22_47086097_ENST00000541677_length(amino acids)=1182AA_BP=
MAPPPPPVLPVLLLLAAAAALPAMGLRAAAWEPRVPGGTRAFALRPGCTYAVGAACTPRAPRELLDVGRDGRLAGRRRVSGAGRPLPLQV
RLVARSAPTALSRRLRARTHLPGCGARARLCGTGARLCGALCFPVPGGCAAAQHSALAAPTTLPACRCPPRPRPRCPGRPICLPPGGSVR
LRLLCALRRAAGAVRVGLALEAATAGTPSASPSPSPPLPPNLPEARAGPARRARRGTSGRGSLKFPMPNYQVALFENEPAGTLILQLHAH
YTIEGEEERVSYYMEGLFDERSRGYFRIDSATGAVSTDSVLDRETKETHVLRVKAVDYSTPPRSATTYITVLVKDTNDHSPVFEQSEYRE
RVRENLEVGYEVLTIRASDRDSPINANLRYRVLGGAWDVFQLNESSGVVSTRAVLDREEAAEYQLLVEANDQGRNPGPLSATATVYIEVE
DENDNYPQFSEQNYVVQVPEDVGLNTAVLRVQATDRDQGQNAAIHYSILSGNVAGQFYLHSLSGILDVINPLDFEDVQKYSLSIKAQDGG
RPPLINSSGVVSVQVLDVNDNEPIFVSSPFQATVLENVPLGYPVVHIQAVDADSGENARLHYRLVDTASTFLGGGSAGPKNPAPTPDFPF
QIHNSSGWITVCAELDREEVEHYSFGVEAVDHGSPPMSSSTSVSITVLDVNDNDPVFTQPTYELRLNEDAAVGSSVLTLQARDRDANSVI
TYQLTGGNTRNRFALSSQRGGGLITLALPLDYKQEQQYVLAVTASDGTRSHTAHVLINVTDANTHRPVFQSSHYTVSVSEDRPVGTSIAT
LSANDEDTGENARITYVIQDPVPQFRIDPDSGTMYTMMELDYENQVAYTLTIMAQDNGIPQKSDTTTLEILILDANDNAPQFLWDFYQGS
IFEDAPPSTSILQVSATDRDSGPNGRLLYTFQGGDDGDGDFYIEPTSGVIRTQRRLDRENVAVYNLWALAVDRGSPTPLSASVEIQVTIL
DINDNAPMFEKDELELFVEENNPVGSVVAKIRANDPDEGPNAQIMYQIVEGDMRHFFQLDLLNGDLRAMVELDFEVRREYVLVVQATSAP
LVSRATVHILLVDQNDNPPVLPDFQILFNNYVTNKSNSFPTGVIGCIPAHDPDVSDSLNYTFVQGNELRLLLLDPATGELQLSRDLDNNR

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for CELSR1-CERK


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for CELSR1-CERK


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for CELSR1-CERK


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource