Fusion Gene Studies
in Kim Lab

FusionBase FusionGDB FusionGDB2 FusionPDB FusionNeoAntigen FusionAI FusionNW FGviewer Publication Contact
FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:CAD-ALK (FusionGDB2 ID:HG790TG238)

Fusion Gene Summary for CAD-ALK

check button Fusion gene summary
Fusion gene informationFusion gene name: CAD-ALK
Fusion gene ID: hg790tg238
HgeneTgene
Gene symbol

CAD

ALK

Gene ID

790

238

Gene namecarbamoyl-phosphate synthetase 2, aspartate transcarbamylase, and dihydroorotaseALK receptor tyrosine kinase
SynonymsCDG1Z|EIEE50|GATD4CD246|NBLST3
Cytomap('CAD')('ALK')

2p23.3

2p23.2-p23.1

Type of geneprotein-codingprotein-coding
DescriptionCAD proteinCAD trifunctional proteinmultifunctional protein CADALK tyrosine kinase receptorCD246 antigenanaplastic lymphoma receptor tyrosine kinasemutant anaplastic lymphoma kinase
Modification date2020031320200329
UniProtAcc

P27708

Q9UM73

Ensembl transtripts involved in fusion geneENST00000264705, ENST00000403525, 
ENST00000464159, 
Fusion gene scores* DoF score3 X 3 X 2=1856 X 74 X 20=82880
# samples 357
** MAII scorelog2(3/18*10)=0.736965594166206
effective Gene in Pan-Cancer Fusion Genes (eGinPCFGs).
DoF>8 and MAII>0
log2(57/82880*10)=-7.18391827352181
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: CAD [Title/Abstract] AND ALK [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpoint
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneCAD

GO:0006207

'de novo' pyrimidine nucleobase biosynthetic process

24332717

TgeneALK

GO:0016310

phosphorylation

9174053

TgeneALK

GO:0046777

protein autophosphorylation

9174053



check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerKB4..CADchr2

27463229

+ALKchr2

29446394

-


Top

Fusion Gene ORF analysis for CAD-ALK

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-intronENST00000264705ENST00000431873CADchr2

27463229

+ALKchr2

29446394

-
5CDS-intronENST00000264705ENST00000498037CADchr2

27463229

+ALKchr2

29446394

-
5CDS-intronENST00000403525ENST00000431873CADchr2

27463229

+ALKchr2

29446394

-
5CDS-intronENST00000403525ENST00000498037CADchr2

27463229

+ALKchr2

29446394

-
In-frameENST00000264705ENST00000389048CADchr2

27463229

+ALKchr2

29446394

-
In-frameENST00000403525ENST00000389048CADchr2

27463229

+ALKchr2

29446394

-
intron-3CDSENST00000464159ENST00000389048CADchr2

27463229

+ALKchr2

29446394

-
intron-intronENST00000464159ENST00000431873CADchr2

27463229

+ALKchr2

29446394

-
intron-intronENST00000464159ENST00000498037CADchr2

27463229

+ALKchr2

29446394

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000264705CADchr227463229+ENST00000389048ALKchr229446394-7896575514774452432
ENST00000403525CADchr227463229+ENST00000389048ALKchr229446394-7689554812972382369

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score

Top

Fusion Genomic Features for CAD-ALK


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)


Top

Fusion Protein Features for CAD-ALK


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr2:/chr2:)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
CAD

P27708

ALK

Q9UM73

FUNCTION: This protein is a 'fusion' protein encoding four enzymatic activities of the pyrimidine pathway (GATase, CPSase, ATCase and DHOase). {ECO:0000269|PubMed:24332717}.FUNCTION: Neuronal receptor tyrosine kinase that is essentially and transiently expressed in specific regions of the central and peripheral nervous systems and plays an important role in the genesis and differentiation of the nervous system. Transduces signals from ligands at the cell surface, through specific activation of the mitogen-activated protein kinase (MAPK) pathway. Phosphorylates almost exclusively at the first tyrosine of the Y-x-x-x-Y-Y motif. Following activation by ligand, ALK induces tyrosine phosphorylation of CBL, FRS2, IRS1 and SHC1, as well as of the MAP kinases MAPK1/ERK2 and MAPK3/ERK1. Acts as a receptor for ligands pleiotrophin (PTN), a secreted growth factor, and midkine (MDK), a PTN-related factor, thus participating in PTN and MDK signal transduction. PTN-binding induces MAPK pathway activation, which is important for the anti-apoptotic signaling of PTN and regulation of cell proliferation. MDK-binding induces phosphorylation of the ALK target insulin receptor substrate (IRS1), activates mitogen-activated protein kinases (MAPKs) and PI3-kinase, resulting also in cell proliferation induction. Drives NF-kappa-B activation, probably through IRS1 and the activation of the AKT serine/threonine kinase. Recruitment of IRS1 to activated ALK and the activation of NF-kappa-B are essential for the autocrine growth and survival signaling of MDK. Thinness gene involved in the resistance to weight gain: in hypothalamic neurons, controls energy expenditure acting as a negative regulator of white adipose tissue lipolysis and sympathetic tone to fine-tune energy homeostasis (By similarity). {ECO:0000250|UniProtKB:P97793, ECO:0000269|PubMed:11121404, ECO:0000269|PubMed:11278720, ECO:0000269|PubMed:11387242, ECO:0000269|PubMed:11809760, ECO:0000269|PubMed:12107166, ECO:0000269|PubMed:12122009, ECO:0000269|PubMed:15226403, ECO:0000269|PubMed:15908427, ECO:0000269|PubMed:16317043, ECO:0000269|PubMed:16878150, ECO:0000269|PubMed:17274988}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note


Top

Fusion Gene Sequence for CAD-ALK


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>12437_12437_1_CAD-ALK_CAD_chr2_27463229_ENST00000264705_ALK_chr2_29446394_ENST00000389048_length(transcript)=7896nt_BP=5755nt
GCGCGCCCGAGGCTCCTACGCTGCCGCGCCCGGCTTCTCTCCAGCGCCCCGCGCCGTTAGCCACGTGGACCGACTCCGGCGCGCCGTCCT
CACGTGGTTCCAGTGGAGTTTGCAGTCCTTCCCGCTTCTCCGTACTCGCCCCCGCCTCTGAGCTCCCTTCCCATGGCGGCCCTAGTGTTG
GAGGACGGGTCGGTCCTGCGGGGCCAGCCCTTTGGGGCCGCCGTGTCGACTGCCGGGGAAGTGGTGTTTCAAACCGGCATGGTCGGCTAC
CCCGAGGCCCTCACCGATCCCTCCTACAAGGCACAGATCTTAGTGCTCACCTATCCTCTGATCGGCAACTATGGCATCCCCCCAGATGAA
ATGGATGAGTTCGGTCTCTGCAAGTGGTTTGAATCCTCGGGCATCCACGTAGCAGCACTGGTAGTGGGAGAGTGCTGTCCTACTCCCAGC
CACTGGAGTGCCACCCGCACCCTGCATGAGTGGCTGCAGCAGCATGGCATCCCTGGCTTGCAAGGAGTAGACACTCGGGAGCTGACCAAG
AAGTTGCGGGAACAGGGGTCTCTGCTGGGGAAGCTGGTCCAGAATGGAACAGAACCTTCATCCCTGCCATTCTTGGACCCCAATGCCCGC
CCCCTGGTACCAGAGGTCTCCATTAAGACTCCACGGGTATTCAATACAGGGGGTGCCCCTCGGATCCTTGCTTTGGACTGTGGCCTCAAG
TATAATCAGATCCGATGCCTCTGCCAGCGTGGGGCTGAGGTCACTGTGGTACCCTGGGACCATGCACTAGACAGCCAAGAGTATGAGGGT
CTCTTCTTAAGTAATGGGCCTGGTGACCCTGCCTCCTATCCCAGTGTCGTATCCACACTGAGCCGTGTTTTATCTGAGCCTAATCCCCGA
CCTGTCTTTGGGATCTGCCTGGGACACCAGCTATTGGCCTTAGCCATTGGGGCCAAGACTTACAAGATGAGATATGGGAACCGAGGCCAT
AACCAGCCCTGCTTGTTGGTGGGCTCTGGGCGCTGCTTTCTGACATCCCAGAACCATGGGTTTGCTGTGGAGACAGACTCACTGCCAGCA
GACTGGGCTCCTCTCTTCACCAACGCCAATGATGGTTCCAATGAAGGCATTGTGCACAACAGCTTGCCTTTCTTCAGTGTCCAGTTTCAC
CCAGAGCACCAAGCTGGCCCTTCAGATATGGAACTGCTTTTCGATATCTTTCTGGAAACTGTGAAAGAGGCCACAGCTGGGAACCCTGGG
GGCCAGACAGTTAGAGAGCGGCTGACTGAGCGCCTCTGTCCCCCTGGGATTCCCACTCCCGGCTCTGGACTTCCACCACCACGAAAGGTT
CTGATCCTGGGCTCAGGGGGCCTCTCCATTGGCCAAGCTGGAGAATTTGACTACTCGGGCTCTCAGGCAATTAAGGCCCTGAAGGAGGAA
AACATCCAGACGTTGCTGATCAACCCCAATATTGCCACAGTGCAGACCTCCCAGGGGCTGGCCGACAAGGTCTATTTTCTTCCCATAACA
CCTCATTATGTAACCCAGGTGATACGTAATGAACGCCCCGATGGTGTGTTACTGACTTTTGGGGGCCAGACTGCTCTGAACTGTGGTGTG
GAGCTGACCAAGGCCGGGGTGCTGGCTCGGTATGGGGTCCGGGTCCTGGGCACACCAGTGGAGACCATTGAGCTGACCGAGGATCGACGG
GCCTTTGCTGCCAGAATGGCAGAGATCGGAGAGCATGTGGCCCCGAGCGAGGCAGCAAATTCTCTTGAACAGGCCCAGGCAGCCGCTGAA
CGGCTGGGGTACCCTGTGCTAGTGCGTGCAGCCTTTGCCCTGGGTGGCCTGGGCTCTGGCTTTGCCTCTAACAGGGAGGAGCTCTCTGCT
CTCGTGGCCCCAGCTTTTGCCCATACCAGCCAAGTGCTAGTAGACAAGTCTCTGAAGGGATGGAAGGAGATTGAGTACGAGGTGGTGAGA
GACGCCTATGGCAACTGTGTCACGGTGTGTAACATGGAGAACTTGGACCCACTGGGCATCCACACTGGTGAGTCCATAGTGGTGGCCCCT
AGCCAGACACTGAATGACAGGGAGTATCAGCTCCTGAGGCAGACAGCTATCAAGGTGACCCAGCACCTGGGAATTGTTGGGGAGTGCAAT
GTGCAGTATGCCTTGAACCCTGAGTCTGAGCAGTATTACATCATTGAAGTGAATGCCAGGCTCTCTCGCAGCTCTGCCCTGGCCAGTAAG
GCCACAGGTTATCCACTGGCTTATGTGGCAGCCAAGCTAGCATTGGGCATCCCTTTGCCTGAGCTCAGGAACTCTGTGACAGGGGGTACA
GCAGCCTTTGAACCCAGCGTGGATTATTGTGTGGTGAAGATTCCTCGATGGGACCTTAGCAAGTTCCTGCGAGTCAGCACAAAGATTGGG
AGCTGCATGAAGAGCGTTGGTGAAGTCATGGGCATTGGGCGTTCATTTGAGGAGGCCTTCCAGAAGGCCCTGCGCATGGTGGATGAGAAC
TGTGTGGGCTTTGATCACACAGTGAAACCAGTCAGCGATATGGAGTTGGAGACTCCAACAGATAAGCGGATTTTTGTGGTGGCAGCTGCT
TTGTGGGCTGGTTATTCAGTGGACCGCCTGTATGAGCTCACACGCATCGACCGCTGGTTCCTGCACCGAATGAAGCGTATCATCGCACAT
GCCCAGCTGCTAGAACAACACCGTGGACAGCCTTTGCCGCCAGACCTGCTGCAACAGGCCAAGTGTCTTGGCTTCTCAGACAAACAGATT
GCCCTTGCAGTTCTGAGCACAGAGCTGGCTGTTCGCAAGCTGCGTCAGGAACTGGGGATCTGTCCAGCAGTGAAACAGATTGACACAGTT
GCAGCTGAGTGGCCAGCCCAGACAAATTACCTATACCTAACGTATTGGGGCACCACCCATGACCTCACCTTTCGAACACCTCATGTCCTA
GTCCTTGGCTCTGGCGTCTACCGTATTGGCTCTAGCGTTGAATTTGACTGGTGTGCTGTAGGCTGCATCCAGCAGCTCCGAAAGATGGGA
TATAAGACCATCATGGTGAACTATAACCCAGAGACAGTCAGCACCGACTATGACATGTGTGATCGACTCTACTTTGATGAGATCTCTTTT
GAGGTGGTGATGGACATCTATGAGCTCGAGAACCCTGAAGGTGTGATCCTATCCATGGGTGGACAGCTGCCCAACAACATGGCCATGGCG
TTGCATCGGCAGCAGTGCCGGGTGCTGGGCACCTCCCCTGAAGCCATTGACTCGGCTGAGAACCGTTTCAAGTTTTCCCGGCTCCTTGAC
ACCATTGGTATCAGCCAGCCTCAGTGGAGGGAGCTCAGTGACCTCGAGTCTGCTCGCCAATTCTGCCAGACCGTGGGGTACCCCTGTGTG
GTGCGCCCCTCCTATGTGCTGAGCGGTGCTGCTATGAATGTGGCCTACACGGATGGAGACCTGGAGCGCTTCCTGAGCAGCGCAGCAGCC
GTCTCCAAAGAGCATCCCGTGGTCATCTCCAAGTTCATCCAGGAGGCTAAGGAGATTGACGTGGATGCCGTGGCCTCTGATGGTGTGGTG
GCAGCCATCGCCATCTCTGAGCATGTGGAGAATGCAGGTGTGCATTCAGGTGATGCGACGCTGGTGACCCCCCCACAAGATATCACTGCC
AAAACCCTGGAGCGGATCAAAGCCATTGTGCATGCTGTGGGCCAGGAGCTACAGGTCACAGGACCCTTCAATCTGCAGCTCATTGCCAAG
GATGACCAGCTGAAAGTTATTGAATGCAACGTACGTGTCTCTCGCTCCTTCCCCTTCGTTTCCAAGACACTGGGTGTGGACCTAGTAGCC
TTGGCCACGCGGGTCATCATGGGGGAAGAAGTGGAACCTGTGGGGCTAATGACTGGTTCTGGAGTCGTGGGAGTAAAGGTGCCTCAGTTC
TCCTTCTCCCGCTTGGCGGGTGCTGACGTGGTGTTGGGTGTGGAAATGACCAGTACTGGGGAGGTGGCCGGCTTTGGGGAGAGCCGCTGT
GAGGCATACCTCAAGGCCATGCTAAGCACTGGCTTTAAGATCCCCAAGAAGAATATCCTGCTGACCATTGGCAGCTATAAGAACAAAAGC
GAGCTGCTCCCAACTGTGCGGCTACTGGAGAGCCTGGGCTACAGCCTCTATGCCAGTCTCGGCACAGCTGACTTCTACACTGAGCATGGC
GTCAAGGTAACAGCTGTGGACTGGCACTTTGAGGAGGCTGTGGATGGTGAGTGCCCACCACAGCGGAGCATCCTGGAGCAGCTAGCTGAG
AAAAACTTTGAGCTGGTGATTAACCTGTCAATGCGTGGAGCTGGGGGCCGGCGTCTCTCTTCCTTTGTCACCAAGGGCTACCGCACCCGA
CGCTTGGCCGCTGACTTCTCCGTGCCCCTAATCATCGATATCAAGTGCACCAAACTCTTTGTGGAGGCCCTAGGCCAGATCGGGCCAGCC
CCTCCTTTGAAGGTGCATGTTGACTGTATGACCTCCCAAAAGCTTGTGCGACTGCCGGGATTGATTGATGTCCATGTGCACCTGCGGGAA
CCAGGTGGGACACATAAGGAGGACTTTGCTTCAGGCACAGCCGCTGCCCTGGCTGGGGGTATCACCATGGTGTGTGCCATGCCTAATACC
CGGCCCCCCATCATTGACGCCCCTGCTCTGGCCCTGGCCCAGAAGCTGGCAGAGGCTGGCGCCCGGTGCGACTTTGCGCTATTCCTTGGG
GCCTCGTCTGAAAATGCAGGAACCTTGGGCACCGTGGCCGGGTCTGCAGCCGGGCTGAAGCTTTACCTCAATGAGACCTTCTCTGAGCTG
CGGCTGGACAGCGTGGTCCAGTGGATGGAGCATTTCGAGACATGGCCCTCCCACCTCCCCATTGTGGCTCACGCAGAGCAGCAAACCGTG
GCTGCTGTCCTCATGGTGGCTCAGCTCACTCAGCGCTCAGTGCACATATGTCACGTGGCACGGAAGGAGGAGATCCTGCTAATTAAAGCT
GCAAAGGCACGGGGCTTGCCAGTGACCTGCGAGGTGGCTCCCCACCACCTGTTCCTAAGCCATGATGACCTGGAGCGCCTGGGGCCTGGG
AAGGGGGAGGTCCGGCCTGAGCTTGGCTCCCGCCAGGATGTGGAAGCCCTGTGGGAGAACATGGCTGTCATCGACTGCTTTGCCTCAGAC
CATGCTCCCCATACCTTGGAGGAGAAGTGTGGGTCCAGGCCCCCACCTGGGTTCCCAGGGTTAGAGACCATGCTGCCACTACTCCTGACG
GCTGTAAGCGAGGGCCGGCTCAGCCTGGACGACCTGCTGCAGCGATTGCACCACAATCCTCGGCGCATCTTTCACCTGCCCCCGCAGGAG
GACACCTATGTGGAGGTGGATCTGGAGCATGAGTGGACAATTCCCAGCCACATGCCCTTCTCCAAGGCCCACTGGACACCTTTTGAAGGG
CAGAAAGTGAAGGGCACCGTCCGCCGTGTGGTCCTGCGAGGGGAGGTTGCCTATATCGATGGGCAGGTTCTGGTACCCCCGGGCTATGGA
CAGGATGTACGGAAGTGGCCACAGGGGGCTGTTCCTCAGCTCCCACCCTCAGCCCCTGCCACTAGTGAGATGACCACGACACCTGAAAGA
CCCCGCCGTGGCATCCCAGGGCTTCCTGATGGCCGCTTCCATCTGCCGCCCCGAATCCATCGAGCCTCCGACCCAGGTTTGCCAGTGTAC
CGCCGGAAGCACCAGGAGCTGCAAGCCATGCAGATGGAGCTGCAGAGCCCTGAGTACAAGCTGAGCAAGCTCCGCACCTCGACCATCATG
ACCGACTACAACCCCAACTACTGCTTTGCTGGCAAGACCTCCTCCATCAGTGACCTGAAGGAGGTGCCGCGGAAAAACATCACCCTCATT
CGGGGTCTGGGCCATGGCGCCTTTGGGGAGGTGTATGAAGGCCAGGTGTCCGGAATGCCCAACGACCCAAGCCCCCTGCAAGTGGCTGTG
AAGACGCTGCCTGAAGTGTGCTCTGAACAGGACGAACTGGATTTCCTCATGGAAGCCCTGATCATCAGCAAATTCAACCACCAGAACATT
GTTCGCTGCATTGGGGTGAGCCTGCAATCCCTGCCCCGGTTCATCCTGCTGGAGCTCATGGCGGGGGGAGACCTCAAGTCCTTCCTCCGA
GAGACCCGCCCTCGCCCGAGCCAGCCCTCCTCCCTGGCCATGCTGGACCTTCTGCACGTGGCTCGGGACATTGCCTGTGGCTGTCAGTAT
TTGGAGGAAAACCACTTCATCCACCGAGACATTGCTGCCAGAAACTGCCTCTTGACCTGTCCAGGCCCTGGAAGAGTGGCCAAGATTGGA
GACTTCGGGATGGCCCGAGACATCTACAGGGCGAGCTACTATAGAAAGGGAGGCTGTGCCATGCTGCCAGTTAAGTGGATGCCCCCAGAG
GCCTTCATGGAAGGAATATTCACTTCTAAAACAGACACATGGTCCTTTGGAGTGCTGCTATGGGAAATCTTTTCTCTTGGATATATGCCA
TACCCCAGCAAAAGCAACCAGGAAGTTCTGGAGTTTGTCACCAGTGGAGGCCGGATGGACCCACCCAAGAACTGCCCTGGGCCTGTATAC
CGGATAATGACTCAGTGCTGGCAACATCAGCCTGAAGACAGGCCCAACTTTGCCATCATTTTGGAGAGGATTGAATACTGCACCCAGGAC
CCGGATGTAATCAACACCGCTTTGCCGATAGAATATGGTCCACTTGTGGAAGAGGAAGAGAAAGTGCCTGTGAGGCCCAAGGACCCTGAG
GGGGTTCCTCCTCTCCTGGTCTCTCAACAGGCAAAACGGGAGGAGGAGCGCAGCCCAGCTGCCCCACCACCTCTGCCTACCACCTCCTCT
GGCAAGGCTGCAAAGAAACCCACAGCTGCAGAGATCTCTGTTCGAGTCCCTAGAGGGCCGGCCGTGGAAGGGGGACACGTGAATATGGCA
TTCTCTCAGTCCAACCCTCCTTCGGAGTTGCACAAGGTCCACGGATCCAGAAACAAGCCCACCAGCTTGTGGAACCCAACGTACGGCTCC
TGGTTTACAGAGAAACCCACCAAAAAGAATAATCCTATAGCAAAGAAGGAGCCACACGACAGGGGTAACCTGGGGCTGGAGGGAAGCTGT
ACTGTCCCACCTAACGTTGCAACTGGGAGACTTCCGGGGGCCTCACTGCTCCTAGAGCCCTCTTCGCTGACTGCCAATATGAAGGAGGTA
CCTCTGTTCAGGCTACGTCACTTCCCTTGTGGGAATGTCAATTACGGCTACCAGCAACAGGGCTTGCCCTTAGAAGCCGCTACTGCCCCT
GGAGCTGGTCATTACGAGGATACCATTCTGAAAAGCAAGAATAGCATGAACCAGCCTGGGCCCTGAGCTCGGTCGCACACTCACTTCTCT
TCCTTGGGATCCCTAAGACCGTGGAGGAGAGAGAGGCAATGGCTCCTTCACAAACCAGAGACCAAATGTCACGTTTTGTTTTGTGCCAAC
CTATTTTGAAGTACCACCAAAAAAGCTGTATTTTGAAAATGCTTTAGAAAGGTTTTGAGCATGGGTTCATCCTATTCTTTCGAAAGAAGA
AAATATCATAAAAATGAGTGATAAATACAAGGCCCAGATGTGGTTGCATAAGGTTTTTATGCATGTTTGTTGTATACTTCCTTATGCTTC
TTTCAAATTGTGTGTGCTCTGCTTCAATGTAGTCAGAATTAGCTGCTTCTATGTTTCATAGTTGGGGTCATAGATGTTTCCTTGCCTTGT

>12437_12437_1_CAD-ALK_CAD_chr2_27463229_ENST00000264705_ALK_chr2_29446394_ENST00000389048_length(amino acids)=2432AA_BP=1869
MSSLPMAALVLEDGSVLRGQPFGAAVSTAGEVVFQTGMVGYPEALTDPSYKAQILVLTYPLIGNYGIPPDEMDEFGLCKWFESSGIHVAA
LVVGECCPTPSHWSATRTLHEWLQQHGIPGLQGVDTRELTKKLREQGSLLGKLVQNGTEPSSLPFLDPNARPLVPEVSIKTPRVFNTGGA
PRILALDCGLKYNQIRCLCQRGAEVTVVPWDHALDSQEYEGLFLSNGPGDPASYPSVVSTLSRVLSEPNPRPVFGICLGHQLLALAIGAK
TYKMRYGNRGHNQPCLLVGSGRCFLTSQNHGFAVETDSLPADWAPLFTNANDGSNEGIVHNSLPFFSVQFHPEHQAGPSDMELLFDIFLE
TVKEATAGNPGGQTVRERLTERLCPPGIPTPGSGLPPPRKVLILGSGGLSIGQAGEFDYSGSQAIKALKEENIQTLLINPNIATVQTSQG
LADKVYFLPITPHYVTQVIRNERPDGVLLTFGGQTALNCGVELTKAGVLARYGVRVLGTPVETIELTEDRRAFAARMAEIGEHVAPSEAA
NSLEQAQAAAERLGYPVLVRAAFALGGLGSGFASNREELSALVAPAFAHTSQVLVDKSLKGWKEIEYEVVRDAYGNCVTVCNMENLDPLG
IHTGESIVVAPSQTLNDREYQLLRQTAIKVTQHLGIVGECNVQYALNPESEQYYIIEVNARLSRSSALASKATGYPLAYVAAKLALGIPL
PELRNSVTGGTAAFEPSVDYCVVKIPRWDLSKFLRVSTKIGSCMKSVGEVMGIGRSFEEAFQKALRMVDENCVGFDHTVKPVSDMELETP
TDKRIFVVAAALWAGYSVDRLYELTRIDRWFLHRMKRIIAHAQLLEQHRGQPLPPDLLQQAKCLGFSDKQIALAVLSTELAVRKLRQELG
ICPAVKQIDTVAAEWPAQTNYLYLTYWGTTHDLTFRTPHVLVLGSGVYRIGSSVEFDWCAVGCIQQLRKMGYKTIMVNYNPETVSTDYDM
CDRLYFDEISFEVVMDIYELENPEGVILSMGGQLPNNMAMALHRQQCRVLGTSPEAIDSAENRFKFSRLLDTIGISQPQWRELSDLESAR
QFCQTVGYPCVVRPSYVLSGAAMNVAYTDGDLERFLSSAAAVSKEHPVVISKFIQEAKEIDVDAVASDGVVAAIAISEHVENAGVHSGDA
TLVTPPQDITAKTLERIKAIVHAVGQELQVTGPFNLQLIAKDDQLKVIECNVRVSRSFPFVSKTLGVDLVALATRVIMGEEVEPVGLMTG
SGVVGVKVPQFSFSRLAGADVVLGVEMTSTGEVAGFGESRCEAYLKAMLSTGFKIPKKNILLTIGSYKNKSELLPTVRLLESLGYSLYAS
LGTADFYTEHGVKVTAVDWHFEEAVDGECPPQRSILEQLAEKNFELVINLSMRGAGGRRLSSFVTKGYRTRRLAADFSVPLIIDIKCTKL
FVEALGQIGPAPPLKVHVDCMTSQKLVRLPGLIDVHVHLREPGGTHKEDFASGTAAALAGGITMVCAMPNTRPPIIDAPALALAQKLAEA
GARCDFALFLGASSENAGTLGTVAGSAAGLKLYLNETFSELRLDSVVQWMEHFETWPSHLPIVAHAEQQTVAAVLMVAQLTQRSVHICHV
ARKEEILLIKAAKARGLPVTCEVAPHHLFLSHDDLERLGPGKGEVRPELGSRQDVEALWENMAVIDCFASDHAPHTLEEKCGSRPPPGFP
GLETMLPLLLTAVSEGRLSLDDLLQRLHHNPRRIFHLPPQEDTYVEVDLEHEWTIPSHMPFSKAHWTPFEGQKVKGTVRRVVLRGEVAYI
DGQVLVPPGYGQDVRKWPQGAVPQLPPSAPATSEMTTTPERPRRGIPGLPDGRFHLPPRIHRASDPGLPVYRRKHQELQAMQMELQSPEY
KLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDFLMEA
LIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARNCLLT
CPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSGGRM
DPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEERSP
AAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAKKEPH
DRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMKEVPLFRLRHFPCGNVNYGYQQQGLPLEAATAPGAGHYEDTILKSKNSMNQP

--------------------------------------------------------------
>12437_12437_2_CAD-ALK_CAD_chr2_27463229_ENST00000403525_ALK_chr2_29446394_ENST00000389048_length(transcript)=7689nt_BP=5548nt
CGCTGCCGCGCCCGGCTTCTCTCCAGCGCCCCGCGCCGTTAGCCACGTGGACCGACTCCGGCGCGCCGTCCTCACGTGGTTCCAGTGGAG
TTTGCAGTCCTTCCCGCTTCTCCGTACTCGCCCCCGCCTCTGAGCTCCCTTCCCATGGCGGCCCTAGTGTTGGAGGACGGGTCGGTCCTG
CGGGGCCAGCCCTTTGGGGCCGCCGTGTCGACTGCCGGGGAAGTGGTGTTTCAAACCGGCATGGTCGGCTACCCCGAGGCCCTCACCGAT
CCCTCCTACAAGGCACAGATCTTAGTGCTCACCTATCCTCTGATCGGCAACTATGGCATCCCCCCAGATGAAATGGATGAGTTCGGTCTC
TGCAAGTGGTTTGAATCCTCGGGCATCCACGTAGCAGCACTGGTAGTGGGAGAGTGCTGTCCTACTCCCAGCCACTGGAGTGCCACCCGC
ACCCTGCATGAGTGGCTGCAGCAGCATGGCATCCCTGGCTTGCAAGGAGTAGACACTCGGGAGCTGACCAAGAAGTTGCGGGAACAGGGG
TCTCTGCTGGGGAAGCTGGTCCAGAATGGAACAGAACCTTCATCCCTGCCATTCTTGGACCCCAATGCCCGCCCCCTGGTACCAGAGGTC
TCCATTAAGACTCCACGGGTATTCAATACAGGGGGTGCCCCTCGGATCCTTGCTTTGGACTGTGGCCTCAAGTATAATCAGATCCGATGC
CTCTGCCAGCGTGGGGCTGAGGTCACTGTGGTACCCTGGGACCATGCACTAGACAGCCAAGAGTATGAGGGTCTCTTCTTAAGTAATGGG
CCTGGTGACCCTGCCTCCTATCCCAGTGTCGTATCCACACTGAGCCGTGTTTTATCTGAGCCTAATCCCCGACCTGTCTTTGGGATCTGC
CTGGGACACCAGCTATTGGCCTTAGCCATTGGGGCCAAGACTTACAAGATGAGATATGGGAACCGAGGCCATAACCAGCCCTGCTTGTTG
GTGGGCTCTGGGCGCTGCTTTCTGACATCCCAGAACCATGGGTTTGCTGTGGAGACAGACTCACTGCCAGCAGACTGGGCTCCTCTCTTC
ACCAACGCCAATGATGGTTCCAATGAAGGCATTGTGCACAACAGCTTGCCTTTCTTCAGTGTCCAGTTTCACCCAGAGCACCAAGCTGGC
CCTTCAGATATGGAACTGCTTTTCGATATCTTTCTGGAAACTGTGAAAGAGGCCACAGCTGGGAACCCTGGGGGCCAGACAGTTAGAGAG
CGGCTGACTGAGCGCCTCTGTCCCCCTGGGATTCCCACTCCCGGCTCTGGACTTCCACCACCACGAAAGGTTCTGATCCTGGGCTCAGGG
GGCCTCTCCATTGGCCAAGCTGGAGAATTTGACTACTCGGGCTCTCAGGCAATTAAGGCCCTGAAGGAGGAAAACATCCAGACGTTGCTG
ATCAACCCCAATATTGCCACAGTGCAGACCTCCCAGGGGCTGGCCGACAAGGTCTATTTTCTTCCCATAACACCTCATTATGTAACCCAG
GTGATACGTAATGAACGCCCCGATGGTGTGTTACTGACTTTTGGGGGCCAGACTGCTCTGAACTGTGGTGTGGAGCTGACCAAGGCCGGG
GTGCTGGCTCGGTATGGGGTCCGGGTCCTGGGCACACCAGTGGAGACCATTGAGCTGACCGAGGATCGACGGGCCTTTGCTGCCAGAATG
GCAGAGATCGGAGAGCATGTGGCCCCGAGCGAGGCAGCAAATTCTCTTGAACAGGCCCAGGCAGCCGCTGAACGGCTGGGGTACCCTGTG
CTAGTGCGTGCAGCCTTTGCCCTGGGTGGCCTGGGCTCTGGCTTTGCCTCTAACAGGGAGGAGCTCTCTGCTCTCGTGGCCCCAGCTTTT
GCCCATACCAGCCAAGTGCTAGTAGACAAGTCTCTGAAGGGATGGAAGGAGATTGAGTACGAGGTGGTGAGAGACGCCTATGGCAACTGT
GTCACGTATTACATCATTGAAGTGAATGCCAGGCTCTCTCGCAGCTCTGCCCTGGCCAGTAAGGCCACAGGTTATCCACTGGCTTATGTG
GCAGCCAAGCTAGCATTGGGCATCCCTTTGCCTGAGCTCAGGAACTCTGTGACAGGGGGTACAGCAGCCTTTGAACCCAGCGTGGATTAT
TGTGTGGTGAAGATTCCTCGATGGGACCTTAGCAAGTTCCTGCGAGTCAGCACAAAGATTGGGAGCTGCATGAAGAGCGTTGGTGAAGTC
ATGGGCATTGGGCGTTCATTTGAGGAGGCCTTCCAGAAGGCCCTGCGCATGGTGGATGAGAACTGTGTGGGCTTTGATCACACAGTGAAA
CCAGTCAGCGATATGGAGTTGGAGACTCCAACAGATAAGCGGATTTTTGTGGTGGCAGCTGCTTTGTGGGCTGGTTATTCAGTGGACCGC
CTGTATGAGCTCACACGCATCGACCGCTGGTTCCTGCACCGAATGAAGCGTATCATCGCACATGCCCAGCTGCTAGAACAACACCGTGGA
CAGCCTTTGCCGCCAGACCTGCTGCAACAGGCCAAGTGTCTTGGCTTCTCAGACAAACAGATTGCCCTTGCAGTTCTGAGCACAGAGCTG
GCTGTTCGCAAGCTGCGTCAGGAACTGGGGATCTGTCCAGCAGTGAAACAGATTGACACAGTTGCAGCTGAGTGGCCAGCCCAGACAAAT
TACCTATACCTAACGTATTGGGGCACCACCCATGACCTCACCTTTCGAACACCTCATGTCCTAGTCCTTGGCTCTGGCGTCTACCGTATT
GGCTCTAGCGTTGAATTTGACTGGTGTGCTGTAGGCTGCATCCAGCAGCTCCGAAAGATGGGATATAAGACCATCATGGTGAACTATAAC
CCAGAGACAGTCAGCACCGACTATGACATGTGTGATCGACTCTACTTTGATGAGATCTCTTTTGAGGTGGTGATGGACATCTATGAGCTC
GAGAACCCTGAAGGTGTGATCCTATCCATGGGTGGACAGCTGCCCAACAACATGGCCATGGCGTTGCATCGGCAGCAGTGCCGGGTGCTG
GGCACCTCCCCTGAAGCCATTGACTCGGCTGAGAACCGTTTCAAGTTTTCCCGGCTCCTTGACACCATTGGTATCAGCCAGCCTCAGTGG
AGGGAGCTCAGTGACCTCGAGTCTGCTCGCCAATTCTGCCAGACCGTGGGGTACCCCTGTGTGGTGCGCCCCTCCTATGTGCTGAGCGGT
GCTGCTATGAATGTGGCCTACACGGATGGAGACCTGGAGCGCTTCCTGAGCAGCGCAGCAGCCGTCTCCAAAGAGCATCCCGTGGTCATC
TCCAAGTTCATCCAGGAGGCTAAGGAGATTGACGTGGATGCCGTGGCCTCTGATGGTGTGGTGGCAGCCATCGCCATCTCTGAGCATGTG
GAGAATGCAGGTGTGCATTCAGGTGATGCGACGCTGGTGACCCCCCCACAAGATATCACTGCCAAAACCCTGGAGCGGATCAAAGCCATT
GTGCATGCTGTGGGCCAGGAGCTACAGGTCACAGGACCCTTCAATCTGCAGCTCATTGCCAAGGATGACCAGCTGAAAGTTATTGAATGC
AACGTACGTGTCTCTCGCTCCTTCCCCTTCGTTTCCAAGACACTGGGTGTGGACCTAGTAGCCTTGGCCACGCGGGTCATCATGGGGGAA
GAAGTGGAACCTGTGGGGCTAATGACTGGTTCTGGAGTCGTGGGAGTAAAGGTGCCTCAGTTCTCCTTCTCCCGCTTGGCGGGTGCTGAC
GTGGTGTTGGGTGTGGAAATGACCAGTACTGGGGAGGTGGCCGGCTTTGGGGAGAGCCGCTGTGAGGCATACCTCAAGGCCATGCTAAGC
ACTGGCTTTAAGATCCCCAAGAAGAATATCCTGCTGACCATTGGCAGCTATAAGAACAAAAGCGAGCTGCTCCCAACTGTGCGGCTACTG
GAGAGCCTGGGCTACAGCCTCTATGCCAGTCTCGGCACAGCTGACTTCTACACTGAGCATGGCGTCAAGGTAACAGCTGTGGACTGGCAC
TTTGAGGAGGCTGTGGATGGTGAGTGCCCACCACAGCGGAGCATCCTGGAGCAGCTAGCTGAGAAAAACTTTGAGCTGGTGATTAACCTG
TCAATGCGTGGAGCTGGGGGCCGGCGTCTCTCTTCCTTTGTCACCAAGGGCTACCGCACCCGACGCTTGGCCGCTGACTTCTCCGTGCCC
CTAATCATCGATATCAAGTGCACCAAACTCTTTGTGGAGGCCCTAGGCCAGATCGGGCCAGCCCCTCCTTTGAAGGTGCATGTTGACTGT
ATGACCTCCCAAAAGCTTGTGCGACTGCCGGGATTGATTGATGTCCATGTGCACCTGCGGGAACCAGGTGGGACACATAAGGAGGACTTT
GCTTCAGGCACAGCCGCTGCCCTGGCTGGGGGTATCACCATGGTGTGTGCCATGCCTAATACCCGGCCCCCCATCATTGACGCCCCTGCT
CTGGCCCTGGCCCAGAAGCTGGCAGAGGCTGGCGCCCGGTGCGACTTTGCGCTATTCCTTGGGGCCTCGTCTGAAAATGCAGGAACCTTG
GGCACCGTGGCCGGGTCTGCAGCCGGGCTGAAGCTTTACCTCAATGAGACCTTCTCTGAGCTGCGGCTGGACAGCGTGGTCCAGTGGATG
GAGCATTTCGAGACATGGCCCTCCCACCTCCCCATTGTGGCTCACGCAGAGCAGCAAACCGTGGCTGCTGTCCTCATGGTGGCTCAGCTC
ACTCAGCGCTCAGTGCACATATGTCACGTGGCACGGAAGGAGGAGATCCTGCTAATTAAAGCTGCAAAGGCACGGGGCTTGCCAGTGACC
TGCGAGGTGGCTCCCCACCACCTGTTCCTAAGCCATGATGACCTGGAGCGCCTGGGGCCTGGGAAGGGGGAGGTCCGGCCTGAGCTTGGC
TCCCGCCAGGATGTGGAAGCCCTGTGGGAGAACATGGCTGTCATCGACTGCTTTGCCTCAGACCATGCTCCCCATACCTTGGAGGAGAAG
TGTGGGTCCAGGCCCCCACCTGGGTTCCCAGGGTTAGAGACCATGCTGCCACTACTCCTGACGGCTGTAAGCGAGGGCCGGCTCAGCCTG
GACGACCTGCTGCAGCGATTGCACCACAATCCTCGGCGCATCTTTCACCTGCCCCCGCAGGAGGACACCTATGTGGAGGTGGATCTGGAG
CATGAGTGGACAATTCCCAGCCACATGCCCTTCTCCAAGGCCCACTGGACACCTTTTGAAGGGCAGAAAGTGAAGGGCACCGTCCGCCGT
GTGGTCCTGCGAGGGGAGGTTGCCTATATCGATGGGCAGGTTCTGGTACCCCCGGGCTATGGACAGGATGTACGGAAGTGGCCACAGGGG
GCTGTTCCTCAGCTCCCACCCTCAGCCCCTGCCACTAGTGAGATGACCACGACACCTGAAAGACCCCGCCGTGGCATCCCAGGGCTTCCT
GATGGCCGCTTCCATCTGCCGCCCCGAATCCATCGAGCCTCCGACCCAGGTTTGCCAGTGTACCGCCGGAAGCACCAGGAGCTGCAAGCC
ATGCAGATGGAGCTGCAGAGCCCTGAGTACAAGCTGAGCAAGCTCCGCACCTCGACCATCATGACCGACTACAACCCCAACTACTGCTTT
GCTGGCAAGACCTCCTCCATCAGTGACCTGAAGGAGGTGCCGCGGAAAAACATCACCCTCATTCGGGGTCTGGGCCATGGCGCCTTTGGG
GAGGTGTATGAAGGCCAGGTGTCCGGAATGCCCAACGACCCAAGCCCCCTGCAAGTGGCTGTGAAGACGCTGCCTGAAGTGTGCTCTGAA
CAGGACGAACTGGATTTCCTCATGGAAGCCCTGATCATCAGCAAATTCAACCACCAGAACATTGTTCGCTGCATTGGGGTGAGCCTGCAA
TCCCTGCCCCGGTTCATCCTGCTGGAGCTCATGGCGGGGGGAGACCTCAAGTCCTTCCTCCGAGAGACCCGCCCTCGCCCGAGCCAGCCC
TCCTCCCTGGCCATGCTGGACCTTCTGCACGTGGCTCGGGACATTGCCTGTGGCTGTCAGTATTTGGAGGAAAACCACTTCATCCACCGA
GACATTGCTGCCAGAAACTGCCTCTTGACCTGTCCAGGCCCTGGAAGAGTGGCCAAGATTGGAGACTTCGGGATGGCCCGAGACATCTAC
AGGGCGAGCTACTATAGAAAGGGAGGCTGTGCCATGCTGCCAGTTAAGTGGATGCCCCCAGAGGCCTTCATGGAAGGAATATTCACTTCT
AAAACAGACACATGGTCCTTTGGAGTGCTGCTATGGGAAATCTTTTCTCTTGGATATATGCCATACCCCAGCAAAAGCAACCAGGAAGTT
CTGGAGTTTGTCACCAGTGGAGGCCGGATGGACCCACCCAAGAACTGCCCTGGGCCTGTATACCGGATAATGACTCAGTGCTGGCAACAT
CAGCCTGAAGACAGGCCCAACTTTGCCATCATTTTGGAGAGGATTGAATACTGCACCCAGGACCCGGATGTAATCAACACCGCTTTGCCG
ATAGAATATGGTCCACTTGTGGAAGAGGAAGAGAAAGTGCCTGTGAGGCCCAAGGACCCTGAGGGGGTTCCTCCTCTCCTGGTCTCTCAA
CAGGCAAAACGGGAGGAGGAGCGCAGCCCAGCTGCCCCACCACCTCTGCCTACCACCTCCTCTGGCAAGGCTGCAAAGAAACCCACAGCT
GCAGAGATCTCTGTTCGAGTCCCTAGAGGGCCGGCCGTGGAAGGGGGACACGTGAATATGGCATTCTCTCAGTCCAACCCTCCTTCGGAG
TTGCACAAGGTCCACGGATCCAGAAACAAGCCCACCAGCTTGTGGAACCCAACGTACGGCTCCTGGTTTACAGAGAAACCCACCAAAAAG
AATAATCCTATAGCAAAGAAGGAGCCACACGACAGGGGTAACCTGGGGCTGGAGGGAAGCTGTACTGTCCCACCTAACGTTGCAACTGGG
AGACTTCCGGGGGCCTCACTGCTCCTAGAGCCCTCTTCGCTGACTGCCAATATGAAGGAGGTACCTCTGTTCAGGCTACGTCACTTCCCT
TGTGGGAATGTCAATTACGGCTACCAGCAACAGGGCTTGCCCTTAGAAGCCGCTACTGCCCCTGGAGCTGGTCATTACGAGGATACCATT
CTGAAAAGCAAGAATAGCATGAACCAGCCTGGGCCCTGAGCTCGGTCGCACACTCACTTCTCTTCCTTGGGATCCCTAAGACCGTGGAGG
AGAGAGAGGCAATGGCTCCTTCACAAACCAGAGACCAAATGTCACGTTTTGTTTTGTGCCAACCTATTTTGAAGTACCACCAAAAAAGCT
GTATTTTGAAAATGCTTTAGAAAGGTTTTGAGCATGGGTTCATCCTATTCTTTCGAAAGAAGAAAATATCATAAAAATGAGTGATAAATA
CAAGGCCCAGATGTGGTTGCATAAGGTTTTTATGCATGTTTGTTGTATACTTCCTTATGCTTCTTTCAAATTGTGTGTGCTCTGCTTCAA
TGTAGTCAGAATTAGCTGCTTCTATGTTTCATAGTTGGGGTCATAGATGTTTCCTTGCCTTGTTGATGTGGACATGAGCCATTTGAGGGG

>12437_12437_2_CAD-ALK_CAD_chr2_27463229_ENST00000403525_ALK_chr2_29446394_ENST00000389048_length(amino acids)=2369AA_BP=1806
MSSLPMAALVLEDGSVLRGQPFGAAVSTAGEVVFQTGMVGYPEALTDPSYKAQILVLTYPLIGNYGIPPDEMDEFGLCKWFESSGIHVAA
LVVGECCPTPSHWSATRTLHEWLQQHGIPGLQGVDTRELTKKLREQGSLLGKLVQNGTEPSSLPFLDPNARPLVPEVSIKTPRVFNTGGA
PRILALDCGLKYNQIRCLCQRGAEVTVVPWDHALDSQEYEGLFLSNGPGDPASYPSVVSTLSRVLSEPNPRPVFGICLGHQLLALAIGAK
TYKMRYGNRGHNQPCLLVGSGRCFLTSQNHGFAVETDSLPADWAPLFTNANDGSNEGIVHNSLPFFSVQFHPEHQAGPSDMELLFDIFLE
TVKEATAGNPGGQTVRERLTERLCPPGIPTPGSGLPPPRKVLILGSGGLSIGQAGEFDYSGSQAIKALKEENIQTLLINPNIATVQTSQG
LADKVYFLPITPHYVTQVIRNERPDGVLLTFGGQTALNCGVELTKAGVLARYGVRVLGTPVETIELTEDRRAFAARMAEIGEHVAPSEAA
NSLEQAQAAAERLGYPVLVRAAFALGGLGSGFASNREELSALVAPAFAHTSQVLVDKSLKGWKEIEYEVVRDAYGNCVTYYIIEVNARLS
RSSALASKATGYPLAYVAAKLALGIPLPELRNSVTGGTAAFEPSVDYCVVKIPRWDLSKFLRVSTKIGSCMKSVGEVMGIGRSFEEAFQK
ALRMVDENCVGFDHTVKPVSDMELETPTDKRIFVVAAALWAGYSVDRLYELTRIDRWFLHRMKRIIAHAQLLEQHRGQPLPPDLLQQAKC
LGFSDKQIALAVLSTELAVRKLRQELGICPAVKQIDTVAAEWPAQTNYLYLTYWGTTHDLTFRTPHVLVLGSGVYRIGSSVEFDWCAVGC
IQQLRKMGYKTIMVNYNPETVSTDYDMCDRLYFDEISFEVVMDIYELENPEGVILSMGGQLPNNMAMALHRQQCRVLGTSPEAIDSAENR
FKFSRLLDTIGISQPQWRELSDLESARQFCQTVGYPCVVRPSYVLSGAAMNVAYTDGDLERFLSSAAAVSKEHPVVISKFIQEAKEIDVD
AVASDGVVAAIAISEHVENAGVHSGDATLVTPPQDITAKTLERIKAIVHAVGQELQVTGPFNLQLIAKDDQLKVIECNVRVSRSFPFVSK
TLGVDLVALATRVIMGEEVEPVGLMTGSGVVGVKVPQFSFSRLAGADVVLGVEMTSTGEVAGFGESRCEAYLKAMLSTGFKIPKKNILLT
IGSYKNKSELLPTVRLLESLGYSLYASLGTADFYTEHGVKVTAVDWHFEEAVDGECPPQRSILEQLAEKNFELVINLSMRGAGGRRLSSF
VTKGYRTRRLAADFSVPLIIDIKCTKLFVEALGQIGPAPPLKVHVDCMTSQKLVRLPGLIDVHVHLREPGGTHKEDFASGTAAALAGGIT
MVCAMPNTRPPIIDAPALALAQKLAEAGARCDFALFLGASSENAGTLGTVAGSAAGLKLYLNETFSELRLDSVVQWMEHFETWPSHLPIV
AHAEQQTVAAVLMVAQLTQRSVHICHVARKEEILLIKAAKARGLPVTCEVAPHHLFLSHDDLERLGPGKGEVRPELGSRQDVEALWENMA
VIDCFASDHAPHTLEEKCGSRPPPGFPGLETMLPLLLTAVSEGRLSLDDLLQRLHHNPRRIFHLPPQEDTYVEVDLEHEWTIPSHMPFSK
AHWTPFEGQKVKGTVRRVVLRGEVAYIDGQVLVPPGYGQDVRKWPQGAVPQLPPSAPATSEMTTTPERPRRGIPGLPDGRFHLPPRIHRA
SDPGLPVYRRKHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPND
PSPLQVAVKTLPEVCSEQDELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVAR
DIACGCQYLEENHFIHRDIAARNCLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWE
IFSLGYMPYPSKSNQEVLEFVTSGGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKV
PVRPKDPEGVPPLLVSQQAKREEERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTS
LWNPTYGSWFTEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMKEVPLFRLRHFPCGNVNYGYQQQGL

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for CAD-ALK


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for CAD-ALK


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status
TgeneALKQ9UM73DB08865CrizotinibInhibitorSmall moleculeApproved
TgeneALKQ9UM73DB09063CeritinibAntagonistSmall moleculeApproved
TgeneALKQ9UM73DB11363AlectinibInhibitorSmall moleculeApproved|Investigational
TgeneALKQ9UM73DB12010FostamatinibInhibitorSmall moleculeApproved|Investigational
TgeneALKQ9UM73DB12130LorlatinibInhibitorSmall moleculeApproved|Investigational
TgeneALKQ9UM73DB12141GilteritinibInhibitorSmall moleculeApproved|Investigational
TgeneALKQ9UM73DB12267BrigatinibInhibitorSmall moleculeApproved|Investigational

Top

Related Diseases for CAD-ALK


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
HgeneCADC4225320EPILEPTIC ENCEPHALOPATHY, EARLY INFANTILE, 504CTD_human;GENOMICS_ENGLAND;ORPHANET;UNIPROT
HgeneCADC0002875Cooley's anemia1CTD_human
HgeneCADC0002895Anemia, Sickle Cell1CTD_human
HgeneCADC0005283beta Thalassemia1CTD_human
HgeneCADC0019025Hemoglobin F Disease1CTD_human
HgeneCADC0023440Acute Erythroblastic Leukemia1CTD_human
HgeneCADC0037889Hereditary spherocytosis1CTD_human
HgeneCADC0085578Thalassemia Minor1CTD_human
HgeneCADC0271979Thalassemia Intermedia1CTD_human
HgeneCADC0949116Congenital hypoplastic anemia1CTD_human
TgeneC0007131Non-Small Cell Lung Carcinoma28CGI;CTD_human
TgeneC0027819Neuroblastoma13CGI;CTD_human;ORPHANET
TgeneC0152013Adenocarcinoma of lung (disorder)8CGI;CTD_human
TgeneC2751681NEUROBLASTOMA, SUSCEPTIBILITY TO, 38CLINGEN;UNIPROT
TgeneC0206180Ki-1+ Anaplastic Large Cell Lymphoma6CGI;CTD_human
TgeneC0334121Inflammatory Myofibroblastic Tumor4CGI;CTD_human;ORPHANET
TgeneC0018199Granuloma, Plasma Cell3CTD_human
TgeneC0007621Neoplastic Cell Transformation2CTD_human
TgeneC0027627Neoplasm Metastasis2CTD_human
TgeneC0238463Papillary thyroid carcinoma2ORPHANET
TgeneC0001973Alcoholic Intoxication, Chronic1PSYGENET
TgeneC0006118Brain Neoplasms1CGI;CTD_human
TgeneC0006142Malignant neoplasm of breast1CTD_human
TgeneC0007134Renal Cell Carcinoma1CTD_human
TgeneC0011570Mental Depression1PSYGENET
TgeneC0011581Depressive disorder1PSYGENET
TgeneC0027643Neoplasm Recurrence, Local1CTD_human
TgeneC0036341Schizophrenia1PSYGENET
TgeneC0079744Diffuse Large B-Cell Lymphoma1CTD_human
TgeneC0085269Plasma Cell Granuloma, Pulmonary1CTD_human
TgeneC0153633Malignant neoplasm of brain1CGI;CTD_human
TgeneC0278601Inflammatory Breast Carcinoma1CTD_human
TgeneC0279702Conventional (Clear Cell) Renal Cell Carcinoma1CTD_human
TgeneC0496899Benign neoplasm of brain, unspecified1CTD_human
TgeneC0678222Breast Carcinoma1CTD_human
TgeneC0750974Brain Tumor, Primary1CTD_human
TgeneC0750977Recurrent Brain Neoplasm1CTD_human
TgeneC0750979Primary malignant neoplasm of brain1CTD_human
TgeneC1257931Mammary Neoplasms, Human1CTD_human
TgeneC1266042Chromophobe Renal Cell Carcinoma1CTD_human
TgeneC1266043Sarcomatoid Renal Cell Carcinoma1CTD_human
TgeneC1266044Collecting Duct Carcinoma of the Kidney1CTD_human
TgeneC1306837Papillary Renal Cell Carcinoma1CTD_human
TgeneC1332079Anaplastic Large Cell Lymphoma, ALK-Positive1ORPHANET
TgeneC1458155Mammary Neoplasms1CTD_human
TgeneC1527390Neoplasms, Intracranial1CTD_human
TgeneC2931189Neural crest tumor1ORPHANET
TgeneC3899155hereditary neuroblastoma1GENOMICS_ENGLAND
TgeneC4704874Mammary Carcinoma, Human1CTD_human