UTHEALTH HOME    ABOUT SBMI    A-Z    WEBMAIL    INSIDE THE UNIVERSITY
FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Terms of Use

Center for Computational Systems Medicine level3
leaf

Fusion Gene Summary

leaf

Fusion Gene Sample Information

leaf

Fusion ORF Analysis

leaf

Fusion Amino Acid Sequences

leaf

Fusion Protein Functional Features

leaf

Fusion Protein Structure

leaf

pLDDT scores

leaf

Ramachandran Plot of Fusion Protein Structure

leaf

Potential Active Site Information

leaf

Potentially Interacting Small Molecules through Virtual Screening

leaf

Biochemical Features of Small Molecules with ADME

leaf

Drug Toxicity Information

leaf

Fusion Protein-Protein Interaction

leaf

Related drugs with this fusion protein

leaf

Related disease with this fusion protein

Fusion Protein:SEC31A-ALK

Fusion Protein Summary

check button Fusion gene summary
Fusion partner gene informationFusion gene name: SEC31A-ALK
FusionPDB ID: 80098
FusionGDB2.0 ID: 80098
HgeneTgene
Gene symbol

SEC31A

ALK

Gene ID

22872

238

Gene nameSEC31 homolog A, COPII coat complex componentALK receptor tyrosine kinase
SynonymsABP125|ABP130|HSPC275|HSPC334|NEDSOSB|SEC31L1CD246|NBLST3
Cytomap

4q21.22

2p23.2-p23.1

Type of geneprotein-codingprotein-coding
Descriptionprotein transport protein Sec31ASEC31 homolog A, COPII coating complex componentSEC31-like protein 1SEC31-related protein Aweb1-like proteinyeast Sec31p homologALK tyrosine kinase receptorCD246 antigenanaplastic lymphoma receptor tyrosine kinasemutant anaplastic lymphoma kinase
Modification date2020031320200329
UniProtAcc

O94979

Q96BT7

Ensembl transtripts involved in fusion geneENST idsENST00000264405, ENST00000311785, 
ENST00000326950, ENST00000348405, 
ENST00000355196, ENST00000395310, 
ENST00000432794, ENST00000443462, 
ENST00000448323, ENST00000500777, 
ENST00000505472, ENST00000505984, 
ENST00000508479, ENST00000508502, 
ENST00000509142, ENST00000513858, 
ENST00000436790, 
ENST00000431873, 
ENST00000498037, ENST00000389048, 
Fusion gene scores for assessment (based on all fusion genes of FusionGDB 2.0)* DoF score22 X 23 X 10=506056 X 74 X 20=82880
# samples 2557
** MAII scorelog2(25/5060*10)=-4.33913738491959
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(57/82880*10)=-7.18391827352181
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context (manual curation of fusion genes in FusionPDB)

PubMed: SEC31A [Title/Abstract] AND ALK [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpoint (based on all fusion genes of FusionGDB 2.0)SEC31A(83765539)-ALK(29446394), # samples:1
Anticipated loss of major functional domain due to fusion event.SEC31A-ALK seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
SEC31A-ALK seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
SEC31A-ALK seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
SEC31A-ALK seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
SEC31A-ALK seems lost the major protein functional domain in Hgene partner, which is a cell metabolism gene due to the frame-shifted ORF.
SEC31A-ALK seems lost the major protein functional domain in Tgene partner, which is a CGC due to the frame-shifted ORF.
SEC31A-ALK seems lost the major protein functional domain in Tgene partner, which is a IUPHAR drug target due to the frame-shifted ORF.
SEC31A-ALK seems lost the major protein functional domain in Tgene partner, which is a kinase due to the frame-shifted ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneSEC31A

GO:0051592

response to calcium ion

17196169

HgeneSEC31A

GO:0090110

cargo loading into COPII-coated vesicle

17499046|18843296

TgeneALK

GO:0016310

phosphorylation

9174053

TgeneALK

GO:0046777

protein autophosphorylation

9174053


check buttonFusion gene breakpoints across SEC31A (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across ALK (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure


Top

Fusion Gene Sample Information

check buttonFusion gene information from FusionGDB2.0.
check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerKB3..SEC31Achr4

83763292

-ALKchr2

29445473

-
ChimerKB3..SEC31Achr4

83763292

-ALKchr2

29446394

-
ChimerKB3..SEC31Achr4

83769956

-ALKchr2

29446394

-
ChimerKB4..SEC31Achr4

83769956

-ALKchr2

29446394

-
ChiTaRS5.0N/AKJ495955SEC31Achr4

83765539

-ALKchr2

29446394

-


Top

Fusion ORF Analysis


check buttonFusion information from ORFfinder translation from full-length transcript sequence from FusionPDB.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000355196SEC31Achr483763292-ENST00000389048ALKchr229445473-5300334637834341018
ENST00000348405SEC31Achr483763292-ENST00000389048ALKchr229446394-504529042945941521
ENST00000395310SEC31Achr483763292-ENST00000389048ALKchr229446394-5292315115948411560
ENST00000326950SEC31Achr483763292-ENST00000389048ALKchr229446394-5153301214047021520
ENST00000432794SEC31Achr483763292-ENST00000389048ALKchr229446394-5273313214048221560
ENST00000448323SEC31Achr483763292-ENST00000389048ALKchr229446394-5270312914048191559
ENST00000505472SEC31Achr483763292-ENST00000389048ALKchr229446394-52033062147521583
ENST00000355196SEC31Achr483769956-ENST00000389048ALKchr229446394-502128803782948856
ENST00000348405SEC31Achr483765539-ENST00000389048ALKchr229446394-470325622942521407
ENST00000513858SEC31Achr483765539-ENST00000389048ALKchr229446394-472025794642691407
ENST00000395310SEC31Achr483765539-ENST00000389048ALKchr229446394-4950280915944991446
ENST00000443462SEC31Achr483765539-ENST00000389048ALKchr229446394-47792638043281442
ENST00000509142SEC31Achr483765539-ENST00000389048ALKchr229446394-4895275410744441445
ENST00000326950SEC31Achr483765539-ENST00000389048ALKchr229446394-4811267014043601406
ENST00000432794SEC31Achr483765539-ENST00000389048ALKchr229446394-4931279014044801446
ENST00000311785SEC31Achr483765539-ENST00000389048ALKchr229446394-4931279014044801446
ENST00000448323SEC31Achr483765539-ENST00000389048ALKchr229446394-4928278714044771445
ENST00000505472SEC31Achr483765539-ENST00000389048ALKchr229446394-48612720144101469
ENST00000500777SEC31Achr483765539-ENST00000389048ALKchr229446394-46542513442031399
ENST00000508502SEC31Achr483765539-ENST00000389048ALKchr229446394-488727469644361446
ENST00000264405SEC31Achr483765539-ENST00000389048ALKchr229446394-4544240348540931202
ENST00000505984SEC31Achr483765539-ENST00000389048ALKchr229446394-471525744142641407

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000348405ENST00000389048SEC31Achr483765539-ALKchr229446394-0.0010988240.9989011
ENST00000513858ENST00000389048SEC31Achr483765539-ALKchr229446394-0.0011209270.998879
ENST00000395310ENST00000389048SEC31Achr483765539-ALKchr229446394-0.0016049820.998395
ENST00000443462ENST00000389048SEC31Achr483765539-ALKchr229446394-0.0012746050.99872535
ENST00000509142ENST00000389048SEC31Achr483765539-ALKchr229446394-0.0014893120.99851066
ENST00000326950ENST00000389048SEC31Achr483765539-ALKchr229446394-0.0012427690.99875724
ENST00000432794ENST00000389048SEC31Achr483765539-ALKchr229446394-0.0016145140.99838555
ENST00000311785ENST00000389048SEC31Achr483765539-ALKchr229446394-0.0016145140.99838555
ENST00000448323ENST00000389048SEC31Achr483765539-ALKchr229446394-0.0015662150.99843377
ENST00000505472ENST00000389048SEC31Achr483765539-ALKchr229446394-0.0010115190.99898845
ENST00000500777ENST00000389048SEC31Achr483765539-ALKchr229446394-0.0009450320.99905497
ENST00000508502ENST00000389048SEC31Achr483765539-ALKchr229446394-0.0014994150.9985006
ENST00000264405ENST00000389048SEC31Achr483765539-ALKchr229446394-0.0012031310.9987968
ENST00000505984ENST00000389048SEC31Achr483765539-ALKchr229446394-0.0011023950.99889755

Top

Fusion Amino Acid Sequences


check button For individual full-length fusion transcript sequence from FusionPDB, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>FusionGDB ID_FusionGDB isoform ID_FGname_Hgene_Hchr_Hbp_Henst_Tgene_Tchr_Tbp_Tenst_length(fusion AA) seq_BP

>80098_80098_1_SEC31A-ALK_SEC31A_chr4_83763292_ENST00000326950_ALK_chr2_29446394_ENST00000389048_length(amino acids)=1520AA_BP=957
MPKVLARMKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASLEIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSKG
DVSGVLIAGGENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPPEDISC
IAWNRQVQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFASSPLRVLENHARGI
LAIAWSMADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLSSSF
GNLDPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRSD
QLQQAVQSQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDLGKKHIKEEKEESEFLPSSGGTFNISVSGDIDGL
ITQALLTGNFESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQSKITRLITAVVMKNWKEIVESCDLKNWREALAAVLTYAKP
DEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEKLVACWTKAQDGSHPLSLQDLIEKVVILRKAVQLTQAMDTSTVGVLLAAKMSQ
YANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESPKIPYEKQQLPKGRPGPVAGHHQMPRVQTQQYYPHGENPPPPG
FIMHGNVNPNAAGQLPTSPGHMHTQVPPYPQPQPYQPAQPYPFGTGGSAMYRPQQPVAPPTSNAYPNTPYISSASSYTGQSQLYAAQHQA
SSPTSSPATSFPPPPSSGASFQHGGPGAPPSSSAYALPPGTTGTLPAASELPASQRTVYRRKHQELQAMQMELQSPEYKLSKLRTSTIMT
DYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDFLMEALIISKFNHQNIV
RCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARNCLLTCPGPGRVAKIGD
FGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSGGRMDPPKNCPGPVYR
IMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEERSPAAPPPLPTTSSG
KAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAKKEPHDRGNLGLEGSCT

--------------------------------------------------------------

>80098_80098_2_SEC31A-ALK_SEC31A_chr4_83763292_ENST00000348405_ALK_chr2_29446394_ENST00000389048_length(amino acids)=1521AA_BP=958
MPKVLASRMKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASLEIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSK
GDVSGVLIAGGENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPPEDIS
CIAWNRQVQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFASSPLRVLENHARG
ILAIAWSMADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLSSS
FGNLDPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRS
DQLQQAVQSQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDLGKKHIKEEKEESEFLPSSGGTFNISVSGDIDG
LITQALLTGNFESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQSKITRLITAVVMKNWKEIVESCDLKNWREALAAVLTYAK
PDEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEKLVACWTKAQDGSHPLSLQDLIEKVVILRKAVQLTQAMDTSTVGVLLAAKMS
QYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESPKIPYEKQQLPKGRPGPVAGHHQMPRVQTQQYYPHGENPPPP
GFIMHGNVNPNAAGQLPTSPGHMHTQVPPYPQPQPYQPAQPYPFGTGGSAMYRPQQPVAPPTSNAYPNTPYISSASSYTGQSQLYAAQHQ
ASSPTSSPATSFPPPPSSGASFQHGGPGAPPSSSAYALPPGTTGTLPAASELPASQRTVYRRKHQELQAMQMELQSPEYKLSKLRTSTIM
TDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDFLMEALIISKFNHQNI
VRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARNCLLTCPGPGRVAKIG
DFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSGGRMDPPKNCPGPVY
RIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEERSPAAPPPLPTTSS
GKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAKKEPHDRGNLGLEGSC

--------------------------------------------------------------

>80098_80098_3_SEC31A-ALK_SEC31A_chr4_83763292_ENST00000355196_ALK_chr2_29445473_ENST00000389048_length(amino acids)=1018AA_BP=988
MKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASLEIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSKGDVSGVLI
AGGENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPPEDISCIAWNRQV
QHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFASSPLRVLENHARGILAIAWSM
ADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLSSSFGNLDPFG
TGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRSDQLQQAVQ
SQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDLGKKIALALNKVDGANVALKDSDQVAQSDGEESPAAEEQLL
GEHIKEEKEESEFLPSSGGTFNISVSGDIDGLITQALLTGNFESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQSKITRLIT
AVVMKNWKEIVESCDLKNWREALAAVLTYAKPDEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEKLVACWTKAQDGSHPLSLQDL
IEKVVILRKAVQLTQAMDTSTVGVLLAAKMSQYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESPKIPYEKQQLP
KGRPGPVAGHHQMPRVQTQQYYPHGENPPPPGFIMHGNVNPNAAGQLPTSPGHMHTQVPPYPQPQPYQPAQPYPFGTGGSAMYRPQQPVA
PPTSNAYPNTPYISSASSYTGQSQLYAAQHQASSPTSSPATSFPPPPSSGASFQHGGPGAPPSSSAYALPPGTTGTLPAASELPASQRTG

--------------------------------------------------------------

>80098_80098_4_SEC31A-ALK_SEC31A_chr4_83763292_ENST00000395310_ALK_chr2_29446394_ENST00000389048_length(amino acids)=1560AA_BP=997
MPKVLASRMKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASLEIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSK
GDVSGVLIAGGENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPPEDIS
CIAWNRQVQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFASSPLRVLENHARG
ILAIAWSMADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLSSS
FGNLDPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRS
DQLQQAVQSQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDLGKKIALALNKVDGANVALKDSDQVAQSDGEES
PAAEEQLLGEHIKEEKEESEFLPSSGGTFNISVSGDIDGLITQALLTGNFESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQ
SKITRLITAVVMKNWKEIVESCDLKNWREALAAVLTYAKPDEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEKLVACWTKAQDGS
HPLSLQDLIEKVVILRKAVQLTQAMDTSTVGVLLAAKMSQYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESPKI
PYEKQQLPKGRPGPVAGHHQMPRVQTQQYYPHGENPPPPGFIMHGNVNPNAAGQLPTSPGHMHTQVPPYPQPQPYQPAQPYPFGTGGSAM
YRPQQPVAPPTSNAYPNTPYISSASSYTGQSQLYAAQHQASSPTSSPATSFPPPPSSGASFQHGGPGAPPSSSAYALPPGTTGTLPAASE
LPASQRTVYRRKHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPN
DPSPLQVAVKTLPEVCSEQDELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVA
RDIACGCQYLEENHFIHRDIAARNCLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLW
EIFSLGYMPYPSKSNQEVLEFVTSGGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEK
VPVRPKDPEGVPPLLVSQQAKREEERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPT
SLWNPTYGSWFTEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMKEVPLFRLRHFPCGNVNYGYQQQG

--------------------------------------------------------------

>80098_80098_5_SEC31A-ALK_SEC31A_chr4_83763292_ENST00000432794_ALK_chr2_29446394_ENST00000389048_length(amino acids)=1560AA_BP=997
MPKVLASRMKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASLEIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSK
GDVSGVLIAGGENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPPEDIS
CIAWNRQVQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFASSPLRVLENHARG
ILAIAWSMADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLSSS
FGNLDPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRS
DQLQQAVQSQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDLGKKIALALNKVDGANVALKDSDQVAQSDGEES
PAAEEQLLGEHIKEEKEESEFLPSSGGTFNISVSGDIDGLITQALLTGNFESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQ
SKITRLITAVVMKNWKEIVESCDLKNWREALAAVLTYAKPDEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEKLVACWTKAQDGS
HPLSLQDLIEKVVILRKAVQLTQAMDTSTVGVLLAAKMSQYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESPKI
PYEKQQLPKGRPGPVAGHHQMPRVQTQQYYPHGENPPPPGFIMHGNVNPNAAGQLPTSPGHMHTQVPPYPQPQPYQPAQPYPFGTGGSAM
YRPQQPVAPPTSNAYPNTPYISSASSYTGQSQLYAAQHQASSPTSSPATSFPPPPSSGASFQHGGPGAPPSSSAYALPPGTTGTLPAASE
LPASQRTVYRRKHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPN
DPSPLQVAVKTLPEVCSEQDELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVA
RDIACGCQYLEENHFIHRDIAARNCLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLW
EIFSLGYMPYPSKSNQEVLEFVTSGGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEK
VPVRPKDPEGVPPLLVSQQAKREEERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPT
SLWNPTYGSWFTEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMKEVPLFRLRHFPCGNVNYGYQQQG

--------------------------------------------------------------

>80098_80098_6_SEC31A-ALK_SEC31A_chr4_83763292_ENST00000448323_ALK_chr2_29446394_ENST00000389048_length(amino acids)=1559AA_BP=996
MPKVLARMKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASLEIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSKG
DVSGVLIAGGENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPPEDISC
IAWNRQVQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFASSPLRVLENHARGI
LAIAWSMADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLSSSF
GNLDPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRSD
QLQQAVQSQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDLGKKIALALNKVDGANVALKDSDQVAQSDGEESP
AAEEQLLGEHIKEEKEESEFLPSSGGTFNISVSGDIDGLITQALLTGNFESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQS
KITRLITAVVMKNWKEIVESCDLKNWREALAAVLTYAKPDEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEKLVACWTKAQDGSH
PLSLQDLIEKVVILRKAVQLTQAMDTSTVGVLLAAKMSQYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESPKIP
YEKQQLPKGRPGPVAGHHQMPRVQTQQYYPHGENPPPPGFIMHGNVNPNAAGQLPTSPGHMHTQVPPYPQPQPYQPAQPYPFGTGGSAMY
RPQQPVAPPTSNAYPNTPYISSASSYTGQSQLYAAQHQASSPTSSPATSFPPPPSSGASFQHGGPGAPPSSSAYALPPGTTGTLPAASEL
PASQRTVYRRKHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPND
PSPLQVAVKTLPEVCSEQDELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVAR
DIACGCQYLEENHFIHRDIAARNCLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWE
IFSLGYMPYPSKSNQEVLEFVTSGGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKV
PVRPKDPEGVPPLLVSQQAKREEERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTS
LWNPTYGSWFTEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMKEVPLFRLRHFPCGNVNYGYQQQGL

--------------------------------------------------------------

>80098_80098_7_SEC31A-ALK_SEC31A_chr4_83763292_ENST00000505472_ALK_chr2_29446394_ENST00000389048_length(amino acids)=1583AA_BP=1020
MKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASLEIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSKGDVSGVLI
AGGENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPPEDISCIAWNRQV
QHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFASSPLRVLENHARGILAIAWSM
ADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLSSSFGNLDPFG
TGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRSDQLQQAVQ
SQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDLGKKIALALNKVDGANVALKDSDQVAQSDGEESPAAEEQLL
GEHIKEEKEESEFLPSSGGTFNISVSGDIDGLITQALLTGNFESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQSKITRLIT
AVVMKNWKEIVESCDLKNWREALAAVLTYAKPDEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEKLVACWTKAQDGSHPLSLQDL
IEKVVILRKAVQLTQAMDTSTVGVLLAAKMSQYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESPKIPYEKQQLP
KGRPGPVAGHHQMPRVQTQQYYPHVRIAPTVTTWSNKTPTALPSHPPAASPSDTQGENPPPPGFIMHGNVNPNAAGQLPTSPGHMHTQVP
PYPQPQPYQPAQPYPFGTGGSAMYRPQQPVAPPTSNAYPNTPYISSASSYTGQSQLYAAQHQASSPTSSPATSFPPPPSSGASFQHGGPG
APPSSSAYALPPGTTGTLPAASELPASQRTVYRRKHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNIT
LIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSF
LRETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARNCLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMP
PEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSGGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCT
QDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVN
MAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMK

--------------------------------------------------------------

>80098_80098_8_SEC31A-ALK_SEC31A_chr4_83765539_ENST00000264405_ALK_chr2_29446394_ENST00000389048_length(amino acids)=1202AA_BP=640
MVKLVLLSIVLLKVTVPKLSNDLLQLDFMPIHRGILAIAWSMADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAV
LSAASFDGRISVYSIMGGSTDGLRQKQVDKLSSSFGNLDPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSFGGKLVTFE
NVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRSDQLQQAVQSQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKE
DLGKKHIKEEKEESEFLPSSGGTFNISVSGDIDGLITQALLTGNFESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQSKITR
LITAVVMKNWKEIVESCDLKNWREALAAVLTYAKPDEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEKLVACWTKAQDGSHPLSL
QDLIEKVVILRKAVQLTQAMDTSTVGVLLAAKMSQYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESPKIPYEKQ
QLPKGRPGPVAGHHQMPRVQTQQYYPHVRIAPTVTTWSNKTPTALPSHPPAASPSDTQGENPPPPGFIMHGNVNPNAAGQLPTSPGHMHT
QVPPYPQPQLYRRKHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGM
PNDPSPLQVAVKTLPEVCSEQDELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLH
VARDIACGCQYLEENHFIHRDIAARNCLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVL
LWEIFSLGYMPYPSKSNQEVLEFVTSGGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEE
EKVPVRPKDPEGVPPLLVSQQAKREEERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNK
PTSLWNPTYGSWFTEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMKEVPLFRLRHFPCGNVNYGYQQ

--------------------------------------------------------------

>80098_80098_9_SEC31A-ALK_SEC31A_chr4_83765539_ENST00000311785_ALK_chr2_29446394_ENST00000389048_length(amino acids)=1446AA_BP=884
MPKVLASRMKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASLEIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSK
GDVSGVLIAGGENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPPEDIS
CIAWNRQVQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFASSPLRVLENHARG
ILAIAWSMADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLSSS
FGNLDPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRS
DQLQQAVQSQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDLGKKIALALNKVDGANVALKDSDQVAQSDGEES
PAAEEQLLGEHIKEEKEESEFLPSSGGTFNISVSGDIDGLITQALLTGNFESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQ
SKITRLITAVVMKNWKEIVESCDLKNWREALAAVLTYAKPDEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEKLVACWTKAQDGS
HPLSLQDLIEKVVILRKAVQLTQAMDTSTVGVLLAAKMSQYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESPKI
PYEKQQLPKGRPGPVAGHHQMPRVQTQQYYPHGENPPPPGFIMHGNVNPNAAGQLPTSPGHMHTQVPPYPQPQLYRRKHQELQAMQMELQ
SPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDF
LMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARN
CLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTS
GGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREE
ERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAK
KEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMKEVPLFRLRHFPCGNVNYGYQQQGLPLEAATAPGAGHYEDTILKSKNS

--------------------------------------------------------------

>80098_80098_10_SEC31A-ALK_SEC31A_chr4_83765539_ENST00000326950_ALK_chr2_29446394_ENST00000389048_length(amino acids)=1406AA_BP=844
MPKVLARMKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASLEIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSKG
DVSGVLIAGGENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPPEDISC
IAWNRQVQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFASSPLRVLENHARGI
LAIAWSMADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLSSSF
GNLDPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRSD
QLQQAVQSQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDLGKKHIKEEKEESEFLPSSGGTFNISVSGDIDGL
ITQALLTGNFESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQSKITRLITAVVMKNWKEIVESCDLKNWREALAAVLTYAKP
DEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEKLVACWTKAQDGSHPLSLQDLIEKVVILRKAVQLTQAMDTSTVGVLLAAKMSQ
YANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESPKIPYEKQQLPKGRPGPVAGHHQMPRVQTQQYYPHGENPPPPG
FIMHGNVNPNAAGQLPTSPGHMHTQVPPYPQPQLYRRKHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRK
NITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDL
KSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARNCLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVK
WMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSGGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIE
YCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGG
HVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTA

--------------------------------------------------------------

>80098_80098_11_SEC31A-ALK_SEC31A_chr4_83765539_ENST00000348405_ALK_chr2_29446394_ENST00000389048_length(amino acids)=1407AA_BP=845
MPKVLASRMKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASLEIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSK
GDVSGVLIAGGENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPPEDIS
CIAWNRQVQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFASSPLRVLENHARG
ILAIAWSMADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLSSS
FGNLDPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRS
DQLQQAVQSQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDLGKKHIKEEKEESEFLPSSGGTFNISVSGDIDG
LITQALLTGNFESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQSKITRLITAVVMKNWKEIVESCDLKNWREALAAVLTYAK
PDEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEKLVACWTKAQDGSHPLSLQDLIEKVVILRKAVQLTQAMDTSTVGVLLAAKMS
QYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESPKIPYEKQQLPKGRPGPVAGHHQMPRVQTQQYYPHGENPPPP
GFIMHGNVNPNAAGQLPTSPGHMHTQVPPYPQPQLYRRKHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPR
KNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGD
LKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARNCLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPV
KWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSGGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERI
EYCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEG
GHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLT

--------------------------------------------------------------

>80098_80098_12_SEC31A-ALK_SEC31A_chr4_83765539_ENST00000395310_ALK_chr2_29446394_ENST00000389048_length(amino acids)=1446AA_BP=884
MPKVLASRMKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASLEIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSK
GDVSGVLIAGGENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPPEDIS
CIAWNRQVQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFASSPLRVLENHARG
ILAIAWSMADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLSSS
FGNLDPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRS
DQLQQAVQSQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDLGKKIALALNKVDGANVALKDSDQVAQSDGEES
PAAEEQLLGEHIKEEKEESEFLPSSGGTFNISVSGDIDGLITQALLTGNFESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQ
SKITRLITAVVMKNWKEIVESCDLKNWREALAAVLTYAKPDEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEKLVACWTKAQDGS
HPLSLQDLIEKVVILRKAVQLTQAMDTSTVGVLLAAKMSQYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESPKI
PYEKQQLPKGRPGPVAGHHQMPRVQTQQYYPHGENPPPPGFIMHGNVNPNAAGQLPTSPGHMHTQVPPYPQPQLYRRKHQELQAMQMELQ
SPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDF
LMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARN
CLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTS
GGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREE
ERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAK
KEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMKEVPLFRLRHFPCGNVNYGYQQQGLPLEAATAPGAGHYEDTILKSKNS

--------------------------------------------------------------

>80098_80098_13_SEC31A-ALK_SEC31A_chr4_83765539_ENST00000432794_ALK_chr2_29446394_ENST00000389048_length(amino acids)=1446AA_BP=884
MPKVLASRMKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASLEIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSK
GDVSGVLIAGGENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPPEDIS
CIAWNRQVQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFASSPLRVLENHARG
ILAIAWSMADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLSSS
FGNLDPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRS
DQLQQAVQSQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDLGKKIALALNKVDGANVALKDSDQVAQSDGEES
PAAEEQLLGEHIKEEKEESEFLPSSGGTFNISVSGDIDGLITQALLTGNFESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQ
SKITRLITAVVMKNWKEIVESCDLKNWREALAAVLTYAKPDEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEKLVACWTKAQDGS
HPLSLQDLIEKVVILRKAVQLTQAMDTSTVGVLLAAKMSQYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESPKI
PYEKQQLPKGRPGPVAGHHQMPRVQTQQYYPHGENPPPPGFIMHGNVNPNAAGQLPTSPGHMHTQVPPYPQPQLYRRKHQELQAMQMELQ
SPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDF
LMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARN
CLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTS
GGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREE
ERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAK
KEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMKEVPLFRLRHFPCGNVNYGYQQQGLPLEAATAPGAGHYEDTILKSKNS

--------------------------------------------------------------

>80098_80098_14_SEC31A-ALK_SEC31A_chr4_83765539_ENST00000443462_ALK_chr2_29446394_ENST00000389048_length(amino acids)=1442AA_BP=880
LAGEEGRRRMLGESDERCTNAGSGCRRSSPGTSAQQLDATFSTNASLEIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSKGDVS
GVLIAGGENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPPEDISCIAW
NRQVQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFASSPLRVLENHARGILAI
AWSMADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLSSSFGNL
DPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRSDQLQ
QAVQSQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDLGKKIALALNKVDGANVALKDSDQVAQSDGEESPAAE
EQLLGEHIKEEKEESEFLPSSGGTFNISVSGDIDGLITQALLTGNFESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQSKIT
RLITAVVMKNWKEIVESCDLKNWREALAAVLTYAKPDEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEKLVACWTKAQDGSHPLS
LQDLIEKVVILRKAVQLTQAMDTSTVGVLLAAKMSQYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESPKIPYEK
QQLPKGRPGPVAGHHQMPRVQTQQYYPHGENPPPPGFIMHGNVNPNAAGQLPTSPGHMHTQVPPYPQPQLYRRKHQELQAMQMELQSPEY
KLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDFLMEA
LIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARNCLLT
CPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSGGRM
DPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEERSP
AAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAKKEPH
DRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMKEVPLFRLRHFPCGNVNYGYQQQGLPLEAATAPGAGHYEDTILKSKNSMNQP

--------------------------------------------------------------

>80098_80098_15_SEC31A-ALK_SEC31A_chr4_83765539_ENST00000448323_ALK_chr2_29446394_ENST00000389048_length(amino acids)=1445AA_BP=883
MPKVLARMKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASLEIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSKG
DVSGVLIAGGENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPPEDISC
IAWNRQVQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFASSPLRVLENHARGI
LAIAWSMADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLSSSF
GNLDPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRSD
QLQQAVQSQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDLGKKIALALNKVDGANVALKDSDQVAQSDGEESP
AAEEQLLGEHIKEEKEESEFLPSSGGTFNISVSGDIDGLITQALLTGNFESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQS
KITRLITAVVMKNWKEIVESCDLKNWREALAAVLTYAKPDEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEKLVACWTKAQDGSH
PLSLQDLIEKVVILRKAVQLTQAMDTSTVGVLLAAKMSQYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESPKIP
YEKQQLPKGRPGPVAGHHQMPRVQTQQYYPHGENPPPPGFIMHGNVNPNAAGQLPTSPGHMHTQVPPYPQPQLYRRKHQELQAMQMELQS
PEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDFL
MEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARNC
LLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSG
GRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEE
RSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAKK
EPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMKEVPLFRLRHFPCGNVNYGYQQQGLPLEAATAPGAGHYEDTILKSKNSM

--------------------------------------------------------------

>80098_80098_16_SEC31A-ALK_SEC31A_chr4_83765539_ENST00000500777_ALK_chr2_29446394_ENST00000389048_length(amino acids)=1399AA_BP=837
MKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASLEIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSKGDVSGVLI
AGGENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPPEDISCIAWNRQV
QHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFASSPLRVLENHARGILAIAWSM
ADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLSSSFGNLDPFG
TGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRSDQLQQAVQ
SQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDLGKKHIKEEKEESEFLPSSGGTFNISVSGDIDGLITQALLT
GNFESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQSKITRLITAVVMKNWKEIVESCDLKNWREALAAVLTYAKPDEFSALC
DLLGTRLENEGDSLLQTQACLCYICAGNVEKLVACWTKAQDGSHPLSLQDLIEKVVILRKAVQLTQAMDTSTVGVLLAAKMSQYANLLAA
QGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESPKIPYEKQQLPKGRPGPVAGHHQMPRVQTQQYYPHGENPPPPGFIMHGNV
NPNAAGQLPTSPGHMHTQVPPYPQPQLYRRKHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRG
LGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRET
RPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARNCLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAF
MEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSGGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPD
VINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFS
QSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMKEVPL

--------------------------------------------------------------

>80098_80098_17_SEC31A-ALK_SEC31A_chr4_83765539_ENST00000505472_ALK_chr2_29446394_ENST00000389048_length(amino acids)=1469AA_BP=907
MKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASLEIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSKGDVSGVLI
AGGENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPPEDISCIAWNRQV
QHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFASSPLRVLENHARGILAIAWSM
ADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLSSSFGNLDPFG
TGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRSDQLQQAVQ
SQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDLGKKIALALNKVDGANVALKDSDQVAQSDGEESPAAEEQLL
GEHIKEEKEESEFLPSSGGTFNISVSGDIDGLITQALLTGNFESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQSKITRLIT
AVVMKNWKEIVESCDLKNWREALAAVLTYAKPDEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEKLVACWTKAQDGSHPLSLQDL
IEKVVILRKAVQLTQAMDTSTVGVLLAAKMSQYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESPKIPYEKQQLP
KGRPGPVAGHHQMPRVQTQQYYPHVRIAPTVTTWSNKTPTALPSHPPAASPSDTQGENPPPPGFIMHGNVNPNAAGQLPTSPGHMHTQVP
PYPQPQLYRRKHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPND
PSPLQVAVKTLPEVCSEQDELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVAR
DIACGCQYLEENHFIHRDIAARNCLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWE
IFSLGYMPYPSKSNQEVLEFVTSGGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKV
PVRPKDPEGVPPLLVSQQAKREEERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTS
LWNPTYGSWFTEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMKEVPLFRLRHFPCGNVNYGYQQQGL

--------------------------------------------------------------

>80098_80098_18_SEC31A-ALK_SEC31A_chr4_83765539_ENST00000505984_ALK_chr2_29446394_ENST00000389048_length(amino acids)=1407AA_BP=845
MPKVLASRMKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASLEIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSK
GDVSGVLIAGGENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPPEDIS
CIAWNRQVQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFASSPLRVLENHARG
ILAIAWSMADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLSSS
FGNLDPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRS
DQLQQAVQSQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDLGKKHIKEEKEESEFLPSSGGTFNISVSGDIDG
LITQALLTGNFESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQSKITRLITAVVMKNWKEIVESCDLKNWREALAAVLTYAK
PDEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEKLVACWTKAQDGSHPLSLQDLIEKVVILRKAVQLTQAMDTSTVGVLLAAKMS
QYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESPKIPYEKQQLPKGRPGPVAGHHQMPRVQTQQYYPHGENPPPP
GFIMHGNVNPNAAGQLPTSPGHMHTQVPPYPQPQLYRRKHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPR
KNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGD
LKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARNCLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPV
KWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSGGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERI
EYCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEG
GHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLT

--------------------------------------------------------------

>80098_80098_19_SEC31A-ALK_SEC31A_chr4_83765539_ENST00000508502_ALK_chr2_29446394_ENST00000389048_length(amino acids)=1446AA_BP=884
MPKVLASRMKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASLEIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSK
GDVSGVLIAGGENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPPEDIS
CIAWNRQVQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFASSPLRVLENHARG
ILAIAWSMADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLSSS
FGNLDPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRS
DQLQQAVQSQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDLGKKIALALNKVDGANVALKDSDQVAQSDGEES
PAAEEQLLGEHIKEEKEESEFLPSSGGTFNISVSGDIDGLITQALLTGNFESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQ
SKITRLITAVVMKNWKEIVESCDLKNWREALAAVLTYAKPDEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEKLVACWTKAQDGS
HPLSLQDLIEKVVILRKAVQLTQAMDTSTVGVLLAAKMSQYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESPKI
PYEKQQLPKGRPGPVAGHHQMPRVQTQQYYPHGENPPPPGFIMHGNVNPNAAGQLPTSPGHMHTQVPPYPQPQLYRRKHQELQAMQMELQ
SPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDF
LMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARN
CLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTS
GGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREE
ERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAK
KEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMKEVPLFRLRHFPCGNVNYGYQQQGLPLEAATAPGAGHYEDTILKSKNS

--------------------------------------------------------------

>80098_80098_20_SEC31A-ALK_SEC31A_chr4_83765539_ENST00000509142_ALK_chr2_29446394_ENST00000389048_length(amino acids)=1445AA_BP=883
MPKVLARMKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASLEIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSKG
DVSGVLIAGGENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPPEDISC
IAWNRQVQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFASSPLRVLENHARGI
LAIAWSMADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLSSSF
GNLDPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRSD
QLQQAVQSQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDLGKKIALALNKVDGANVALKDSDQVAQSDGEESP
AAEEQLLGEHIKEEKEESEFLPSSGGTFNISVSGDIDGLITQALLTGNFESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQS
KITRLITAVVMKNWKEIVESCDLKNWREALAAVLTYAKPDEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEKLVACWTKAQDGSH
PLSLQDLIEKVVILRKAVQLTQAMDTSTVGVLLAAKMSQYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESPKIP
YEKQQLPKGRPGPVAGHHQMPRVQTQQYYPHGENPPPPGFIMHGNVNPNAAGQLPTSPGHMHTQVPPYPQPQLYRRKHQELQAMQMELQS
PEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDFL
MEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARNC
LLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSG
GRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEE
RSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAKK
EPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMKEVPLFRLRHFPCGNVNYGYQQQGLPLEAATAPGAGHYEDTILKSKNSM

--------------------------------------------------------------

>80098_80098_21_SEC31A-ALK_SEC31A_chr4_83765539_ENST00000513858_ALK_chr2_29446394_ENST00000389048_length(amino acids)=1407AA_BP=845
MPKVLASRMKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASLEIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSK
GDVSGVLIAGGENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPPEDIS
CIAWNRQVQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFASSPLRVLENHARG
ILAIAWSMADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLSSS
FGNLDPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRS
DQLQQAVQSQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDLGKKHIKEEKEESEFLPSSGGTFNISVSGDIDG
LITQALLTGNFESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQSKITRLITAVVMKNWKEIVESCDLKNWREALAAVLTYAK
PDEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEKLVACWTKAQDGSHPLSLQDLIEKVVILRKAVQLTQAMDTSTVGVLLAAKMS
QYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESPKIPYEKQQLPKGRPGPVAGHHQMPRVQTQQYYPHGENPPPP
GFIMHGNVNPNAAGQLPTSPGHMHTQVPPYPQPQLYRRKHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPR
KNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGD
LKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARNCLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPV
KWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSGGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERI
EYCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEG
GHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLT

--------------------------------------------------------------

>80098_80098_22_SEC31A-ALK_SEC31A_chr4_83769956_ENST00000355196_ALK_chr2_29446394_ENST00000389048_length(amino acids)=856AA_BP=834
MKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASLEIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSKGDVSGVLI
AGGENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPPEDISCIAWNRQV
QHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFASSPLRVLENHARGILAIAWSM
ADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLSSSFGNLDPFG
TGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRSDQLQQAVQ
SQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDLGKKIALALNKVDGANVALKDSDQVAQSDGEESPAAEEQLL
GEHIKEEKEESEFLPSSGGTFNISVSGDIDGLITQALLTGNFESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQSKITRLIT
AVVMKNWKEIVESCDLKNWREALAAVLTYAKPDEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEKLVACWTKAQDGSHPLSLQDL
IEKVVILRKAVQLTQAMDTSTVGVLLAAKMSQYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESPKIPYEKQQLP

--------------------------------------------------------------

Top

Fusion Protein Functional Features


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr4:83765539/chr2:29446394)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
SEC31A

O94979

ALK

Q96BT7

FUNCTION: Component of the coat protein complex II (COPII) which promotes the formation of transport vesicles from the endoplasmic reticulum (ER) (PubMed:10788476). The coat has two main functions, the physical deformation of the endoplasmic reticulum membrane into vesicles and the selection of cargo molecules (By similarity). {ECO:0000250|UniProtKB:Q9Z2Q1, ECO:0000269|PubMed:10788476}.FUNCTION: Catalyzes the methylation of 5-carboxymethyl uridine to 5-methylcarboxymethyl uridine at the wobble position of the anticodon loop in tRNA via its methyltransferase domain (PubMed:20123966, PubMed:20308323, PubMed:31079898). Catalyzes the last step in the formation of 5-methylcarboxymethyl uridine at the wobble position of the anticodon loop in target tRNA (PubMed:20123966, PubMed:20308323). Has a preference for tRNA(Arg) and tRNA(Glu), and does not bind tRNA(Lys)(PubMed:20308323). Binds tRNA and catalyzes the iron and alpha-ketoglutarate dependent hydroxylation of 5-methylcarboxymethyl uridine at the wobble position of the anticodon loop in tRNA via its dioxygenase domain, giving rise to 5-(S)-methoxycarbonylhydroxymethyluridine; has a preference for tRNA(Gly) (PubMed:21285950). Required for normal survival after DNA damage (PubMed:20308323). May inhibit apoptosis and promote cell survival and angiogenesis (PubMed:19293182). {ECO:0000269|PubMed:19293182, ECO:0000269|PubMed:20123966, ECO:0000269|PubMed:20308323, ECO:0000269|PubMed:21285950, ECO:0000269|PubMed:31079898}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page

* Minus value of BPloci means that the break pointn is located before the CDS.
- Retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneSEC31Achr4:83765539chr2:29446394ENST00000311785-2126842_848875.33333333333341107.0MotifNote=ALG-2-binding site motif-2 (ABS-2)%2C
HgeneSEC31Achr4:83765539chr2:29446394ENST00000355196-2329842_848875.33333333333341221.0MotifNote=ALG-2-binding site motif-2 (ABS-2)%2C
HgeneSEC31Achr4:83765539chr2:29446394ENST00000395310-2127842_848875.33333333333341221.0MotifNote=ALG-2-binding site motif-2 (ABS-2)%2C
HgeneSEC31Achr4:83765539chr2:29446394ENST00000432794-2128842_848875.33333333333341234.0MotifNote=ALG-2-binding site motif-2 (ABS-2)%2C
HgeneSEC31Achr4:83765539chr2:29446394ENST00000443462-2026842_848870.33333333333341201.0MotifNote=ALG-2-binding site motif-2 (ABS-2)%2C
HgeneSEC31Achr4:83765539chr2:29446394ENST00000448323-2127842_848875.33333333333341221.0MotifNote=ALG-2-binding site motif-2 (ABS-2)%2C
HgeneSEC31Achr4:83765539chr2:29446394ENST00000508502-2127842_848875.33333333333341206.0MotifNote=ALG-2-binding site motif-2 (ABS-2)%2C
HgeneSEC31Achr4:83765539chr2:29446394ENST00000509142-2126842_848875.33333333333341107.0MotifNote=ALG-2-binding site motif-2 (ABS-2)%2C
HgeneSEC31Achr4:83765539chr2:29446394ENST00000311785-2126120_160875.33333333333341107.0RepeatNote=WD 3
HgeneSEC31Achr4:83765539chr2:29446394ENST00000311785-2126166_206875.33333333333341107.0RepeatNote=WD 4
HgeneSEC31Achr4:83765539chr2:29446394ENST00000311785-2126209_254875.33333333333341107.0RepeatNote=WD 5
HgeneSEC31Achr4:83765539chr2:29446394ENST00000311785-2126258_298875.33333333333341107.0RepeatNote=WD 6
HgeneSEC31Achr4:83765539chr2:29446394ENST00000311785-2126301_342875.33333333333341107.0RepeatNote=WD 7
HgeneSEC31Achr4:83765539chr2:29446394ENST00000311785-21264_47875.33333333333341107.0RepeatNote=WD 1
HgeneSEC31Achr4:83765539chr2:29446394ENST00000311785-212668_111875.33333333333341107.0RepeatNote=WD 2
HgeneSEC31Achr4:83765539chr2:29446394ENST00000326950-1925120_160836.33333333333341182.0RepeatNote=WD 3
HgeneSEC31Achr4:83765539chr2:29446394ENST00000326950-1925166_206836.33333333333341182.0RepeatNote=WD 4
HgeneSEC31Achr4:83765539chr2:29446394ENST00000326950-1925209_254836.33333333333341182.0RepeatNote=WD 5
HgeneSEC31Achr4:83765539chr2:29446394ENST00000326950-1925258_298836.33333333333341182.0RepeatNote=WD 6
HgeneSEC31Achr4:83765539chr2:29446394ENST00000326950-1925301_342836.33333333333341182.0RepeatNote=WD 7
HgeneSEC31Achr4:83765539chr2:29446394ENST00000326950-19254_47836.33333333333341182.0RepeatNote=WD 1
HgeneSEC31Achr4:83765539chr2:29446394ENST00000326950-192568_111836.33333333333341182.0RepeatNote=WD 2
HgeneSEC31Achr4:83765539chr2:29446394ENST00000348405-1925120_160836.33333333333341182.0RepeatNote=WD 3
HgeneSEC31Achr4:83765539chr2:29446394ENST00000348405-1925166_206836.33333333333341182.0RepeatNote=WD 4
HgeneSEC31Achr4:83765539chr2:29446394ENST00000348405-1925209_254836.33333333333341182.0RepeatNote=WD 5
HgeneSEC31Achr4:83765539chr2:29446394ENST00000348405-1925258_298836.33333333333341182.0RepeatNote=WD 6
HgeneSEC31Achr4:83765539chr2:29446394ENST00000348405-1925301_342836.33333333333341182.0RepeatNote=WD 7
HgeneSEC31Achr4:83765539chr2:29446394ENST00000348405-19254_47836.33333333333341182.0RepeatNote=WD 1
HgeneSEC31Achr4:83765539chr2:29446394ENST00000348405-192568_111836.33333333333341182.0RepeatNote=WD 2
HgeneSEC31Achr4:83765539chr2:29446394ENST00000355196-2329120_160875.33333333333341221.0RepeatNote=WD 3
HgeneSEC31Achr4:83765539chr2:29446394ENST00000355196-2329166_206875.33333333333341221.0RepeatNote=WD 4
HgeneSEC31Achr4:83765539chr2:29446394ENST00000355196-2329209_254875.33333333333341221.0RepeatNote=WD 5
HgeneSEC31Achr4:83765539chr2:29446394ENST00000355196-2329258_298875.33333333333341221.0RepeatNote=WD 6
HgeneSEC31Achr4:83765539chr2:29446394ENST00000355196-2329301_342875.33333333333341221.0RepeatNote=WD 7
HgeneSEC31Achr4:83765539chr2:29446394ENST00000355196-23294_47875.33333333333341221.0RepeatNote=WD 1
HgeneSEC31Achr4:83765539chr2:29446394ENST00000355196-232968_111875.33333333333341221.0RepeatNote=WD 2
HgeneSEC31Achr4:83765539chr2:29446394ENST00000395310-2127120_160875.33333333333341221.0RepeatNote=WD 3
HgeneSEC31Achr4:83765539chr2:29446394ENST00000395310-2127166_206875.33333333333341221.0RepeatNote=WD 4
HgeneSEC31Achr4:83765539chr2:29446394ENST00000395310-2127209_254875.33333333333341221.0RepeatNote=WD 5
HgeneSEC31Achr4:83765539chr2:29446394ENST00000395310-2127258_298875.33333333333341221.0RepeatNote=WD 6
HgeneSEC31Achr4:83765539chr2:29446394ENST00000395310-2127301_342875.33333333333341221.0RepeatNote=WD 7
HgeneSEC31Achr4:83765539chr2:29446394ENST00000395310-21274_47875.33333333333341221.0RepeatNote=WD 1
HgeneSEC31Achr4:83765539chr2:29446394ENST00000395310-212768_111875.33333333333341221.0RepeatNote=WD 2
HgeneSEC31Achr4:83765539chr2:29446394ENST00000432794-2128120_160875.33333333333341234.0RepeatNote=WD 3
HgeneSEC31Achr4:83765539chr2:29446394ENST00000432794-2128166_206875.33333333333341234.0RepeatNote=WD 4
HgeneSEC31Achr4:83765539chr2:29446394ENST00000432794-2128209_254875.33333333333341234.0RepeatNote=WD 5
HgeneSEC31Achr4:83765539chr2:29446394ENST00000432794-2128258_298875.33333333333341234.0RepeatNote=WD 6
HgeneSEC31Achr4:83765539chr2:29446394ENST00000432794-2128301_342875.33333333333341234.0RepeatNote=WD 7
HgeneSEC31Achr4:83765539chr2:29446394ENST00000432794-21284_47875.33333333333341234.0RepeatNote=WD 1
HgeneSEC31Achr4:83765539chr2:29446394ENST00000432794-212868_111875.33333333333341234.0RepeatNote=WD 2
HgeneSEC31Achr4:83765539chr2:29446394ENST00000443462-2026120_160870.33333333333341201.0RepeatNote=WD 3
HgeneSEC31Achr4:83765539chr2:29446394ENST00000443462-2026166_206870.33333333333341201.0RepeatNote=WD 4
HgeneSEC31Achr4:83765539chr2:29446394ENST00000443462-2026209_254870.33333333333341201.0RepeatNote=WD 5
HgeneSEC31Achr4:83765539chr2:29446394ENST00000443462-2026258_298870.33333333333341201.0RepeatNote=WD 6
HgeneSEC31Achr4:83765539chr2:29446394ENST00000443462-2026301_342870.33333333333341201.0RepeatNote=WD 7
HgeneSEC31Achr4:83765539chr2:29446394ENST00000443462-20264_47870.33333333333341201.0RepeatNote=WD 1
HgeneSEC31Achr4:83765539chr2:29446394ENST00000443462-202668_111870.33333333333341201.0RepeatNote=WD 2
HgeneSEC31Achr4:83765539chr2:29446394ENST00000448323-2127120_160875.33333333333341221.0RepeatNote=WD 3
HgeneSEC31Achr4:83765539chr2:29446394ENST00000448323-2127166_206875.33333333333341221.0RepeatNote=WD 4
HgeneSEC31Achr4:83765539chr2:29446394ENST00000448323-2127209_254875.33333333333341221.0RepeatNote=WD 5
HgeneSEC31Achr4:83765539chr2:29446394ENST00000448323-2127258_298875.33333333333341221.0RepeatNote=WD 6
HgeneSEC31Achr4:83765539chr2:29446394ENST00000448323-2127301_342875.33333333333341221.0RepeatNote=WD 7
HgeneSEC31Achr4:83765539chr2:29446394ENST00000448323-21274_47875.33333333333341221.0RepeatNote=WD 1
HgeneSEC31Achr4:83765539chr2:29446394ENST00000448323-212768_111875.33333333333341221.0RepeatNote=WD 2
HgeneSEC31Achr4:83765539chr2:29446394ENST00000500777-1823120_160836.33333333333341068.0RepeatNote=WD 3
HgeneSEC31Achr4:83765539chr2:29446394ENST00000500777-1823166_206836.33333333333341068.0RepeatNote=WD 4
HgeneSEC31Achr4:83765539chr2:29446394ENST00000500777-1823209_254836.33333333333341068.0RepeatNote=WD 5
HgeneSEC31Achr4:83765539chr2:29446394ENST00000500777-1823258_298836.33333333333341068.0RepeatNote=WD 6
HgeneSEC31Achr4:83765539chr2:29446394ENST00000500777-1823301_342836.33333333333341068.0RepeatNote=WD 7
HgeneSEC31Achr4:83765539chr2:29446394ENST00000500777-18234_47836.33333333333341068.0RepeatNote=WD 1
HgeneSEC31Achr4:83765539chr2:29446394ENST00000500777-182368_111836.33333333333341068.0RepeatNote=WD 2
HgeneSEC31Achr4:83765539chr2:29446394ENST00000508502-2127120_160875.33333333333341206.0RepeatNote=WD 3
HgeneSEC31Achr4:83765539chr2:29446394ENST00000508502-2127166_206875.33333333333341206.0RepeatNote=WD 4
HgeneSEC31Achr4:83765539chr2:29446394ENST00000508502-2127209_254875.33333333333341206.0RepeatNote=WD 5
HgeneSEC31Achr4:83765539chr2:29446394ENST00000508502-2127258_298875.33333333333341206.0RepeatNote=WD 6
HgeneSEC31Achr4:83765539chr2:29446394ENST00000508502-2127301_342875.33333333333341206.0RepeatNote=WD 7
HgeneSEC31Achr4:83765539chr2:29446394ENST00000508502-21274_47875.33333333333341206.0RepeatNote=WD 1
HgeneSEC31Achr4:83765539chr2:29446394ENST00000508502-212768_111875.33333333333341206.0RepeatNote=WD 2
HgeneSEC31Achr4:83765539chr2:29446394ENST00000509142-2126120_160875.33333333333341107.0RepeatNote=WD 3
HgeneSEC31Achr4:83765539chr2:29446394ENST00000509142-2126166_206875.33333333333341107.0RepeatNote=WD 4
HgeneSEC31Achr4:83765539chr2:29446394ENST00000509142-2126209_254875.33333333333341107.0RepeatNote=WD 5
HgeneSEC31Achr4:83765539chr2:29446394ENST00000509142-2126258_298875.33333333333341107.0RepeatNote=WD 6
HgeneSEC31Achr4:83765539chr2:29446394ENST00000509142-2126301_342875.33333333333341107.0RepeatNote=WD 7
HgeneSEC31Achr4:83765539chr2:29446394ENST00000509142-21264_47875.33333333333341107.0RepeatNote=WD 1
HgeneSEC31Achr4:83765539chr2:29446394ENST00000509142-212668_111875.33333333333341107.0RepeatNote=WD 2
HgeneSEC31Achr4:83765539chr2:29446394ENST00000513858-1924120_160836.33333333333341068.0RepeatNote=WD 3
HgeneSEC31Achr4:83765539chr2:29446394ENST00000513858-1924166_206836.33333333333341068.0RepeatNote=WD 4
HgeneSEC31Achr4:83765539chr2:29446394ENST00000513858-1924209_254836.33333333333341068.0RepeatNote=WD 5
HgeneSEC31Achr4:83765539chr2:29446394ENST00000513858-1924258_298836.33333333333341068.0RepeatNote=WD 6
HgeneSEC31Achr4:83765539chr2:29446394ENST00000513858-1924301_342836.33333333333341068.0RepeatNote=WD 7
HgeneSEC31Achr4:83765539chr2:29446394ENST00000513858-19244_47836.33333333333341068.0RepeatNote=WD 1
HgeneSEC31Achr4:83765539chr2:29446394ENST00000513858-192468_111836.33333333333341068.0RepeatNote=WD 2
TgeneALKchr4:83765539chr2:29446394ENST0000038904818291116_13921057.33333333333331621.0DomainProtein kinase
TgeneALKchr4:83765539chr2:29446394ENST0000038904818291197_11991057.33333333333331621.0RegionNote=Inhibitor binding
TgeneALKchr4:83765539chr2:29446394ENST0000038904818291060_16201057.33333333333331621.0Topological domainCytoplasmic

- Not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneSEC31Achr4:83765539chr2:29446394ENST00000311785-2126800_1091875.33333333333341107.0Compositional biasNote=Pro-rich
HgeneSEC31Achr4:83765539chr2:29446394ENST00000326950-1925800_1091836.33333333333341182.0Compositional biasNote=Pro-rich
HgeneSEC31Achr4:83765539chr2:29446394ENST00000348405-1925800_1091836.33333333333341182.0Compositional biasNote=Pro-rich
HgeneSEC31Achr4:83765539chr2:29446394ENST00000355196-2329800_1091875.33333333333341221.0Compositional biasNote=Pro-rich
HgeneSEC31Achr4:83765539chr2:29446394ENST00000395310-2127800_1091875.33333333333341221.0Compositional biasNote=Pro-rich
HgeneSEC31Achr4:83765539chr2:29446394ENST00000432794-2128800_1091875.33333333333341234.0Compositional biasNote=Pro-rich
HgeneSEC31Achr4:83765539chr2:29446394ENST00000443462-2026800_1091870.33333333333341201.0Compositional biasNote=Pro-rich
HgeneSEC31Achr4:83765539chr2:29446394ENST00000448323-2127800_1091875.33333333333341221.0Compositional biasNote=Pro-rich
HgeneSEC31Achr4:83765539chr2:29446394ENST00000500777-1823800_1091836.33333333333341068.0Compositional biasNote=Pro-rich
HgeneSEC31Achr4:83765539chr2:29446394ENST00000508502-2127800_1091875.33333333333341206.0Compositional biasNote=Pro-rich
HgeneSEC31Achr4:83765539chr2:29446394ENST00000509142-2126800_1091875.33333333333341107.0Compositional biasNote=Pro-rich
HgeneSEC31Achr4:83765539chr2:29446394ENST00000513858-1924800_1091836.33333333333341068.0Compositional biasNote=Pro-rich
HgeneSEC31Achr4:83765539chr2:29446394ENST00000326950-1925842_848836.33333333333341182.0MotifNote=ALG-2-binding site motif-2 (ABS-2)%2C
HgeneSEC31Achr4:83765539chr2:29446394ENST00000348405-1925842_848836.33333333333341182.0MotifNote=ALG-2-binding site motif-2 (ABS-2)%2C
HgeneSEC31Achr4:83765539chr2:29446394ENST00000500777-1823842_848836.33333333333341068.0MotifNote=ALG-2-binding site motif-2 (ABS-2)%2C
HgeneSEC31Achr4:83765539chr2:29446394ENST00000513858-1924842_848836.33333333333341068.0MotifNote=ALG-2-binding site motif-2 (ABS-2)%2C
TgeneALKchr4:83765539chr2:29446394ENST000003890481829816_9401057.33333333333331621.0Compositional biasNote=Gly-rich
TgeneALKchr4:83765539chr2:29446394ENST000003890481829264_4271057.33333333333331621.0DomainMAM 1
TgeneALKchr4:83765539chr2:29446394ENST000003890481829437_4731057.33333333333331621.0DomainNote=LDL-receptor class A
TgeneALKchr4:83765539chr2:29446394ENST000003890481829478_6361057.33333333333331621.0DomainMAM 2
TgeneALKchr4:83765539chr2:29446394ENST00000389048182919_10381057.33333333333331621.0Topological domainExtracellular
TgeneALKchr4:83765539chr2:29446394ENST0000038904818291039_10591057.33333333333331621.0TransmembraneHelical


Top

Fusion Protein Structures

check button PDB and CIF files of the predicted fusion proteins
* Here we show the 3D structure of the fusion proteins using Mol*. AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Model confidence is shown from the pLDDT values per residue. pLDDT corresponds to the model’s prediction of its score on the local Distance Difference Test. It is a measure of local accuracy (from AlphfaFold website). To color code individual residues, we transformed individual PDB files into CIF format.
Fusion protein PDB link (fusion AA seq ID in FusionPDB)HgeneHchrHbpHstrandTgeneTchrTbpTstrandAA seqLen(AA seq)
PDB file (682) >>>682.pdbFusion protein BP residue: 834
CIF file (682) >>>682.cif
SEC31Achr483769956-ALKchr229446394-
MKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASLEIFELDL
SDPSLDMKSCATFSSSHRYHKLIWGPYKMDSKGDVSGVLIAGGENGNIIL
YDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIW
DLNNFATPMTPGAKTQPPEDISCIAWNRQVQHILASASPSGRATVWDLRK
NEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFAS
SPLRVLENHARGILAIAWSMADPELLLSCGKDAKILCSNPNTGEVLYELP
TNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLS
SSFGNLDPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSF
GGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRSDQLQQAVQ
SQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDL
GKKIALALNKVDGANVALKDSDQVAQSDGEESPAAEEQLLGEHIKEEKEE
SEFLPSSGGTFNISVSGDIDGLITQALLTGNFESAVDLCLHDNRMADAII
LAIAGGQELLARTQKKYFAKSQSKITRLITAVVMKNWKEIVESCDLKNWR
EALAAVLTYAKPDEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEK
LVACWTKAQDGSHPLSLQDLIEKVVILRKAVQLTQAMDTSTVGVLLAAKM
SQYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESP
KIPYEKQQLPKGRPGPVAGHHQMPRVQTQQYYPHCTAGSTRSCKPCRWSC
856
3D view using mol* of 682 (AA BP:834)
PDB file (759) >>>759.pdbFusion protein BP residue: 988
CIF file (759) >>>759.cif
SEC31Achr483763292-ALKchr229445473-
MKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASLEIFELDL
SDPSLDMKSCATFSSSHRYHKLIWGPYKMDSKGDVSGVLIAGGENGNIIL
YDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIW
DLNNFATPMTPGAKTQPPEDISCIAWNRQVQHILASASPSGRATVWDLRK
NEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFAS
SPLRVLENHARGILAIAWSMADPELLLSCGKDAKILCSNPNTGEVLYELP
TNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLS
SSFGNLDPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSF
GGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRSDQLQQAVQ
SQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDL
GKKIALALNKVDGANVALKDSDQVAQSDGEESPAAEEQLLGEHIKEEKEE
SEFLPSSGGTFNISVSGDIDGLITQALLTGNFESAVDLCLHDNRMADAII
LAIAGGQELLARTQKKYFAKSQSKITRLITAVVMKNWKEIVESCDLKNWR
EALAAVLTYAKPDEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEK
LVACWTKAQDGSHPLSLQDLIEKVVILRKAVQLTQAMDTSTVGVLLAAKM
SQYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESP
KIPYEKQQLPKGRPGPVAGHHQMPRVQTQQYYPHGENPPPPGFIMHGNVN
PNAAGQLPTSPGHMHTQVPPYPQPQPYQPAQPYPFGTGGSAMYRPQQPVA
PPTSNAYPNTPYISSASSYTGQSQLYAAQHQASSPTSSPATSFPPPPSSG
ASFQHGGPGAPPSSSAYALPPGTTGTLPAASELPASQRTGVWAMAPLGRC
1018
3D view using mol* of 759 (AA BP:988)
PDB file (947) >>>947.pdbFusion protein BP residue: 957
CIF file (947) >>>947.cif
SEC31Achr483763292-ALKchr229446394-
MPKVLARMKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASL
EIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSKGDVSGVLIAGG
ENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGAN
ESEIYIWDLNNFATPMTPGAKTQPPEDISCIAWNRQVQHILASASPSGRA
TVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQM
WDLRFASSPLRVLENHARGILAIAWSMADPELLLSCGKDAKILCSNPNTG
EVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQ
KQVDKLSSSFGNLDPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRP
VGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRSD
QLQQAVQSQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELL
GYRKEDLGKKHIKEEKEESEFLPSSGGTFNISVSGDIDGLITQALLTGNF
ESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQSKITRLITAV
VMKNWKEIVESCDLKNWREALAAVLTYAKPDEFSALCDLLGTRLENEGDS
LLQTQACLCYICAGNVEKLVACWTKAQDGSHPLSLQDLIEKVVILRKAVQ
LTQAMDTSTVGVLLAAKMSQYANLLAAQGSIAAALAFLPDNTNQPNIMQL
RDRLCRAQGEPVAGHESPKIPYEKQQLPKGRPGPVAGHHQMPRVQTQQYY
PHGENPPPPGFIMHGNVNPNAAGQLPTSPGHMHTQVPPYPQPQPYQPAQP
YPFGTGGSAMYRPQQPVAPPTSNAYPNTPYISSASSYTGQSQLYAAQHQA
SSPTSSPATSFPPPPSSGASFQHGGPGAPPSSSAYALPPGTTGTLPAASE
LPASQRTVYRRKHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAG
KTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVK
TLPEVCSEQDELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMA
GGDLKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDI
AARNCLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEA
FMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSGGRMDP
PKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIE
YGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEERSPAAPPPLPTTSSG
KAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPT
SLWNPTYGSWFTEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRL
PGASLLLEPSSLTANMKEVPLFRLRHFPCGNVNYGYQQQGLPLEAATAPG
1520
3D view using mol* of 947 (AA BP:957)
PDB file (948)CIF file (948) >>>948.cifSEC31Achr483763292-ALKchr229446394-
MPKVLASRMKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNAS
LEIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSKGDVSGVLIAG
GENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGA
NESEIYIWDLNNFATPMTPGAKTQPPEDISCIAWNRQVQHILASASPSGR
ATVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQ
MWDLRFASSPLRVLENHARGILAIAWSMADPELLLSCGKDAKILCSNPNT
GEVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLR
QKQVDKLSSSFGNLDPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRR
PVGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRS
DQLQQAVQSQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLEL
LGYRKEDLGKKHIKEEKEESEFLPSSGGTFNISVSGDIDGLITQALLTGN
FESAVDLCLHDNRMADAIILAIAGGQELLARTQKKYFAKSQSKITRLITA
VVMKNWKEIVESCDLKNWREALAAVLTYAKPDEFSALCDLLGTRLENEGD
SLLQTQACLCYICAGNVEKLVACWTKAQDGSHPLSLQDLIEKVVILRKAV
QLTQAMDTSTVGVLLAAKMSQYANLLAAQGSIAAALAFLPDNTNQPNIMQ
LRDRLCRAQGEPVAGHESPKIPYEKQQLPKGRPGPVAGHHQMPRVQTQQY
YPHGENPPPPGFIMHGNVNPNAAGQLPTSPGHMHTQVPPYPQPQPYQPAQ
PYPFGTGGSAMYRPQQPVAPPTSNAYPNTPYISSASSYTGQSQLYAAQHQ
ASSPTSSPATSFPPPPSSGASFQHGGPGAPPSSSAYALPPGTTGTLPAAS
ELPASQRTVYRRKHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFA
GKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAV
KTLPEVCSEQDELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELM
AGGDLKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRD
IAARNCLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPE
AFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSGGRMD
PPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPI
EYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEERSPAAPPPLPTTSS
GKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKP
TSLWNPTYGSWFTEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGR
LPGASLLLEPSSLTANMKEVPLFRLRHFPCGNVNYGYQQQGLPLEAATAP
1521
3D view using mol* of 948 (AA BP:)
PDB file (962)CIF file (962) >>>962.cifSEC31Achr483763292-ALKchr229446394-
MPKVLARMKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASL
EIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSKGDVSGVLIAGG
ENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGAN
ESEIYIWDLNNFATPMTPGAKTQPPEDISCIAWNRQVQHILASASPSGRA
TVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQM
WDLRFASSPLRVLENHARGILAIAWSMADPELLLSCGKDAKILCSNPNTG
EVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQ
KQVDKLSSSFGNLDPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRP
VGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRSD
QLQQAVQSQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELL
GYRKEDLGKKIALALNKVDGANVALKDSDQVAQSDGEESPAAEEQLLGEH
IKEEKEESEFLPSSGGTFNISVSGDIDGLITQALLTGNFESAVDLCLHDN
RMADAIILAIAGGQELLARTQKKYFAKSQSKITRLITAVVMKNWKEIVES
CDLKNWREALAAVLTYAKPDEFSALCDLLGTRLENEGDSLLQTQACLCYI
CAGNVEKLVACWTKAQDGSHPLSLQDLIEKVVILRKAVQLTQAMDTSTVG
VLLAAKMSQYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEP
VAGHESPKIPYEKQQLPKGRPGPVAGHHQMPRVQTQQYYPHGENPPPPGF
IMHGNVNPNAAGQLPTSPGHMHTQVPPYPQPQPYQPAQPYPFGTGGSAMY
RPQQPVAPPTSNAYPNTPYISSASSYTGQSQLYAAQHQASSPTSSPATSF
PPPPSSGASFQHGGPGAPPSSSAYALPPGTTGTLPAASELPASQRTVYRR
KHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEV
PRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDE
LDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRET
RPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARNCLLTCPG
PGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTD
TWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSGGRMDPPKNCPGPVYRI
MTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKV
PVRPKDPEGVPPLLVSQQAKREEERSPAAPPPLPTTSSGKAAKKPTAAEI
SVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWF
TEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSS
LTANMKEVPLFRLRHFPCGNVNYGYQQQGLPLEAATAPGAGHYEDTILKS
1559
3D view using mol* of 962 (AA BP:)
PDB file (963) >>>963.pdbFusion protein BP residue: 997
CIF file (963) >>>963.cif
SEC31Achr483763292-ALKchr229446394-
MPKVLASRMKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNAS
LEIFELDLSDPSLDMKSCATFSSSHRYHKLIWGPYKMDSKGDVSGVLIAG
GENGNIILYDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGA
NESEIYIWDLNNFATPMTPGAKTQPPEDISCIAWNRQVQHILASASPSGR
ATVWDLRKNEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQ
MWDLRFASSPLRVLENHARGILAIAWSMADPELLLSCGKDAKILCSNPNT
GEVLYELPTNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLR
QKQVDKLSSSFGNLDPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRR
PVGASFSFGGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRS
DQLQQAVQSQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLEL
LGYRKEDLGKKIALALNKVDGANVALKDSDQVAQSDGEESPAAEEQLLGE
HIKEEKEESEFLPSSGGTFNISVSGDIDGLITQALLTGNFESAVDLCLHD
NRMADAIILAIAGGQELLARTQKKYFAKSQSKITRLITAVVMKNWKEIVE
SCDLKNWREALAAVLTYAKPDEFSALCDLLGTRLENEGDSLLQTQACLCY
ICAGNVEKLVACWTKAQDGSHPLSLQDLIEKVVILRKAVQLTQAMDTSTV
GVLLAAKMSQYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGE
PVAGHESPKIPYEKQQLPKGRPGPVAGHHQMPRVQTQQYYPHGENPPPPG
FIMHGNVNPNAAGQLPTSPGHMHTQVPPYPQPQPYQPAQPYPFGTGGSAM
YRPQQPVAPPTSNAYPNTPYISSASSYTGQSQLYAAQHQASSPTSSPATS
FPPPPSSGASFQHGGPGAPPSSSAYALPPGTTGTLPAASELPASQRTVYR
RKHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKE
VPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQD
ELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRE
TRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARNCLLTCP
GPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKT
DTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSGGRMDPPKNCPGPVYR
IMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEK
VPVRPKDPEGVPPLLVSQQAKREEERSPAAPPPLPTTSSGKAAKKPTAAE
ISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSW
FTEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPS
SLTANMKEVPLFRLRHFPCGNVNYGYQQQGLPLEAATAPGAGHYEDTILK
1560
3D view using mol* of 963 (AA BP:997)
PDB file (964)CIF file (964) >>>964.cifSEC31Achr483763292-ALKchr229446394-
MKLKEVDRTAMQAWSPAQNHPIYLATGTSAQQLDATFSTNASLEIFELDL
SDPSLDMKSCATFSSSHRYHKLIWGPYKMDSKGDVSGVLIAGGENGNIIL
YDPSKIIAGDKEVVIAQNDKHTGPVRALDVNIFQTNLVASGANESEIYIW
DLNNFATPMTPGAKTQPPEDISCIAWNRQVQHILASASPSGRATVWDLRK
NEPIIKVSDHSNRMHCSGLAWHPDVATQMVLASEDDRLPVIQMWDLRFAS
SPLRVLENHARGILAIAWSMADPELLLSCGKDAKILCSNPNTGEVLYELP
TNTQWCFDIQWCPRNPAVLSAASFDGRISVYSIMGGSTDGLRQKQVDKLS
SSFGNLDPFGTGQPLPPLQIPQQTAQHSIVLPLKKPPKWIRRPVGASFSF
GGKLVTFENVRMPSHQGAEQQQQQHHVFISQVVTEKEFLSRSDQLQQAVQ
SQGFINYCQKKIDASQTEFEKNVWSFLKVNFEDDSRGKYLELLGYRKEDL
GKKIALALNKVDGANVALKDSDQVAQSDGEESPAAEEQLLGEHIKEEKEE
SEFLPSSGGTFNISVSGDIDGLITQALLTGNFESAVDLCLHDNRMADAII
LAIAGGQELLARTQKKYFAKSQSKITRLITAVVMKNWKEIVESCDLKNWR
EALAAVLTYAKPDEFSALCDLLGTRLENEGDSLLQTQACLCYICAGNVEK
LVACWTKAQDGSHPLSLQDLIEKVVILRKAVQLTQAMDTSTVGVLLAAKM
SQYANLLAAQGSIAAALAFLPDNTNQPNIMQLRDRLCRAQGEPVAGHESP
KIPYEKQQLPKGRPGPVAGHHQMPRVQTQQYYPHVRIAPTVTTWSNKTPT
ALPSHPPAASPSDTQGENPPPPGFIMHGNVNPNAAGQLPTSPGHMHTQVP
PYPQPQPYQPAQPYPFGTGGSAMYRPQQPVAPPTSNAYPNTPYISSASSY
TGQSQLYAAQHQASSPTSSPATSFPPPPSSGASFQHGGPGAPPSSSAYAL
PPGTTGTLPAASELPASQRTVYRRKHQELQAMQMELQSPEYKLSKLRTST
IMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSG
MPNDPSPLQVAVKTLPEVCSEQDELDFLMEALIISKFNHQNIVRCIGVSL
QSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVARDIACGC
QYLEENHFIHRDIAARNCLLTCPGPGRVAKIGDFGMARDIYRASYYRKGG
CAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQE
VLEFVTSGGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCT
QDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEERS
PAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPS
ELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAKKEPHDRGNLGLEG
SCTVPPNVATGRLPGASLLLEPSSLTANMKEVPLFRLRHFPCGNVNYGYQ
1583
3D view using mol* of 964 (AA BP:)


Top

pLDDT score distribution

check button pLDDT score distribution of the predicted wild-type structures of two partner proteins from AlphaFold2
* AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100.
SEC31A_pLDDT.png
all structure
all structure
ALK_pLDDT.png
all structure
all structure

check button pLDDT score distribution of the predicted fusion protein structures from AlphaFold2
* AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100.
SEC31A_ALK_682_pLDDT.png (AA BP:834)
all structure
SEC31A_ALK_682_pLDDT_and_active_sites.png (AA BP:834)
all structure
SEC31A_ALK_682_violinplot.png (AA BP:834)
all structure
SEC31A_ALK_759_pLDDT.png (AA BP:988)
all structure
SEC31A_ALK_759_pLDDT_and_active_sites.png (AA BP:988)
all structure
SEC31A_ALK_759_violinplot.png (AA BP:988)
all structure
SEC31A_ALK_947_pLDDT.png (AA BP:957)
all structure
SEC31A_ALK_947_pLDDT_and_active_sites.png (AA BP:957)
all structure
SEC31A_ALK_947_violinplot.png (AA BP:957)
all structure
SEC31A_ALK_948_pLDDT_and_active_sites.png (AA BP:)
all structure
SEC31A_ALK_948_violinplot.png (AA BP:)
all structure
SEC31A_ALK_962_pLDDT_and_active_sites.png (AA BP:)
all structure
SEC31A_ALK_962_violinplot.png (AA BP:)
all structure
SEC31A_ALK_963_pLDDT.png (AA BP:997)
all structure
SEC31A_ALK_963_pLDDT_and_active_sites.png (AA BP:997)
all structure
SEC31A_ALK_963_violinplot.png (AA BP:997)
all structure
SEC31A_ALK_964_pLDDT_and_active_sites.png (AA BP:)
all structure
SEC31A_ALK_964_violinplot.png (AA BP:)
all structure


Top

Ramachandran Plot of Fusion Protein Structure


check button Ramachandran plot of the torsional angles - phi (φ)and psi (ψ) - of the residues (amino acids) contained in this fusion protein peptide.
Fusion AA seq ID in FusionPDB and their Ramachandran plots
SEC31A_ALK_682.png
all structure
SEC31A_ALK_759.png
all structure
SEC31A_ALK_947.png
all structure
SEC31A_ALK_963.png
all structure

Top

Potential Active Site Information


check button The potential binding sites of these fusion proteins were identified using SiteMap, a module of the Schrodinger suite.
Fusion AA seq ID in FusionPDBSite scoreSizeD scoreVolumeExposureEnclosureContactPhobicPhilicBalanceDon/AccResidues
6821.0923261.02937.0760.3770.8351.0790.3371.2920.2611.011Chain A: 11,13,14,16,17,18,29,30,31,32,69,70,71,73
,74,75,76,126,127,128,129,130,132,135,172,173,174,
175,176,178,189,215,216,217,218,219,220,221,234,26
4,265,266,267,268,269,270,271,307,308,309,310,311,
312,313,316,324
7591.0763271.024976.1780.4140.8121.0280.3911.2370.3160.954Chain A: 11,12,13,14,16,17,18,31,32,70,71,72,73,74
,75,76,126,127,128,129,130,131,132,135,172,173,174
,175,176,178,217,218,219,220,221,234,264,265,266,2
67,268,269,270,271,307,308,309,310,311,312,313,316
,324
9471.0633351.061046.4930.4580.7910.9830.4611.0870.4240.926Chain A: 18,20,21,23,24,25,36,38,39,77,78,80,81,82
,83,133,134,135,136,137,139,142,179,180,181,182,18
5,196,222,223,224,225,226,227,228,241,271,272,273,
274,275,276,277,278,314,315,316,317,318,319,320,32
3,331
9481.0643231.0351011.850.4510.7940.9740.4061.1710.3470.877Chain A: 19,20,21,22,23,24,25,26,39,40,78,79,80,81
,82,83,84,134,135,136,137,138,140,180,181,182,183,
186,197,223,224,225,226,227,228,229,242,272,273,27
4,275,276,277,278,279,315,316,317,318,319,320,321,
324
9621.0753251.027992.2990.4180.8111.0280.3171.2250.2590.901Chain A: 18,20,21,23,24,25,36,38,39,77,78,80,81,82
,83,133,134,135,136,137,139,179,180,181,182,183,18
5,196,222,223,224,225,226,227,228,241,271,272,273,
274,275,276,277,278,314,315,316,317,318,319,320,32
3
9631.0593181.0291027.2850.4590.7870.9760.3761.1750.320.922Chain A: 19,21,22,24,25,26,39,40,78,79,81,82,83,84
,134,135,136,137,138,140,178,179,180,181,182,183,1
86,197,223,224,225,226,227,228,229,242,272,273,274
,275,276,277,278,279,315,316,317,318,319,320,321,3
24
9641.0623331.0191000.1880.4390.7910.9990.3331.2130.2751.016Chain A: 11,13,14,15,16,17,18,31,32,70,71,73,74,75
,76,126,127,128,129,130,132,135,172,173,174,175,17
6,178,189,215,216,217,218,219,220,221,234,264,265,
266,267,268,269,270,271,307,308,309,310,311,312,31
3,316

Top

Potentially Interacting Small Molecules through Virtual Screening


check button The FDA-approved small molecule library molecules were subjected to virtual screening using the Glide.
Fusion AA seq ID in FusionPDBZINC IDDrugBank IDDrug nameDocking scoreGlide gscore
682ZINC000003938482DB01263Posaconazole-8.21983-10.3084
682ZINC000004097344DB01167Itraconazole-8.2193-10.1908
682ZINC000028639340DB01263Posaconazole-7.58262-9.67122
682ZINC000150588351DB11574Elbasvir-7.83737-9.08097
682ZINC000150338819DB09027Ledipasvir-8.2069-9.0305
682ZINC000003810860DB00973Ezetimibe-8.96354-8.96414
682ZINC000003914596DB01232Saquinavir-6.63604-8.89604
682ZINC000004474682DB00287Travoprost-8.67898-8.67898
682ZINC000003830943DB01362Iohexol-8.62698-8.62698
682ZINC000029571072DB06636Isavuconazonium-8.18973-8.62173
682ZINC000148723177DB12267Brigatinib-6.30944-8.55694
682ZINC000068204830DB09102Daclatasvir-7.36878-8.51968
682ZINC000002005305DB11256Levomefolic acid-8.47773-8.49393
682ZINC000008143864DB09134Ioversol-8.48797-8.48797
682ZINC000096006013DB00565Cisatracurium-8.46174-8.46174
682GilteritinibDB00565-8.11772-8.14412
682ZINC000060183170DB01421Paromomycin-6.68216-8.12986
682ZINC000003973334DB00990Exemestane-8.07352-8.07352
682ZINC000100014909DB09079Nintedanib-7.19621-8.05071
682ZINC000026985532DB01232Saquinavir-8.03523-8.04843

Top

check button Drug information from DrugBank of the top 20 interacting small molecules.
ZINC IDDrugBank IDDrug nameDrug typeSMILESDrug group
ZINC000003938482DB01263PosaconazoleSmall molecule[H][C@@](C)(O)[C@]([H])(CC)N1N=CN(C1=O)C1=CC=C(C=C1)N1CCN(CC1)C1=CC=C(OC[C@]2([H])CO[C@](CN3C=NC=N3)(C2)C2=C(F)C=C(F)C=C2)C=C1Approved|Investigational|Vet_approved
ZINC000004097344DB01167ItraconazoleSmall moleculeCCC(C)N1N=CN(C1=O)C1=CC=C(C=C1)N1CCN(CC1)C1=CC=C(OC[C@H]2CO[C@@](CN3C=NC=N3)(O2)C2=CC=C(Cl)C=C2Cl)C=C1Approved|Investigational
ZINC000150588351DB11574ElbasvirSmall molecule[H][C@]1(CCCN1C(=O)[C@@H](NC(=O)OC)C(C)C)C1=NC=C(N1)C1=CC2=C(C=C1)N1[C@@H](OC3=C(C=CC(=C3)C3=CN=C(N3)[C@]3([H])CCCN3C(=O)[C@@H](NC(=O)OC)C(C)C)C1=C2)C1=CC=CC=C1Approved
ZINC000150338819DB09027LedipasvirSmall moleculeCOC(=O)N[C@@H](C(C)C)C(=O)N1CC2(CC2)C[C@H]1C1=NC(=CN1)C1=CC=C2C3=CC=C(C=C3C(F)(F)C2=C1)C1=CC=C2NC(=NC2=C1)[C@@H]1[C@H]2CC[C@H](C2)N1C(=O)[C@@H](NC(=O)OC)C(C)CApproved
ZINC000003810860DB00973EzetimibeSmall molecule[H][C@]1(CC[C@H](O)C2=CC=C(F)C=C2)C(=O)N(C2=CC=C(F)C=C2)[C@]1([H])C1=CC=C(O)C=C1Approved
ZINC000003914596DB01232SaquinavirSmall molecule[H][C@@]12CCCC[C@]1([H])CN(C[C@@H](O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(N)=O)NC(=O)C1=NC3=C(C=CC=C3)C=C1)[C@@H](C2)C(=O)NC(C)(C)CApproved|Investigational
ZINC000004474682DB00287TravoprostSmall moleculeCC(C)OC(=O)CCCC=C/C[C@H]1[C@@H](O)C[C@@H](O)[C@@H]1C=C[C@@H](O)COC1=CC=CC(=C1)C(F)(F)FApproved
ZINC000148723177DB12267BrigatinibSmall moleculeCOC1=CC(=CC=C1NC1=NC=C(Cl)C(NC2=CC=CC=C2P(C)(C)=O)=N1)N1CCC(CC1)N1CCN(C)CC1Approved|Investigational
ZINC000068204830DB09102DaclatasvirSmall moleculeCOC(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@H]1C1=NC=C(N1)C1=CC=C(C=C1)C1=CC=C(C=C1)C1=CN=C(N1)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)OC)C(C)CApproved|Investigational
ZINC000002005305DB11256Levomefolic acidSmall moleculeCN1[C@@H](CNC2=CC=C(C=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O)CNC2=C1C(=O)N=C(N)N2Approved|Investigational
ZINC000008143864DB09134IoversolSmall moleculeOCCN(C(=O)CO)C1=C(I)C(C(=O)NCC(O)CO)=C(I)C(C(=O)NCC(O)CO)=C1IApproved
ZINC000096006013DB00565CisatracuriumSmall moleculeCOC1=CC2=C(C=C1OC)[C@@H](CC1=CC(OC)=C(OC)C=C1)[N@@+](C)(CCC(=O)OCCCCCOC(=O)CC[N@@+]1(C)CCC3=C(C=C(OC)C(OC)=C3)[C@H]1CC1=CC(OC)=C(OC)C=C1)CC2Approved
ZINC000060183170DB01421ParomomycinSmall moleculeNC[C@@H]1O[C@H](O[C@@H]2[C@@H](CO)O[C@@H](O[C@@H]3[C@@H](O)[C@H](N)C[C@H](N)[C@H]3O[C@H]3O[C@H](CO)[C@@H](O)[C@H](O)[C@H]3N)[C@@H]2O)[C@H](N)[C@@H](O)[C@@H]1OApproved|Investigational
ZINC000003973334DB00990ExemestaneSmall molecule[H][C@@]12CCC(=O)[C@@]1(C)CC[C@@]1([H])[C@@]2([H])CC(=C)C2=CC(=O)C=C[C@]12CApproved|Investigational
ZINC000100014909DB09079NintedanibSmall moleculeCOC(=O)C1=CC=C2C(NC(=O)C2=C(/NC2=CC=C(C=C2)N(C)C(=O)CN2CCN(C)CC2)C2=CC=CC=C2)=C1Approved

Top

Biochemical Features of Small Molecules


check button ADME (Absorption, Distribution, Metabolism, and Excretion) of drugs using QikProp(v3.9)
ZINC IDmol_MWdipoleSASAFOSAFISAPISAWPSAvolumedonorHBaccptHBIPHuman Oral AbsorptionPercent Human Oral AbsorptionRule Of FiveRule Of Three
ZINC000003938482700.7876.5931133.73453.341130.579474.96374.8482110.183111.28.262182.41731
ZINC000003938482700.7878.5451114.625456.955127.996456.48273.1922101.798111.28.612182.5231
ZINC000004097344705.6427.2011108.356441.22893.993451.204121.9312059.909010.258.234176.93131
ZINC000004097344705.6429.2311077.094456.70894.986424.927100.4732039.22010.258.496177.44131
ZINC000028639340700.7877.4041124.08472.765117.692460.37673.2482110.438111.28.362185.02331
ZINC000028639340700.7879.0091124.607459.294130.195459.08576.0332100.905111.28.651182.08231
ZINC000150588351882.036.5721345.74717.517186.345441.87802635.9722.513.258.001168.97331
ZINC000150588351882.0315.5811298.014670.728196.39430.89602598.6552.513.258.107165.89131
ZINC000150588351882.037.6941389.327750.107173.462465.75802673.8482.513.258.124173.08531
ZINC000150588351882.032.2191373.023721.483195.055456.48502655.1922.513.258.091168.49731
ZINC000150588351882.0313.7611347.364706.26205.501435.60302635.2572.513.258.253164.82431
ZINC000150588351882.036.9221403.807740.212182.531481.06402678.7442.513.258.307171.99131
ZINC000150588351882.0313.991367.982729.766188.685449.53102678.6732.513.257.973170.20832
ZINC000150588351882.0310.6561391.126719.587194.166477.37302673.6242.513.257.937169.24432
ZINC000150588351882.038.2331375.056718.117197.908459.03202655.632.513.257.962167.78732
ZINC000150338819889.0124.3881369.588772.236205.681331.75859.9142637.5922.512.58.669166.60131
ZINC000150338819889.0128.2551369.279775.121203.738327.3763.052645.2712.512.58.582167.42631
ZINC000150338819889.0129.6191351.189790.212180.178319.4461.362612.5082.512.58.678171.81131
ZINC000150338819889.0123.7791408.248811.944183.707340.05872.5392660.5462.512.58.565173.19231
ZINC000150338819889.0129.4481400.543793.783195.881338.35672.5242661.7462.512.58.543170.18231
ZINC000150338819889.0124.8391395.432791.3195.813335.78172.5382661.1332.512.58.437170.20531
ZINC000150338819889.0127.521431.265780.081196.772394.6159.8022709.9662.512.58.583172.17231
ZINC000150338819889.0125.3721296.949757.843185.555289.1464.4112577.2052.512.58.658168.62631
ZINC000150338819889.01210.9691383.164796.537180.774333.32572.5282658.9422.512.58.466174.00331
ZINC000003810860409.4325.656707.47174.233136.697402.90893.6341265.98724.458.806191.96111
ZINC000003914596670.856.4391070.917508.781153.229408.90702096.549513.78.806234.97531
ZINC000003914596670.858.3711120.448500.908193.797425.74302136.94513.78.696225.23432
ZINC000003914596670.857.3031031.722510.522191.885329.31502046.15513.78.906224.80432
ZINC000004474682500.5547.814798.264377.787163.81139.044117.6221521.37537.859.535385.40612
ZINC000003830943821.1432.11797.152329.149341.9798.796117.2271489.14818.29.3561031
ZINC000148723177584.15.166976.551585.47160.71276.12754.2431801.327112.757.731378.04711
ZINC000148723177584.15.799978.721539.73584.801306.93847.2471814.982112.757.869273.55111
ZINC000068204830738.8859.1861185.225698.369189.608297.24802297.4812.512.58.424156.2531
ZINC000068204830738.88512.7281242.653732.724195.462314.46702325.8512.512.58.586156.99331
ZINC000068204830738.8855.6591215.343718.68189.061307.60202308.8542.512.58.456157.66831
ZINC000068204830738.8853.0361238.237724.613189.774323.84902337.4652.512.58.529158.58831
ZINC000068204830738.8856.6121209.345719.031182.477307.83702307.3472.512.58.288159.64831
ZINC000068204830738.8854.8491244.561737.115179.716327.73102332.7592.512.58.571160.79831
ZINC000002005305459.46111.831783.577206.681420.974155.92201372.0347.2512.257.6551021
ZINC000002005305459.46112.273767.829200.116415.916151.79601360.767.2512.257.5781021
ZINC000008143864807.1165.478759.618304.239335.9096.136113.3331427.748818.29.2111031
ZINC000096006013929.15816.0051354.4711103.56690.843160.06202802.8260102.877110032
ZINC000096006013929.15857.0381416.3451150.32299.098166.92402837.4680104.487110032
ZINC000096006013929.15839.5751507.5551192.433106.579208.54202903.0470104.759110032
ZINC000060183170615.6346.469852.379333.487518.892001659.2811828.89.3491032
ZINC000060183170615.6343.96880.798394.277486.521001692.3141828.89.3051032
ZINC000060183170615.6345.103837.174322.647514.527001660.7851828.89.2221032
ZINC000060183170615.6342.705864.842336.076528.766001662.1151828.89.4241032
ZINC000060183170615.6345.505857.834356.773501.061001653.7311828.89.0251032
ZINC000060183170615.6344.108857.637355.734501.903001653.7891828.89.1071032
ZINC000060183170615.6347.059842.213356.118486.095001655.7631828.89.11032
ZINC000060183170615.6347.013831.783339.851491.933001655.6381828.89.1961032
ZINC000060183170615.6349.322830.127345.166484.961001649.2491828.89.4381032
ZINC000060183170615.6346.26774.039316.84457.199001589.5781828.89.1661032
ZINC000060183170615.6347.054839.2337.172502.027001650.0581828.89.3551032
ZINC000003973334296.4085.128536.649327.243107.306102.10980.671049.882310000
ZINC000100014909539.6332.179878.075386.508171.123320.44401669.688111.58.456252.47111
ZINC000100014909539.6336.878916.216406.188186.696323.33201691.689111.58.516249.91611
ZINC000100014909539.6335.607823.609357.513187.71278.38601605.261111.58.279246.45911
ZINC000026985532670.852.9921090.203522.841198.112369.2502103.734513.78.881223.06232
ZINC000026985532670.858.5931062.24515.596176.807369.83702085.965513.79.245225.69832
ZINC000026985532670.852.7191077.343520.688191.958364.69702094.541513.78.97622532


Top

Drug Toxicity Information


check button Toxicity information of individual drugs using eToxPred
ZINC IDSmileSurface AccessibilityToxicity
ZINC000003938482CC[C@@H]([C@H](C)O)n1ncn(-c2ccc(N3CCN(c4ccc(OC[C@@H]5CO[C@@](Cn6cncn6)(c6ccc(F)cc6F)C5)cc4)CC3)cc2)c1=O0.0317243810.429306683
ZINC000004097344CC[C@H](C)n1ncn(-c2ccc(N3CCN(c4ccc(OC[C@H]5CO[C@](Cn6cncn6)(c6ccc(Cl)cc6Cl)O5)cc4)CC3)cc2)c1=O0.0443632860.444547752
ZINC000028639340CC[C@@H]([C@H](C)O)n1ncn(-c2ccc(N3CCN(c4ccc(OC[C@H]5CO[C@@](Cn6cncn6)(c6ccc(F)cc6F)C5)cc4)CC3)cc2)c1=O0.0317243810.429306683
ZINC000150588351COC(=O)N[C@H](C(=O)N1CCC[C@H]1c1nc(-c2ccc3c(c2)O[C@@H](c2ccccc2)n2c-3cc3cc(-c4c[nH]c([C@@H]5CCCN5C(=O)[C@@H](NC(=O)OC)C(C)C)n4)ccc32)c[nH]1)C(C)C0.0158759170.217012798
ZINC000150338819COC(=O)N[C@H](C(=O)N1CC2(CC2)C[C@H]1c1ncc(-c2ccc3c(c2)C(F)(F)c2cc(-c4ccc5nc([C@@H]6[C@H]7CC[C@H](C7)N6C(=O)[C@@H](NC(=O)OC)C(C)C)[nH]c5c4)ccc2-3)[nH]1)C(C)C0.0048310950.217274953
ZINC000003810860O=C1[C@H](CC[C@H](O)c2ccc(F)cc2)[C@@H](c2ccc(O)cc2)N1c1ccc(F)cc10.1137834410.30369071
ZINC000003914596CC(C)(C)NC(=O)[C@@H]1C[C@@H]2CCCC[C@@H]2CN1C[C@@H](O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(N)=O)NC(=O)c1ccc2ccccc2n10.0392487760.333476835
ZINC000004474682CC(C)OC(=O)CCC/C=CC[C@H]1[C@@H](O)C[C@@H](O)[C@@H]1/C=C/[C@@H](O)COc1cccc(C(F)(F)F)c10.0437719140.166030074
ZINC000003830943CC(=O)N(C[C@H](O)CO)c1c(I)c(C(=O)NC[C@H](O)CO)c(I)c(C(=O)NC[C@H](O)CO)c1I0.0423322430.211666342
ZINC000029571072CNCC(=O)OCc1cccnc1N(C)C(=O)O[C@H](C)[n+]1cnn(C[C@](O)(c2cc(F)ccc2F)[C@@H](C)c2nc(-c3ccc(C#N)cc3)cs2)c10.0254160270.235213823
ZINC000148723177COc1cc(N2CCC(N3CCN(C)CC3)CC2)ccc1Nc1ncc(Cl)c(Nc2ccccc2P(C)(C)=O)n10.1425535560.1573686
ZINC000068204830COC(=O)N[C@H](C(=O)N1CCC[C@H]1c1ncc(-c2ccc(-c3ccc(-c4cnc([C@@H]5CCCN5C(=O)[C@@H](NC(=O)OC)C(C)C)[nH]4)cc3)cc2)[nH]1)C(C)C0.0399710920.226481814
ZINC000002005305CN1c2c(nc(N)[nH]c2=O)NC[C@@H]1CNc1ccc(C(=O)N[C@@H](CCC(=O)O)C(=O)O)cc10.0734179150.401979429
ZINC000008143864O=C(NC[C@H](O)CO)c1c(I)c(C(=O)NC[C@H](O)CO)c(I)c(N(CCO)C(=O)CO)c1I0.0517806690.23149478
ZINC000096006013COc1ccc(C[C@@H]2c3cc(OC)c(OC)cc3CC[N+]2(C)CCC(=O)OCCCCCOC(=O)CC[N+]2(C)CCc3cc(OC)c(OC)cc3[C@H]2Cc2ccc(OC)c(OC)c2)cc1OC0.0236998120.356880207
ZINC000060183170NC[C@@H]1O[C@H](O[C@@H]2[C@@H](CO)O[C@@H](O[C@@H]3[C@@H](O)[C@H](N)C[C@H](N)[C@H]3O[C@H]3O[C@H](CO)[C@@H](O)[C@H](O)[C@H]3N)[C@@H]2O)[C@H](N)[C@@H](O)[C@@H]1O0.0109957710.270848186
ZINC000003973334C=C1C[C@@H]2[C@H](CC[C@]3(C)C(=O)CC[C@@H]23)[C@@]2(C)C=CC(=O)C=C120.0367038170.383013774
ZINC000100014909COC(=O)c1ccc2c(c1)NC(=O)/C2=C(Nc1ccc(N(C)C(=O)CN2CCN(C)CC2)cc1)c1ccccc10.200066730.324562999
ZINC000026985532CC(C)(C)NC(=O)[C@@H]1C[C@H]2CCCC[C@@H]2CN1C[C@@H](O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(N)=O)NC(=O)c1ccc2ccccc2n10.0392487760.333476835


Top

Fusion Protein-Protein Interaction


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type from validated records (BIOGRID-3.4.160)
GenePPI interactors
SEC31ASEC13, APC, TNNT1, PFDN1, ALG2, STAM, STAM2, CUL3, CDK9, HSPA8, VCAM1, ITGA4, REL, SHMT2, MOV10, NXF1, TFG, SCYL1, C6orf211, ATP6V1C1, ENO2, FDPS, NUDT13, SEC23A, SEC23B, SEC24B, ALDH5A1, ATP6V1D, CBS, CTH, CYB5B, DHX15, HNRNPA1, HSD17B10, NAPRT, NSF, SAR1A, TBCA, TPD52L2, TRIM25, TSTA3, TCEB2, NTRK1, TMEM17, XPO1, Sec24c, TSC22D2, NOP14, SEC23IP, NOC4L, GNPDA1, CCDC88A, BRCA1, KLHL12, PEF1, PDCD6, UBC, SEC24C, SEC24D, UBE2M, PIH1D1, ARIH1, USP8, ESR2, ARF1, POLE3, TBL1XR1, IGF2BP1, DEPDC5, BAZ1A, ARL5A, BMI1, RAB5C, MTF2, MAN1A1, NCOR1, MAN2A1, SLC39A9, TRAPPC12, TRAPPC10, TSC1, GCNT2, CARM1, HNRNPD, ARFGAP2, SLC30A5, UBE2J1, TRAPPC13, VPS51, HDAC8, DCAF7, EP300, SLC33A1, VPS45, IGF2R, RAB1B, TRAPPC11, VPS53, TRAPPC5, RABGEF1, HSP90AB1, TCF3, MAN1B1, B3GNT2, IGF1R, RICTOR, SEC24A, SLC39A7, MOGS, EZH2, EED, TRAPPC2, CTBP1, TRAPPC2L, MYC, TP53, BET1, VCP, KIAA1429, PRNP, APEX1, ABCC6, PPP1CA, BIRC3, LMBR1L, TRIM28, FHL5, PLEKHA4, N, LUZP1, ORF6, ORF7a, E, ORF14, ORF7b, MYOM2, CIT, CHMP4C, KIF14, KIF23, ACACA, Rnf183, LGALS9, RIN3, B3GAT1, CAV1, EBAG9, EMD, ERGIC2, GJA1, GJD3, HSD3B7, NUP155, ZFPL1, WDR5, GORASP1, TMPRSS2, FURIN, IFITM1, OPTN,


check button Protein-protein interactors based on sequence similarity (STRING)
GeneSTRING network
SEC31Aall structure
ALKall structure


check button - Retained interactions in fusion protein (protein functional feature from UniProt).
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost interactions due to fusion (protein functional feature from UniProt).
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with
HgeneSEC31Achr4:83765539chr2:29446394ENST00000311785-2126800_1113875.33333333333341107.0PDCD6
HgeneSEC31Achr4:83765539chr2:29446394ENST00000326950-1925800_1113836.33333333333341182.0PDCD6
HgeneSEC31Achr4:83765539chr2:29446394ENST00000348405-1925800_1113836.33333333333341182.0PDCD6
HgeneSEC31Achr4:83765539chr2:29446394ENST00000355196-2329800_1113875.33333333333341221.0PDCD6
HgeneSEC31Achr4:83765539chr2:29446394ENST00000395310-2127800_1113875.33333333333341221.0PDCD6
HgeneSEC31Achr4:83765539chr2:29446394ENST00000432794-2128800_1113875.33333333333341234.0PDCD6
HgeneSEC31Achr4:83765539chr2:29446394ENST00000443462-2026800_1113870.33333333333341201.0PDCD6
HgeneSEC31Achr4:83765539chr2:29446394ENST00000448323-2127800_1113875.33333333333341221.0PDCD6
HgeneSEC31Achr4:83765539chr2:29446394ENST00000500777-1823800_1113836.33333333333341068.0PDCD6
HgeneSEC31Achr4:83765539chr2:29446394ENST00000508502-2127800_1113875.33333333333341206.0PDCD6
HgeneSEC31Achr4:83765539chr2:29446394ENST00000509142-2126800_1113875.33333333333341107.0PDCD6
HgeneSEC31Achr4:83765539chr2:29446394ENST00000513858-1924800_1113836.33333333333341068.0PDCD6


Top

Related Drugs to SEC31A-ALK


check button Drugs used for this fusion-positive patient.
(Manual curation of PubMed, 04-30-2022 + MyCancerGenome)
HgeneTgeneDrugSourcePMID
SEC31AALKNvp-Tae-684 (Alk Inhibitor)PubMed25715771

Top

Related Diseases to SEC31A-ALK


check button Diseases that have this fusion gene.
(Manual curation of PubMed, 04-30-2022 + MyCancerGenome)
HgeneTgeneDiseaseSourcePMID
SEC31AALKLung AdenocarcinomaPubMed25715771
SEC31AALKLung AdenocarcinomaMyCancerGenome
SEC31AALKAcute Myeloid LeukemiaMyCancerGenome
SEC31AALKCancer Of Unknown PrimaryMyCancerGenome
SEC31AALKDiffuse Large B-Cell LymphomaMyCancerGenome
SEC31AALKNot Otherwise SpecifiedMyCancerGenome
SEC31AALKGlioblastomaMyCancerGenome

check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
HgeneSEC31AC0079744Diffuse Large B-Cell Lymphoma1CTD_human
TgeneALKC0007131Non-Small Cell Lung Carcinoma28CGI;CTD_human
TgeneALKC0027819Neuroblastoma13CGI;CTD_human;ORPHANET
TgeneALKC0152013Adenocarcinoma of lung (disorder)8CGI;CTD_human
TgeneALKC2751681NEUROBLASTOMA, SUSCEPTIBILITY TO, 38CLINGEN;UNIPROT
TgeneALKC0206180Ki-1+ Anaplastic Large Cell Lymphoma6CGI;CTD_human
TgeneALKC0334121Inflammatory Myofibroblastic Tumor4CGI;CTD_human;ORPHANET
TgeneALKC0018199Granuloma, Plasma Cell3CTD_human
TgeneALKC0007621Neoplastic Cell Transformation2CTD_human
TgeneALKC0027627Neoplasm Metastasis2CTD_human
TgeneALKC0238463Papillary thyroid carcinoma2ORPHANET
TgeneALKC0001973Alcoholic Intoxication, Chronic1PSYGENET
TgeneALKC0006118Brain Neoplasms1CGI;CTD_human
TgeneALKC0006142Malignant neoplasm of breast1CTD_human
TgeneALKC0007134Renal Cell Carcinoma1CTD_human
TgeneALKC0011570Mental Depression1PSYGENET
TgeneALKC0011581Depressive disorder1PSYGENET
TgeneALKC0027643Neoplasm Recurrence, Local1CTD_human
TgeneALKC0036341Schizophrenia1PSYGENET
TgeneALKC0079744Diffuse Large B-Cell Lymphoma1CTD_human
TgeneALKC0085269Plasma Cell Granuloma, Pulmonary1CTD_human
TgeneALKC0153633Malignant neoplasm of brain1CGI;CTD_human
TgeneALKC0278601Inflammatory Breast Carcinoma1CTD_human
TgeneALKC0279702Conventional (Clear Cell) Renal Cell Carcinoma1CTD_human
TgeneALKC0496899Benign neoplasm of brain, unspecified1CTD_human
TgeneALKC0678222Breast Carcinoma1CTD_human
TgeneALKC0750974Brain Tumor, Primary1CTD_human
TgeneALKC0750977Recurrent Brain Neoplasm1CTD_human
TgeneALKC0750979Primary malignant neoplasm of brain1CTD_human
TgeneALKC1257931Mammary Neoplasms, Human1CTD_human
TgeneALKC1266042Chromophobe Renal Cell Carcinoma1CTD_human
TgeneALKC1266043Sarcomatoid Renal Cell Carcinoma1CTD_human
TgeneALKC1266044Collecting Duct Carcinoma of the Kidney1CTD_human
TgeneALKC1306837Papillary Renal Cell Carcinoma1CTD_human
TgeneALKC1332079Anaplastic Large Cell Lymphoma, ALK-Positive1ORPHANET
TgeneALKC1458155Mammary Neoplasms1CTD_human
TgeneALKC1527390Neoplasms, Intracranial1CTD_human
TgeneALKC2931189Neural crest tumor1ORPHANET
TgeneALKC3899155hereditary neuroblastoma1GENOMICS_ENGLAND
TgeneALKC4704874Mammary Carcinoma, Human1CTD_human