FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:GTF2IRD1-ALK (FusionGDB2 ID:35245)

Fusion Gene Summary for GTF2IRD1-ALK

check button Fusion gene summary
Fusion gene informationFusion gene name: GTF2IRD1-ALK
Fusion gene ID: 35245
HgeneTgene
Gene symbol

GTF2IRD1

ALK

Gene ID

9569

238

Gene nameGTF2I repeat domain containing 1ALK receptor tyrosine kinase
SynonymsBEN|CREAM1|GTF3|MUSTRD1|RBAP2|WBS|WBSCR11|WBSCR12|hMusTRD1alpha1CD246|NBLST3
Cytomap

7q11.23

2p23.2-p23.1

Type of geneprotein-codingprotein-coding
Descriptiongeneral transcription factor II-I repeat domain-containing protein 1USE B1-binding proteinWilliams-Beuren syndrome chromosome region 11binding factor for early enhancergeneral transcription factor 3general transcription factor IIImuscle TFII-I repeaALK tyrosine kinase receptorCD246 antigenanaplastic lymphoma receptor tyrosine kinasemutant anaplastic lymphoma kinase
Modification date2020031320200329
UniProtAcc

Q9UHL9

Q96BT7

Ensembl transtripts involved in fusion geneENST00000489094, ENST00000265755, 
ENST00000424337, ENST00000455841, 
ENST00000476977, 
ENST00000389048, 
ENST00000431873, ENST00000498037, 
Fusion gene scores* DoF score11 X 6 X 9=59456 X 74 X 20=82880
# samples 1257
** MAII scorelog2(12/594*10)=-2.30742852519225
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(57/82880*10)=-7.18391827352181
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: GTF2IRD1 [Title/Abstract] AND ALK [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointGTF2IRD1(73935627)-ALK(29446394), # samples:3
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
TgeneALK

GO:0016310

phosphorylation

9174053

TgeneALK

GO:0046777

protein autophosphorylation

9174053


check buttonFusion gene breakpoints across GTF2IRD1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across ALK (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4THCATCGA-EL-A4KD-01AGTF2IRD1chr7

73935627

-ALKchr2

29446394

-
ChimerDB4THCATCGA-EL-A4KD-01AGTF2IRD1chr7

73935627

+ALKchr2

29446394

-


Top

Fusion Gene ORF analysis for GTF2IRD1-ALK

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
3UTR-3CDSENST00000489094ENST00000389048GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
3UTR-intronENST00000489094ENST00000431873GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
3UTR-intronENST00000489094ENST00000498037GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000265755ENST00000431873GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000265755ENST00000498037GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000424337ENST00000431873GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000424337ENST00000498037GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000455841ENST00000431873GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000455841ENST00000498037GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000476977ENST00000431873GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000476977ENST00000498037GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
In-frameENST00000265755ENST00000389048GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
In-frameENST00000424337ENST00000389048GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
In-frameENST00000455841ENST00000389048GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
In-frameENST00000476977ENST00000389048GTF2IRD1chr7

73935627

+ALKchr2

29446394

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000265755GTF2IRD1chr773935627+ENST00000389048ALKchr229446394-354013993303089919
ENST00000455841GTF2IRD1chr773935627+ENST00000389048ALKchr229446394-345613151503005951
ENST00000424337GTF2IRD1chr773935627+ENST00000389048ALKchr229446394-32211080112770919
ENST00000476977GTF2IRD1chr773935627+ENST00000389048ALKchr229446394-4838269716914387898

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000265755ENST00000389048GTF2IRD1chr773935627+ALKchr229446394-0.0053528640.9946471
ENST00000455841ENST00000389048GTF2IRD1chr773935627+ALKchr229446394-0.0043992820.9956007
ENST00000424337ENST00000389048GTF2IRD1chr773935627+ALKchr229446394-0.0053839780.994616
ENST00000476977ENST00000389048GTF2IRD1chr773935627+ALKchr229446394-0.011810.98819005

Top

Fusion Genomic Features for GTF2IRD1-ALK


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.

Top

Fusion Protein Features for GTF2IRD1-ALK


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr7:73935627/chr2:29446394)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
GTF2IRD1

Q9UHL9

ALK

Q96BT7

FUNCTION: May be a transcription regulator involved in cell-cycle progression and skeletal muscle differentiation. May repress GTF2I transcriptional functions, by preventing its nuclear residency, or by inhibiting its transcriptional activation. May contribute to slow-twitch fiber type specificity during myogenesis and in regenerating muscles. Binds troponin I slow-muscle fiber enhancer (USE B1). Binds specifically and with high affinity to the EFG sequences derived from the early enhancer of HOXC8 (By similarity). {ECO:0000250, ECO:0000269|PubMed:11438732}.FUNCTION: Catalyzes the methylation of 5-carboxymethyl uridine to 5-methylcarboxymethyl uridine at the wobble position of the anticodon loop in tRNA via its methyltransferase domain (PubMed:20123966, PubMed:20308323, PubMed:31079898). Catalyzes the last step in the formation of 5-methylcarboxymethyl uridine at the wobble position of the anticodon loop in target tRNA (PubMed:20123966, PubMed:20308323). Has a preference for tRNA(Arg) and tRNA(Glu), and does not bind tRNA(Lys)(PubMed:20308323). Binds tRNA and catalyzes the iron and alpha-ketoglutarate dependent hydroxylation of 5-methylcarboxymethyl uridine at the wobble position of the anticodon loop in tRNA via its dioxygenase domain, giving rise to 5-(S)-methoxycarbonylhydroxymethyluridine; has a preference for tRNA(Gly) (PubMed:21285950). Required for normal survival after DNA damage (PubMed:20308323). May inhibit apoptosis and promote cell survival and angiogenesis (PubMed:19293182). {ECO:0000269|PubMed:19293182, ECO:0000269|PubMed:20123966, ECO:0000269|PubMed:20308323, ECO:0000269|PubMed:21285950, ECO:0000269|PubMed:31079898}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000265755+727119_213335960.0RepeatNote=GTF2I-like 1
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000424337+727119_213335945.0RepeatNote=GTF2I-like 1
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000455841+727119_213367977.0RepeatNote=GTF2I-like 1
TgeneALKchr7:73935627chr2:29446394ENST0000038904818291116_139210571621.0DomainProtein kinase
TgeneALKchr7:73935627chr2:29446394ENST0000038904818291197_119910571621.0RegionNote=Inhibitor binding
TgeneALKchr7:73935627chr2:29446394ENST0000038904818291060_162010571621.0Topological domainCytoplasmic

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000265755+727906_930335960.0Compositional biasNote=Ser-rich
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000424337+727906_930335945.0Compositional biasNote=Ser-rich
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000455841+727906_930367977.0Compositional biasNote=Ser-rich
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000265755+727898_905335960.0MotifNuclear localization signal
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000424337+727898_905335945.0MotifNuclear localization signal
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000455841+727898_905367977.0MotifNuclear localization signal
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000265755+727342_436335960.0RepeatNote=GTF2I-like 2
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000265755+727556_650335960.0RepeatNote=GTF2I-like 3
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000265755+727696_790335960.0RepeatNote=GTF2I-like 4
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000265755+727793_887335960.0RepeatNote=GTF2I-like 5
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000424337+727342_436335945.0RepeatNote=GTF2I-like 2
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000424337+727556_650335945.0RepeatNote=GTF2I-like 3
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000424337+727696_790335945.0RepeatNote=GTF2I-like 4
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000424337+727793_887335945.0RepeatNote=GTF2I-like 5
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000455841+727342_436367977.0RepeatNote=GTF2I-like 2
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000455841+727556_650367977.0RepeatNote=GTF2I-like 3
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000455841+727696_790367977.0RepeatNote=GTF2I-like 4
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000455841+727793_887367977.0RepeatNote=GTF2I-like 5
TgeneALKchr7:73935627chr2:29446394ENST000003890481829816_94010571621.0Compositional biasNote=Gly-rich
TgeneALKchr7:73935627chr2:29446394ENST000003890481829264_42710571621.0DomainMAM 1
TgeneALKchr7:73935627chr2:29446394ENST000003890481829437_47310571621.0DomainNote=LDL-receptor class A
TgeneALKchr7:73935627chr2:29446394ENST000003890481829478_63610571621.0DomainMAM 2
TgeneALKchr7:73935627chr2:29446394ENST00000389048182919_103810571621.0Topological domainExtracellular
TgeneALKchr7:73935627chr2:29446394ENST0000038904818291039_105910571621.0TransmembraneHelical


Top

Fusion Gene Sequence for GTF2IRD1-ALK


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>35245_35245_1_GTF2IRD1-ALK_GTF2IRD1_chr7_73935627_ENST00000265755_ALK_chr2_29446394_ENST00000389048_length(transcript)=3540nt_BP=1399nt
TAAATGGCAGCCAATGGAGGGTGGTGTTGCGCGGGGCTGGGATTAGGGCCGGGGCGAATGGCTGGCAATCTTACTGGGATTACAGAACAA
AGAGCCTCCCCGCGCTCCCGCTCTCCGCTCCTCTCCCCGCGCCGCCCCGCCCTCCGCCGCAGCCCGCGCCGGGGGTGGGGGCCGCCGAGC
GCCAGCCCCCCGGCCGGCCGATTCCCCCCCCGCGCCCCCTCCCCGCGCCTCCCTCCCCGCCCTCGCCGCGCCGCCGTCCTCGCCTCCCTC
TGCCTCTCCTTCCCCCATTCTCCCGGATTAATTAAGGAGGCAGCGGCAGGAGGCTGAGTCCTGGCCGCGGGCCGGGGCCGGGGCGCCGCT
GGCAGGAGCGCTTGGGGATCCTCCAAGGCGACCATGGCCTTGCTGGGTAAGCGCTGTGACGTCCCCACCAACGGCTGCGGACCCGACCGC
TGGAACTCCGCGTTCACCCGCAAAGACGAGATCATCACCAGCCTCGTGTCTGCCTTAGACTCCATGTGCTCAGCGCTGTCCAAACTGAAC
GCCGAGGTGGCCTGTGTCGCCGTGCACGATGAGAGCGCCTTTGTGGTGGGCACAGAGAAGGGGAGAATGTTCCTGAATGCCCGGAAGGAG
CTACAGTCAGACTTCCTCAGGTTCTGCCGAGGGCCCCCGTGGAAGGATCCGGAGGCAGAGCACCCCAAGAAGGTGCAGCGGGGCGAGGGT
GGAGGCCGTAGCCTCCCTCGGTCCTCCCTGGAACATGGCTCAGATGTGTACCTTCTGCGGAAGATGGTAGAGGAGGTGTTTGATGTTCTT
TATAGCGAGGCCCTGGGAAGGGCCAGTGTGGTGCCACTGCCCTATGAGAGGCTGCTCAGGGAGCCAGGGCTGCTGGCCGTGCAGGGGCTG
CCCGAAGGCCTGGCCTTCCGAAGGCCAGCCGAGTATGACCCCAAGGCCCTCATGGCCATCCTGGAACACAGCCACCGCATCCGCTTCAAG
CTCAAGAGGCCACTTGAGGATGGCGGGCGGGACTCGAAGGCCCTGGTGGAGCTGAACGGTGTCTCCCTGATTCCCAAGGGGTCACGGGAC
TGTGGCCTGCATGGCCAGGCCCCCAAGGTGCCACCCCAGGACCTGCCCCCAACCGCCACCTCCTCCTCCATGGCCAGCTTCCTGTACAGC
ACGGCGCTCCCCAACCACGCCATCCGAGAGCTCAAGCAGGAAGCACCTTCCTGCCCCCTTGCCCCCAGCGACCTGGGCCTGAGTCGGCCC
ATGCCAGAGCCCAAGGCCACCGGTGCCCAAGACTTCTCCGACTGTTGTGGACAGAAGCCCACTGGGCCTGGTGGGCCTCTCATCCAGAAC
GTCCATGCCTCCAAGCGCATTCTCTTCTCCATCGTCCATGACAAGTCAGTGTACCGCCGGAAGCACCAGGAGCTGCAAGCCATGCAGATG
GAGCTGCAGAGCCCTGAGTACAAGCTGAGCAAGCTCCGCACCTCGACCATCATGACCGACTACAACCCCAACTACTGCTTTGCTGGCAAG
ACCTCCTCCATCAGTGACCTGAAGGAGGTGCCGCGGAAAAACATCACCCTCATTCGGGGTCTGGGCCATGGCGCCTTTGGGGAGGTGTAT
GAAGGCCAGGTGTCCGGAATGCCCAACGACCCAAGCCCCCTGCAAGTGGCTGTGAAGACGCTGCCTGAAGTGTGCTCTGAACAGGACGAA
CTGGATTTCCTCATGGAAGCCCTGATCATCAGCAAATTCAACCACCAGAACATTGTTCGCTGCATTGGGGTGAGCCTGCAATCCCTGCCC
CGGTTCATCCTGCTGGAGCTCATGGCGGGGGGAGACCTCAAGTCCTTCCTCCGAGAGACCCGCCCTCGCCCGAGCCAGCCCTCCTCCCTG
GCCATGCTGGACCTTCTGCACGTGGCTCGGGACATTGCCTGTGGCTGTCAGTATTTGGAGGAAAACCACTTCATCCACCGAGACATTGCT
GCCAGAAACTGCCTCTTGACCTGTCCAGGCCCTGGAAGAGTGGCCAAGATTGGAGACTTCGGGATGGCCCGAGACATCTACAGGGCGAGC
TACTATAGAAAGGGAGGCTGTGCCATGCTGCCAGTTAAGTGGATGCCCCCAGAGGCCTTCATGGAAGGAATATTCACTTCTAAAACAGAC
ACATGGTCCTTTGGAGTGCTGCTATGGGAAATCTTTTCTCTTGGATATATGCCATACCCCAGCAAAAGCAACCAGGAAGTTCTGGAGTTT
GTCACCAGTGGAGGCCGGATGGACCCACCCAAGAACTGCCCTGGGCCTGTATACCGGATAATGACTCAGTGCTGGCAACATCAGCCTGAA
GACAGGCCCAACTTTGCCATCATTTTGGAGAGGATTGAATACTGCACCCAGGACCCGGATGTAATCAACACCGCTTTGCCGATAGAATAT
GGTCCACTTGTGGAAGAGGAAGAGAAAGTGCCTGTGAGGCCCAAGGACCCTGAGGGGGTTCCTCCTCTCCTGGTCTCTCAACAGGCAAAA
CGGGAGGAGGAGCGCAGCCCAGCTGCCCCACCACCTCTGCCTACCACCTCCTCTGGCAAGGCTGCAAAGAAACCCACAGCTGCAGAGATC
TCTGTTCGAGTCCCTAGAGGGCCGGCCGTGGAAGGGGGACACGTGAATATGGCATTCTCTCAGTCCAACCCTCCTTCGGAGTTGCACAAG
GTCCACGGATCCAGAAACAAGCCCACCAGCTTGTGGAACCCAACGTACGGCTCCTGGTTTACAGAGAAACCCACCAAAAAGAATAATCCT
ATAGCAAAGAAGGAGCCACACGACAGGGGTAACCTGGGGCTGGAGGGAAGCTGTACTGTCCCACCTAACGTTGCAACTGGGAGACTTCCG
GGGGCCTCACTGCTCCTAGAGCCCTCTTCGCTGACTGCCAATATGAAGGAGGTACCTCTGTTCAGGCTACGTCACTTCCCTTGTGGGAAT
GTCAATTACGGCTACCAGCAACAGGGCTTGCCCTTAGAAGCCGCTACTGCCCCTGGAGCTGGTCATTACGAGGATACCATTCTGAAAAGC
AAGAATAGCATGAACCAGCCTGGGCCCTGAGCTCGGTCGCACACTCACTTCTCTTCCTTGGGATCCCTAAGACCGTGGAGGAGAGAGAGG
CAATGGCTCCTTCACAAACCAGAGACCAAATGTCACGTTTTGTTTTGTGCCAACCTATTTTGAAGTACCACCAAAAAAGCTGTATTTTGA
AAATGCTTTAGAAAGGTTTTGAGCATGGGTTCATCCTATTCTTTCGAAAGAAGAAAATATCATAAAAATGAGTGATAAATACAAGGCCCA
GATGTGGTTGCATAAGGTTTTTATGCATGTTTGTTGTATACTTCCTTATGCTTCTTTCAAATTGTGTGTGCTCTGCTTCAATGTAGTCAG
AATTAGCTGCTTCTATGTTTCATAGTTGGGGTCATAGATGTTTCCTTGCCTTGTTGATGTGGACATGAGCCATTTGAGGGGAGAGGGAAC

>35245_35245_1_GTF2IRD1-ALK_GTF2IRD1_chr7_73935627_ENST00000265755_ALK_chr2_29446394_ENST00000389048_length(amino acids)=919AA_BP=356
MAAGRGRGAAGRSAWGSSKATMALLGKRCDVPTNGCGPDRWNSAFTRKDEIITSLVSALDSMCSALSKLNAEVACVAVHDESAFVVGTEK
GRMFLNARKELQSDFLRFCRGPPWKDPEAEHPKKVQRGEGGGRSLPRSSLEHGSDVYLLRKMVEEVFDVLYSEALGRASVVPLPYERLLR
EPGLLAVQGLPEGLAFRRPAEYDPKALMAILEHSHRIRFKLKRPLEDGGRDSKALVELNGVSLIPKGSRDCGLHGQAPKVPPQDLPPTAT
SSSMASFLYSTALPNHAIRELKQEAPSCPLAPSDLGLSRPMPEPKATGAQDFSDCCGQKPTGPGGPLIQNVHASKRILFSIVHDKSVYRR
KHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKT
LPEVCSEQDELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLE
ENHFIHRDIAARNCLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYP
SKSNQEVLEFVTSGGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGV
PPLLVSQQAKREEERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWF
TEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMKEVPLFRLRHFPCGNVNYGYQQQGLPLEAATAPGA

--------------------------------------------------------------
>35245_35245_2_GTF2IRD1-ALK_GTF2IRD1_chr7_73935627_ENST00000424337_ALK_chr2_29446394_ENST00000389048_length(transcript)=3221nt_BP=1080nt
GAGGCTGAGTCCTGGCCGCGGGCCGGGGCCGGGGCGCCGCTGGCAGGAGCGCTTGGGGATCCTCCAAGGCGACCATGGCCTTGCTGGGTA
AGCGCTGTGACGTCCCCACCAACGGCTGCGGACCCGACCGCTGGAACTCCGCGTTCACCCGCAAAGACGAGATCATCACCAGCCTCGTGT
CTGCCTTAGACTCCATGTGCTCAGCGCTGTCCAAACTGAACGCCGAGGTGGCCTGTGTCGCCGTGCACGATGAGAGCGCCTTTGTGGTGG
GCACAGAGAAGGGGAGAATGTTCCTGAATGCCCGGAAGGAGCTACAGTCAGACTTCCTCAGGTTCTGCCGAGGGCCCCCGTGGAAGGATC
CGGAGGCAGAGCACCCCAAGAAGGTGCAGCGGGGCGAGGGTGGAGGCCGTAGCCTCCCTCGGTCCTCCCTGGAACATGGCTCAGATGTGT
ACCTTCTGCGGAAGATGGTAGAGGAGGTGTTTGATGTTCTTTATAGCGAGGCCCTGGGAAGGGCCAGTGTGGTGCCACTGCCCTATGAGA
GGCTGCTCAGGGAGCCAGGGCTGCTGGCCGTGCAGGGGCTGCCCGAAGGCCTGGCCTTCCGAAGGCCAGCCGAGTATGACCCCAAGGCCC
TCATGGCCATCCTGGAACACAGCCACCGCATCCGCTTCAAGCTCAAGAGGCCACTTGAGGATGGCGGGCGGGACTCGAAGGCCCTGGTGG
AGCTGAACGGTGTCTCCCTGATTCCCAAGGGGTCACGGGACTGTGGCCTGCATGGCCAGGCCCCCAAGGTGCCACCCCAGGACCTGCCCC
CAACCGCCACCTCCTCCTCCATGGCCAGCTTCCTGTACAGCACGGCGCTCCCCAACCACGCCATCCGAGAGCTCAAGCAGGAAGCACCTT
CCTGCCCCCTTGCCCCCAGCGACCTGGGCCTGAGTCGGCCCATGCCAGAGCCCAAGGCCACCGGTGCCCAAGACTTCTCCGACTGTTGTG
GACAGAAGCCCACTGGGCCTGGTGGGCCTCTCATCCAGAACGTCCATGCCTCCAAGCGCATTCTCTTCTCCATCGTCCATGACAAGTCAG
TGTACCGCCGGAAGCACCAGGAGCTGCAAGCCATGCAGATGGAGCTGCAGAGCCCTGAGTACAAGCTGAGCAAGCTCCGCACCTCGACCA
TCATGACCGACTACAACCCCAACTACTGCTTTGCTGGCAAGACCTCCTCCATCAGTGACCTGAAGGAGGTGCCGCGGAAAAACATCACCC
TCATTCGGGGTCTGGGCCATGGCGCCTTTGGGGAGGTGTATGAAGGCCAGGTGTCCGGAATGCCCAACGACCCAAGCCCCCTGCAAGTGG
CTGTGAAGACGCTGCCTGAAGTGTGCTCTGAACAGGACGAACTGGATTTCCTCATGGAAGCCCTGATCATCAGCAAATTCAACCACCAGA
ACATTGTTCGCTGCATTGGGGTGAGCCTGCAATCCCTGCCCCGGTTCATCCTGCTGGAGCTCATGGCGGGGGGAGACCTCAAGTCCTTCC
TCCGAGAGACCCGCCCTCGCCCGAGCCAGCCCTCCTCCCTGGCCATGCTGGACCTTCTGCACGTGGCTCGGGACATTGCCTGTGGCTGTC
AGTATTTGGAGGAAAACCACTTCATCCACCGAGACATTGCTGCCAGAAACTGCCTCTTGACCTGTCCAGGCCCTGGAAGAGTGGCCAAGA
TTGGAGACTTCGGGATGGCCCGAGACATCTACAGGGCGAGCTACTATAGAAAGGGAGGCTGTGCCATGCTGCCAGTTAAGTGGATGCCCC
CAGAGGCCTTCATGGAAGGAATATTCACTTCTAAAACAGACACATGGTCCTTTGGAGTGCTGCTATGGGAAATCTTTTCTCTTGGATATA
TGCCATACCCCAGCAAAAGCAACCAGGAAGTTCTGGAGTTTGTCACCAGTGGAGGCCGGATGGACCCACCCAAGAACTGCCCTGGGCCTG
TATACCGGATAATGACTCAGTGCTGGCAACATCAGCCTGAAGACAGGCCCAACTTTGCCATCATTTTGGAGAGGATTGAATACTGCACCC
AGGACCCGGATGTAATCAACACCGCTTTGCCGATAGAATATGGTCCACTTGTGGAAGAGGAAGAGAAAGTGCCTGTGAGGCCCAAGGACC
CTGAGGGGGTTCCTCCTCTCCTGGTCTCTCAACAGGCAAAACGGGAGGAGGAGCGCAGCCCAGCTGCCCCACCACCTCTGCCTACCACCT
CCTCTGGCAAGGCTGCAAAGAAACCCACAGCTGCAGAGATCTCTGTTCGAGTCCCTAGAGGGCCGGCCGTGGAAGGGGGACACGTGAATA
TGGCATTCTCTCAGTCCAACCCTCCTTCGGAGTTGCACAAGGTCCACGGATCCAGAAACAAGCCCACCAGCTTGTGGAACCCAACGTACG
GCTCCTGGTTTACAGAGAAACCCACCAAAAAGAATAATCCTATAGCAAAGAAGGAGCCACACGACAGGGGTAACCTGGGGCTGGAGGGAA
GCTGTACTGTCCCACCTAACGTTGCAACTGGGAGACTTCCGGGGGCCTCACTGCTCCTAGAGCCCTCTTCGCTGACTGCCAATATGAAGG
AGGTACCTCTGTTCAGGCTACGTCACTTCCCTTGTGGGAATGTCAATTACGGCTACCAGCAACAGGGCTTGCCCTTAGAAGCCGCTACTG
CCCCTGGAGCTGGTCATTACGAGGATACCATTCTGAAAAGCAAGAATAGCATGAACCAGCCTGGGCCCTGAGCTCGGTCGCACACTCACT
TCTCTTCCTTGGGATCCCTAAGACCGTGGAGGAGAGAGAGGCAATGGCTCCTTCACAAACCAGAGACCAAATGTCACGTTTTGTTTTGTG
CCAACCTATTTTGAAGTACCACCAAAAAAGCTGTATTTTGAAAATGCTTTAGAAAGGTTTTGAGCATGGGTTCATCCTATTCTTTCGAAA
GAAGAAAATATCATAAAAATGAGTGATAAATACAAGGCCCAGATGTGGTTGCATAAGGTTTTTATGCATGTTTGTTGTATACTTCCTTAT
GCTTCTTTCAAATTGTGTGTGCTCTGCTTCAATGTAGTCAGAATTAGCTGCTTCTATGTTTCATAGTTGGGGTCATAGATGTTTCCTTGC

>35245_35245_2_GTF2IRD1-ALK_GTF2IRD1_chr7_73935627_ENST00000424337_ALK_chr2_29446394_ENST00000389048_length(amino acids)=919AA_BP=356
MAAGRGRGAAGRSAWGSSKATMALLGKRCDVPTNGCGPDRWNSAFTRKDEIITSLVSALDSMCSALSKLNAEVACVAVHDESAFVVGTEK
GRMFLNARKELQSDFLRFCRGPPWKDPEAEHPKKVQRGEGGGRSLPRSSLEHGSDVYLLRKMVEEVFDVLYSEALGRASVVPLPYERLLR
EPGLLAVQGLPEGLAFRRPAEYDPKALMAILEHSHRIRFKLKRPLEDGGRDSKALVELNGVSLIPKGSRDCGLHGQAPKVPPQDLPPTAT
SSSMASFLYSTALPNHAIRELKQEAPSCPLAPSDLGLSRPMPEPKATGAQDFSDCCGQKPTGPGGPLIQNVHASKRILFSIVHDKSVYRR
KHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKT
LPEVCSEQDELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLE
ENHFIHRDIAARNCLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYP
SKSNQEVLEFVTSGGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGV
PPLLVSQQAKREEERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWF
TEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMKEVPLFRLRHFPCGNVNYGYQQQGLPLEAATAPGA

--------------------------------------------------------------
>35245_35245_3_GTF2IRD1-ALK_GTF2IRD1_chr7_73935627_ENST00000455841_ALK_chr2_29446394_ENST00000389048_length(transcript)=3456nt_BP=1315nt
GCCAGCCCCCCGGCCGGCCGATTCCCCCCCCGCGCCCCCTCCCCGCGCCTCCCTCCCCGCCCTCGCCGCGCCGCCGTCCTCGCCTCCCTC
TGCCTCTCCTTCCCCCATTCTCCCGGATTAATTAAGGAGGCAGCGGCAGGAGGCTGAGTCCTGGCCGCGGGCCGGGGCCGGGGCGCCGCT
GGCAGGAGCGCTTGGGGATCCTCCAAGGCGACCATGGCCTTGCTGGGTAAGCGCTGTGACGTCCCCACCAACGGCTGCGGACCCGACCGC
TGGAACTCCGCGTTCACCCGCAAAGACGAGATCATCACCAGCCTCGTGTCTGCCTTAGACTCCATGTGCTCAGCGCTGTCCAAACTGAAC
GCCGAGGTGGCCTGTGTCGCCGTGCACGATGAGAGCGCCTTTGTGGTGGGCACAGAGAAGGGGAGAATGTTCCTGAATGCCCGGAAGGAG
CTACAGTCAGACTTCCTCAGGTTCTGCCTCTCCGCAGCTCAGCACAGGGCAGCGACATCCCAGCTCGAAGGCCGGGTGGTGAGACGGGTG
CTCACTGTGGCCTCGCGTGCTCTGTGTCCCACAGGAGGGCCCCCGTGGAAGGATCCGGAGGCAGAGCACCCCAAGAAGGTGCAGCGGGGC
GAGGGTGGAGGCCGTAGCCTCCCTCGGTCCTCCCTGGAACATGGCTCAGATGTGTACCTTCTGCGGAAGATGGTAGAGGAGGTGTTTGAT
GTTCTTTATAGCGAGGCCCTGGGAAGGGCCAGTGTGGTGCCACTGCCCTATGAGAGGCTGCTCAGGGAGCCAGGGCTGCTGGCCGTGCAG
GGGCTGCCCGAAGGCCTGGCCTTCCGAAGGCCAGCCGAGTATGACCCCAAGGCCCTCATGGCCATCCTGGAACACAGCCACCGCATCCGC
TTCAAGCTCAAGAGGCCACTTGAGGATGGCGGGCGGGACTCGAAGGCCCTGGTGGAGCTGAACGGTGTCTCCCTGATTCCCAAGGGGTCA
CGGGACTGTGGCCTGCATGGCCAGGCCCCCAAGGTGCCACCCCAGGACCTGCCCCCAACCGCCACCTCCTCCTCCATGGCCAGCTTCCTG
TACAGCACGGCGCTCCCCAACCACGCCATCCGAGAGCTCAAGCAGGAAGCACCTTCCTGCCCCCTTGCCCCCAGCGACCTGGGCCTGAGT
CGGCCCATGCCAGAGCCCAAGGCCACCGGTGCCCAAGACTTCTCCGACTGTTGTGGACAGAAGCCCACTGGGCCTGGTGGGCCTCTCATC
CAGAACGTCCATGCCTCCAAGCGCATTCTCTTCTCCATCGTCCATGACAAGTCAGTGTACCGCCGGAAGCACCAGGAGCTGCAAGCCATG
CAGATGGAGCTGCAGAGCCCTGAGTACAAGCTGAGCAAGCTCCGCACCTCGACCATCATGACCGACTACAACCCCAACTACTGCTTTGCT
GGCAAGACCTCCTCCATCAGTGACCTGAAGGAGGTGCCGCGGAAAAACATCACCCTCATTCGGGGTCTGGGCCATGGCGCCTTTGGGGAG
GTGTATGAAGGCCAGGTGTCCGGAATGCCCAACGACCCAAGCCCCCTGCAAGTGGCTGTGAAGACGCTGCCTGAAGTGTGCTCTGAACAG
GACGAACTGGATTTCCTCATGGAAGCCCTGATCATCAGCAAATTCAACCACCAGAACATTGTTCGCTGCATTGGGGTGAGCCTGCAATCC
CTGCCCCGGTTCATCCTGCTGGAGCTCATGGCGGGGGGAGACCTCAAGTCCTTCCTCCGAGAGACCCGCCCTCGCCCGAGCCAGCCCTCC
TCCCTGGCCATGCTGGACCTTCTGCACGTGGCTCGGGACATTGCCTGTGGCTGTCAGTATTTGGAGGAAAACCACTTCATCCACCGAGAC
ATTGCTGCCAGAAACTGCCTCTTGACCTGTCCAGGCCCTGGAAGAGTGGCCAAGATTGGAGACTTCGGGATGGCCCGAGACATCTACAGG
GCGAGCTACTATAGAAAGGGAGGCTGTGCCATGCTGCCAGTTAAGTGGATGCCCCCAGAGGCCTTCATGGAAGGAATATTCACTTCTAAA
ACAGACACATGGTCCTTTGGAGTGCTGCTATGGGAAATCTTTTCTCTTGGATATATGCCATACCCCAGCAAAAGCAACCAGGAAGTTCTG
GAGTTTGTCACCAGTGGAGGCCGGATGGACCCACCCAAGAACTGCCCTGGGCCTGTATACCGGATAATGACTCAGTGCTGGCAACATCAG
CCTGAAGACAGGCCCAACTTTGCCATCATTTTGGAGAGGATTGAATACTGCACCCAGGACCCGGATGTAATCAACACCGCTTTGCCGATA
GAATATGGTCCACTTGTGGAAGAGGAAGAGAAAGTGCCTGTGAGGCCCAAGGACCCTGAGGGGGTTCCTCCTCTCCTGGTCTCTCAACAG
GCAAAACGGGAGGAGGAGCGCAGCCCAGCTGCCCCACCACCTCTGCCTACCACCTCCTCTGGCAAGGCTGCAAAGAAACCCACAGCTGCA
GAGATCTCTGTTCGAGTCCCTAGAGGGCCGGCCGTGGAAGGGGGACACGTGAATATGGCATTCTCTCAGTCCAACCCTCCTTCGGAGTTG
CACAAGGTCCACGGATCCAGAAACAAGCCCACCAGCTTGTGGAACCCAACGTACGGCTCCTGGTTTACAGAGAAACCCACCAAAAAGAAT
AATCCTATAGCAAAGAAGGAGCCACACGACAGGGGTAACCTGGGGCTGGAGGGAAGCTGTACTGTCCCACCTAACGTTGCAACTGGGAGA
CTTCCGGGGGCCTCACTGCTCCTAGAGCCCTCTTCGCTGACTGCCAATATGAAGGAGGTACCTCTGTTCAGGCTACGTCACTTCCCTTGT
GGGAATGTCAATTACGGCTACCAGCAACAGGGCTTGCCCTTAGAAGCCGCTACTGCCCCTGGAGCTGGTCATTACGAGGATACCATTCTG
AAAAGCAAGAATAGCATGAACCAGCCTGGGCCCTGAGCTCGGTCGCACACTCACTTCTCTTCCTTGGGATCCCTAAGACCGTGGAGGAGA
GAGAGGCAATGGCTCCTTCACAAACCAGAGACCAAATGTCACGTTTTGTTTTGTGCCAACCTATTTTGAAGTACCACCAAAAAAGCTGTA
TTTTGAAAATGCTTTAGAAAGGTTTTGAGCATGGGTTCATCCTATTCTTTCGAAAGAAGAAAATATCATAAAAATGAGTGATAAATACAA
GGCCCAGATGTGGTTGCATAAGGTTTTTATGCATGTTTGTTGTATACTTCCTTATGCTTCTTTCAAATTGTGTGTGCTCTGCTTCAATGT
AGTCAGAATTAGCTGCTTCTATGTTTCATAGTTGGGGTCATAGATGTTTCCTTGCCTTGTTGATGTGGACATGAGCCATTTGAGGGGAGA

>35245_35245_3_GTF2IRD1-ALK_GTF2IRD1_chr7_73935627_ENST00000455841_ALK_chr2_29446394_ENST00000389048_length(amino acids)=951AA_BP=388
MAAGRGRGAAGRSAWGSSKATMALLGKRCDVPTNGCGPDRWNSAFTRKDEIITSLVSALDSMCSALSKLNAEVACVAVHDESAFVVGTEK
GRMFLNARKELQSDFLRFCLSAAQHRAATSQLEGRVVRRVLTVASRALCPTGGPPWKDPEAEHPKKVQRGEGGGRSLPRSSLEHGSDVYL
LRKMVEEVFDVLYSEALGRASVVPLPYERLLREPGLLAVQGLPEGLAFRRPAEYDPKALMAILEHSHRIRFKLKRPLEDGGRDSKALVEL
NGVSLIPKGSRDCGLHGQAPKVPPQDLPPTATSSSMASFLYSTALPNHAIRELKQEAPSCPLAPSDLGLSRPMPEPKATGAQDFSDCCGQ
KPTGPGGPLIQNVHASKRILFSIVHDKSVYRRKHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLI
RGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLR
ETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARNCLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPE
AFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSGGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQD
PDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMA
FSQSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMKEV

--------------------------------------------------------------
>35245_35245_4_GTF2IRD1-ALK_GTF2IRD1_chr7_73935627_ENST00000476977_ALK_chr2_29446394_ENST00000389048_length(transcript)=4838nt_BP=2697nt
AACATTTAGCAGCAAACTCAACATACATTTTGGCCAAACGCCCACCGGCCAGTTGTTTAAATCAATATTTATCCACAGCCAAAAAGCAGA
GGAGAGCCAGAACTGAGGCCAGAGCCGAGCTCTGATGCATCTCTCATTTCTCGGGATGTTTCTGTCCCTGTGGTTGGACACCTCTGGCCT
TGTGAAGTGTGATGCACTGTCACATCTCCTGTTCTGTGTCATTGGCCAATGATCATATATCATGACCTGCTGGAAGGCCTGTCTGTGGCT
GGGACCACACGCCTTGGGCCTTATGCACACTGGGCACTGGTCGGGATCCTGGGGTGCAACAGTGGCAGGCAGACCTGGTTTCTGCCCTCA
AAGAGCTTACAGATGGCAGGGGCACCCATGGCGGCAGAAGACACCCCAGGCCTGGTGCCCTTTGTGGTGCCAGCCCCATGGTCCTCCTGC
CTGGGCCTTCCCTACCCCATTGGGTGCAGAAACTCCCTGTCTGCAGGTAGGAGACAGAGGGTAGGTTTTCAGGCTCCCTTGGGAACTGCA
GCCCTGCTCTCTGCCATCAACACCCAGCAGGGGCCACACAGAGAGCACCGGGACTGAGCCCATAGAGGGGAACCGAGAGGCCCCTGCCTC
TAGTCTCTGCCTTCTTTGCTTGGATTGGTGCAGGGACAGCTGCCTTGAGGGCAGGCCCTGGCACTGGGGCAGCCTGTGGGTGCCCCCTGG
GTCAAGAAGGAGAGGGGCAGGGTAGAACCAGGAGCCAAAGGAGGCTGATCTTTGCATCTCATGGGTGCCCAGCTGGACACTGTCATACCC
AGGAAGCCTGTGCCATGCCATGGGGACCCACAACTGGGGGCCCTGGACTTGAGGGGGAGGATGCAGCTCTGTCCCCCAGGAACCCCATTG
CAACAGGACACAGTCCTGCCCTGGGGAGCCCCTGACCTGAGACAAAGCAGCCTCGGCCCTGCTGTATCTTTCCATACCCCTGATGCCAAG
TCTCCTGGCTAGGAGGGAAACTGAGGCTGGAAGGCCTCGGCGGGGGTGGCATTGGCCTCGGGAGCATGTGGCTTGATGCAGAAATGTGAC
GGCAGAGCTCAGAGGCATGCGGAAGGGAGGGGAGGACATCACCGGCTCCTGACCCAGCTGGGCTTCAGGTTGGGGGTACAGGAGGTGGGC
AAGCAGGTTGGACAATTAAAAGCTTCGATGAGGCTGGGTGAGTGGCTTATGCCTGTATTTCCAACACTTTGGGAGGCTGAGGTGGGCAGA
TCACCTGAGGCCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGCCAGACGTGTGGT
GGCACCTGTAATCCCAGCTACCCGGGAGGCTGAGGCAGGAGAATCACTCGAACCCAGGAAGGGGAGGTTGCAGTGAGCCAAGATTGCACC
ACTGCACTATAGCCTGGGCAACAGAGTGAGACTCTGTCTCGAAATAAAATTAAATTTAAAATTTAAAAAAGCTTCAAGGACAACCAGCAG
ATGATGGCAGGACCAGGAAGGGTGCTTCAGGCAGCGGGAACTGAACCTGCACAGACTAGGGAAGCATGAACATCGGTACATCTGGGAGTG
GCACAGCTTAGAGGGCCTGGAGCATGGAGTGTGAGGGGGAACTGGAGTGAGGGACAGGAGACCAGGCGACCATGGCCTTGCTGGGTAAGC
GCTGTGACGTCCCCACCAACGGCTGCGGACCCGACCGCTGGAACTCCGCGTTCACCCGCAAAGACGAGATCATCACCAGCCTCGTGTCTG
CCTTAGACTCCATGTGCTCAGCGCTGTCCAAACTGAACGCCGAGGTGGCCTGTGTCGCCGTGCACGATGAGAGCGCCTTTGTGGTGGGCA
CAGAGAAGGGGAGAATGTTCCTGAATGCCCGGAAGGAGCTACAGTCAGACTTCCTCAGGTTCTGCCGAGGGCCCCCGTGGAAGGATCCGG
AGGCAGAGCACCCCAAGAAGGTGCAGCGGGGCGAGGGTGGAGGCCGTAGCCTCCCTCGGTCCTCCCTGGAACATGGCTCAGATGTGTACC
TTCTGCGGAAGATGGTAGAGGAGGTGTTTGATGTTCTTTATAGCGAGGCCCTGGGAAGGGCCAGTGTGGTGCCACTGCCCTATGAGAGGC
TGCTCAGGGAGCCAGGGCTGCTGGCCGTGCAGGGGCTGCCCGAAGGCCTGGCCTTCCGAAGGCCAGCCGAGTATGACCCCAAGGCCCTCA
TGGCCATCCTGGAACACAGCCACCGCATCCGCTTCAAGCTCAAGAGGCCACTTGAGGATGGCGGGCGGGACTCGAAGGCCCTGGTGGAGC
TGAACGGTGTCTCCCTGATTCCCAAGGGGTCACGGGACTGTGGCCTGCATGGCCAGGCCCCCAAGGTGCCACCCCAGGACCTGCCCCCAA
CCGCCACCTCCTCCTCCATGGCCAGCTTCCTGTACAGCACGGCGCTCCCCAACCACGCCATCCGAGAGCTCAAGCAGGAAGCACCTTCCT
GCCCCCTTGCCCCCAGCGACCTGGGCCTGAGTCGGCCCATGCCAGAGCCCAAGGCCACCGGTGCCCAAGACTTCTCCGACTGTTGTGGAC
AGAAGCCCACTGGGCCTGGTGGGCCTCTCATCCAGAACGTCCATGCCTCCAAGCGCATTCTCTTCTCCATCGTCCATGACAAGTCAGTGT
ACCGCCGGAAGCACCAGGAGCTGCAAGCCATGCAGATGGAGCTGCAGAGCCCTGAGTACAAGCTGAGCAAGCTCCGCACCTCGACCATCA
TGACCGACTACAACCCCAACTACTGCTTTGCTGGCAAGACCTCCTCCATCAGTGACCTGAAGGAGGTGCCGCGGAAAAACATCACCCTCA
TTCGGGGTCTGGGCCATGGCGCCTTTGGGGAGGTGTATGAAGGCCAGGTGTCCGGAATGCCCAACGACCCAAGCCCCCTGCAAGTGGCTG
TGAAGACGCTGCCTGAAGTGTGCTCTGAACAGGACGAACTGGATTTCCTCATGGAAGCCCTGATCATCAGCAAATTCAACCACCAGAACA
TTGTTCGCTGCATTGGGGTGAGCCTGCAATCCCTGCCCCGGTTCATCCTGCTGGAGCTCATGGCGGGGGGAGACCTCAAGTCCTTCCTCC
GAGAGACCCGCCCTCGCCCGAGCCAGCCCTCCTCCCTGGCCATGCTGGACCTTCTGCACGTGGCTCGGGACATTGCCTGTGGCTGTCAGT
ATTTGGAGGAAAACCACTTCATCCACCGAGACATTGCTGCCAGAAACTGCCTCTTGACCTGTCCAGGCCCTGGAAGAGTGGCCAAGATTG
GAGACTTCGGGATGGCCCGAGACATCTACAGGGCGAGCTACTATAGAAAGGGAGGCTGTGCCATGCTGCCAGTTAAGTGGATGCCCCCAG
AGGCCTTCATGGAAGGAATATTCACTTCTAAAACAGACACATGGTCCTTTGGAGTGCTGCTATGGGAAATCTTTTCTCTTGGATATATGC
CATACCCCAGCAAAAGCAACCAGGAAGTTCTGGAGTTTGTCACCAGTGGAGGCCGGATGGACCCACCCAAGAACTGCCCTGGGCCTGTAT
ACCGGATAATGACTCAGTGCTGGCAACATCAGCCTGAAGACAGGCCCAACTTTGCCATCATTTTGGAGAGGATTGAATACTGCACCCAGG
ACCCGGATGTAATCAACACCGCTTTGCCGATAGAATATGGTCCACTTGTGGAAGAGGAAGAGAAAGTGCCTGTGAGGCCCAAGGACCCTG
AGGGGGTTCCTCCTCTCCTGGTCTCTCAACAGGCAAAACGGGAGGAGGAGCGCAGCCCAGCTGCCCCACCACCTCTGCCTACCACCTCCT
CTGGCAAGGCTGCAAAGAAACCCACAGCTGCAGAGATCTCTGTTCGAGTCCCTAGAGGGCCGGCCGTGGAAGGGGGACACGTGAATATGG
CATTCTCTCAGTCCAACCCTCCTTCGGAGTTGCACAAGGTCCACGGATCCAGAAACAAGCCCACCAGCTTGTGGAACCCAACGTACGGCT
CCTGGTTTACAGAGAAACCCACCAAAAAGAATAATCCTATAGCAAAGAAGGAGCCACACGACAGGGGTAACCTGGGGCTGGAGGGAAGCT
GTACTGTCCCACCTAACGTTGCAACTGGGAGACTTCCGGGGGCCTCACTGCTCCTAGAGCCCTCTTCGCTGACTGCCAATATGAAGGAGG
TACCTCTGTTCAGGCTACGTCACTTCCCTTGTGGGAATGTCAATTACGGCTACCAGCAACAGGGCTTGCCCTTAGAAGCCGCTACTGCCC
CTGGAGCTGGTCATTACGAGGATACCATTCTGAAAAGCAAGAATAGCATGAACCAGCCTGGGCCCTGAGCTCGGTCGCACACTCACTTCT
CTTCCTTGGGATCCCTAAGACCGTGGAGGAGAGAGAGGCAATGGCTCCTTCACAAACCAGAGACCAAATGTCACGTTTTGTTTTGTGCCA
ACCTATTTTGAAGTACCACCAAAAAAGCTGTATTTTGAAAATGCTTTAGAAAGGTTTTGAGCATGGGTTCATCCTATTCTTTCGAAAGAA
GAAAATATCATAAAAATGAGTGATAAATACAAGGCCCAGATGTGGTTGCATAAGGTTTTTATGCATGTTTGTTGTATACTTCCTTATGCT
TCTTTCAAATTGTGTGTGCTCTGCTTCAATGTAGTCAGAATTAGCTGCTTCTATGTTTCATAGTTGGGGTCATAGATGTTTCCTTGCCTT

>35245_35245_4_GTF2IRD1-ALK_GTF2IRD1_chr7_73935627_ENST00000476977_ALK_chr2_29446394_ENST00000389048_length(amino acids)=898AA_BP=335
MALLGKRCDVPTNGCGPDRWNSAFTRKDEIITSLVSALDSMCSALSKLNAEVACVAVHDESAFVVGTEKGRMFLNARKELQSDFLRFCRG
PPWKDPEAEHPKKVQRGEGGGRSLPRSSLEHGSDVYLLRKMVEEVFDVLYSEALGRASVVPLPYERLLREPGLLAVQGLPEGLAFRRPAE
YDPKALMAILEHSHRIRFKLKRPLEDGGRDSKALVELNGVSLIPKGSRDCGLHGQAPKVPPQDLPPTATSSSMASFLYSTALPNHAIREL
KQEAPSCPLAPSDLGLSRPMPEPKATGAQDFSDCCGQKPTGPGGPLIQNVHASKRILFSIVHDKSVYRRKHQELQAMQMELQSPEYKLSK
LRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDFLMEALIIS
KFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARNCLLTCPGP
GRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSGGRMDPPK
NCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEERSPAAPP
PLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAKKEPHDRGN

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for GTF2IRD1-ALK


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for GTF2IRD1-ALK


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for GTF2IRD1-ALK


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource