Fusion Gene Studies
in Kim Lab

FusionBase FusionGDB FusionGDB2 FusionPDB FusionNeoAntigen FusionAI FusionNW FGviewer Publication Contact
FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:GTF2IRD1-ALK (FusionGDB2 ID:HG9569TG238)

Fusion Gene Summary for GTF2IRD1-ALK

check button Fusion gene summary
Fusion gene informationFusion gene name: GTF2IRD1-ALK
Fusion gene ID: hg9569tg238
HgeneTgene
Gene symbol

GTF2IRD1

ALK

Gene ID

9569

238

Gene nameGTF2I repeat domain containing 1ALK receptor tyrosine kinase
SynonymsBEN|CREAM1|GTF3|MUSTRD1|RBAP2|WBS|WBSCR11|WBSCR12|hMusTRD1alpha1CD246|NBLST3
Cytomap('GTF2IRD1')('ALK')

7q11.23

2p23.2-p23.1

Type of geneprotein-codingprotein-coding
Descriptiongeneral transcription factor II-I repeat domain-containing protein 1USE B1-binding proteinWilliams-Beuren syndrome chromosome region 11binding factor for early enhancergeneral transcription factor 3general transcription factor IIImuscle TFII-I repeaALK tyrosine kinase receptorCD246 antigenanaplastic lymphoma receptor tyrosine kinasemutant anaplastic lymphoma kinase
Modification date2020031320200329
UniProtAcc.

Q9UM73

Ensembl transtripts involved in fusion geneENST00000489094, ENST00000265755, 
ENST00000424337, ENST00000455841, 
ENST00000476977, 
Fusion gene scores* DoF score11 X 6 X 9=59456 X 74 X 20=82880
# samples 1257
** MAII scorelog2(12/594*10)=-2.30742852519225
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(57/82880*10)=-7.18391827352181
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: GTF2IRD1 [Title/Abstract] AND ALK [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointGTF2IRD1(73935627)-ALK(29446394), # samples:3
Anticipated loss of major functional domain due to fusion event.GTF2IRD1-ALK seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
GTF2IRD1-ALK seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
GTF2IRD1-ALK seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
GTF2IRD1-ALK seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
TgeneALK

GO:0016310

phosphorylation

9174053

TgeneALK

GO:0046777

protein autophosphorylation

9174053


check buttonFusion gene breakpoints across GTF2IRD1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure
check buttonFusion gene breakpoints across ALK (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4THCATCGA-EL-A4KD-01AGTF2IRD1chr7

73935627

-ALKchr2

29446394

-
ChimerDB4THCATCGA-EL-A4KD-01AGTF2IRD1chr7

73935627

+ALKchr2

29446394

-


Top

Fusion Gene ORF analysis for GTF2IRD1-ALK

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
3UTR-3CDSENST00000489094ENST00000389048GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
3UTR-intronENST00000489094ENST00000431873GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
3UTR-intronENST00000489094ENST00000498037GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000265755ENST00000431873GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000265755ENST00000498037GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000424337ENST00000431873GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000424337ENST00000498037GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000455841ENST00000431873GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000455841ENST00000498037GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000476977ENST00000431873GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
5CDS-intronENST00000476977ENST00000498037GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
In-frameENST00000265755ENST00000389048GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
In-frameENST00000424337ENST00000389048GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
In-frameENST00000455841ENST00000389048GTF2IRD1chr7

73935627

+ALKchr2

29446394

-
In-frameENST00000476977ENST00000389048GTF2IRD1chr7

73935627

+ALKchr2

29446394

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000265755GTF2IRD1chr773935627+ENST00000389048ALKchr229446394-354013993303089919
ENST00000455841GTF2IRD1chr773935627+ENST00000389048ALKchr229446394-345613151503005951
ENST00000424337GTF2IRD1chr773935627+ENST00000389048ALKchr229446394-32211080112770919
ENST00000476977GTF2IRD1chr773935627+ENST00000389048ALKchr229446394-4838269716914387898

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000265755ENST00000389048GTF2IRD1chr773935627+ALKchr229446394-0.0053528640.9946471
ENST00000455841ENST00000389048GTF2IRD1chr773935627+ALKchr229446394-0.0043992820.9956007
ENST00000424337ENST00000389048GTF2IRD1chr773935627+ALKchr229446394-0.0053839780.994616
ENST00000476977ENST00000389048GTF2IRD1chr773935627+ALKchr229446394-0.011810.98819005

Top

Fusion Genomic Features for GTF2IRD1-ALK


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

Top

Fusion Protein Features for GTF2IRD1-ALK


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr7:73935627/chr2:29446394)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
.ALK

Q9UM73

FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.FUNCTION: Neuronal receptor tyrosine kinase that is essentially and transiently expressed in specific regions of the central and peripheral nervous systems and plays an important role in the genesis and differentiation of the nervous system. Transduces signals from ligands at the cell surface, through specific activation of the mitogen-activated protein kinase (MAPK) pathway. Phosphorylates almost exclusively at the first tyrosine of the Y-x-x-x-Y-Y motif. Following activation by ligand, ALK induces tyrosine phosphorylation of CBL, FRS2, IRS1 and SHC1, as well as of the MAP kinases MAPK1/ERK2 and MAPK3/ERK1. Acts as a receptor for ligands pleiotrophin (PTN), a secreted growth factor, and midkine (MDK), a PTN-related factor, thus participating in PTN and MDK signal transduction. PTN-binding induces MAPK pathway activation, which is important for the anti-apoptotic signaling of PTN and regulation of cell proliferation. MDK-binding induces phosphorylation of the ALK target insulin receptor substrate (IRS1), activates mitogen-activated protein kinases (MAPKs) and PI3-kinase, resulting also in cell proliferation induction. Drives NF-kappa-B activation, probably through IRS1 and the activation of the AKT serine/threonine kinase. Recruitment of IRS1 to activated ALK and the activation of NF-kappa-B are essential for the autocrine growth and survival signaling of MDK. Thinness gene involved in the resistance to weight gain: in hypothalamic neurons, controls energy expenditure acting as a negative regulator of white adipose tissue lipolysis and sympathetic tone to fine-tune energy homeostasis (By similarity). {ECO:0000250|UniProtKB:P97793, ECO:0000269|PubMed:11121404, ECO:0000269|PubMed:11278720, ECO:0000269|PubMed:11387242, ECO:0000269|PubMed:11809760, ECO:0000269|PubMed:12107166, ECO:0000269|PubMed:12122009, ECO:0000269|PubMed:15226403, ECO:0000269|PubMed:15908427, ECO:0000269|PubMed:16317043, ECO:0000269|PubMed:16878150, ECO:0000269|PubMed:17274988}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000265755+727119_213335960.0RepeatNote=GTF2I-like 1
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000424337+727119_213335945.0RepeatNote=GTF2I-like 1
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000455841+727119_213367977.0RepeatNote=GTF2I-like 1
TgeneALKchr7:73935627chr2:29446394ENST0000038904818291116_139210571621.0DomainProtein kinase
TgeneALKchr7:73935627chr2:29446394ENST0000038904818291197_119910571621.0RegionNote=Inhibitor binding
TgeneALKchr7:73935627chr2:29446394ENST0000038904818291060_162010571621.0Topological domainCytoplasmic

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000265755+727906_930335960.0Compositional biasNote=Ser-rich
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000424337+727906_930335945.0Compositional biasNote=Ser-rich
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000455841+727906_930367977.0Compositional biasNote=Ser-rich
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000265755+727898_905335960.0MotifNuclear localization signal
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000424337+727898_905335945.0MotifNuclear localization signal
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000455841+727898_905367977.0MotifNuclear localization signal
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000265755+727342_436335960.0RepeatNote=GTF2I-like 2
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000265755+727556_650335960.0RepeatNote=GTF2I-like 3
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000265755+727696_790335960.0RepeatNote=GTF2I-like 4
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000265755+727793_887335960.0RepeatNote=GTF2I-like 5
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000424337+727342_436335945.0RepeatNote=GTF2I-like 2
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000424337+727556_650335945.0RepeatNote=GTF2I-like 3
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000424337+727696_790335945.0RepeatNote=GTF2I-like 4
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000424337+727793_887335945.0RepeatNote=GTF2I-like 5
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000455841+727342_436367977.0RepeatNote=GTF2I-like 2
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000455841+727556_650367977.0RepeatNote=GTF2I-like 3
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000455841+727696_790367977.0RepeatNote=GTF2I-like 4
HgeneGTF2IRD1chr7:73935627chr2:29446394ENST00000455841+727793_887367977.0RepeatNote=GTF2I-like 5
TgeneALKchr7:73935627chr2:29446394ENST000003890481829816_94010571621.0Compositional biasNote=Gly-rich
TgeneALKchr7:73935627chr2:29446394ENST000003890481829264_42710571621.0DomainMAM 1
TgeneALKchr7:73935627chr2:29446394ENST000003890481829437_47310571621.0DomainNote=LDL-receptor class A
TgeneALKchr7:73935627chr2:29446394ENST000003890481829478_63610571621.0DomainMAM 2
TgeneALKchr7:73935627chr2:29446394ENST00000389048182919_103810571621.0Topological domainExtracellular
TgeneALKchr7:73935627chr2:29446394ENST0000038904818291039_105910571621.0TransmembraneHelical


Top

Fusion Gene Sequence for GTF2IRD1-ALK


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>35245_35245_1_GTF2IRD1-ALK_GTF2IRD1_chr7_73935627_ENST00000265755_ALK_chr2_29446394_ENST00000389048_length(transcript)=3540nt_BP=1399nt
TAAATGGCAGCCAATGGAGGGTGGTGTTGCGCGGGGCTGGGATTAGGGCCGGGGCGAATGGCTGGCAATCTTACTGGGATTACAGAACAA
AGAGCCTCCCCGCGCTCCCGCTCTCCGCTCCTCTCCCCGCGCCGCCCCGCCCTCCGCCGCAGCCCGCGCCGGGGGTGGGGGCCGCCGAGC
GCCAGCCCCCCGGCCGGCCGATTCCCCCCCCGCGCCCCCTCCCCGCGCCTCCCTCCCCGCCCTCGCCGCGCCGCCGTCCTCGCCTCCCTC
TGCCTCTCCTTCCCCCATTCTCCCGGATTAATTAAGGAGGCAGCGGCAGGAGGCTGAGTCCTGGCCGCGGGCCGGGGCCGGGGCGCCGCT
GGCAGGAGCGCTTGGGGATCCTCCAAGGCGACCATGGCCTTGCTGGGTAAGCGCTGTGACGTCCCCACCAACGGCTGCGGACCCGACCGC
TGGAACTCCGCGTTCACCCGCAAAGACGAGATCATCACCAGCCTCGTGTCTGCCTTAGACTCCATGTGCTCAGCGCTGTCCAAACTGAAC
GCCGAGGTGGCCTGTGTCGCCGTGCACGATGAGAGCGCCTTTGTGGTGGGCACAGAGAAGGGGAGAATGTTCCTGAATGCCCGGAAGGAG
CTACAGTCAGACTTCCTCAGGTTCTGCCGAGGGCCCCCGTGGAAGGATCCGGAGGCAGAGCACCCCAAGAAGGTGCAGCGGGGCGAGGGT
GGAGGCCGTAGCCTCCCTCGGTCCTCCCTGGAACATGGCTCAGATGTGTACCTTCTGCGGAAGATGGTAGAGGAGGTGTTTGATGTTCTT
TATAGCGAGGCCCTGGGAAGGGCCAGTGTGGTGCCACTGCCCTATGAGAGGCTGCTCAGGGAGCCAGGGCTGCTGGCCGTGCAGGGGCTG
CCCGAAGGCCTGGCCTTCCGAAGGCCAGCCGAGTATGACCCCAAGGCCCTCATGGCCATCCTGGAACACAGCCACCGCATCCGCTTCAAG
CTCAAGAGGCCACTTGAGGATGGCGGGCGGGACTCGAAGGCCCTGGTGGAGCTGAACGGTGTCTCCCTGATTCCCAAGGGGTCACGGGAC
TGTGGCCTGCATGGCCAGGCCCCCAAGGTGCCACCCCAGGACCTGCCCCCAACCGCCACCTCCTCCTCCATGGCCAGCTTCCTGTACAGC
ACGGCGCTCCCCAACCACGCCATCCGAGAGCTCAAGCAGGAAGCACCTTCCTGCCCCCTTGCCCCCAGCGACCTGGGCCTGAGTCGGCCC
ATGCCAGAGCCCAAGGCCACCGGTGCCCAAGACTTCTCCGACTGTTGTGGACAGAAGCCCACTGGGCCTGGTGGGCCTCTCATCCAGAAC
GTCCATGCCTCCAAGCGCATTCTCTTCTCCATCGTCCATGACAAGTCAGTGTACCGCCGGAAGCACCAGGAGCTGCAAGCCATGCAGATG
GAGCTGCAGAGCCCTGAGTACAAGCTGAGCAAGCTCCGCACCTCGACCATCATGACCGACTACAACCCCAACTACTGCTTTGCTGGCAAG
ACCTCCTCCATCAGTGACCTGAAGGAGGTGCCGCGGAAAAACATCACCCTCATTCGGGGTCTGGGCCATGGCGCCTTTGGGGAGGTGTAT
GAAGGCCAGGTGTCCGGAATGCCCAACGACCCAAGCCCCCTGCAAGTGGCTGTGAAGACGCTGCCTGAAGTGTGCTCTGAACAGGACGAA
CTGGATTTCCTCATGGAAGCCCTGATCATCAGCAAATTCAACCACCAGAACATTGTTCGCTGCATTGGGGTGAGCCTGCAATCCCTGCCC
CGGTTCATCCTGCTGGAGCTCATGGCGGGGGGAGACCTCAAGTCCTTCCTCCGAGAGACCCGCCCTCGCCCGAGCCAGCCCTCCTCCCTG
GCCATGCTGGACCTTCTGCACGTGGCTCGGGACATTGCCTGTGGCTGTCAGTATTTGGAGGAAAACCACTTCATCCACCGAGACATTGCT
GCCAGAAACTGCCTCTTGACCTGTCCAGGCCCTGGAAGAGTGGCCAAGATTGGAGACTTCGGGATGGCCCGAGACATCTACAGGGCGAGC
TACTATAGAAAGGGAGGCTGTGCCATGCTGCCAGTTAAGTGGATGCCCCCAGAGGCCTTCATGGAAGGAATATTCACTTCTAAAACAGAC
ACATGGTCCTTTGGAGTGCTGCTATGGGAAATCTTTTCTCTTGGATATATGCCATACCCCAGCAAAAGCAACCAGGAAGTTCTGGAGTTT
GTCACCAGTGGAGGCCGGATGGACCCACCCAAGAACTGCCCTGGGCCTGTATACCGGATAATGACTCAGTGCTGGCAACATCAGCCTGAA
GACAGGCCCAACTTTGCCATCATTTTGGAGAGGATTGAATACTGCACCCAGGACCCGGATGTAATCAACACCGCTTTGCCGATAGAATAT
GGTCCACTTGTGGAAGAGGAAGAGAAAGTGCCTGTGAGGCCCAAGGACCCTGAGGGGGTTCCTCCTCTCCTGGTCTCTCAACAGGCAAAA
CGGGAGGAGGAGCGCAGCCCAGCTGCCCCACCACCTCTGCCTACCACCTCCTCTGGCAAGGCTGCAAAGAAACCCACAGCTGCAGAGATC
TCTGTTCGAGTCCCTAGAGGGCCGGCCGTGGAAGGGGGACACGTGAATATGGCATTCTCTCAGTCCAACCCTCCTTCGGAGTTGCACAAG
GTCCACGGATCCAGAAACAAGCCCACCAGCTTGTGGAACCCAACGTACGGCTCCTGGTTTACAGAGAAACCCACCAAAAAGAATAATCCT
ATAGCAAAGAAGGAGCCACACGACAGGGGTAACCTGGGGCTGGAGGGAAGCTGTACTGTCCCACCTAACGTTGCAACTGGGAGACTTCCG
GGGGCCTCACTGCTCCTAGAGCCCTCTTCGCTGACTGCCAATATGAAGGAGGTACCTCTGTTCAGGCTACGTCACTTCCCTTGTGGGAAT
GTCAATTACGGCTACCAGCAACAGGGCTTGCCCTTAGAAGCCGCTACTGCCCCTGGAGCTGGTCATTACGAGGATACCATTCTGAAAAGC
AAGAATAGCATGAACCAGCCTGGGCCCTGAGCTCGGTCGCACACTCACTTCTCTTCCTTGGGATCCCTAAGACCGTGGAGGAGAGAGAGG
CAATGGCTCCTTCACAAACCAGAGACCAAATGTCACGTTTTGTTTTGTGCCAACCTATTTTGAAGTACCACCAAAAAAGCTGTATTTTGA
AAATGCTTTAGAAAGGTTTTGAGCATGGGTTCATCCTATTCTTTCGAAAGAAGAAAATATCATAAAAATGAGTGATAAATACAAGGCCCA
GATGTGGTTGCATAAGGTTTTTATGCATGTTTGTTGTATACTTCCTTATGCTTCTTTCAAATTGTGTGTGCTCTGCTTCAATGTAGTCAG
AATTAGCTGCTTCTATGTTTCATAGTTGGGGTCATAGATGTTTCCTTGCCTTGTTGATGTGGACATGAGCCATTTGAGGGGAGAGGGAAC

>35245_35245_1_GTF2IRD1-ALK_GTF2IRD1_chr7_73935627_ENST00000265755_ALK_chr2_29446394_ENST00000389048_length(amino acids)=919AA_BP=356
MAAGRGRGAAGRSAWGSSKATMALLGKRCDVPTNGCGPDRWNSAFTRKDEIITSLVSALDSMCSALSKLNAEVACVAVHDESAFVVGTEK
GRMFLNARKELQSDFLRFCRGPPWKDPEAEHPKKVQRGEGGGRSLPRSSLEHGSDVYLLRKMVEEVFDVLYSEALGRASVVPLPYERLLR
EPGLLAVQGLPEGLAFRRPAEYDPKALMAILEHSHRIRFKLKRPLEDGGRDSKALVELNGVSLIPKGSRDCGLHGQAPKVPPQDLPPTAT
SSSMASFLYSTALPNHAIRELKQEAPSCPLAPSDLGLSRPMPEPKATGAQDFSDCCGQKPTGPGGPLIQNVHASKRILFSIVHDKSVYRR
KHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKT
LPEVCSEQDELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLE
ENHFIHRDIAARNCLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYP
SKSNQEVLEFVTSGGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGV
PPLLVSQQAKREEERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWF
TEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMKEVPLFRLRHFPCGNVNYGYQQQGLPLEAATAPGA

--------------------------------------------------------------
>35245_35245_2_GTF2IRD1-ALK_GTF2IRD1_chr7_73935627_ENST00000424337_ALK_chr2_29446394_ENST00000389048_length(transcript)=3221nt_BP=1080nt
GAGGCTGAGTCCTGGCCGCGGGCCGGGGCCGGGGCGCCGCTGGCAGGAGCGCTTGGGGATCCTCCAAGGCGACCATGGCCTTGCTGGGTA
AGCGCTGTGACGTCCCCACCAACGGCTGCGGACCCGACCGCTGGAACTCCGCGTTCACCCGCAAAGACGAGATCATCACCAGCCTCGTGT
CTGCCTTAGACTCCATGTGCTCAGCGCTGTCCAAACTGAACGCCGAGGTGGCCTGTGTCGCCGTGCACGATGAGAGCGCCTTTGTGGTGG
GCACAGAGAAGGGGAGAATGTTCCTGAATGCCCGGAAGGAGCTACAGTCAGACTTCCTCAGGTTCTGCCGAGGGCCCCCGTGGAAGGATC
CGGAGGCAGAGCACCCCAAGAAGGTGCAGCGGGGCGAGGGTGGAGGCCGTAGCCTCCCTCGGTCCTCCCTGGAACATGGCTCAGATGTGT
ACCTTCTGCGGAAGATGGTAGAGGAGGTGTTTGATGTTCTTTATAGCGAGGCCCTGGGAAGGGCCAGTGTGGTGCCACTGCCCTATGAGA
GGCTGCTCAGGGAGCCAGGGCTGCTGGCCGTGCAGGGGCTGCCCGAAGGCCTGGCCTTCCGAAGGCCAGCCGAGTATGACCCCAAGGCCC
TCATGGCCATCCTGGAACACAGCCACCGCATCCGCTTCAAGCTCAAGAGGCCACTTGAGGATGGCGGGCGGGACTCGAAGGCCCTGGTGG
AGCTGAACGGTGTCTCCCTGATTCCCAAGGGGTCACGGGACTGTGGCCTGCATGGCCAGGCCCCCAAGGTGCCACCCCAGGACCTGCCCC
CAACCGCCACCTCCTCCTCCATGGCCAGCTTCCTGTACAGCACGGCGCTCCCCAACCACGCCATCCGAGAGCTCAAGCAGGAAGCACCTT
CCTGCCCCCTTGCCCCCAGCGACCTGGGCCTGAGTCGGCCCATGCCAGAGCCCAAGGCCACCGGTGCCCAAGACTTCTCCGACTGTTGTG
GACAGAAGCCCACTGGGCCTGGTGGGCCTCTCATCCAGAACGTCCATGCCTCCAAGCGCATTCTCTTCTCCATCGTCCATGACAAGTCAG
TGTACCGCCGGAAGCACCAGGAGCTGCAAGCCATGCAGATGGAGCTGCAGAGCCCTGAGTACAAGCTGAGCAAGCTCCGCACCTCGACCA
TCATGACCGACTACAACCCCAACTACTGCTTTGCTGGCAAGACCTCCTCCATCAGTGACCTGAAGGAGGTGCCGCGGAAAAACATCACCC
TCATTCGGGGTCTGGGCCATGGCGCCTTTGGGGAGGTGTATGAAGGCCAGGTGTCCGGAATGCCCAACGACCCAAGCCCCCTGCAAGTGG
CTGTGAAGACGCTGCCTGAAGTGTGCTCTGAACAGGACGAACTGGATTTCCTCATGGAAGCCCTGATCATCAGCAAATTCAACCACCAGA
ACATTGTTCGCTGCATTGGGGTGAGCCTGCAATCCCTGCCCCGGTTCATCCTGCTGGAGCTCATGGCGGGGGGAGACCTCAAGTCCTTCC
TCCGAGAGACCCGCCCTCGCCCGAGCCAGCCCTCCTCCCTGGCCATGCTGGACCTTCTGCACGTGGCTCGGGACATTGCCTGTGGCTGTC
AGTATTTGGAGGAAAACCACTTCATCCACCGAGACATTGCTGCCAGAAACTGCCTCTTGACCTGTCCAGGCCCTGGAAGAGTGGCCAAGA
TTGGAGACTTCGGGATGGCCCGAGACATCTACAGGGCGAGCTACTATAGAAAGGGAGGCTGTGCCATGCTGCCAGTTAAGTGGATGCCCC
CAGAGGCCTTCATGGAAGGAATATTCACTTCTAAAACAGACACATGGTCCTTTGGAGTGCTGCTATGGGAAATCTTTTCTCTTGGATATA
TGCCATACCCCAGCAAAAGCAACCAGGAAGTTCTGGAGTTTGTCACCAGTGGAGGCCGGATGGACCCACCCAAGAACTGCCCTGGGCCTG
TATACCGGATAATGACTCAGTGCTGGCAACATCAGCCTGAAGACAGGCCCAACTTTGCCATCATTTTGGAGAGGATTGAATACTGCACCC
AGGACCCGGATGTAATCAACACCGCTTTGCCGATAGAATATGGTCCACTTGTGGAAGAGGAAGAGAAAGTGCCTGTGAGGCCCAAGGACC
CTGAGGGGGTTCCTCCTCTCCTGGTCTCTCAACAGGCAAAACGGGAGGAGGAGCGCAGCCCAGCTGCCCCACCACCTCTGCCTACCACCT
CCTCTGGCAAGGCTGCAAAGAAACCCACAGCTGCAGAGATCTCTGTTCGAGTCCCTAGAGGGCCGGCCGTGGAAGGGGGACACGTGAATA
TGGCATTCTCTCAGTCCAACCCTCCTTCGGAGTTGCACAAGGTCCACGGATCCAGAAACAAGCCCACCAGCTTGTGGAACCCAACGTACG
GCTCCTGGTTTACAGAGAAACCCACCAAAAAGAATAATCCTATAGCAAAGAAGGAGCCACACGACAGGGGTAACCTGGGGCTGGAGGGAA
GCTGTACTGTCCCACCTAACGTTGCAACTGGGAGACTTCCGGGGGCCTCACTGCTCCTAGAGCCCTCTTCGCTGACTGCCAATATGAAGG
AGGTACCTCTGTTCAGGCTACGTCACTTCCCTTGTGGGAATGTCAATTACGGCTACCAGCAACAGGGCTTGCCCTTAGAAGCCGCTACTG
CCCCTGGAGCTGGTCATTACGAGGATACCATTCTGAAAAGCAAGAATAGCATGAACCAGCCTGGGCCCTGAGCTCGGTCGCACACTCACT
TCTCTTCCTTGGGATCCCTAAGACCGTGGAGGAGAGAGAGGCAATGGCTCCTTCACAAACCAGAGACCAAATGTCACGTTTTGTTTTGTG
CCAACCTATTTTGAAGTACCACCAAAAAAGCTGTATTTTGAAAATGCTTTAGAAAGGTTTTGAGCATGGGTTCATCCTATTCTTTCGAAA
GAAGAAAATATCATAAAAATGAGTGATAAATACAAGGCCCAGATGTGGTTGCATAAGGTTTTTATGCATGTTTGTTGTATACTTCCTTAT
GCTTCTTTCAAATTGTGTGTGCTCTGCTTCAATGTAGTCAGAATTAGCTGCTTCTATGTTTCATAGTTGGGGTCATAGATGTTTCCTTGC

>35245_35245_2_GTF2IRD1-ALK_GTF2IRD1_chr7_73935627_ENST00000424337_ALK_chr2_29446394_ENST00000389048_length(amino acids)=919AA_BP=356
MAAGRGRGAAGRSAWGSSKATMALLGKRCDVPTNGCGPDRWNSAFTRKDEIITSLVSALDSMCSALSKLNAEVACVAVHDESAFVVGTEK
GRMFLNARKELQSDFLRFCRGPPWKDPEAEHPKKVQRGEGGGRSLPRSSLEHGSDVYLLRKMVEEVFDVLYSEALGRASVVPLPYERLLR
EPGLLAVQGLPEGLAFRRPAEYDPKALMAILEHSHRIRFKLKRPLEDGGRDSKALVELNGVSLIPKGSRDCGLHGQAPKVPPQDLPPTAT
SSSMASFLYSTALPNHAIRELKQEAPSCPLAPSDLGLSRPMPEPKATGAQDFSDCCGQKPTGPGGPLIQNVHASKRILFSIVHDKSVYRR
KHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKT
LPEVCSEQDELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLE
ENHFIHRDIAARNCLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYP
SKSNQEVLEFVTSGGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGV
PPLLVSQQAKREEERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWF
TEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMKEVPLFRLRHFPCGNVNYGYQQQGLPLEAATAPGA

--------------------------------------------------------------
>35245_35245_3_GTF2IRD1-ALK_GTF2IRD1_chr7_73935627_ENST00000455841_ALK_chr2_29446394_ENST00000389048_length(transcript)=3456nt_BP=1315nt
GCCAGCCCCCCGGCCGGCCGATTCCCCCCCCGCGCCCCCTCCCCGCGCCTCCCTCCCCGCCCTCGCCGCGCCGCCGTCCTCGCCTCCCTC
TGCCTCTCCTTCCCCCATTCTCCCGGATTAATTAAGGAGGCAGCGGCAGGAGGCTGAGTCCTGGCCGCGGGCCGGGGCCGGGGCGCCGCT
GGCAGGAGCGCTTGGGGATCCTCCAAGGCGACCATGGCCTTGCTGGGTAAGCGCTGTGACGTCCCCACCAACGGCTGCGGACCCGACCGC
TGGAACTCCGCGTTCACCCGCAAAGACGAGATCATCACCAGCCTCGTGTCTGCCTTAGACTCCATGTGCTCAGCGCTGTCCAAACTGAAC
GCCGAGGTGGCCTGTGTCGCCGTGCACGATGAGAGCGCCTTTGTGGTGGGCACAGAGAAGGGGAGAATGTTCCTGAATGCCCGGAAGGAG
CTACAGTCAGACTTCCTCAGGTTCTGCCTCTCCGCAGCTCAGCACAGGGCAGCGACATCCCAGCTCGAAGGCCGGGTGGTGAGACGGGTG
CTCACTGTGGCCTCGCGTGCTCTGTGTCCCACAGGAGGGCCCCCGTGGAAGGATCCGGAGGCAGAGCACCCCAAGAAGGTGCAGCGGGGC
GAGGGTGGAGGCCGTAGCCTCCCTCGGTCCTCCCTGGAACATGGCTCAGATGTGTACCTTCTGCGGAAGATGGTAGAGGAGGTGTTTGAT
GTTCTTTATAGCGAGGCCCTGGGAAGGGCCAGTGTGGTGCCACTGCCCTATGAGAGGCTGCTCAGGGAGCCAGGGCTGCTGGCCGTGCAG
GGGCTGCCCGAAGGCCTGGCCTTCCGAAGGCCAGCCGAGTATGACCCCAAGGCCCTCATGGCCATCCTGGAACACAGCCACCGCATCCGC
TTCAAGCTCAAGAGGCCACTTGAGGATGGCGGGCGGGACTCGAAGGCCCTGGTGGAGCTGAACGGTGTCTCCCTGATTCCCAAGGGGTCA
CGGGACTGTGGCCTGCATGGCCAGGCCCCCAAGGTGCCACCCCAGGACCTGCCCCCAACCGCCACCTCCTCCTCCATGGCCAGCTTCCTG
TACAGCACGGCGCTCCCCAACCACGCCATCCGAGAGCTCAAGCAGGAAGCACCTTCCTGCCCCCTTGCCCCCAGCGACCTGGGCCTGAGT
CGGCCCATGCCAGAGCCCAAGGCCACCGGTGCCCAAGACTTCTCCGACTGTTGTGGACAGAAGCCCACTGGGCCTGGTGGGCCTCTCATC
CAGAACGTCCATGCCTCCAAGCGCATTCTCTTCTCCATCGTCCATGACAAGTCAGTGTACCGCCGGAAGCACCAGGAGCTGCAAGCCATG
CAGATGGAGCTGCAGAGCCCTGAGTACAAGCTGAGCAAGCTCCGCACCTCGACCATCATGACCGACTACAACCCCAACTACTGCTTTGCT
GGCAAGACCTCCTCCATCAGTGACCTGAAGGAGGTGCCGCGGAAAAACATCACCCTCATTCGGGGTCTGGGCCATGGCGCCTTTGGGGAG
GTGTATGAAGGCCAGGTGTCCGGAATGCCCAACGACCCAAGCCCCCTGCAAGTGGCTGTGAAGACGCTGCCTGAAGTGTGCTCTGAACAG
GACGAACTGGATTTCCTCATGGAAGCCCTGATCATCAGCAAATTCAACCACCAGAACATTGTTCGCTGCATTGGGGTGAGCCTGCAATCC
CTGCCCCGGTTCATCCTGCTGGAGCTCATGGCGGGGGGAGACCTCAAGTCCTTCCTCCGAGAGACCCGCCCTCGCCCGAGCCAGCCCTCC
TCCCTGGCCATGCTGGACCTTCTGCACGTGGCTCGGGACATTGCCTGTGGCTGTCAGTATTTGGAGGAAAACCACTTCATCCACCGAGAC
ATTGCTGCCAGAAACTGCCTCTTGACCTGTCCAGGCCCTGGAAGAGTGGCCAAGATTGGAGACTTCGGGATGGCCCGAGACATCTACAGG
GCGAGCTACTATAGAAAGGGAGGCTGTGCCATGCTGCCAGTTAAGTGGATGCCCCCAGAGGCCTTCATGGAAGGAATATTCACTTCTAAA
ACAGACACATGGTCCTTTGGAGTGCTGCTATGGGAAATCTTTTCTCTTGGATATATGCCATACCCCAGCAAAAGCAACCAGGAAGTTCTG
GAGTTTGTCACCAGTGGAGGCCGGATGGACCCACCCAAGAACTGCCCTGGGCCTGTATACCGGATAATGACTCAGTGCTGGCAACATCAG
CCTGAAGACAGGCCCAACTTTGCCATCATTTTGGAGAGGATTGAATACTGCACCCAGGACCCGGATGTAATCAACACCGCTTTGCCGATA
GAATATGGTCCACTTGTGGAAGAGGAAGAGAAAGTGCCTGTGAGGCCCAAGGACCCTGAGGGGGTTCCTCCTCTCCTGGTCTCTCAACAG
GCAAAACGGGAGGAGGAGCGCAGCCCAGCTGCCCCACCACCTCTGCCTACCACCTCCTCTGGCAAGGCTGCAAAGAAACCCACAGCTGCA
GAGATCTCTGTTCGAGTCCCTAGAGGGCCGGCCGTGGAAGGGGGACACGTGAATATGGCATTCTCTCAGTCCAACCCTCCTTCGGAGTTG
CACAAGGTCCACGGATCCAGAAACAAGCCCACCAGCTTGTGGAACCCAACGTACGGCTCCTGGTTTACAGAGAAACCCACCAAAAAGAAT
AATCCTATAGCAAAGAAGGAGCCACACGACAGGGGTAACCTGGGGCTGGAGGGAAGCTGTACTGTCCCACCTAACGTTGCAACTGGGAGA
CTTCCGGGGGCCTCACTGCTCCTAGAGCCCTCTTCGCTGACTGCCAATATGAAGGAGGTACCTCTGTTCAGGCTACGTCACTTCCCTTGT
GGGAATGTCAATTACGGCTACCAGCAACAGGGCTTGCCCTTAGAAGCCGCTACTGCCCCTGGAGCTGGTCATTACGAGGATACCATTCTG
AAAAGCAAGAATAGCATGAACCAGCCTGGGCCCTGAGCTCGGTCGCACACTCACTTCTCTTCCTTGGGATCCCTAAGACCGTGGAGGAGA
GAGAGGCAATGGCTCCTTCACAAACCAGAGACCAAATGTCACGTTTTGTTTTGTGCCAACCTATTTTGAAGTACCACCAAAAAAGCTGTA
TTTTGAAAATGCTTTAGAAAGGTTTTGAGCATGGGTTCATCCTATTCTTTCGAAAGAAGAAAATATCATAAAAATGAGTGATAAATACAA
GGCCCAGATGTGGTTGCATAAGGTTTTTATGCATGTTTGTTGTATACTTCCTTATGCTTCTTTCAAATTGTGTGTGCTCTGCTTCAATGT
AGTCAGAATTAGCTGCTTCTATGTTTCATAGTTGGGGTCATAGATGTTTCCTTGCCTTGTTGATGTGGACATGAGCCATTTGAGGGGAGA

>35245_35245_3_GTF2IRD1-ALK_GTF2IRD1_chr7_73935627_ENST00000455841_ALK_chr2_29446394_ENST00000389048_length(amino acids)=951AA_BP=388
MAAGRGRGAAGRSAWGSSKATMALLGKRCDVPTNGCGPDRWNSAFTRKDEIITSLVSALDSMCSALSKLNAEVACVAVHDESAFVVGTEK
GRMFLNARKELQSDFLRFCLSAAQHRAATSQLEGRVVRRVLTVASRALCPTGGPPWKDPEAEHPKKVQRGEGGGRSLPRSSLEHGSDVYL
LRKMVEEVFDVLYSEALGRASVVPLPYERLLREPGLLAVQGLPEGLAFRRPAEYDPKALMAILEHSHRIRFKLKRPLEDGGRDSKALVEL
NGVSLIPKGSRDCGLHGQAPKVPPQDLPPTATSSSMASFLYSTALPNHAIRELKQEAPSCPLAPSDLGLSRPMPEPKATGAQDFSDCCGQ
KPTGPGGPLIQNVHASKRILFSIVHDKSVYRRKHQELQAMQMELQSPEYKLSKLRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLI
RGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDFLMEALIISKFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLR
ETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARNCLLTCPGPGRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPE
AFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSGGRMDPPKNCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQD
PDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEERSPAAPPPLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMA
FSQSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAKKEPHDRGNLGLEGSCTVPPNVATGRLPGASLLLEPSSLTANMKEV

--------------------------------------------------------------
>35245_35245_4_GTF2IRD1-ALK_GTF2IRD1_chr7_73935627_ENST00000476977_ALK_chr2_29446394_ENST00000389048_length(transcript)=4838nt_BP=2697nt
AACATTTAGCAGCAAACTCAACATACATTTTGGCCAAACGCCCACCGGCCAGTTGTTTAAATCAATATTTATCCACAGCCAAAAAGCAGA
GGAGAGCCAGAACTGAGGCCAGAGCCGAGCTCTGATGCATCTCTCATTTCTCGGGATGTTTCTGTCCCTGTGGTTGGACACCTCTGGCCT
TGTGAAGTGTGATGCACTGTCACATCTCCTGTTCTGTGTCATTGGCCAATGATCATATATCATGACCTGCTGGAAGGCCTGTCTGTGGCT
GGGACCACACGCCTTGGGCCTTATGCACACTGGGCACTGGTCGGGATCCTGGGGTGCAACAGTGGCAGGCAGACCTGGTTTCTGCCCTCA
AAGAGCTTACAGATGGCAGGGGCACCCATGGCGGCAGAAGACACCCCAGGCCTGGTGCCCTTTGTGGTGCCAGCCCCATGGTCCTCCTGC
CTGGGCCTTCCCTACCCCATTGGGTGCAGAAACTCCCTGTCTGCAGGTAGGAGACAGAGGGTAGGTTTTCAGGCTCCCTTGGGAACTGCA
GCCCTGCTCTCTGCCATCAACACCCAGCAGGGGCCACACAGAGAGCACCGGGACTGAGCCCATAGAGGGGAACCGAGAGGCCCCTGCCTC
TAGTCTCTGCCTTCTTTGCTTGGATTGGTGCAGGGACAGCTGCCTTGAGGGCAGGCCCTGGCACTGGGGCAGCCTGTGGGTGCCCCCTGG
GTCAAGAAGGAGAGGGGCAGGGTAGAACCAGGAGCCAAAGGAGGCTGATCTTTGCATCTCATGGGTGCCCAGCTGGACACTGTCATACCC
AGGAAGCCTGTGCCATGCCATGGGGACCCACAACTGGGGGCCCTGGACTTGAGGGGGAGGATGCAGCTCTGTCCCCCAGGAACCCCATTG
CAACAGGACACAGTCCTGCCCTGGGGAGCCCCTGACCTGAGACAAAGCAGCCTCGGCCCTGCTGTATCTTTCCATACCCCTGATGCCAAG
TCTCCTGGCTAGGAGGGAAACTGAGGCTGGAAGGCCTCGGCGGGGGTGGCATTGGCCTCGGGAGCATGTGGCTTGATGCAGAAATGTGAC
GGCAGAGCTCAGAGGCATGCGGAAGGGAGGGGAGGACATCACCGGCTCCTGACCCAGCTGGGCTTCAGGTTGGGGGTACAGGAGGTGGGC
AAGCAGGTTGGACAATTAAAAGCTTCGATGAGGCTGGGTGAGTGGCTTATGCCTGTATTTCCAACACTTTGGGAGGCTGAGGTGGGCAGA
TCACCTGAGGCCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGCCAGACGTGTGGT
GGCACCTGTAATCCCAGCTACCCGGGAGGCTGAGGCAGGAGAATCACTCGAACCCAGGAAGGGGAGGTTGCAGTGAGCCAAGATTGCACC
ACTGCACTATAGCCTGGGCAACAGAGTGAGACTCTGTCTCGAAATAAAATTAAATTTAAAATTTAAAAAAGCTTCAAGGACAACCAGCAG
ATGATGGCAGGACCAGGAAGGGTGCTTCAGGCAGCGGGAACTGAACCTGCACAGACTAGGGAAGCATGAACATCGGTACATCTGGGAGTG
GCACAGCTTAGAGGGCCTGGAGCATGGAGTGTGAGGGGGAACTGGAGTGAGGGACAGGAGACCAGGCGACCATGGCCTTGCTGGGTAAGC
GCTGTGACGTCCCCACCAACGGCTGCGGACCCGACCGCTGGAACTCCGCGTTCACCCGCAAAGACGAGATCATCACCAGCCTCGTGTCTG
CCTTAGACTCCATGTGCTCAGCGCTGTCCAAACTGAACGCCGAGGTGGCCTGTGTCGCCGTGCACGATGAGAGCGCCTTTGTGGTGGGCA
CAGAGAAGGGGAGAATGTTCCTGAATGCCCGGAAGGAGCTACAGTCAGACTTCCTCAGGTTCTGCCGAGGGCCCCCGTGGAAGGATCCGG
AGGCAGAGCACCCCAAGAAGGTGCAGCGGGGCGAGGGTGGAGGCCGTAGCCTCCCTCGGTCCTCCCTGGAACATGGCTCAGATGTGTACC
TTCTGCGGAAGATGGTAGAGGAGGTGTTTGATGTTCTTTATAGCGAGGCCCTGGGAAGGGCCAGTGTGGTGCCACTGCCCTATGAGAGGC
TGCTCAGGGAGCCAGGGCTGCTGGCCGTGCAGGGGCTGCCCGAAGGCCTGGCCTTCCGAAGGCCAGCCGAGTATGACCCCAAGGCCCTCA
TGGCCATCCTGGAACACAGCCACCGCATCCGCTTCAAGCTCAAGAGGCCACTTGAGGATGGCGGGCGGGACTCGAAGGCCCTGGTGGAGC
TGAACGGTGTCTCCCTGATTCCCAAGGGGTCACGGGACTGTGGCCTGCATGGCCAGGCCCCCAAGGTGCCACCCCAGGACCTGCCCCCAA
CCGCCACCTCCTCCTCCATGGCCAGCTTCCTGTACAGCACGGCGCTCCCCAACCACGCCATCCGAGAGCTCAAGCAGGAAGCACCTTCCT
GCCCCCTTGCCCCCAGCGACCTGGGCCTGAGTCGGCCCATGCCAGAGCCCAAGGCCACCGGTGCCCAAGACTTCTCCGACTGTTGTGGAC
AGAAGCCCACTGGGCCTGGTGGGCCTCTCATCCAGAACGTCCATGCCTCCAAGCGCATTCTCTTCTCCATCGTCCATGACAAGTCAGTGT
ACCGCCGGAAGCACCAGGAGCTGCAAGCCATGCAGATGGAGCTGCAGAGCCCTGAGTACAAGCTGAGCAAGCTCCGCACCTCGACCATCA
TGACCGACTACAACCCCAACTACTGCTTTGCTGGCAAGACCTCCTCCATCAGTGACCTGAAGGAGGTGCCGCGGAAAAACATCACCCTCA
TTCGGGGTCTGGGCCATGGCGCCTTTGGGGAGGTGTATGAAGGCCAGGTGTCCGGAATGCCCAACGACCCAAGCCCCCTGCAAGTGGCTG
TGAAGACGCTGCCTGAAGTGTGCTCTGAACAGGACGAACTGGATTTCCTCATGGAAGCCCTGATCATCAGCAAATTCAACCACCAGAACA
TTGTTCGCTGCATTGGGGTGAGCCTGCAATCCCTGCCCCGGTTCATCCTGCTGGAGCTCATGGCGGGGGGAGACCTCAAGTCCTTCCTCC
GAGAGACCCGCCCTCGCCCGAGCCAGCCCTCCTCCCTGGCCATGCTGGACCTTCTGCACGTGGCTCGGGACATTGCCTGTGGCTGTCAGT
ATTTGGAGGAAAACCACTTCATCCACCGAGACATTGCTGCCAGAAACTGCCTCTTGACCTGTCCAGGCCCTGGAAGAGTGGCCAAGATTG
GAGACTTCGGGATGGCCCGAGACATCTACAGGGCGAGCTACTATAGAAAGGGAGGCTGTGCCATGCTGCCAGTTAAGTGGATGCCCCCAG
AGGCCTTCATGGAAGGAATATTCACTTCTAAAACAGACACATGGTCCTTTGGAGTGCTGCTATGGGAAATCTTTTCTCTTGGATATATGC
CATACCCCAGCAAAAGCAACCAGGAAGTTCTGGAGTTTGTCACCAGTGGAGGCCGGATGGACCCACCCAAGAACTGCCCTGGGCCTGTAT
ACCGGATAATGACTCAGTGCTGGCAACATCAGCCTGAAGACAGGCCCAACTTTGCCATCATTTTGGAGAGGATTGAATACTGCACCCAGG
ACCCGGATGTAATCAACACCGCTTTGCCGATAGAATATGGTCCACTTGTGGAAGAGGAAGAGAAAGTGCCTGTGAGGCCCAAGGACCCTG
AGGGGGTTCCTCCTCTCCTGGTCTCTCAACAGGCAAAACGGGAGGAGGAGCGCAGCCCAGCTGCCCCACCACCTCTGCCTACCACCTCCT
CTGGCAAGGCTGCAAAGAAACCCACAGCTGCAGAGATCTCTGTTCGAGTCCCTAGAGGGCCGGCCGTGGAAGGGGGACACGTGAATATGG
CATTCTCTCAGTCCAACCCTCCTTCGGAGTTGCACAAGGTCCACGGATCCAGAAACAAGCCCACCAGCTTGTGGAACCCAACGTACGGCT
CCTGGTTTACAGAGAAACCCACCAAAAAGAATAATCCTATAGCAAAGAAGGAGCCACACGACAGGGGTAACCTGGGGCTGGAGGGAAGCT
GTACTGTCCCACCTAACGTTGCAACTGGGAGACTTCCGGGGGCCTCACTGCTCCTAGAGCCCTCTTCGCTGACTGCCAATATGAAGGAGG
TACCTCTGTTCAGGCTACGTCACTTCCCTTGTGGGAATGTCAATTACGGCTACCAGCAACAGGGCTTGCCCTTAGAAGCCGCTACTGCCC
CTGGAGCTGGTCATTACGAGGATACCATTCTGAAAAGCAAGAATAGCATGAACCAGCCTGGGCCCTGAGCTCGGTCGCACACTCACTTCT
CTTCCTTGGGATCCCTAAGACCGTGGAGGAGAGAGAGGCAATGGCTCCTTCACAAACCAGAGACCAAATGTCACGTTTTGTTTTGTGCCA
ACCTATTTTGAAGTACCACCAAAAAAGCTGTATTTTGAAAATGCTTTAGAAAGGTTTTGAGCATGGGTTCATCCTATTCTTTCGAAAGAA
GAAAATATCATAAAAATGAGTGATAAATACAAGGCCCAGATGTGGTTGCATAAGGTTTTTATGCATGTTTGTTGTATACTTCCTTATGCT
TCTTTCAAATTGTGTGTGCTCTGCTTCAATGTAGTCAGAATTAGCTGCTTCTATGTTTCATAGTTGGGGTCATAGATGTTTCCTTGCCTT

>35245_35245_4_GTF2IRD1-ALK_GTF2IRD1_chr7_73935627_ENST00000476977_ALK_chr2_29446394_ENST00000389048_length(amino acids)=898AA_BP=335
MALLGKRCDVPTNGCGPDRWNSAFTRKDEIITSLVSALDSMCSALSKLNAEVACVAVHDESAFVVGTEKGRMFLNARKELQSDFLRFCRG
PPWKDPEAEHPKKVQRGEGGGRSLPRSSLEHGSDVYLLRKMVEEVFDVLYSEALGRASVVPLPYERLLREPGLLAVQGLPEGLAFRRPAE
YDPKALMAILEHSHRIRFKLKRPLEDGGRDSKALVELNGVSLIPKGSRDCGLHGQAPKVPPQDLPPTATSSSMASFLYSTALPNHAIREL
KQEAPSCPLAPSDLGLSRPMPEPKATGAQDFSDCCGQKPTGPGGPLIQNVHASKRILFSIVHDKSVYRRKHQELQAMQMELQSPEYKLSK
LRTSTIMTDYNPNYCFAGKTSSISDLKEVPRKNITLIRGLGHGAFGEVYEGQVSGMPNDPSPLQVAVKTLPEVCSEQDELDFLMEALIIS
KFNHQNIVRCIGVSLQSLPRFILLELMAGGDLKSFLRETRPRPSQPSSLAMLDLLHVARDIACGCQYLEENHFIHRDIAARNCLLTCPGP
GRVAKIGDFGMARDIYRASYYRKGGCAMLPVKWMPPEAFMEGIFTSKTDTWSFGVLLWEIFSLGYMPYPSKSNQEVLEFVTSGGRMDPPK
NCPGPVYRIMTQCWQHQPEDRPNFAIILERIEYCTQDPDVINTALPIEYGPLVEEEEKVPVRPKDPEGVPPLLVSQQAKREEERSPAAPP
PLPTTSSGKAAKKPTAAEISVRVPRGPAVEGGHVNMAFSQSNPPSELHKVHGSRNKPTSLWNPTYGSWFTEKPTKKNNPIAKKEPHDRGN

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for GTF2IRD1-ALK


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for GTF2IRD1-ALK


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status
TgeneALKQ9UM73DB08865CrizotinibInhibitorSmall moleculeApproved
TgeneALKQ9UM73DB09063CeritinibAntagonistSmall moleculeApproved
TgeneALKQ9UM73DB11363AlectinibInhibitorSmall moleculeApproved|Investigational
TgeneALKQ9UM73DB12010FostamatinibInhibitorSmall moleculeApproved|Investigational
TgeneALKQ9UM73DB12130LorlatinibInhibitorSmall moleculeApproved|Investigational
TgeneALKQ9UM73DB12141GilteritinibInhibitorSmall moleculeApproved|Investigational
TgeneALKQ9UM73DB12267BrigatinibInhibitorSmall moleculeApproved|Investigational

Top

Related Diseases for GTF2IRD1-ALK


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
HgeneGTF2IRD1C0175702Williams Syndrome1CTD_human
HgeneGTF2IRD1C0376634Craniofacial Abnormalities1CTD_human
TgeneC0007131Non-Small Cell Lung Carcinoma28CGI;CTD_human
TgeneC0027819Neuroblastoma13CGI;CTD_human;ORPHANET
TgeneC0152013Adenocarcinoma of lung (disorder)8CGI;CTD_human
TgeneC2751681NEUROBLASTOMA, SUSCEPTIBILITY TO, 38CLINGEN;UNIPROT
TgeneC0206180Ki-1+ Anaplastic Large Cell Lymphoma6CGI;CTD_human
TgeneC0334121Inflammatory Myofibroblastic Tumor4CGI;CTD_human;ORPHANET
TgeneC0018199Granuloma, Plasma Cell3CTD_human
TgeneC0007621Neoplastic Cell Transformation2CTD_human
TgeneC0027627Neoplasm Metastasis2CTD_human
TgeneC0238463Papillary thyroid carcinoma2ORPHANET
TgeneC0001973Alcoholic Intoxication, Chronic1PSYGENET
TgeneC0006118Brain Neoplasms1CGI;CTD_human
TgeneC0006142Malignant neoplasm of breast1CTD_human
TgeneC0007134Renal Cell Carcinoma1CTD_human
TgeneC0011570Mental Depression1PSYGENET
TgeneC0011581Depressive disorder1PSYGENET
TgeneC0027643Neoplasm Recurrence, Local1CTD_human
TgeneC0036341Schizophrenia1PSYGENET
TgeneC0079744Diffuse Large B-Cell Lymphoma1CTD_human
TgeneC0085269Plasma Cell Granuloma, Pulmonary1CTD_human
TgeneC0153633Malignant neoplasm of brain1CGI;CTD_human
TgeneC0278601Inflammatory Breast Carcinoma1CTD_human
TgeneC0279702Conventional (Clear Cell) Renal Cell Carcinoma1CTD_human
TgeneC0496899Benign neoplasm of brain, unspecified1CTD_human
TgeneC0678222Breast Carcinoma1CTD_human
TgeneC0750974Brain Tumor, Primary1CTD_human
TgeneC0750977Recurrent Brain Neoplasm1CTD_human
TgeneC0750979Primary malignant neoplasm of brain1CTD_human
TgeneC1257931Mammary Neoplasms, Human1CTD_human
TgeneC1266042Chromophobe Renal Cell Carcinoma1CTD_human
TgeneC1266043Sarcomatoid Renal Cell Carcinoma1CTD_human
TgeneC1266044Collecting Duct Carcinoma of the Kidney1CTD_human
TgeneC1306837Papillary Renal Cell Carcinoma1CTD_human
TgeneC1332079Anaplastic Large Cell Lymphoma, ALK-Positive1ORPHANET
TgeneC1458155Mammary Neoplasms1CTD_human
TgeneC1527390Neoplasms, Intracranial1CTD_human
TgeneC2931189Neural crest tumor1ORPHANET
TgeneC3899155hereditary neuroblastoma1GENOMICS_ENGLAND
TgeneC4704874Mammary Carcinoma, Human1CTD_human