Fusion Gene Studies
in Kim Lab

FusionBase FusionGDB FusionGDB2 FusionPDB FusionNeoAntigen FusionAI FusionNW FGviewer Publication Contact
FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:CD74-NTRK1 (FusionGDB2 ID:HG972TG4914)

Fusion Gene Summary for CD74-NTRK1

check button Fusion gene summary
Fusion gene informationFusion gene name: CD74-NTRK1
Fusion gene ID: hg972tg4914
HgeneTgene
Gene symbol

CD74

NTRK1

Gene ID

972

4914

Gene nameCD74 moleculeneurotrophic receptor tyrosine kinase 1
SynonymsDHLAG|HLADG|II|Ia-GAMMA|p33MTC|TRK|TRK1|TRKA|Trk-A|p140-TrkA
Cytomap('CD74')('NTRK1')

5q33.1

1q23.1

Type of geneprotein-codingprotein-coding
DescriptionHLA class II histocompatibility antigen gamma chainCD74 antigen (invariant polypeptide of major histocompatibility complex, class II antigen-associated)CD74 molecule, major histocompatibility complex, class II invariant chainHLA-DR antigens-associated high affinity nerve growth factor receptorOncogene TRKTRK1-transforming tyrosine kinase proteingp140trkneurotrophic tyrosine kinase, receptor, type 1tropomyosin-related kinase Atyrosine kinase receptor A
Modification date2020031320200313
UniProtAcc

P04233

P04629

Ensembl transtripts involved in fusion geneENST00000377795, ENST00000009530, 
ENST00000353334, ENST00000524315, 
Fusion gene scores* DoF score43 X 51 X 20=4386025 X 24 X 13=7800
# samples 5933
** MAII scorelog2(59/43860*10)=-6.21604704731175
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(33/7800*10)=-4.56293619439116
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: CD74 [Title/Abstract] AND NTRK1 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointCD74(149782126)-NTRK1(156844360), # samples:2
Anticipated loss of major functional domain due to fusion event.CD74-NTRK1 seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
CD74-NTRK1 seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
CD74-NTRK1 seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
CD74-NTRK1 seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneCD74

GO:0001516

prostaglandin biosynthetic process

12782713

HgeneCD74

GO:0001934

positive regulation of protein phosphorylation

24942581

HgeneCD74

GO:0002792

negative regulation of peptide secretion

19849849

HgeneCD74

GO:0033674

positive regulation of kinase activity

24942581

HgeneCD74

GO:0043066

negative regulation of apoptotic process

12782713

HgeneCD74

GO:0043123

positive regulation of I-kappaB kinase/NF-kappaB signaling

24942581

HgeneCD74

GO:0043410

positive regulation of MAPK cascade

24942581

HgeneCD74

GO:0043518

negative regulation of DNA damage response, signal transduction by p53 class mediator

17045821

HgeneCD74

GO:0045657

positive regulation of monocyte differentiation

24942581

HgeneCD74

GO:0045893

positive regulation of transcription, DNA-templated

24942581

HgeneCD74

GO:0046598

positive regulation of viral entry into host cell

24942581

HgeneCD74

GO:0050731

positive regulation of peptidyl-tyrosine phosphorylation

17045821

HgeneCD74

GO:0070374

positive regulation of ERK1 and ERK2 cascade

17045821|24942581

TgeneNTRK1

GO:0006468

protein phosphorylation

15488758

TgeneNTRK1

GO:0008285

negative regulation of cell proliferation

15488758

TgeneNTRK1

GO:0010976

positive regulation of neuron projection development

15488758

TgeneNTRK1

GO:0018108

peptidyl-tyrosine phosphorylation

2927393

TgeneNTRK1

GO:0043547

positive regulation of GTPase activity

15488758

TgeneNTRK1

GO:0046579

positive regulation of Ras protein signal transduction

15488758

TgeneNTRK1

GO:0046777

protein autophosphorylation

15488758

TgeneNTRK1

GO:0048011

neurotrophin TRK receptor signaling pathway

15488758

TgeneNTRK1

GO:0051092

positive regulation of NF-kappaB transcription factor activity

15488758

TgeneNTRK1

GO:0070374

positive regulation of ERK1 and ERK2 cascade

15488758

TgeneNTRK1

GO:1904646

cellular response to amyloid-beta

11927634


check buttonFusion gene breakpoints across CD74 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure
check buttonFusion gene breakpoints across NTRK1 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4non small cell lung cancerHG532018CD74chr5

149792499

NTRK1chr1

156844360



Top

Fusion Gene ORF analysis for CD74-NTRK1

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
intron-3CDSENST00000009530ENST00000358660CD74chr5

149792499

NTRK1chr1

156844360

intron-3CDSENST00000009530ENST00000368196CD74chr5

149792499

NTRK1chr1

156844360

intron-3CDSENST00000009530ENST00000392302CD74chr5

149792499

NTRK1chr1

156844360

intron-3CDSENST00000009530ENST00000524377CD74chr5

149792499

NTRK1chr1

156844360

intron-3CDSENST00000353334ENST00000358660CD74chr5

149792499

NTRK1chr1

156844360

intron-3CDSENST00000353334ENST00000368196CD74chr5

149792499

NTRK1chr1

156844360

intron-3CDSENST00000353334ENST00000392302CD74chr5

149792499

NTRK1chr1

156844360

intron-3CDSENST00000353334ENST00000524377CD74chr5

149792499

NTRK1chr1

156844360

intron-3CDSENST00000377795ENST00000358660CD74chr5

149792499

NTRK1chr1

156844360

intron-3CDSENST00000377795ENST00000368196CD74chr5

149792499

NTRK1chr1

156844360

intron-3CDSENST00000377795ENST00000392302CD74chr5

149792499

NTRK1chr1

156844360

intron-3CDSENST00000377795ENST00000524377CD74chr5

149792499

NTRK1chr1

156844360

intron-3CDSENST00000524315ENST00000358660CD74chr5

149792499

NTRK1chr1

156844360

intron-3CDSENST00000524315ENST00000368196CD74chr5

149792499

NTRK1chr1

156844360

intron-3CDSENST00000524315ENST00000392302CD74chr5

149792499

NTRK1chr1

156844360

intron-3CDSENST00000524315ENST00000524377CD74chr5

149792499

NTRK1chr1

156844360

intron-intronENST00000009530ENST00000531606CD74chr5

149792499

NTRK1chr1

156844360

intron-intronENST00000353334ENST00000531606CD74chr5

149792499

NTRK1chr1

156844360

intron-intronENST00000377795ENST00000531606CD74chr5

149792499

NTRK1chr1

156844360

intron-intronENST00000524315ENST00000531606CD74chr5

149792499

NTRK1chr1

156844360


check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000353334ENST00000392302CD74chr5149782126-NTRK1chr1156844360+0.039992950.9600071
ENST00000353334ENST00000368196CD74chr5149782126-NTRK1chr1156844360+0.038089690.9619103
ENST00000353334ENST00000524377CD74chr5149782126-NTRK1chr1156844360+0.0431011840.95689887
ENST00000353334ENST00000358660CD74chr5149782126-NTRK1chr1156844360+0.0404397360.9595602
ENST00000009530ENST00000392302CD74chr5149782126-NTRK1chr1156844360+0.047954960.9520451
ENST00000009530ENST00000368196CD74chr5149782126-NTRK1chr1156844360+0.0463296320.95367044
ENST00000009530ENST00000524377CD74chr5149782126-NTRK1chr1156844360+0.048694250.9513057
ENST00000009530ENST00000358660CD74chr5149782126-NTRK1chr1156844360+0.04727360.95272636

Top

Fusion Genomic Features for CD74-NTRK1


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
CD74chr5149782125-NTRK1chr1156844362+0.0001800410.99981993
CD74chr5149782125-NTRK1chr1156844362+0.0001800410.99981993

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.
genomic feature of top 1%

Top

Fusion Protein Features for CD74-NTRK1


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr5:149782126/chr1:156844360)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
CD74

P04233

NTRK1

P04629

FUNCTION: Plays a critical role in MHC class II antigen processing by stabilizing peptide-free class II alpha/beta heterodimers in a complex soon after their synthesis and directing transport of the complex from the endoplasmic reticulum to the endosomal/lysosomal system where the antigen processing and binding of antigenic peptides to MHC class II takes place. Serves as cell surface receptor for the cytokine MIF.; FUNCTION: [Class-II-associated invariant chain peptide]: Binds to the peptide-binding site of MHC class II alpha/beta heterodimers forming an alpha-beta-CLIP complex, thereby preventing the loading of antigenic peptides to the MHC class II complex until its release by HLA-DM in the endosome. {ECO:0000269|PubMed:1448172}.; FUNCTION: [Isoform p41]: Stabilizes the conformation of mature CTSL by binding to its active site and serving as a chaperone to help maintain a pool of mature enzyme in endocytic compartments and extracellular space of antigen-presenting cells (APCs). Has antiviral activity by stymieing the endosomal entry of Ebola virus and coronaviruses, including SARS-CoV-2 (PubMed:32855215). Disrupts cathepsin-mediated Ebola virus glycoprotein processing, which prevents viral fusion and entry. This antiviral activity is specific to p41 isoform (PubMed:32855215). {ECO:0000250|UniProtKB:P04441, ECO:0000269|PubMed:32855215}.FUNCTION: Receptor tyrosine kinase involved in the development and the maturation of the central and peripheral nervous systems through regulation of proliferation, differentiation and survival of sympathetic and nervous neurons. High affinity receptor for NGF which is its primary ligand (PubMed:1850821, PubMed:1849459, PubMed:1281417, PubMed:8325889, PubMed:15488758, PubMed:22649032, PubMed:17196528, PubMed:27445338). Can also bind and be activated by NTF3/neurotrophin-3. However, NTF3 only supports axonal extension through NTRK1 but has no effect on neuron survival (By similarity). Upon dimeric NGF ligand-binding, undergoes homodimerization, autophosphorylation and activation (PubMed:1281417). Recruits, phosphorylates and/or activates several downstream effectors including SHC1, FRS2, SH2B1, SH2B2 and PLCG1 that regulate distinct overlapping signaling cascades driving cell survival and differentiation. Through SHC1 and FRS2 activates a GRB2-Ras-MAPK cascade that regulates cell differentiation and survival. Through PLCG1 controls NF-Kappa-B activation and the transcription of genes involved in cell survival. Through SHC1 and SH2B1 controls a Ras-PI3 kinase-AKT1 signaling cascade that is also regulating survival. In absence of ligand and activation, may promote cell death, making the survival of neurons dependent on trophic factors. {ECO:0000250|UniProtKB:P35739, ECO:0000250|UniProtKB:Q3UFB7, ECO:0000269|PubMed:11244088, ECO:0000269|PubMed:1281417, ECO:0000269|PubMed:15488758, ECO:0000269|PubMed:17196528, ECO:0000269|PubMed:1849459, ECO:0000269|PubMed:1850821, ECO:0000269|PubMed:22649032, ECO:0000269|PubMed:27445338, ECO:0000269|PubMed:27676246, ECO:0000269|PubMed:8155326, ECO:0000269|PubMed:8325889}.; FUNCTION: [Isoform TrkA-III]: Resistant to NGF, it constitutively activates AKT1 and NF-kappa-B and is unable to activate the Ras-MAPK signaling cascade. Antagonizes the anti-proliferative NGF-NTRK1 signaling that promotes neuronal precursors differentiation. Isoform TrkA-III promotes angiogenesis and has oncogenic activity when overexpressed. {ECO:0000269|PubMed:15488758}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneCD74chr5:149782126chr1:156844360ENST00000009530-89210_271293297.0DomainThyroglobulin type-1
HgeneCD74chr5:149782126chr1:156844360ENST00000009530-891_46293297.0Topological domainCytoplasmic
HgeneCD74chr5:149782126chr1:156844360ENST00000009530-8973_296293297.0Topological domainExtracellular
HgeneCD74chr5:149782126chr1:156844360ENST00000353334-781_46229233.0Topological domainCytoplasmic
HgeneCD74chr5:149782126chr1:156844360ENST00000377795-561_46168280.6666666666667Topological domainCytoplasmic
HgeneCD74chr5:149782126chr1:156844360ENST00000009530-8947_72293297.0TransmembraneHelical%3B Signal-anchor for type II membrane protein
HgeneCD74chr5:149782126chr1:156844360ENST00000353334-7847_72229233.0TransmembraneHelical%3B Signal-anchor for type II membrane protein
HgeneCD74chr5:149782126chr1:156844360ENST00000377795-5647_72168280.6666666666667TransmembraneHelical%3B Signal-anchor for type II membrane protein
TgeneNTRK1chr5:149782126chr1:156844360ENST00000368196716510_781392791.0DomainProtein kinase
TgeneNTRK1chr5:149782126chr1:156844360ENST00000392302817510_781362761.0DomainProtein kinase
TgeneNTRK1chr5:149782126chr1:156844360ENST00000524377817510_781398797.0DomainProtein kinase
TgeneNTRK1chr5:149782126chr1:156844360ENST00000368196716537_541392791.0MotifDXXLL
TgeneNTRK1chr5:149782126chr1:156844360ENST00000368196716607_611392791.0MotifDXXLL
TgeneNTRK1chr5:149782126chr1:156844360ENST00000392302817537_541362761.0MotifDXXLL
TgeneNTRK1chr5:149782126chr1:156844360ENST00000392302817607_611362761.0MotifDXXLL
TgeneNTRK1chr5:149782126chr1:156844360ENST00000524377817537_541398797.0MotifDXXLL
TgeneNTRK1chr5:149782126chr1:156844360ENST00000524377817607_611398797.0MotifDXXLL
TgeneNTRK1chr5:149782126chr1:156844360ENST00000368196716516_524392791.0Nucleotide bindingATP
TgeneNTRK1chr5:149782126chr1:156844360ENST00000392302817516_524362761.0Nucleotide bindingATP
TgeneNTRK1chr5:149782126chr1:156844360ENST00000524377817516_524398797.0Nucleotide bindingATP
TgeneNTRK1chr5:149782126chr1:156844360ENST00000368196716440_796392791.0Topological domainCytoplasmic
TgeneNTRK1chr5:149782126chr1:156844360ENST00000392302817440_796362761.0Topological domainCytoplasmic
TgeneNTRK1chr5:149782126chr1:156844360ENST00000524377817440_796398797.0Topological domainCytoplasmic
TgeneNTRK1chr5:149782126chr1:156844360ENST00000368196716424_439392791.0TransmembraneHelical
TgeneNTRK1chr5:149782126chr1:156844360ENST00000392302817424_439362761.0TransmembraneHelical
TgeneNTRK1chr5:149782126chr1:156844360ENST00000524377817424_439398797.0TransmembraneHelical

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneCD74chr5:149782126chr1:156844360ENST00000353334-78210_271229233.0DomainThyroglobulin type-1
HgeneCD74chr5:149782126chr1:156844360ENST00000377795-56210_271168280.6666666666667DomainThyroglobulin type-1
HgeneCD74chr5:149782126chr1:156844360ENST00000353334-7873_296229233.0Topological domainExtracellular
HgeneCD74chr5:149782126chr1:156844360ENST00000377795-5673_296168280.6666666666667Topological domainExtracellular
TgeneNTRK1chr5:149782126chr1:156844360ENST00000368196716148_193392791.0DomainNote=LRRCT
TgeneNTRK1chr5:149782126chr1:156844360ENST00000368196716194_283392791.0DomainIg-like C2-type 1
TgeneNTRK1chr5:149782126chr1:156844360ENST00000368196716299_365392791.0DomainIg-like C2-type 2
TgeneNTRK1chr5:149782126chr1:156844360ENST00000392302817148_193362761.0DomainNote=LRRCT
TgeneNTRK1chr5:149782126chr1:156844360ENST00000392302817194_283362761.0DomainIg-like C2-type 1
TgeneNTRK1chr5:149782126chr1:156844360ENST00000392302817299_365362761.0DomainIg-like C2-type 2
TgeneNTRK1chr5:149782126chr1:156844360ENST00000524377817148_193398797.0DomainNote=LRRCT
TgeneNTRK1chr5:149782126chr1:156844360ENST00000524377817194_283398797.0DomainIg-like C2-type 1
TgeneNTRK1chr5:149782126chr1:156844360ENST00000524377817299_365398797.0DomainIg-like C2-type 2
TgeneNTRK1chr5:149782126chr1:156844360ENST00000368196716116_137392791.0RepeatNote=LRR 2
TgeneNTRK1chr5:149782126chr1:156844360ENST0000036819671690_113392791.0RepeatNote=LRR 1
TgeneNTRK1chr5:149782126chr1:156844360ENST00000392302817116_137362761.0RepeatNote=LRR 2
TgeneNTRK1chr5:149782126chr1:156844360ENST0000039230281790_113362761.0RepeatNote=LRR 1
TgeneNTRK1chr5:149782126chr1:156844360ENST00000524377817116_137398797.0RepeatNote=LRR 2
TgeneNTRK1chr5:149782126chr1:156844360ENST0000052437781790_113398797.0RepeatNote=LRR 1
TgeneNTRK1chr5:149782126chr1:156844360ENST0000036819671633_423392791.0Topological domainExtracellular
TgeneNTRK1chr5:149782126chr1:156844360ENST0000039230281733_423362761.0Topological domainExtracellular
TgeneNTRK1chr5:149782126chr1:156844360ENST0000052437781733_423398797.0Topological domainExtracellular


Top

Fusion Gene Sequence for CD74-NTRK1


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>14643_14643_1_CD74-NTRK1_CD74_chr5_149782126_ENST00000009530_NTRK1_chr1_156844360_ENST00000358660_length(transcript)=2156nt_BP=882nt
AGATGCACAGGAGGAGAAGCAGGAGCTGTCGGGAAGATCAGAAGCCAGTCATGGATGACCAGCGCGACCTTATCTCCAACAATGAGCAAC
TGCCCATGCTGGGCCGGCGCCCTGGGGCCCCGGAGAGCAAGTGCAGCCGCGGAGCCCTGTACACAGGCTTTTCCATCCTGGTGACTCTGC
TCCTCGCTGGCCAGGCCACCACCGCCTACTTCCTGTACCAGCAGCAGGGCCGGCTGGACAAACTGACAGTCACCTCCCAGAACCTGCAGC
TGGAGAACCTGCGCATGAAGCTTCCCAAGCCTCCCAAGCCTGTGAGCAAGATGCGCATGGCCACCCCGCTGCTGATGCAGGCGCTGCCCA
TGGGAGCCCTGCCCCAGGGGCCCATGCAGAATGCCACCAAGTATGGCAACATGACAGAGGACCATGTGATGCACCTGCTCCAGAATGCTG
ACCCCCTGAAGGTGTACCCGCCACTGAAGGGGAGCTTCCCGGAGAACCTGAGACACCTTAAGAACACCATGGAGACCATAGACTGGAAGG
TCTTTGAGAGCTGGATGCACCATTGGCTCCTGTTTGAAATGAGCAGGCACTCCTTGGAGCAAAAGCCCACTGACGCTCCACCGAAAGTAC
TGACCAAGTGCCAGGAAGAGGTCAGCCACATCCCTGCTGTCCACCCGGGTTCATTCAGGCCCAAGTGCGACGAGAACGGCAACTATCTGC
CACTCCAGTGCTATGGGAGCATCGGCTACTGCTGGTGTGTCTTCCCCAACGGCACGGAGGTCCCCAACACCAGAAGCCGCGGGCACCATA
ACTGCAGTGAGTCACTGGAACTGGAGGACCCGTCTTCTGGGCTGGGTGTGACCAAGCAGGATCTGGGCCCAGACACTAACAGCACATCTG
GAGACCCGGTGGAGAAGAAGGACGAAACACCTTTTGGGGTCTCGGTGGCTGTGGGCCTGGCCGTCTTTGCCTGCCTCTTCCTTTCTACGC
TGCTCCTTGTGCTCAACAAATGTGGACGGAGAAACAAGTTTGGGATCAACCGCCCGGCTGTGCTGGCTCCAGAGGATGGGCTGGCCATGT
CCCTGCATTTCATGACATTGGGTGGCAGCTCCCTGTCCCCCACCGAGGGCAAAGGCTCTGGGCTCCAAGGCCACATCATCGAGAACCCAC
AATACTTCAGTGATGCCTCCCCCTCAGGTGTTCACCACATCAAGCGCCGGGACATCGTGCTCAAGTGGGAGCTGGGGGAGGGCGCCTTTG
GGAAGGTCTTCCTTGCTGAGTGCCACAACCTCCTGCCTGAGCAGGACAAGATGCTGGTGGCTGTCAAGGCACTGAAGGAGGCGTCCGAGA
GTGCTCGGCAGGACTTCCAGCGTGAGGCTGAGCTGCTCACCATGCTGCAGCACCAGCACATCGTGCGCTTCTTCGGCGTCTGCACCGAGG
GCCGCCCCCTGCTCATGGTCTTTGAGTATATGCGGCACGGGGACCTCAACCGCTTCCTCCGATCCCATGGACCTGATGCCAAGCTGCTGG
CTGGTGGGGAGGATGTGGCTCCAGGCCCCCTGGGTCTGGGGCAGCTGCTGGCCGTGGCTAGCCAGGTCGCTGCGGGGATGGTGTACCTGG
CGGGTCTGCATTTTGTGCACCGGGACCTGGCCACACGCAACTGTCTAGTGGGCCAGGGACTGGTGGTCAAGATTGGTGATTTTGGCATGA
GCAGGGATATCTACAGCACCGACTATTACCGTGTGGGAGGCCGCACCATGCTGCCCATTCGCTGGATGCCGCCCGAGAGCATCCTGTACC
GTAAGTTCACCACCGAGAGCGACGTGTGGAGCTTCGGCGTGGTGCTCTGGGAGATCTTCACCTACGGCAAGCAGCCCTGGTACCAGCTCT
CCAACACGGAGGCAATCGACTGCATCACGCAGGGACGTGAGTTGGAGCGGCCACGTGCCTGCCCACCAGAGGTCTACGCCATCATGCGGG
GCTGCTGGCAGCGGGAGCCCCAGCAACGCCACAGCATCAAGGATGTGCACGCCCGGCTGCAAGCCCTGGCCCAGGCACCTCCTGTCTACC

>14643_14643_1_CD74-NTRK1_CD74_chr5_149782126_ENST00000009530_NTRK1_chr1_156844360_ENST00000358660_length(amino acids)=694AA_BP=293
MHRRRSRSCREDQKPVMDDQRDLISNNEQLPMLGRRPGAPESKCSRGALYTGFSILVTLLLAGQATTAYFLYQQQGRLDKLTVTSQNLQL
ENLRMKLPKPPKPVSKMRMATPLLMQALPMGALPQGPMQNATKYGNMTEDHVMHLLQNADPLKVYPPLKGSFPENLRHLKNTMETIDWKV
FESWMHHWLLFEMSRHSLEQKPTDAPPKVLTKCQEEVSHIPAVHPGSFRPKCDENGNYLPLQCYGSIGYCWCVFPNGTEVPNTRSRGHHN
CSESLELEDPSSGLGVTKQDLGPDTNSTSGDPVEKKDETPFGVSVAVGLAVFACLFLSTLLLVLNKCGRRNKFGINRPAVLAPEDGLAMS
LHFMTLGGSSLSPTEGKGSGLQGHIIENPQYFSDASPSGVHHIKRRDIVLKWELGEGAFGKVFLAECHNLLPEQDKMLVAVKALKEASES
ARQDFQREAELLTMLQHQHIVRFFGVCTEGRPLLMVFEYMRHGDLNRFLRSHGPDAKLLAGGEDVAPGPLGLGQLLAVASQVAAGMVYLA
GLHFVHRDLATRNCLVGQGLVVKIGDFGMSRDIYSTDYYRVGGRTMLPIRWMPPESILYRKFTTESDVWSFGVVLWEIFTYGKQPWYQLS

--------------------------------------------------------------
>14643_14643_2_CD74-NTRK1_CD74_chr5_149782126_ENST00000009530_NTRK1_chr1_156844360_ENST00000368196_length(transcript)=2286nt_BP=882nt
AGATGCACAGGAGGAGAAGCAGGAGCTGTCGGGAAGATCAGAAGCCAGTCATGGATGACCAGCGCGACCTTATCTCCAACAATGAGCAAC
TGCCCATGCTGGGCCGGCGCCCTGGGGCCCCGGAGAGCAAGTGCAGCCGCGGAGCCCTGTACACAGGCTTTTCCATCCTGGTGACTCTGC
TCCTCGCTGGCCAGGCCACCACCGCCTACTTCCTGTACCAGCAGCAGGGCCGGCTGGACAAACTGACAGTCACCTCCCAGAACCTGCAGC
TGGAGAACCTGCGCATGAAGCTTCCCAAGCCTCCCAAGCCTGTGAGCAAGATGCGCATGGCCACCCCGCTGCTGATGCAGGCGCTGCCCA
TGGGAGCCCTGCCCCAGGGGCCCATGCAGAATGCCACCAAGTATGGCAACATGACAGAGGACCATGTGATGCACCTGCTCCAGAATGCTG
ACCCCCTGAAGGTGTACCCGCCACTGAAGGGGAGCTTCCCGGAGAACCTGAGACACCTTAAGAACACCATGGAGACCATAGACTGGAAGG
TCTTTGAGAGCTGGATGCACCATTGGCTCCTGTTTGAAATGAGCAGGCACTCCTTGGAGCAAAAGCCCACTGACGCTCCACCGAAAGTAC
TGACCAAGTGCCAGGAAGAGGTCAGCCACATCCCTGCTGTCCACCCGGGTTCATTCAGGCCCAAGTGCGACGAGAACGGCAACTATCTGC
CACTCCAGTGCTATGGGAGCATCGGCTACTGCTGGTGTGTCTTCCCCAACGGCACGGAGGTCCCCAACACCAGAAGCCGCGGGCACCATA
ACTGCAGTGAGTCACTGGAACTGGAGGACCCGTCTTCTGGGCTGGGTGTGACCAAGCAGGATCTGGGCCCAGACACTAACAGCACATCTG
GAGACCCGGTGGAGAAGAAGGACGAAACACCTTTTGGGGTCTCGGTGGCTGTGGGCCTGGCCGTCTTTGCCTGCCTCTTCCTTTCTACGC
TGCTCCTTGTGCTCAACAAATGTGGACGGAGAAACAAGTTTGGGATCAACCGCCCGGCTGTGCTGGCTCCAGAGGATGGGCTGGCCATGT
CCCTGCATTTCATGACATTGGGTGGCAGCTCCCTGTCCCCCACCGAGGGCAAAGGCTCTGGGCTCCAAGGCCACATCATCGAGAACCCAC
AATACTTCAGTGATGCCTGTGTTCACCACATCAAGCGCCGGGACATCGTGCTCAAGTGGGAGCTGGGGGAGGGCGCCTTTGGGAAGGTCT
TCCTTGCTGAGTGCCACAACCTCCTGCCTGAGCAGGACAAGATGCTGGTGGCTGTCAAGGCACTGAAGGAGGCGTCCGAGAGTGCTCGGC
AGGACTTCCAGCGTGAGGCTGAGCTGCTCACCATGCTGCAGCACCAGCACATCGTGCGCTTCTTCGGCGTCTGCACCGAGGGCCGCCCCC
TGCTCATGGTCTTTGAGTATATGCGGCACGGGGACCTCAACCGCTTCCTCCGATCCCATGGACCTGATGCCAAGCTGCTGGCTGGTGGGG
AGGATGTGGCTCCAGGCCCCCTGGGTCTGGGGCAGCTGCTGGCCGTGGCTAGCCAGGTCGCTGCGGGGATGGTGTACCTGGCGGGTCTGC
ATTTTGTGCACCGGGACCTGGCCACACGCAACTGTCTAGTGGGCCAGGGACTGGTGGTCAAGATTGGTGATTTTGGCATGAGCAGGGATA
TCTACAGCACCGACTATTACCGTGTGGGAGGCCGCACCATGCTGCCCATTCGCTGGATGCCGCCCGAGAGCATCCTGTACCGTAAGTTCA
CCACCGAGAGCGACGTGTGGAGCTTCGGCGTGGTGCTCTGGGAGATCTTCACCTACGGCAAGCAGCCCTGGTACCAGCTCTCCAACACGG
AGGCAATCGACTGCATCACGCAGGGACGTGAGTTGGAGCGGCCACGTGCCTGCCCACCAGAGGTCTACGCCATCATGCGGGGCTGCTGGC
AGCGGGAGCCCCAGCAACGCCACAGCATCAAGGATGTGCACGCCCGGCTGCAAGCCCTGGCCCAGGCACCTCCTGTCTACCTGGATGTCC
TGGGCTAGGGGGCCGGCCCAGGGGCTGGGAGTGGTTAGCCGGAATACTGGGGCCTGCCCTCAGCATCCCCCATAGCTCCCAGCAGCCCCA
GGGTGATCTCAAAGTATCTAATTCACCCTCAGCATGTGGGAAGGGACAGGTGGGGGCTGGGAGTAGAGGATGTTCCTGCTTCTCTAGGCA

>14643_14643_2_CD74-NTRK1_CD74_chr5_149782126_ENST00000009530_NTRK1_chr1_156844360_ENST00000368196_length(amino acids)=691AA_BP=293
MHRRRSRSCREDQKPVMDDQRDLISNNEQLPMLGRRPGAPESKCSRGALYTGFSILVTLLLAGQATTAYFLYQQQGRLDKLTVTSQNLQL
ENLRMKLPKPPKPVSKMRMATPLLMQALPMGALPQGPMQNATKYGNMTEDHVMHLLQNADPLKVYPPLKGSFPENLRHLKNTMETIDWKV
FESWMHHWLLFEMSRHSLEQKPTDAPPKVLTKCQEEVSHIPAVHPGSFRPKCDENGNYLPLQCYGSIGYCWCVFPNGTEVPNTRSRGHHN
CSESLELEDPSSGLGVTKQDLGPDTNSTSGDPVEKKDETPFGVSVAVGLAVFACLFLSTLLLVLNKCGRRNKFGINRPAVLAPEDGLAMS
LHFMTLGGSSLSPTEGKGSGLQGHIIENPQYFSDACVHHIKRRDIVLKWELGEGAFGKVFLAECHNLLPEQDKMLVAVKALKEASESARQ
DFQREAELLTMLQHQHIVRFFGVCTEGRPLLMVFEYMRHGDLNRFLRSHGPDAKLLAGGEDVAPGPLGLGQLLAVASQVAAGMVYLAGLH
FVHRDLATRNCLVGQGLVVKIGDFGMSRDIYSTDYYRVGGRTMLPIRWMPPESILYRKFTTESDVWSFGVVLWEIFTYGKQPWYQLSNTE

--------------------------------------------------------------
>14643_14643_3_CD74-NTRK1_CD74_chr5_149782126_ENST00000009530_NTRK1_chr1_156844360_ENST00000392302_length(transcript)=2230nt_BP=882nt
AGATGCACAGGAGGAGAAGCAGGAGCTGTCGGGAAGATCAGAAGCCAGTCATGGATGACCAGCGCGACCTTATCTCCAACAATGAGCAAC
TGCCCATGCTGGGCCGGCGCCCTGGGGCCCCGGAGAGCAAGTGCAGCCGCGGAGCCCTGTACACAGGCTTTTCCATCCTGGTGACTCTGC
TCCTCGCTGGCCAGGCCACCACCGCCTACTTCCTGTACCAGCAGCAGGGCCGGCTGGACAAACTGACAGTCACCTCCCAGAACCTGCAGC
TGGAGAACCTGCGCATGAAGCTTCCCAAGCCTCCCAAGCCTGTGAGCAAGATGCGCATGGCCACCCCGCTGCTGATGCAGGCGCTGCCCA
TGGGAGCCCTGCCCCAGGGGCCCATGCAGAATGCCACCAAGTATGGCAACATGACAGAGGACCATGTGATGCACCTGCTCCAGAATGCTG
ACCCCCTGAAGGTGTACCCGCCACTGAAGGGGAGCTTCCCGGAGAACCTGAGACACCTTAAGAACACCATGGAGACCATAGACTGGAAGG
TCTTTGAGAGCTGGATGCACCATTGGCTCCTGTTTGAAATGAGCAGGCACTCCTTGGAGCAAAAGCCCACTGACGCTCCACCGAAAGTAC
TGACCAAGTGCCAGGAAGAGGTCAGCCACATCCCTGCTGTCCACCCGGGTTCATTCAGGCCCAAGTGCGACGAGAACGGCAACTATCTGC
CACTCCAGTGCTATGGGAGCATCGGCTACTGCTGGTGTGTCTTCCCCAACGGCACGGAGGTCCCCAACACCAGAAGCCGCGGGCACCATA
ACTGCAGTGAGTCACTGGAACTGGAGGACCCGTCTTCTGGGCTGGGTGTGACCAAGCAGGATCTGGGCCCAGACACTAACAGCACATCTG
GAGACCCGGTGGAGAAGAAGGACGAAACACCTTTTGGGGTCTCGGTGGCTGTGGGCCTGGCCGTCTTTGCCTGCCTCTTCCTTTCTACGC
TGCTCCTTGTGCTCAACAAATGTGGACGGAGAAACAAGTTTGGGATCAACCGCCCGGCTGTGCTGGCTCCAGAGGATGGGCTGGCCATGT
CCCTGCATTTCATGACATTGGGTGGCAGCTCCCTGTCCCCCACCGAGGGCAAAGGCTCTGGGCTCCAAGGCCACATCATCGAGAACCCAC
AATACTTCAGTGATGCCTGTGTTCACCACATCAAGCGCCGGGACATCGTGCTCAAGTGGGAGCTGGGGGAGGGCGCCTTTGGGAAGGTCT
TCCTTGCTGAGTGCCACAACCTCCTGCCTGAGCAGGACAAGATGCTGGTGGCTGTCAAGGCACTGAAGGAGGCGTCCGAGAGTGCTCGGC
AGGACTTCCAGCGTGAGGCTGAGCTGCTCACCATGCTGCAGCACCAGCACATCGTGCGCTTCTTCGGCGTCTGCACCGAGGGCCGCCCCC
TGCTCATGGTCTTTGAGTATATGCGGCACGGGGACCTCAACCGCTTCCTCCGATCCCATGGACCTGATGCCAAGCTGCTGGCTGGTGGGG
AGGATGTGGCTCCAGGCCCCCTGGGTCTGGGGCAGCTGCTGGCCGTGGCTAGCCAGGTCGCTGCGGGGATGGTGTACCTGGCGGGTCTGC
ATTTTGTGCACCGGGACCTGGCCACACGCAACTGTCTAGTGGGCCAGGGACTGGTGGTCAAGATTGGTGATTTTGGCATGAGCAGGGATA
TCTACAGCACCGACTATTACCGTGTGGGAGGCCGCACCATGCTGCCCATTCGCTGGATGCCGCCCGAGAGCATCCTGTACCGTAAGTTCA
CCACCGAGAGCGACGTGTGGAGCTTCGGCGTGGTGCTCTGGGAGATCTTCACCTACGGCAAGCAGCCCTGGTACCAGCTCTCCAACACGG
AGGCAATCGACTGCATCACGCAGGGACGTGAGTTGGAGCGGCCACGTGCCTGCCCACCAGAGGTCTACGCCATCATGCGGGGCTGCTGGC
AGCGGGAGCCCCAGCAACGCCACAGCATCAAGGATGTGCACGCCCGGCTGCAAGCCCTGGCCCAGGCACCTCCTGTCTACCTGGATGTCC
TGGGCTAGGGGGCCGGCCCAGGGGCTGGGAGTGGTTAGCCGGAATACTGGGGCCTGCCCTCAGCATCCCCCATAGCTCCCAGCAGCCCCA

>14643_14643_3_CD74-NTRK1_CD74_chr5_149782126_ENST00000009530_NTRK1_chr1_156844360_ENST00000392302_length(amino acids)=691AA_BP=293
MHRRRSRSCREDQKPVMDDQRDLISNNEQLPMLGRRPGAPESKCSRGALYTGFSILVTLLLAGQATTAYFLYQQQGRLDKLTVTSQNLQL
ENLRMKLPKPPKPVSKMRMATPLLMQALPMGALPQGPMQNATKYGNMTEDHVMHLLQNADPLKVYPPLKGSFPENLRHLKNTMETIDWKV
FESWMHHWLLFEMSRHSLEQKPTDAPPKVLTKCQEEVSHIPAVHPGSFRPKCDENGNYLPLQCYGSIGYCWCVFPNGTEVPNTRSRGHHN
CSESLELEDPSSGLGVTKQDLGPDTNSTSGDPVEKKDETPFGVSVAVGLAVFACLFLSTLLLVLNKCGRRNKFGINRPAVLAPEDGLAMS
LHFMTLGGSSLSPTEGKGSGLQGHIIENPQYFSDACVHHIKRRDIVLKWELGEGAFGKVFLAECHNLLPEQDKMLVAVKALKEASESARQ
DFQREAELLTMLQHQHIVRFFGVCTEGRPLLMVFEYMRHGDLNRFLRSHGPDAKLLAGGEDVAPGPLGLGQLLAVASQVAAGMVYLAGLH
FVHRDLATRNCLVGQGLVVKIGDFGMSRDIYSTDYYRVGGRTMLPIRWMPPESILYRKFTTESDVWSFGVVLWEIFTYGKQPWYQLSNTE

--------------------------------------------------------------
>14643_14643_4_CD74-NTRK1_CD74_chr5_149782126_ENST00000009530_NTRK1_chr1_156844360_ENST00000524377_length(transcript)=2078nt_BP=882nt
AGATGCACAGGAGGAGAAGCAGGAGCTGTCGGGAAGATCAGAAGCCAGTCATGGATGACCAGCGCGACCTTATCTCCAACAATGAGCAAC
TGCCCATGCTGGGCCGGCGCCCTGGGGCCCCGGAGAGCAAGTGCAGCCGCGGAGCCCTGTACACAGGCTTTTCCATCCTGGTGACTCTGC
TCCTCGCTGGCCAGGCCACCACCGCCTACTTCCTGTACCAGCAGCAGGGCCGGCTGGACAAACTGACAGTCACCTCCCAGAACCTGCAGC
TGGAGAACCTGCGCATGAAGCTTCCCAAGCCTCCCAAGCCTGTGAGCAAGATGCGCATGGCCACCCCGCTGCTGATGCAGGCGCTGCCCA
TGGGAGCCCTGCCCCAGGGGCCCATGCAGAATGCCACCAAGTATGGCAACATGACAGAGGACCATGTGATGCACCTGCTCCAGAATGCTG
ACCCCCTGAAGGTGTACCCGCCACTGAAGGGGAGCTTCCCGGAGAACCTGAGACACCTTAAGAACACCATGGAGACCATAGACTGGAAGG
TCTTTGAGAGCTGGATGCACCATTGGCTCCTGTTTGAAATGAGCAGGCACTCCTTGGAGCAAAAGCCCACTGACGCTCCACCGAAAGTAC
TGACCAAGTGCCAGGAAGAGGTCAGCCACATCCCTGCTGTCCACCCGGGTTCATTCAGGCCCAAGTGCGACGAGAACGGCAACTATCTGC
CACTCCAGTGCTATGGGAGCATCGGCTACTGCTGGTGTGTCTTCCCCAACGGCACGGAGGTCCCCAACACCAGAAGCCGCGGGCACCATA
ACTGCAGTGAGTCACTGGAACTGGAGGACCCGTCTTCTGGGCTGGGTGTGACCAAGCAGGATCTGGGCCCAGACACTAACAGCACATCTG
GAGACCCGGTGGAGAAGAAGGACGAAACACCTTTTGGGGTCTCGGTGGCTGTGGGCCTGGCCGTCTTTGCCTGCCTCTTCCTTTCTACGC
TGCTCCTTGTGCTCAACAAATGTGGACGGAGAAACAAGTTTGGGATCAACCGCCCGGCTGTGCTGGCTCCAGAGGATGGGCTGGCCATGT
CCCTGCATTTCATGACATTGGGTGGCAGCTCCCTGTCCCCCACCGAGGGCAAAGGCTCTGGGCTCCAAGGCCACATCATCGAGAACCCAC
AATACTTCAGTGATGCCTGTGTTCACCACATCAAGCGCCGGGACATCGTGCTCAAGTGGGAGCTGGGGGAGGGCGCCTTTGGGAAGGTCT
TCCTTGCTGAGTGCCACAACCTCCTGCCTGAGCAGGACAAGATGCTGGTGGCTGTCAAGGCACTGAAGGAGGCGTCCGAGAGTGCTCGGC
AGGACTTCCAGCGTGAGGCTGAGCTGCTCACCATGCTGCAGCACCAGCACATCGTGCGCTTCTTCGGCGTCTGCACCGAGGGCCGCCCCC
TGCTCATGGTCTTTGAGTATATGCGGCACGGGGACCTCAACCGCTTCCTCCGATCCCATGGACCTGATGCCAAGCTGCTGGCTGGTGGGG
AGGATGTGGCTCCAGGCCCCCTGGGTCTGGGGCAGCTGCTGGCCGTGGCTAGCCAGGTCGCTGCGGGGATGGTGTACCTGGCGGGTCTGC
ATTTTGTGCACCGGGACCTGGCCACACGCAACTGTCTAGTGGGCCAGGGACTGGTGGTCAAGATTGGTGATTTTGGCATGAGCAGGGATA
TCTACAGCACCGACTATTACCGTGTGGGAGGCCGCACCATGCTGCCCATTCGCTGGATGCCGCCCGAGAGCATCCTGTACCGTAAGTTCA
CCACCGAGAGCGACGTGTGGAGCTTCGGCGTGGTGCTCTGGGAGATCTTCACCTACGGCAAGCAGCCCTGGTACCAGCTCTCCAACACGG
AGGCAATCGACTGCATCACGCAGGGACGTGAGTTGGAGCGGCCACGTGCCTGCCCACCAGAGGTCTACGCCATCATGCGGGGCTGCTGGC
AGCGGGAGCCCCAGCAACGCCACAGCATCAAGGATGTGCACGCCCGGCTGCAAGCCCTGGCCCAGGCACCTCCTGTCTACCTGGATGTCC

>14643_14643_4_CD74-NTRK1_CD74_chr5_149782126_ENST00000009530_NTRK1_chr1_156844360_ENST00000524377_length(amino acids)=692AA_BP=293
MHRRRSRSCREDQKPVMDDQRDLISNNEQLPMLGRRPGAPESKCSRGALYTGFSILVTLLLAGQATTAYFLYQQQGRLDKLTVTSQNLQL
ENLRMKLPKPPKPVSKMRMATPLLMQALPMGALPQGPMQNATKYGNMTEDHVMHLLQNADPLKVYPPLKGSFPENLRHLKNTMETIDWKV
FESWMHHWLLFEMSRHSLEQKPTDAPPKVLTKCQEEVSHIPAVHPGSFRPKCDENGNYLPLQCYGSIGYCWCVFPNGTEVPNTRSRGHHN
CSESLELEDPSSGLGVTKQDLGPDTNSTSGDPVEKKDETPFGVSVAVGLAVFACLFLSTLLLVLNKCGRRNKFGINRPAVLAPEDGLAMS
LHFMTLGGSSLSPTEGKGSGLQGHIIENPQYFSDACVHHIKRRDIVLKWELGEGAFGKVFLAECHNLLPEQDKMLVAVKALKEASESARQ
DFQREAELLTMLQHQHIVRFFGVCTEGRPLLMVFEYMRHGDLNRFLRSHGPDAKLLAGGEDVAPGPLGLGQLLAVASQVAAGMVYLAGLH
FVHRDLATRNCLVGQGLVVKIGDFGMSRDIYSTDYYRVGGRTMLPIRWMPPESILYRKFTTESDVWSFGVVLWEIFTYGKQPWYQLSNTE

--------------------------------------------------------------
>14643_14643_5_CD74-NTRK1_CD74_chr5_149782126_ENST00000353334_NTRK1_chr1_156844360_ENST00000358660_length(transcript)=2142nt_BP=868nt
GGGAGCCCCCCCGCCCCACATCCTGCCCCGCAAAAGGCAGCTTCACCAAAGTGGGGTATTTCCAGCCTTTGTAGCTTTCACTTCCACATC
TACCAAGTGGGCGGAGTGGCCTTCTGTGGACGAATCAGATTCCTCTCCAGCACCGACTTTAAGAGGCGAGCCGGGGGGTCAGGGTCCCAG
ATGCACAGGAGGAGAAGCAGGAGCTGTCGGGAAGATCAGAAGCCAGTCATGGATGACCAGCGCGACCTTATCTCCAACAATGAGCAACTG
CCCATGCTGGGCCGGCGCCCTGGGGCCCCGGAGAGCAAGTGCAGCCGCGGAGCCCTGTACACAGGCTTTTCCATCCTGGTGACTCTGCTC
CTCGCTGGCCAGGCCACCACCGCCTACTTCCTGTACCAGCAGCAGGGCCGGCTGGACAAACTGACAGTCACCTCCCAGAACCTGCAGCTG
GAGAACCTGCGCATGAAGCTTCCCAAGCCTCCCAAGCCTGTGAGCAAGATGCGCATGGCCACCCCGCTGCTGATGCAGGCGCTGCCCATG
GGAGCCCTGCCCCAGGGGCCCATGCAGAATGCCACCAAGTATGGCAACATGACAGAGGACCATGTGATGCACCTGCTCCAGAATGCTGAC
CCCCTGAAGGTGTACCCGCCACTGAAGGGGAGCTTCCCGGAGAACCTGAGACACCTTAAGAACACCATGGAGACCATAGACTGGAAGGTC
TTTGAGAGCTGGATGCACCATTGGCTCCTGTTTGAAATGAGCAGGCACTCCTTGGAGCAAAAGCCCACTGACGCTCCACCGAAAGAGTCA
CTGGAACTGGAGGACCCGTCTTCTGGGCTGGGTGTGACCAAGCAGGATCTGGGCCCAGACACTAACAGCACATCTGGAGACCCGGTGGAG
AAGAAGGACGAAACACCTTTTGGGGTCTCGGTGGCTGTGGGCCTGGCCGTCTTTGCCTGCCTCTTCCTTTCTACGCTGCTCCTTGTGCTC
AACAAATGTGGACGGAGAAACAAGTTTGGGATCAACCGCCCGGCTGTGCTGGCTCCAGAGGATGGGCTGGCCATGTCCCTGCATTTCATG
ACATTGGGTGGCAGCTCCCTGTCCCCCACCGAGGGCAAAGGCTCTGGGCTCCAAGGCCACATCATCGAGAACCCACAATACTTCAGTGAT
GCCTCCCCCTCAGGTGTTCACCACATCAAGCGCCGGGACATCGTGCTCAAGTGGGAGCTGGGGGAGGGCGCCTTTGGGAAGGTCTTCCTT
GCTGAGTGCCACAACCTCCTGCCTGAGCAGGACAAGATGCTGGTGGCTGTCAAGGCACTGAAGGAGGCGTCCGAGAGTGCTCGGCAGGAC
TTCCAGCGTGAGGCTGAGCTGCTCACCATGCTGCAGCACCAGCACATCGTGCGCTTCTTCGGCGTCTGCACCGAGGGCCGCCCCCTGCTC
ATGGTCTTTGAGTATATGCGGCACGGGGACCTCAACCGCTTCCTCCGATCCCATGGACCTGATGCCAAGCTGCTGGCTGGTGGGGAGGAT
GTGGCTCCAGGCCCCCTGGGTCTGGGGCAGCTGCTGGCCGTGGCTAGCCAGGTCGCTGCGGGGATGGTGTACCTGGCGGGTCTGCATTTT
GTGCACCGGGACCTGGCCACACGCAACTGTCTAGTGGGCCAGGGACTGGTGGTCAAGATTGGTGATTTTGGCATGAGCAGGGATATCTAC
AGCACCGACTATTACCGTGTGGGAGGCCGCACCATGCTGCCCATTCGCTGGATGCCGCCCGAGAGCATCCTGTACCGTAAGTTCACCACC
GAGAGCGACGTGTGGAGCTTCGGCGTGGTGCTCTGGGAGATCTTCACCTACGGCAAGCAGCCCTGGTACCAGCTCTCCAACACGGAGGCA
ATCGACTGCATCACGCAGGGACGTGAGTTGGAGCGGCCACGTGCCTGCCCACCAGAGGTCTACGCCATCATGCGGGGCTGCTGGCAGCGG
GAGCCCCAGCAACGCCACAGCATCAAGGATGTGCACGCCCGGCTGCAAGCCCTGGCCCAGGCACCTCCTGTCTACCTGGATGTCCTGGGC

>14643_14643_5_CD74-NTRK1_CD74_chr5_149782126_ENST00000353334_NTRK1_chr1_156844360_ENST00000358660_length(amino acids)=630AA_BP=229
MHRRRSRSCREDQKPVMDDQRDLISNNEQLPMLGRRPGAPESKCSRGALYTGFSILVTLLLAGQATTAYFLYQQQGRLDKLTVTSQNLQL
ENLRMKLPKPPKPVSKMRMATPLLMQALPMGALPQGPMQNATKYGNMTEDHVMHLLQNADPLKVYPPLKGSFPENLRHLKNTMETIDWKV
FESWMHHWLLFEMSRHSLEQKPTDAPPKESLELEDPSSGLGVTKQDLGPDTNSTSGDPVEKKDETPFGVSVAVGLAVFACLFLSTLLLVL
NKCGRRNKFGINRPAVLAPEDGLAMSLHFMTLGGSSLSPTEGKGSGLQGHIIENPQYFSDASPSGVHHIKRRDIVLKWELGEGAFGKVFL
AECHNLLPEQDKMLVAVKALKEASESARQDFQREAELLTMLQHQHIVRFFGVCTEGRPLLMVFEYMRHGDLNRFLRSHGPDAKLLAGGED
VAPGPLGLGQLLAVASQVAAGMVYLAGLHFVHRDLATRNCLVGQGLVVKIGDFGMSRDIYSTDYYRVGGRTMLPIRWMPPESILYRKFTT
ESDVWSFGVVLWEIFTYGKQPWYQLSNTEAIDCITQGRELERPRACPPEVYAIMRGCWQREPQQRHSIKDVHARLQALAQAPPVYLDVLG

--------------------------------------------------------------
>14643_14643_6_CD74-NTRK1_CD74_chr5_149782126_ENST00000353334_NTRK1_chr1_156844360_ENST00000368196_length(transcript)=2272nt_BP=868nt
GGGAGCCCCCCCGCCCCACATCCTGCCCCGCAAAAGGCAGCTTCACCAAAGTGGGGTATTTCCAGCCTTTGTAGCTTTCACTTCCACATC
TACCAAGTGGGCGGAGTGGCCTTCTGTGGACGAATCAGATTCCTCTCCAGCACCGACTTTAAGAGGCGAGCCGGGGGGTCAGGGTCCCAG
ATGCACAGGAGGAGAAGCAGGAGCTGTCGGGAAGATCAGAAGCCAGTCATGGATGACCAGCGCGACCTTATCTCCAACAATGAGCAACTG
CCCATGCTGGGCCGGCGCCCTGGGGCCCCGGAGAGCAAGTGCAGCCGCGGAGCCCTGTACACAGGCTTTTCCATCCTGGTGACTCTGCTC
CTCGCTGGCCAGGCCACCACCGCCTACTTCCTGTACCAGCAGCAGGGCCGGCTGGACAAACTGACAGTCACCTCCCAGAACCTGCAGCTG
GAGAACCTGCGCATGAAGCTTCCCAAGCCTCCCAAGCCTGTGAGCAAGATGCGCATGGCCACCCCGCTGCTGATGCAGGCGCTGCCCATG
GGAGCCCTGCCCCAGGGGCCCATGCAGAATGCCACCAAGTATGGCAACATGACAGAGGACCATGTGATGCACCTGCTCCAGAATGCTGAC
CCCCTGAAGGTGTACCCGCCACTGAAGGGGAGCTTCCCGGAGAACCTGAGACACCTTAAGAACACCATGGAGACCATAGACTGGAAGGTC
TTTGAGAGCTGGATGCACCATTGGCTCCTGTTTGAAATGAGCAGGCACTCCTTGGAGCAAAAGCCCACTGACGCTCCACCGAAAGAGTCA
CTGGAACTGGAGGACCCGTCTTCTGGGCTGGGTGTGACCAAGCAGGATCTGGGCCCAGACACTAACAGCACATCTGGAGACCCGGTGGAG
AAGAAGGACGAAACACCTTTTGGGGTCTCGGTGGCTGTGGGCCTGGCCGTCTTTGCCTGCCTCTTCCTTTCTACGCTGCTCCTTGTGCTC
AACAAATGTGGACGGAGAAACAAGTTTGGGATCAACCGCCCGGCTGTGCTGGCTCCAGAGGATGGGCTGGCCATGTCCCTGCATTTCATG
ACATTGGGTGGCAGCTCCCTGTCCCCCACCGAGGGCAAAGGCTCTGGGCTCCAAGGCCACATCATCGAGAACCCACAATACTTCAGTGAT
GCCTGTGTTCACCACATCAAGCGCCGGGACATCGTGCTCAAGTGGGAGCTGGGGGAGGGCGCCTTTGGGAAGGTCTTCCTTGCTGAGTGC
CACAACCTCCTGCCTGAGCAGGACAAGATGCTGGTGGCTGTCAAGGCACTGAAGGAGGCGTCCGAGAGTGCTCGGCAGGACTTCCAGCGT
GAGGCTGAGCTGCTCACCATGCTGCAGCACCAGCACATCGTGCGCTTCTTCGGCGTCTGCACCGAGGGCCGCCCCCTGCTCATGGTCTTT
GAGTATATGCGGCACGGGGACCTCAACCGCTTCCTCCGATCCCATGGACCTGATGCCAAGCTGCTGGCTGGTGGGGAGGATGTGGCTCCA
GGCCCCCTGGGTCTGGGGCAGCTGCTGGCCGTGGCTAGCCAGGTCGCTGCGGGGATGGTGTACCTGGCGGGTCTGCATTTTGTGCACCGG
GACCTGGCCACACGCAACTGTCTAGTGGGCCAGGGACTGGTGGTCAAGATTGGTGATTTTGGCATGAGCAGGGATATCTACAGCACCGAC
TATTACCGTGTGGGAGGCCGCACCATGCTGCCCATTCGCTGGATGCCGCCCGAGAGCATCCTGTACCGTAAGTTCACCACCGAGAGCGAC
GTGTGGAGCTTCGGCGTGGTGCTCTGGGAGATCTTCACCTACGGCAAGCAGCCCTGGTACCAGCTCTCCAACACGGAGGCAATCGACTGC
ATCACGCAGGGACGTGAGTTGGAGCGGCCACGTGCCTGCCCACCAGAGGTCTACGCCATCATGCGGGGCTGCTGGCAGCGGGAGCCCCAG
CAACGCCACAGCATCAAGGATGTGCACGCCCGGCTGCAAGCCCTGGCCCAGGCACCTCCTGTCTACCTGGATGTCCTGGGCTAGGGGGCC
GGCCCAGGGGCTGGGAGTGGTTAGCCGGAATACTGGGGCCTGCCCTCAGCATCCCCCATAGCTCCCAGCAGCCCCAGGGTGATCTCAAAG
TATCTAATTCACCCTCAGCATGTGGGAAGGGACAGGTGGGGGCTGGGAGTAGAGGATGTTCCTGCTTCTCTAGGCAAGGTCCCGTCATAG

>14643_14643_6_CD74-NTRK1_CD74_chr5_149782126_ENST00000353334_NTRK1_chr1_156844360_ENST00000368196_length(amino acids)=627AA_BP=229
MHRRRSRSCREDQKPVMDDQRDLISNNEQLPMLGRRPGAPESKCSRGALYTGFSILVTLLLAGQATTAYFLYQQQGRLDKLTVTSQNLQL
ENLRMKLPKPPKPVSKMRMATPLLMQALPMGALPQGPMQNATKYGNMTEDHVMHLLQNADPLKVYPPLKGSFPENLRHLKNTMETIDWKV
FESWMHHWLLFEMSRHSLEQKPTDAPPKESLELEDPSSGLGVTKQDLGPDTNSTSGDPVEKKDETPFGVSVAVGLAVFACLFLSTLLLVL
NKCGRRNKFGINRPAVLAPEDGLAMSLHFMTLGGSSLSPTEGKGSGLQGHIIENPQYFSDACVHHIKRRDIVLKWELGEGAFGKVFLAEC
HNLLPEQDKMLVAVKALKEASESARQDFQREAELLTMLQHQHIVRFFGVCTEGRPLLMVFEYMRHGDLNRFLRSHGPDAKLLAGGEDVAP
GPLGLGQLLAVASQVAAGMVYLAGLHFVHRDLATRNCLVGQGLVVKIGDFGMSRDIYSTDYYRVGGRTMLPIRWMPPESILYRKFTTESD

--------------------------------------------------------------
>14643_14643_7_CD74-NTRK1_CD74_chr5_149782126_ENST00000353334_NTRK1_chr1_156844360_ENST00000392302_length(transcript)=2216nt_BP=868nt
GGGAGCCCCCCCGCCCCACATCCTGCCCCGCAAAAGGCAGCTTCACCAAAGTGGGGTATTTCCAGCCTTTGTAGCTTTCACTTCCACATC
TACCAAGTGGGCGGAGTGGCCTTCTGTGGACGAATCAGATTCCTCTCCAGCACCGACTTTAAGAGGCGAGCCGGGGGGTCAGGGTCCCAG
ATGCACAGGAGGAGAAGCAGGAGCTGTCGGGAAGATCAGAAGCCAGTCATGGATGACCAGCGCGACCTTATCTCCAACAATGAGCAACTG
CCCATGCTGGGCCGGCGCCCTGGGGCCCCGGAGAGCAAGTGCAGCCGCGGAGCCCTGTACACAGGCTTTTCCATCCTGGTGACTCTGCTC
CTCGCTGGCCAGGCCACCACCGCCTACTTCCTGTACCAGCAGCAGGGCCGGCTGGACAAACTGACAGTCACCTCCCAGAACCTGCAGCTG
GAGAACCTGCGCATGAAGCTTCCCAAGCCTCCCAAGCCTGTGAGCAAGATGCGCATGGCCACCCCGCTGCTGATGCAGGCGCTGCCCATG
GGAGCCCTGCCCCAGGGGCCCATGCAGAATGCCACCAAGTATGGCAACATGACAGAGGACCATGTGATGCACCTGCTCCAGAATGCTGAC
CCCCTGAAGGTGTACCCGCCACTGAAGGGGAGCTTCCCGGAGAACCTGAGACACCTTAAGAACACCATGGAGACCATAGACTGGAAGGTC
TTTGAGAGCTGGATGCACCATTGGCTCCTGTTTGAAATGAGCAGGCACTCCTTGGAGCAAAAGCCCACTGACGCTCCACCGAAAGAGTCA
CTGGAACTGGAGGACCCGTCTTCTGGGCTGGGTGTGACCAAGCAGGATCTGGGCCCAGACACTAACAGCACATCTGGAGACCCGGTGGAG
AAGAAGGACGAAACACCTTTTGGGGTCTCGGTGGCTGTGGGCCTGGCCGTCTTTGCCTGCCTCTTCCTTTCTACGCTGCTCCTTGTGCTC
AACAAATGTGGACGGAGAAACAAGTTTGGGATCAACCGCCCGGCTGTGCTGGCTCCAGAGGATGGGCTGGCCATGTCCCTGCATTTCATG
ACATTGGGTGGCAGCTCCCTGTCCCCCACCGAGGGCAAAGGCTCTGGGCTCCAAGGCCACATCATCGAGAACCCACAATACTTCAGTGAT
GCCTGTGTTCACCACATCAAGCGCCGGGACATCGTGCTCAAGTGGGAGCTGGGGGAGGGCGCCTTTGGGAAGGTCTTCCTTGCTGAGTGC
CACAACCTCCTGCCTGAGCAGGACAAGATGCTGGTGGCTGTCAAGGCACTGAAGGAGGCGTCCGAGAGTGCTCGGCAGGACTTCCAGCGT
GAGGCTGAGCTGCTCACCATGCTGCAGCACCAGCACATCGTGCGCTTCTTCGGCGTCTGCACCGAGGGCCGCCCCCTGCTCATGGTCTTT
GAGTATATGCGGCACGGGGACCTCAACCGCTTCCTCCGATCCCATGGACCTGATGCCAAGCTGCTGGCTGGTGGGGAGGATGTGGCTCCA
GGCCCCCTGGGTCTGGGGCAGCTGCTGGCCGTGGCTAGCCAGGTCGCTGCGGGGATGGTGTACCTGGCGGGTCTGCATTTTGTGCACCGG
GACCTGGCCACACGCAACTGTCTAGTGGGCCAGGGACTGGTGGTCAAGATTGGTGATTTTGGCATGAGCAGGGATATCTACAGCACCGAC
TATTACCGTGTGGGAGGCCGCACCATGCTGCCCATTCGCTGGATGCCGCCCGAGAGCATCCTGTACCGTAAGTTCACCACCGAGAGCGAC
GTGTGGAGCTTCGGCGTGGTGCTCTGGGAGATCTTCACCTACGGCAAGCAGCCCTGGTACCAGCTCTCCAACACGGAGGCAATCGACTGC
ATCACGCAGGGACGTGAGTTGGAGCGGCCACGTGCCTGCCCACCAGAGGTCTACGCCATCATGCGGGGCTGCTGGCAGCGGGAGCCCCAG
CAACGCCACAGCATCAAGGATGTGCACGCCCGGCTGCAAGCCCTGGCCCAGGCACCTCCTGTCTACCTGGATGTCCTGGGCTAGGGGGCC
GGCCCAGGGGCTGGGAGTGGTTAGCCGGAATACTGGGGCCTGCCCTCAGCATCCCCCATAGCTCCCAGCAGCCCCAGGGTGATCTCAAAG

>14643_14643_7_CD74-NTRK1_CD74_chr5_149782126_ENST00000353334_NTRK1_chr1_156844360_ENST00000392302_length(amino acids)=627AA_BP=229
MHRRRSRSCREDQKPVMDDQRDLISNNEQLPMLGRRPGAPESKCSRGALYTGFSILVTLLLAGQATTAYFLYQQQGRLDKLTVTSQNLQL
ENLRMKLPKPPKPVSKMRMATPLLMQALPMGALPQGPMQNATKYGNMTEDHVMHLLQNADPLKVYPPLKGSFPENLRHLKNTMETIDWKV
FESWMHHWLLFEMSRHSLEQKPTDAPPKESLELEDPSSGLGVTKQDLGPDTNSTSGDPVEKKDETPFGVSVAVGLAVFACLFLSTLLLVL
NKCGRRNKFGINRPAVLAPEDGLAMSLHFMTLGGSSLSPTEGKGSGLQGHIIENPQYFSDACVHHIKRRDIVLKWELGEGAFGKVFLAEC
HNLLPEQDKMLVAVKALKEASESARQDFQREAELLTMLQHQHIVRFFGVCTEGRPLLMVFEYMRHGDLNRFLRSHGPDAKLLAGGEDVAP
GPLGLGQLLAVASQVAAGMVYLAGLHFVHRDLATRNCLVGQGLVVKIGDFGMSRDIYSTDYYRVGGRTMLPIRWMPPESILYRKFTTESD

--------------------------------------------------------------
>14643_14643_8_CD74-NTRK1_CD74_chr5_149782126_ENST00000353334_NTRK1_chr1_156844360_ENST00000524377_length(transcript)=2064nt_BP=868nt
GGGAGCCCCCCCGCCCCACATCCTGCCCCGCAAAAGGCAGCTTCACCAAAGTGGGGTATTTCCAGCCTTTGTAGCTTTCACTTCCACATC
TACCAAGTGGGCGGAGTGGCCTTCTGTGGACGAATCAGATTCCTCTCCAGCACCGACTTTAAGAGGCGAGCCGGGGGGTCAGGGTCCCAG
ATGCACAGGAGGAGAAGCAGGAGCTGTCGGGAAGATCAGAAGCCAGTCATGGATGACCAGCGCGACCTTATCTCCAACAATGAGCAACTG
CCCATGCTGGGCCGGCGCCCTGGGGCCCCGGAGAGCAAGTGCAGCCGCGGAGCCCTGTACACAGGCTTTTCCATCCTGGTGACTCTGCTC
CTCGCTGGCCAGGCCACCACCGCCTACTTCCTGTACCAGCAGCAGGGCCGGCTGGACAAACTGACAGTCACCTCCCAGAACCTGCAGCTG
GAGAACCTGCGCATGAAGCTTCCCAAGCCTCCCAAGCCTGTGAGCAAGATGCGCATGGCCACCCCGCTGCTGATGCAGGCGCTGCCCATG
GGAGCCCTGCCCCAGGGGCCCATGCAGAATGCCACCAAGTATGGCAACATGACAGAGGACCATGTGATGCACCTGCTCCAGAATGCTGAC
CCCCTGAAGGTGTACCCGCCACTGAAGGGGAGCTTCCCGGAGAACCTGAGACACCTTAAGAACACCATGGAGACCATAGACTGGAAGGTC
TTTGAGAGCTGGATGCACCATTGGCTCCTGTTTGAAATGAGCAGGCACTCCTTGGAGCAAAAGCCCACTGACGCTCCACCGAAAGAGTCA
CTGGAACTGGAGGACCCGTCTTCTGGGCTGGGTGTGACCAAGCAGGATCTGGGCCCAGACACTAACAGCACATCTGGAGACCCGGTGGAG
AAGAAGGACGAAACACCTTTTGGGGTCTCGGTGGCTGTGGGCCTGGCCGTCTTTGCCTGCCTCTTCCTTTCTACGCTGCTCCTTGTGCTC
AACAAATGTGGACGGAGAAACAAGTTTGGGATCAACCGCCCGGCTGTGCTGGCTCCAGAGGATGGGCTGGCCATGTCCCTGCATTTCATG
ACATTGGGTGGCAGCTCCCTGTCCCCCACCGAGGGCAAAGGCTCTGGGCTCCAAGGCCACATCATCGAGAACCCACAATACTTCAGTGAT
GCCTGTGTTCACCACATCAAGCGCCGGGACATCGTGCTCAAGTGGGAGCTGGGGGAGGGCGCCTTTGGGAAGGTCTTCCTTGCTGAGTGC
CACAACCTCCTGCCTGAGCAGGACAAGATGCTGGTGGCTGTCAAGGCACTGAAGGAGGCGTCCGAGAGTGCTCGGCAGGACTTCCAGCGT
GAGGCTGAGCTGCTCACCATGCTGCAGCACCAGCACATCGTGCGCTTCTTCGGCGTCTGCACCGAGGGCCGCCCCCTGCTCATGGTCTTT
GAGTATATGCGGCACGGGGACCTCAACCGCTTCCTCCGATCCCATGGACCTGATGCCAAGCTGCTGGCTGGTGGGGAGGATGTGGCTCCA
GGCCCCCTGGGTCTGGGGCAGCTGCTGGCCGTGGCTAGCCAGGTCGCTGCGGGGATGGTGTACCTGGCGGGTCTGCATTTTGTGCACCGG
GACCTGGCCACACGCAACTGTCTAGTGGGCCAGGGACTGGTGGTCAAGATTGGTGATTTTGGCATGAGCAGGGATATCTACAGCACCGAC
TATTACCGTGTGGGAGGCCGCACCATGCTGCCCATTCGCTGGATGCCGCCCGAGAGCATCCTGTACCGTAAGTTCACCACCGAGAGCGAC
GTGTGGAGCTTCGGCGTGGTGCTCTGGGAGATCTTCACCTACGGCAAGCAGCCCTGGTACCAGCTCTCCAACACGGAGGCAATCGACTGC
ATCACGCAGGGACGTGAGTTGGAGCGGCCACGTGCCTGCCCACCAGAGGTCTACGCCATCATGCGGGGCTGCTGGCAGCGGGAGCCCCAG

>14643_14643_8_CD74-NTRK1_CD74_chr5_149782126_ENST00000353334_NTRK1_chr1_156844360_ENST00000524377_length(amino acids)=627AA_BP=229
MHRRRSRSCREDQKPVMDDQRDLISNNEQLPMLGRRPGAPESKCSRGALYTGFSILVTLLLAGQATTAYFLYQQQGRLDKLTVTSQNLQL
ENLRMKLPKPPKPVSKMRMATPLLMQALPMGALPQGPMQNATKYGNMTEDHVMHLLQNADPLKVYPPLKGSFPENLRHLKNTMETIDWKV
FESWMHHWLLFEMSRHSLEQKPTDAPPKESLELEDPSSGLGVTKQDLGPDTNSTSGDPVEKKDETPFGVSVAVGLAVFACLFLSTLLLVL
NKCGRRNKFGINRPAVLAPEDGLAMSLHFMTLGGSSLSPTEGKGSGLQGHIIENPQYFSDACVHHIKRRDIVLKWELGEGAFGKVFLAEC
HNLLPEQDKMLVAVKALKEASESARQDFQREAELLTMLQHQHIVRFFGVCTEGRPLLMVFEYMRHGDLNRFLRSHGPDAKLLAGGEDVAP
GPLGLGQLLAVASQVAAGMVYLAGLHFVHRDLATRNCLVGQGLVVKIGDFGMSRDIYSTDYYRVGGRTMLPIRWMPPESILYRKFTTESD

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for CD74-NTRK1


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with
TgeneNTRK1chr5:149782126chr1:156844360ENST00000368196716469_490392.3333333333333791.0SQSTM1
TgeneNTRK1chr5:149782126chr1:156844360ENST00000392302817469_490362.3333333333333761.0SQSTM1
TgeneNTRK1chr5:149782126chr1:156844360ENST00000524377817469_490398.3333333333333797.0SQSTM1


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for CD74-NTRK1


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status
TgeneNTRK1P04629DB00321AmitriptylineActivator|AgonistSmall moleculeApproved
TgeneNTRK1P04629DB00321AmitriptylineActivator|AgonistSmall moleculeApproved
TgeneNTRK1P04629DB00321AmitriptylineActivator|AgonistSmall moleculeApproved
TgeneNTRK1P04629DB00321AmitriptylineActivator|AgonistSmall moleculeApproved
TgeneNTRK1P04629DB00619ImatinibAntagonistSmall moleculeApproved
TgeneNTRK1P04629DB00619ImatinibAntagonistSmall moleculeApproved
TgeneNTRK1P04629DB00619ImatinibAntagonistSmall moleculeApproved
TgeneNTRK1P04629DB00619ImatinibAntagonistSmall moleculeApproved
TgeneNTRK1P04629DB08896RegorafenibInhibitorSmall moleculeApproved
TgeneNTRK1P04629DB08896RegorafenibInhibitorSmall moleculeApproved
TgeneNTRK1P04629DB08896RegorafenibInhibitorSmall moleculeApproved
TgeneNTRK1P04629DB08896RegorafenibInhibitorSmall moleculeApproved
TgeneNTRK1P04629DB11986EntrectinibInhibitorSmall moleculeApproved|Investigational
TgeneNTRK1P04629DB11986EntrectinibInhibitorSmall moleculeApproved|Investigational
TgeneNTRK1P04629DB11986EntrectinibInhibitorSmall moleculeApproved|Investigational
TgeneNTRK1P04629DB11986EntrectinibInhibitorSmall moleculeApproved|Investigational
TgeneNTRK1P04629DB12010FostamatinibInhibitorSmall moleculeApproved|Investigational
TgeneNTRK1P04629DB12010FostamatinibInhibitorSmall moleculeApproved|Investigational
TgeneNTRK1P04629DB12010FostamatinibInhibitorSmall moleculeApproved|Investigational
TgeneNTRK1P04629DB12010FostamatinibInhibitorSmall moleculeApproved|Investigational
TgeneNTRK1P04629DB13926CenegerminStimulatorBiotechApproved|Investigational
TgeneNTRK1P04629DB13926CenegerminStimulatorBiotechApproved|Investigational
TgeneNTRK1P04629DB13926CenegerminStimulatorBiotechApproved|Investigational
TgeneNTRK1P04629DB13926CenegerminStimulatorBiotechApproved|Investigational
TgeneNTRK1P04629DB14723LarotrectinibInhibitorSmall moleculeApproved|Investigational
TgeneNTRK1P04629DB14723LarotrectinibInhibitorSmall moleculeApproved|Investigational
TgeneNTRK1P04629DB14723LarotrectinibInhibitorSmall moleculeApproved|Investigational
TgeneNTRK1P04629DB14723LarotrectinibInhibitorSmall moleculeApproved|Investigational
TgeneNTRK1P04629DB15822PralsetinibInhibitorSmall moleculeApproved|Investigational
TgeneNTRK1P04629DB15822PralsetinibInhibitorSmall moleculeApproved|Investigational
TgeneNTRK1P04629DB15822PralsetinibInhibitorSmall moleculeApproved|Investigational
TgeneNTRK1P04629DB15822PralsetinibInhibitorSmall moleculeApproved|Investigational

Top

Related Diseases for CD74-NTRK1


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
HgeneCD74C0006142Malignant neoplasm of breast1CTD_human
HgeneCD74C0007131Non-Small Cell Lung Carcinoma1CTD_human
HgeneCD74C0023893Liver Cirrhosis, Experimental1CTD_human
HgeneCD74C0162557Liver Failure, Acute1CTD_human
HgeneCD74C0678222Breast Carcinoma1CTD_human
HgeneCD74C1257931Mammary Neoplasms, Human1CTD_human
HgeneCD74C1458155Mammary Neoplasms1CTD_human
HgeneCD74C4704874Mammary Carcinoma, Human1CTD_human
TgeneC0020074HSAN Type IV17CTD_human;GENOMICS_ENGLAND;UNIPROT
TgeneC0238463Papillary thyroid carcinoma3ORPHANET
TgeneC0002768Congenital Pain Insensitivity1ORPHANET
TgeneC0005586Bipolar Disorder1CTD_human
TgeneC0005587Depression, Bipolar1CTD_human
TgeneC0017638Glioma1CTD_human
TgeneC0020075Hereditary Sensory Autonomic Neuropathy, Type 51CTD_human;ORPHANET
TgeneC0024713Manic Disorder1CTD_human
TgeneC0027796Neuralgia1CTD_human
TgeneC0027819Neuroblastoma1CTD_human
TgeneC0033958Psychosis, Brief Reactive1CTD_human
TgeneC0033975Psychotic Disorders1CTD_human
TgeneC0036337Schizoaffective Disorder1CTD_human
TgeneC0036341Schizophrenia1CTD_human
TgeneC0036358Schizophreniform Disorders1CTD_human
TgeneC0038870Neuralgia, Supraorbital1CTD_human
TgeneC0042656Neuralgia, Vidian1CTD_human
TgeneC0234247Neuralgia, Atypical1CTD_human
TgeneC0234249Neuralgia, Stump1CTD_human
TgeneC0259783mixed gliomas1CTD_human
TgeneC0273115Lung Injury1CTD_human
TgeneC0338831Manic1CTD_human
TgeneC0423711Neuralgia, Perineal1CTD_human
TgeneC0423712Neuralgia, Iliohypogastric Nerve1CTD_human
TgeneC0555198Malignant Glioma1CTD_human
TgeneC0598589Inherited neuropathies1GENOMICS_ENGLAND
TgeneC0751371Neuralgia, Ilioinguinal1CTD_human
TgeneC0751372Nerve Pain1CTD_human
TgeneC0751373Paroxysmal Nerve Pain1CTD_human
TgeneC0752347Lewy Body Disease1CTD_human
TgeneC1833921Familial medullary thyroid carcinoma1CTD_human;GENOMICS_ENGLAND;ORPHANET
TgeneC2350344Chronic Lung Injury1CTD_human