Fusion Gene Studies
in Kim Lab

FusionBase FusionGDB FusionGDB2 FusionPDB FusionNeoAntigen FusionAI FusionNW FGviewer Publication Contact
FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:ATXN7L1-EGFR (FusionGDB2 ID:HG222255TG1956)

Fusion Gene Summary for ATXN7L1-EGFR

check button Fusion gene summary
Fusion gene informationFusion gene name: ATXN7L1-EGFR
Fusion gene ID: hg222255tg1956
HgeneTgene
Gene symbol

ATXN7L1

EGFR

Gene ID

222255

1956

Gene nameataxin 7 like 1epidermal growth factor receptor
SynonymsATXN7L4ERBB|ERBB1|HER1|NISBD2|PIG61|mENA
Cytomap('ATXN7L1')('EGFR')

7q22.3

7p11.2

Type of geneprotein-codingprotein-coding
Descriptionataxin-7-like protein 1ataxin 7-like 4ataxin-7-like protein 4epidermal growth factor receptoravian erythroblastic leukemia viral (v-erb-b) oncogene homologcell growth inhibiting protein 40cell proliferation-inducing protein 61epidermal growth factor receptor tyrosine kinase domainerb-b2 receptor tyrosine kinas
Modification date2020031320200329
UniProtAcc..
Ensembl transtripts involved in fusion geneENST00000318724, ENST00000419735, 
ENST00000388807, ENST00000472910, 
ENST00000477775, ENST00000478915, 
ENST00000318724, ENST00000388807, 
ENST00000472910, ENST00000478915, 
ENST00000419735, ENST00000477775, 
Fusion gene scores* DoF score13 X 10 X 5=65017 X 20 X 8=2720
# samples 1322
** MAII scorelog2(13/650*10)=-2.32192809488736
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(22/2720*10)=-3.62803122261304
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: ATXN7L1 [Title/Abstract] AND EGFR [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointATXN7L1(105516257)-EGFR(55209978), # samples:1
Anticipated loss of major functional domain due to fusion event.EGFR-ATXN7L1 seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
EGFR-ATXN7L1 seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
ATXN7L1-EGFR seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
ATXN7L1-EGFR seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
ATXN7L1-EGFR seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
ATXN7L1-EGFR seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
TgeneEGFR

GO:0001934

positive regulation of protein phosphorylation

20551055

TgeneEGFR

GO:0007165

signal transduction

10572067

TgeneEGFR

GO:0007166

cell surface receptor signaling pathway

7736574

TgeneEGFR

GO:0007173

epidermal growth factor receptor signaling pathway

7736574|12435727

TgeneEGFR

GO:0008283

cell proliferation

17115032

TgeneEGFR

GO:0008284

positive regulation of cell proliferation

7736574

TgeneEGFR

GO:0010750

positive regulation of nitric oxide mediated signal transduction

12828935

TgeneEGFR

GO:0018108

peptidyl-tyrosine phosphorylation

22732145

TgeneEGFR

GO:0030307

positive regulation of cell growth

15467833

TgeneEGFR

GO:0042177

negative regulation of protein catabolic process

17115032

TgeneEGFR

GO:0042327

positive regulation of phosphorylation

15082764

TgeneEGFR

GO:0043406

positive regulation of MAP kinase activity

10572067

TgeneEGFR

GO:0045739

positive regulation of DNA repair

17115032

TgeneEGFR

GO:0045740

positive regulation of DNA replication

17115032

TgeneEGFR

GO:0045944

positive regulation of transcription by RNA polymerase II

20551055

TgeneEGFR

GO:0050679

positive regulation of epithelial cell proliferation

10572067

TgeneEGFR

GO:0050999

regulation of nitric-oxide synthase activity

12828935

TgeneEGFR

GO:0070141

response to UV-A

18483258

TgeneEGFR

GO:0070374

positive regulation of ERK1 and ERK2 cascade

20551055

TgeneEGFR

GO:0071392

cellular response to estradiol stimulus

20551055

TgeneEGFR

GO:1900020

positive regulation of protein kinase C activity

22732145

TgeneEGFR

GO:1903078

positive regulation of protein localization to plasma membrane

22732145


check buttonFusion gene breakpoints across ATXN7L1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure
check buttonFusion gene breakpoints across EGFR (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4STADTCGA-VQ-AA6IATXN7L1chr7

105516257

-EGFRchr7

55209978

+


Top

Fusion Gene ORF analysis for ATXN7L1-EGFR

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-5UTRENST00000318724ENST00000454757ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
5CDS-5UTRENST00000419735ENST00000454757ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
5CDS-intronENST00000318724ENST00000463948ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
5CDS-intronENST00000419735ENST00000463948ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
In-frameENST00000318724ENST00000275493ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
In-frameENST00000318724ENST00000342916ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
In-frameENST00000318724ENST00000344576ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
In-frameENST00000318724ENST00000420316ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
In-frameENST00000318724ENST00000442591ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
In-frameENST00000318724ENST00000455089ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
In-frameENST00000419735ENST00000275493ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
In-frameENST00000419735ENST00000342916ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
In-frameENST00000419735ENST00000344576ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
In-frameENST00000419735ENST00000420316ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
In-frameENST00000419735ENST00000442591ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
In-frameENST00000419735ENST00000455089ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000388807ENST00000275493ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000388807ENST00000342916ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000388807ENST00000344576ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000388807ENST00000420316ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000388807ENST00000442591ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000388807ENST00000455089ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000472910ENST00000275493ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000472910ENST00000342916ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000472910ENST00000344576ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000472910ENST00000420316ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000472910ENST00000442591ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000472910ENST00000455089ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000477775ENST00000275493ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000477775ENST00000342916ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000477775ENST00000344576ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000477775ENST00000420316ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000477775ENST00000442591ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000477775ENST00000455089ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000478915ENST00000275493ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000478915ENST00000342916ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000478915ENST00000344576ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000478915ENST00000420316ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000478915ENST00000442591ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-3CDSENST00000478915ENST00000455089ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-5UTRENST00000388807ENST00000454757ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-5UTRENST00000472910ENST00000454757ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-5UTRENST00000477775ENST00000454757ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-5UTRENST00000478915ENST00000454757ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-intronENST00000388807ENST00000463948ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-intronENST00000472910ENST00000463948ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-intronENST00000477775ENST00000463948ATXN7L1chr7

105516257

-EGFRchr7

55209978

+
intron-intronENST00000478915ENST00000463948ATXN7L1chr7

105516257

-EGFRchr7

55209978

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000419735ATXN7L1chr7105516257-ENST00000455089EGFRchr755209978+37952964634831145
ENST00000419735ATXN7L1chr7105516257-ENST00000342916EGFRchr755209978+2201296462094682
ENST00000419735ATXN7L1chr7105516257-ENST00000344576EGFRchr755209978+2827296462325759
ENST00000419735ATXN7L1chr7105516257-ENST00000420316EGFRchr755209978+1534296461425459
ENST00000419735ATXN7L1chr7105516257-ENST00000275493EGFRchr755209978+98522964638401264
ENST00000419735ATXN7L1chr7105516257-ENST00000442591EGFRchr755209978+2577296462181711
ENST00000318724ATXN7L1chr7105516257-ENST00000455089EGFRchr755209978+37732742434611145
ENST00000318724ATXN7L1chr7105516257-ENST00000342916EGFRchr755209978+2179274242072682
ENST00000318724ATXN7L1chr7105516257-ENST00000344576EGFRchr755209978+2805274242303759
ENST00000318724ATXN7L1chr7105516257-ENST00000420316EGFRchr755209978+1512274241403459
ENST00000318724ATXN7L1chr7105516257-ENST00000275493EGFRchr755209978+98302742438181264
ENST00000318724ATXN7L1chr7105516257-ENST00000442591EGFRchr755209978+2555274242159711

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000419735ENST00000455089ATXN7L1chr7105516257-EGFRchr755209978+0.0004929830.99950707
ENST00000419735ENST00000342916ATXN7L1chr7105516257-EGFRchr755209978+0.0008270070.999173
ENST00000419735ENST00000344576ATXN7L1chr7105516257-EGFRchr755209978+0.0007466530.99925333
ENST00000419735ENST00000420316ATXN7L1chr7105516257-EGFRchr755209978+0.0009930290.99900705
ENST00000419735ENST00000275493ATXN7L1chr7105516257-EGFRchr755209978+0.0002994210.99970067
ENST00000419735ENST00000442591ATXN7L1chr7105516257-EGFRchr755209978+0.0007391620.99926084
ENST00000318724ENST00000455089ATXN7L1chr7105516257-EGFRchr755209978+0.0004697610.9995303
ENST00000318724ENST00000342916ATXN7L1chr7105516257-EGFRchr755209978+0.0007463930.9992536
ENST00000318724ENST00000344576ATXN7L1chr7105516257-EGFRchr755209978+0.0006929650.99930704
ENST00000318724ENST00000420316ATXN7L1chr7105516257-EGFRchr755209978+0.000905110.99909496
ENST00000318724ENST00000275493ATXN7L1chr7105516257-EGFRchr755209978+0.0002885780.99971145
ENST00000318724ENST00000442591ATXN7L1chr7105516257-EGFRchr755209978+0.0006800090.99932003

Top

Fusion Genomic Features for ATXN7L1-EGFR


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
ATXN7L1chr7105516257-EGFRchr755209978+5.18E-081
ATXN7L1chr7105516257-EGFRchr755209978+5.18E-081

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.
genomic feature of top 1%

Top

Fusion Protein Features for ATXN7L1-EGFR


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr7:105516257/chr7:55209978)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
..
FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgeneEGFRchr7:105516257chr7:55209978ENST00000275493028712_979291211.0DomainProtein kinase
TgeneEGFRchr7:105516257chr7:55209978ENST00000342916016712_97929629.0DomainProtein kinase
TgeneEGFRchr7:105516257chr7:55209978ENST00000344576016712_97929706.0DomainProtein kinase
TgeneEGFRchr7:105516257chr7:55209978ENST00000420316010712_97929406.0DomainProtein kinase
TgeneEGFRchr7:105516257chr7:55209978ENST00000275493028718_726291211.0Nucleotide bindingATP
TgeneEGFRchr7:105516257chr7:55209978ENST00000275493028790_791291211.0Nucleotide bindingATP
TgeneEGFRchr7:105516257chr7:55209978ENST00000342916016718_72629629.0Nucleotide bindingATP
TgeneEGFRchr7:105516257chr7:55209978ENST00000342916016790_79129629.0Nucleotide bindingATP
TgeneEGFRchr7:105516257chr7:55209978ENST00000344576016718_72629706.0Nucleotide bindingATP
TgeneEGFRchr7:105516257chr7:55209978ENST00000344576016790_79129706.0Nucleotide bindingATP
TgeneEGFRchr7:105516257chr7:55209978ENST00000420316010718_72629406.0Nucleotide bindingATP
TgeneEGFRchr7:105516257chr7:55209978ENST00000420316010790_79129406.0Nucleotide bindingATP
TgeneEGFRchr7:105516257chr7:55209978ENST00000275493028688_704291211.0RegionNote=Important for dimerization%2C phosphorylation and activation
TgeneEGFRchr7:105516257chr7:55209978ENST00000342916016688_70429629.0RegionNote=Important for dimerization%2C phosphorylation and activation
TgeneEGFRchr7:105516257chr7:55209978ENST00000344576016688_70429706.0RegionNote=Important for dimerization%2C phosphorylation and activation
TgeneEGFRchr7:105516257chr7:55209978ENST00000420316010688_70429406.0RegionNote=Important for dimerization%2C phosphorylation and activation
TgeneEGFRchr7:105516257chr7:55209978ENST00000275493028390_600291211.0RepeatNote=Approximate
TgeneEGFRchr7:105516257chr7:55209978ENST0000027549302875_300291211.0RepeatNote=Approximate
TgeneEGFRchr7:105516257chr7:55209978ENST00000342916016390_60029629.0RepeatNote=Approximate
TgeneEGFRchr7:105516257chr7:55209978ENST0000034291601675_30029629.0RepeatNote=Approximate
TgeneEGFRchr7:105516257chr7:55209978ENST00000344576016390_60029706.0RepeatNote=Approximate
TgeneEGFRchr7:105516257chr7:55209978ENST0000034457601675_30029706.0RepeatNote=Approximate
TgeneEGFRchr7:105516257chr7:55209978ENST00000420316010390_60029406.0RepeatNote=Approximate
TgeneEGFRchr7:105516257chr7:55209978ENST0000042031601075_30029406.0RepeatNote=Approximate
TgeneEGFRchr7:105516257chr7:55209978ENST00000275493028669_1210291211.0Topological domainCytoplasmic
TgeneEGFRchr7:105516257chr7:55209978ENST00000342916016669_121029629.0Topological domainCytoplasmic
TgeneEGFRchr7:105516257chr7:55209978ENST00000344576016669_121029706.0Topological domainCytoplasmic
TgeneEGFRchr7:105516257chr7:55209978ENST00000420316010669_121029406.0Topological domainCytoplasmic
TgeneEGFRchr7:105516257chr7:55209978ENST00000275493028646_668291211.0TransmembraneHelical
TgeneEGFRchr7:105516257chr7:55209978ENST00000342916016646_66829629.0TransmembraneHelical
TgeneEGFRchr7:105516257chr7:55209978ENST00000344576016646_66829706.0TransmembraneHelical
TgeneEGFRchr7:105516257chr7:55209978ENST00000420316010646_66829406.0TransmembraneHelical

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneATXN7L1chr7:105516257chr7:55209978ENST00000318724-24364_41783147.0Compositional biasNote=Ser-rich
HgeneATXN7L1chr7:105516257chr7:55209978ENST00000318724-24536_84783147.0Compositional biasNote=Ser-rich
HgeneATXN7L1chr7:105516257chr7:55209978ENST00000419735-212364_41783862.0Compositional biasNote=Ser-rich
HgeneATXN7L1chr7:105516257chr7:55209978ENST00000419735-212536_84783862.0Compositional biasNote=Ser-rich
HgeneATXN7L1chr7:105516257chr7:55209978ENST00000477775-110364_4170739.0Compositional biasNote=Ser-rich
HgeneATXN7L1chr7:105516257chr7:55209978ENST00000477775-110536_8470739.0Compositional biasNote=Ser-rich
HgeneATXN7L1chr7:105516257chr7:55209978ENST00000318724-24284_35183147.0DomainSCA7
HgeneATXN7L1chr7:105516257chr7:55209978ENST00000419735-212284_35183862.0DomainSCA7
HgeneATXN7L1chr7:105516257chr7:55209978ENST00000477775-110284_3510739.0DomainSCA7
TgeneEGFRchr7:105516257chr7:55209978ENST0000027549302825_645291211.0Topological domainExtracellular
TgeneEGFRchr7:105516257chr7:55209978ENST0000034291601625_64529629.0Topological domainExtracellular
TgeneEGFRchr7:105516257chr7:55209978ENST0000034457601625_64529706.0Topological domainExtracellular
TgeneEGFRchr7:105516257chr7:55209978ENST0000042031601025_64529406.0Topological domainExtracellular


Top

Fusion Gene Sequence for ATXN7L1-EGFR


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>8454_8454_1_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000318724_EGFR_chr7_55209978_ENST00000275493_length(transcript)=9830nt_BP=274nt
ACTCAATGTCGGAACGTTCCGAAGATGACGTCGGAGCGTTCTCGAATCCCGTGTCTCTCGGCTGCTGCTGCCGAAGGAACAGGGAAAAAG
CAACAAGAAGGAAGAGCAATGGCGACACTGGATCGCAAAGTGCCCAGTCCGGAGGCGTTTCTGGGCAAACCCTGGTCCTCCTGGATCGAC
GCCGCCAAATTACACTGCTCCGACAATGTAGATTTAGAAGAGGCTGGAAAAGAGGGTGGAAAAAGCAGGGAGGTTATGAGGCTTAATAAA
GAAGTTTGCCAAGGCACGAGTAACAAGCTCACGCAGTTGGGCACTTTTGAAGATCATTTTCTCAGCCTCCAGAGGATGTTCAATAACTGT
GAGGTGGTCCTTGGGAATTTGGAAATTACCTATGTGCAGAGGAATTATGATCTTTCCTTCTTAAAGACCATCCAGGAGGTGGCTGGTTAT
GTCCTCATTGCCCTCAACACAGTGGAGCGAATTCCTTTGGAAAACCTGCAGATCATCAGAGGAAATATGTACTACGAAAATTCCTATGCC
TTAGCAGTCTTATCTAACTATGATGCAAATAAAACCGGACTGAAGGAGCTGCCCATGAGAAATTTACAGGAAATCCTGCATGGCGCCGTG
CGGTTCAGCAACAACCCTGCCCTGTGCAACGTGGAGAGCATCCAGTGGCGGGACATAGTCAGCAGTGACTTTCTCAGCAACATGTCGATG
GACTTCCAGAACCACCTGGGCAGCTGCCAAAAGTGTGATCCAAGCTGTCCCAATGGGAGCTGCTGGGGTGCAGGAGAGGAGAACTGCCAG
AAACTGACCAAAATCATCTGTGCCCAGCAGTGCTCCGGGCGCTGCCGTGGCAAGTCCCCCAGTGACTGCTGCCACAACCAGTGTGCTGCA
GGCTGCACAGGCCCCCGGGAGAGCGACTGCCTGGTCTGCCGCAAATTCCGAGACGAAGCCACGTGCAAGGACACCTGCCCCCCACTCATG
CTCTACAACCCCACCACGTACCAGATGGATGTGAACCCCGAGGGCAAATACAGCTTTGGTGCCACCTGCGTGAAGAAGTGTCCCCGTAAT
TATGTGGTGACAGATCACGGCTCGTGCGTCCGAGCCTGTGGGGCCGACAGCTATGAGATGGAGGAAGACGGCGTCCGCAAGTGTAAGAAG
TGCGAAGGGCCTTGCCGCAAAGTGTGTAACGGAATAGGTATTGGTGAATTTAAAGACTCACTCTCCATAAATGCTACGAATATTAAACAC
TTCAAAAACTGCACCTCCATCAGTGGCGATCTCCACATCCTGCCGGTGGCATTTAGGGGTGACTCCTTCACACATACTCCTCCTCTGGAT
CCACAGGAACTGGATATTCTGAAAACCGTAAAGGAAATCACAGGGTTTTTGCTGATTCAGGCTTGGCCTGAAAACAGGACGGACCTCCAT
GCCTTTGAGAACCTAGAAATCATACGCGGCAGGACCAAGCAACATGGTCAGTTTTCTCTTGCAGTCGTCAGCCTGAACATAACATCCTTG
GGATTACGCTCCCTCAAGGAGATAAGTGATGGAGATGTGATAATTTCAGGAAACAAAAATTTGTGCTATGCAAATACAATAAACTGGAAA
AAACTGTTTGGGACCTCCGGTCAGAAAACCAAAATTATAAGCAACAGAGGTGAAAACAGCTGCAAGGCCACAGGCCAGGTCTGCCATGCC
TTGTGCTCCCCCGAGGGCTGCTGGGGCCCGGAGCCCAGGGACTGCGTCTCTTGCCGGAATGTCAGCCGAGGCAGGGAATGCGTGGACAAG
TGCAACCTTCTGGAGGGTGAGCCAAGGGAGTTTGTGGAGAACTCTGAGTGCATACAGTGCCACCCAGAGTGCCTGCCTCAGGCCATGAAC
ATCACCTGCACAGGACGGGGACCAGACAACTGTATCCAGTGTGCCCACTACATTGACGGCCCCCACTGCGTCAAGACCTGCCCGGCAGGA
GTCATGGGAGAAAACAACACCCTGGTCTGGAAGTACGCAGACGCCGGCCATGTGTGCCACCTGTGCCATCCAAACTGCACCTACGGATGC
ACTGGGCCAGGTCTTGAAGGCTGTCCAACGAATGGGCCTAAGATCCCGTCCATCGCCACTGGGATGGTGGGGGCCCTCCTCTTGCTGCTG
GTGGTGGCCCTGGGGATCGGCCTCTTCATGCGAAGGCGCCACATCGTTCGGAAGCGCACGCTGCGGAGGCTGCTGCAGGAGAGGGAGCTT
GTGGAGCCTCTTACACCCAGTGGAGAAGCTCCCAACCAAGCTCTCTTGAGGATCTTGAAGGAAACTGAATTCAAAAAGATCAAAGTGCTG
GGCTCCGGTGCGTTCGGCACGGTGTATAAGGGACTCTGGATCCCAGAAGGTGAGAAAGTTAAAATTCCCGTCGCTATCAAGGAATTAAGA
GAAGCAACATCTCCGAAAGCCAACAAGGAAATCCTCGATGAAGCCTACGTGATGGCCAGCGTGGACAACCCCCACGTGTGCCGCCTGCTG
GGCATCTGCCTCACCTCCACCGTGCAGCTCATCACGCAGCTCATGCCCTTCGGCTGCCTCCTGGACTATGTCCGGGAACACAAAGACAAT
ATTGGCTCCCAGTACCTGCTCAACTGGTGTGTGCAGATCGCAAAGGGCATGAACTACTTGGAGGACCGTCGCTTGGTGCACCGCGACCTG
GCAGCCAGGAACGTACTGGTGAAAACACCGCAGCATGTCAAGATCACAGATTTTGGGCTGGCCAAACTGCTGGGTGCGGAAGAGAAAGAA
TACCATGCAGAAGGAGGCAAAGTGCCTATCAAGTGGATGGCATTGGAATCAATTTTACACAGAATCTATACCCACCAGAGTGATGTCTGG
AGCTACGGGGTGACTGTTTGGGAGTTGATGACCTTTGGATCCAAGCCATATGACGGAATCCCTGCCAGCGAGATCTCCTCCATCCTGGAG
AAAGGAGAACGCCTCCCTCAGCCACCCATATGTACCATCGATGTCTACATGATCATGGTCAAGTGCTGGATGATAGACGCAGATAGTCGC
CCAAAGTTCCGTGAGTTGATCATCGAATTCTCCAAAATGGCCCGAGACCCCCAGCGCTACCTTGTCATTCAGGGGGATGAAAGAATGCAT
TTGCCAAGTCCTACAGACTCCAACTTCTACCGTGCCCTGATGGATGAAGAAGACATGGACGACGTGGTGGATGCCGACGAGTACCTCATC
CCACAGCAGGGCTTCTTCAGCAGCCCCTCCACGTCACGGACTCCCCTCCTGAGCTCTCTGAGTGCAACCAGCAACAATTCCACCGTGGCT
TGCATTGATAGAAATGGGCTGCAAAGCTGTCCCATCAAGGAAGACAGCTTCTTGCAGCGATACAGCTCAGACCCCACAGGCGCCTTGACT
GAGGACAGCATAGACGACACCTTCCTCCCAGTGCCTGAATACATAAACCAGTCCGTTCCCAAAAGGCCCGCTGGCTCTGTGCAGAATCCT
GTCTATCACAATCAGCCTCTGAACCCCGCGCCCAGCAGAGACCCACACTACCAGGACCCCCACAGCACTGCAGTGGGCAACCCCGAGTAT
CTCAACACTGTCCAGCCCACCTGTGTCAACAGCACATTCGACAGCCCTGCCCACTGGGCCCAGAAAGGCAGCCACCAAATTAGCCTGGAC
AACCCTGACTACCAGCAGGACTTCTTTCCCAAGGAAGCCAAGCCAAATGGCATCTTTAAGGGCTCCACAGCTGAAAATGCAGAATACCTA
AGGGTCGCGCCACAAAGCAGTGAATTTATTGGAGCATGACCACGGAGGATAGTATGAGCCCTAAAAATCCAGACTCTTTCGATACCCAGG
ACCAAGCCACAGCAGGTCCTCCATCCCAACAGCCATGCCCGCATTAGCTCTTAGACCCACAGACTGGTTTTGCAACGTTTACACCGACTA
GCCAGGAAGTACTTCCACCTCGGGCACATTTTGGGAAGTTGCATTCCTTTGTCTTCAAACTGTGAAGCATTTACAGAAACGCATCCAGCA
AGAATATTGTCCCTTTGAGCAGAAATTTATCTTTCAAAGAGGTATATTTGAAAAAAAAAAAAAGTATATGTGAGGATTTTTATTGATTGG
GGATCTTGGAGTTTTTCATTGTCGCTATTGATTTTTACTTCAATGGGCTCTTCCAACAAGGAAGAAGCTTGCTGGTAGCACTTGCTACCC
TGAGTTCATCCAGGCCCAACTGTGAGCAAGGAGCACAAGCCACAAGTCTTCCAGAGGATGCTTGATTCCAGTGGTTCTGCTTCAAGGCTT
CCACTGCAAAACACTAAAGATCCAAGAAGGCCTTCATGGCCCCAGCAGGCCGGATCGGTACTGTATCAAGTCATGGCAGGTACAGTAGGA
TAAGCCACTCTGTCCCTTCCTGGGCAAAGAAGAAACGGAGGGGATGGAATTCTTCCTTAGACTTACTTTTGTAAAAATGTCCCCACGGTA
CTTACTCCCCACTGATGGACCAGTGGTTTCCAGTCATGAGCGTTAGACTGACTTGTTTGTCTTCCATTCCATTGTTTTGAAACTCAGTAT
GCTGCCCCTGTCTTGCTGTCATGAAATCAGCAAGAGAGGATGACACATCAAATAATAACTCGGATTCCAGCCCACATTGGATTCATCAGC
ATTTGGACCAATAGCCCACAGCTGAGAATGTGGAATACCTAAGGATAGCACCGCTTTTGTTCTCGCAAAAACGTATCTCCTAATTTGAGG
CTCAGATGAAATGCATCAGGTCCTTTGGGGCATAGATCAGAAGACTACAAAAATGAAGCTGCTCTGAAATCTCCTTTAGCCATCACCCCA
ACCCCCCAAAATTAGTTTGTGTTACTTATGGAAGATAGTTTTCTCCTTTTACTTCACTTCAAAAGCTTTTTACTCAAAGAGTATATGTTC
CCTCCAGGTCAGCTGCCCCCAAACCCCCTCCTTACGCTTTGTCACACAAAAAGTGTCTCTGCCTTGAGTCATCTATTCAAGCACTTACAG
CTCTGGCCACAACAGGGCATTTTACAGGTGCGAATGACAGTAGCATTATGAGTAGTGTGGAATTCAGGTAGTAAATATGAAACTAGGGTT
TGAAATTGATAATGCTTTCACAACATTTGCAGATGTTTTAGAAGGAAAAAAGTTCCTTCCTAAAATAATTTCTCTACAATTGGAAGATTG
GAAGATTCAGCTAGTTAGGAGCCCACCTTTTTTCCTAATCTGTGTGTGCCCTGTAACCTGACTGGTTAACAGCAGTCCTTTGTAAACAGT
GTTTTAAACTCTCCTAGTCAATATCCACCCCATCCAATTTATCAAGGAAGAAATGGTTCAGAAAATATTTTCAGCCTACAGTTATGTTCA
GTCACACACACATACAAAATGTTCCTTTTGCTTTTAAAGTAATTTTTGACTCCCAGATCAGTCAGAGCCCCTACAGCATTGTTAAGAAAG
TATTTGATTTTTGTCTCAATGAAAATAAAACTATATTCATTTCCACTCTATTATGCTCTCAAATACCCCTAAGCATCTATACTAGCCTGG
TATGGGTATGAAAGATACAAAGATAAATAAAACATAGTCCCTGATTCTAAGAAATTCACAATTTAGCAAAGGAAATGGACTCATAGATGC
TAACCTTAAAACAACGTGACAAATGCCAGACAGGACCCATCAGCCAGGCACTGTGAGAGCACAGAGCAGGGAGGTTGGGTCCTGCCTGAG
GAGACCTGGAAGGGAGGCCTCACAGGAGGATGACCAGGTCTCAGTCAGCGGGGAGGTGGAAAGTGCAGGTGCATCAGGGGCACCCTGACC
GAGGAAACAGCTGCCAGAGGCCTCCACTGCTAAAGTCCACATAAGGCTGAGGTCAGTCACCCTAAACAACCTGCTCCCTCTAAGCCAGGG
GATGAGCTTGGAGCATCCCACAAGTTCCCTAAAAGTTGCAGCCCCCAGGGGGATTTTGAGCTATCATCTCTGCACATGCTTAGTGAGAAG
ACTACACAACATTTCTAAGAATCTGAGATTTTATATTGTCAGTTAACCACTTTCATTATTCATTCACCTCAGGACATGCAGAAATATTTC
AGTCAGAACTGGGAAACAGAAGGACCTACATTCTGCTGTCACTTATGTGTCAAGAAGCAGATGATCGATGAGGCAGGTCAGTTGTAAGTG
AGTCACATTGTAGCATTAAATTCTAGTATTTTTGTAGTTTGAAACAGTAACTTAATAAAAGAGCAAAAGCTATTCTAGCTTTCTTCTTCA
TATTTTAATTTTCCACCATAAAGTTTAGTTGCTAAATTCTATTAATTTTAAGATTGTGCTTCCCAAAATAGTTCTCACTTCATCTGTCCA
GGGAGGCACAGTTCTGTCTGGTAGAAGCCGCAAAGCCCTTAGCCTCTTCACGGATCTGGCGACTGTGATGGGCAGGTCAGGAGAGGAGCT
GCCCAAAGTCCCATGATTTTCACCTAACAGCCCTGATCAGTCAGTACTCAAAGCTTGGACTCCATCCCTGAAGGTCTTCCTGATTGATAG
CCTGGCCTTAATACCCTACAGAAAGCCTGTCCATTGGCTGTTTCTTCCTCAGTCAGTTCCTGGAAGACCTTACCCCATGACCCCAGCTTC
AGATGTGGTCTTTGGAAACAGAGGTCGAAGGAAAGTAAGGAGCTGAGAGCTCACATTCATAGGTGCCGCCAGCCTTCGTGCATCTTCTTG
CATCATCTCTAAGGAGCTCCTCTAATTACACCATGCCCGTCACCCCATGAGGGATCAGAGAAGGGATGAGTCTTCTAAACTCTATATTCG
CTGTGAGTCCAGGTTGTAAGGGGGAGCACTGTGGATGCATCCTATTGCACTCCAGCTGATGACACCAAAGCTTAGGTGTTTGCTGAAAGT
TCTTGATGTTGTGACTTACCACCCCTGCCTCACAACTGCAGACATAAGGGGACTATGGATTGCTTAGCAGGAAAGGCACTGGTTCTCAAG
GGCGGCTGCCCTTGGGAATCTTCTGGTCCCAACCAGAAAGACTGTGGCTTGATTTTCTCAGGTGCAGCCCAGCCGTAGGGCCTTTTCAGA
GCACCCCCTGGTTATTGCAACATTCATCAAAGTTTCTAGAACCTCTGGCCTAAAGGAAGGGCCTGGTGGGATCTACTTGGCACTCGCTGG
GGGGCCACCCCCCAGTGCCACTCTCACTAGGCCTCTGATTGCACTTGTGTAGGATGAAGCTGGTGGGTGATGGGAACTCAGCACCTCCCC
TCAGGCAGAAAAGAATCATCTGTGGAGCTTCAAAAGAAGGGGCCTGGAGTCTCTGCAGACCAATTCAACCCAAATCTCGGGGGCTCTTTC
ATGATTCTAATGGGCAACCAGGGTTGAAACCCTTATTTCTAGGGTCTTCAGTTGTACAAGACTGTGGGTCTGTACCAGAGCCCCCGTCAG
AGTAGAATAAAAGGCTGGGTAGGGTAGAGATTCCCATGTGCAGTGGAGAGAACAATCTGCAGTCACTGATAAGCCTGAGACTTGGCTCAT
TTCAAAAGCGTTCAATTCATCCTCACCAGCAGTTCAGCTGGAAAGGGGCAAATACCCCCACCTGAGCTTTGAAAACGCCCTGGGACCCTC
TGCATTCTCTAAGTAAGTTATAGAAACCAGTCTCTTCCCTCCTTTGTGAGTGAGCTGCTATTCCACGTAGGCAACACCTGTTGAAATTGC
CCTCAATGTCTACTCTGCATTTCTTTCTTGTGATAAGCACACACTTTTATTGCAACATAATGATCTGCTCACATTTCCTTGCCTGGGGGC
TGTAAAACCTTACAGAACAGAAATCCTTGCCTCTTTCACCAGCCACACCTGCCATACCAGGGGTACAGCTTTGTACTATTGAAGACACAG
ACAGGATTTTTAAATGTAAATCTATTTTTGTAACTTTGTTGCGGGATATAGTTCTCTTTATGTAGCACTGAACTTTGTACAATATATTTT
TAGAAACTCATTTTTCTACTAAAACAAACACAGTTTACTTTAGAGAGACTGCAATAGAATCAAAATTTGAAACTGAAATCTTTGTTTAAA
AGGGTTAAGTTGAGGCAAGAGGAAAGCCCTTTCTCTCTCTTATAAAAAGGCACAACCTCATTGGGGAGCTAAGCTAGGTCATTGTCATGG
TGAAGAAGAGAAGCATCGTTTTTATATTTAGGAAATTTTAAAAGATGATGGAAAGCACATTTAGCTTGGTCTGAGGCAGGTTCTGTTGGG
GCAGTGTTAATGGAAAGGGCTCACTGTTGTTACTACTAGAAAAATCCAGTTGCATGCCATACTCTCATCATCTGCCAGTGTAACCCTGTA
CATGTAAGAAAAGCAATAACATAGCACTTTGTTGGTTTATATATATAATGTGACTTCAATGCAAATTTTATTTTTATATTTACAATTGAT
ATGCATTTACCAGTATAAACTAGACATGTCTGGAGAGCCTAATAATGTTCAGCACACTTTGGTTAGTTCACCAACAGTCTTACCAAGCCT
GGGCCCAGCCACCCTAGAGAAGTTATTCAGCCCTGGCTGCAGTGACATCACCTGAGGAGCTTTTAAAAGCTTGAAGCCCAGCTACACCTC
AGACCGATTAAACGCAAATCTCTGGGGCTGAAACCCAAGCATTCGTAGTTTTTAAAGCTCCTGAGGTCATTCCAATGTGCGGCCAAAGTT
GAGAACTACTGGCCTAGGGATTAGCCACAAGGACATGGACTTGGAGGCAAATTCTGCAGGTGTATGTGATTCTCAGGCCTAGAGAGCTAA
GACACAAAGACCTCCACATCTGTCGCTGAGAGTCAAGAACCTGAACAGAGTTTCCATGAAGGTTCTCCAAGCACTAGAAGGGAGAGTGTC
TAAACAATGGTTGAAAAGCAAAGGAAATATAAAACAGACACCTCTTTCCATTTCCTAAGGTTTCTCTCTTTATTAAGGGTGGACTAGTAA
TAAAATATAATATTCTTGCTGCTTATGCAGCTGACATTGTTGCCCTCCCTAAAGCAACCAAGTAGCCTTTATTTCCCACAGTGAAAGAAA
ACGCTGGCCTATCAGTTACATTACAAAAGGCAGATTTCAAGAGGATTGAGTAAGTAGTTGGATGGCTTTCATAAAAACAAGAATTCAAGA
AGAGGATTCATGCTTTAAGAAACATTTGTTATACATTCCTCACAAATTATACCTGGGATAAAAACTATGTAGCAGGCAGTGTGTTTTCCT
TCCATGTCTCTCTGCACTACCTGCAGTGTGTCCTCTGAGGCTGCAAGTCTGTCCTATCTGAATTCCCAGCAGAAGCACTAAGAAGCTCCA
CCCTATCACCTAGCAGATAAAACTATGGGGAAAACTTAAATCTGTGCATACATTTCTGGATGCATTTACTTATCTTTAAAAAAAAAGGAA
TCCTATGACCTGATTTGGCCACAAAAATAATCTTGCTGTACAATACAATCTCTTGGAAATTAAGAGATCCTATGGATTTGATGACTGGTA
TTAGAGGTGACAATGTAACCGATTAACAACAGACAGCAATAACTTCGTTTTAGAAACATTCAAGCAATAGCTTTATAGCTTCAACATATG
GTACGTTTTAACCTTGAAAGTTTTGCAATGATGAAAGCAGTATTTGTACAAATGAAAAGCAGAATTCTCTTTTATATGGTTTATACTGTT
GATCAGAAATGTTGATTGTGCATTGAGTATTAAAAAATTAGATGTATATTATTCATTGTTCTTTACTCCTGAGTACCTTATAATAATAAT

>8454_8454_1_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000318724_EGFR_chr7_55209978_ENST00000275493_length(amino acids)=1264AA_BP=83
MTSERSRIPCLSAAAAEGTGKKQQEGRAMATLDRKVPSPEAFLGKPWSSWIDAAKLHCSDNVDLEEAGKEGGKSREVMRLNKEVCQGTSN
KLTQLGTFEDHFLSLQRMFNNCEVVLGNLEITYVQRNYDLSFLKTIQEVAGYVLIALNTVERIPLENLQIIRGNMYYENSYALAVLSNYD
ANKTGLKELPMRNLQEILHGAVRFSNNPALCNVESIQWRDIVSSDFLSNMSMDFQNHLGSCQKCDPSCPNGSCWGAGEENCQKLTKIICA
QQCSGRCRGKSPSDCCHNQCAAGCTGPRESDCLVCRKFRDEATCKDTCPPLMLYNPTTYQMDVNPEGKYSFGATCVKKCPRNYVVTDHGS
CVRACGADSYEMEEDGVRKCKKCEGPCRKVCNGIGIGEFKDSLSINATNIKHFKNCTSISGDLHILPVAFRGDSFTHTPPLDPQELDILK
TVKEITGFLLIQAWPENRTDLHAFENLEIIRGRTKQHGQFSLAVVSLNITSLGLRSLKEISDGDVIISGNKNLCYANTINWKKLFGTSGQ
KTKIISNRGENSCKATGQVCHALCSPEGCWGPEPRDCVSCRNVSRGRECVDKCNLLEGEPREFVENSECIQCHPECLPQAMNITCTGRGP
DNCIQCAHYIDGPHCVKTCPAGVMGENNTLVWKYADAGHVCHLCHPNCTYGCTGPGLEGCPTNGPKIPSIATGMVGALLLLLVVALGIGL
FMRRRHIVRKRTLRRLLQERELVEPLTPSGEAPNQALLRILKETEFKKIKVLGSGAFGTVYKGLWIPEGEKVKIPVAIKELREATSPKAN
KEILDEAYVMASVDNPHVCRLLGICLTSTVQLITQLMPFGCLLDYVREHKDNIGSQYLLNWCVQIAKGMNYLEDRRLVHRDLAARNVLVK
TPQHVKITDFGLAKLLGAEEKEYHAEGGKVPIKWMALESILHRIYTHQSDVWSYGVTVWELMTFGSKPYDGIPASEISSILEKGERLPQP
PICTIDVYMIMVKCWMIDADSRPKFRELIIEFSKMARDPQRYLVIQGDERMHLPSPTDSNFYRALMDEEDMDDVVDADEYLIPQQGFFSS
PSTSRTPLLSSLSATSNNSTVACIDRNGLQSCPIKEDSFLQRYSSDPTGALTEDSIDDTFLPVPEYINQSVPKRPAGSVQNPVYHNQPLN
PAPSRDPHYQDPHSTAVGNPEYLNTVQPTCVNSTFDSPAHWAQKGSHQISLDNPDYQQDFFPKEAKPNGIFKGSTAENAEYLRVAPQSSE

--------------------------------------------------------------
>8454_8454_2_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000318724_EGFR_chr7_55209978_ENST00000342916_length(transcript)=2179nt_BP=274nt
ACTCAATGTCGGAACGTTCCGAAGATGACGTCGGAGCGTTCTCGAATCCCGTGTCTCTCGGCTGCTGCTGCCGAAGGAACAGGGAAAAAG
CAACAAGAAGGAAGAGCAATGGCGACACTGGATCGCAAAGTGCCCAGTCCGGAGGCGTTTCTGGGCAAACCCTGGTCCTCCTGGATCGAC
GCCGCCAAATTACACTGCTCCGACAATGTAGATTTAGAAGAGGCTGGAAAAGAGGGTGGAAAAAGCAGGGAGGTTATGAGGCTTAATAAA
GAAGTTTGCCAAGGCACGAGTAACAAGCTCACGCAGTTGGGCACTTTTGAAGATCATTTTCTCAGCCTCCAGAGGATGTTCAATAACTGT
GAGGTGGTCCTTGGGAATTTGGAAATTACCTATGTGCAGAGGAATTATGATCTTTCCTTCTTAAAGACCATCCAGGAGGTGGCTGGTTAT
GTCCTCATTGCCCTCAACACAGTGGAGCGAATTCCTTTGGAAAACCTGCAGATCATCAGAGGAAATATGTACTACGAAAATTCCTATGCC
TTAGCAGTCTTATCTAACTATGATGCAAATAAAACCGGACTGAAGGAGCTGCCCATGAGAAATTTACAGGAAATCCTGCATGGCGCCGTG
CGGTTCAGCAACAACCCTGCCCTGTGCAACGTGGAGAGCATCCAGTGGCGGGACATAGTCAGCAGTGACTTTCTCAGCAACATGTCGATG
GACTTCCAGAACCACCTGGGCAGCTGCCAAAAGTGTGATCCAAGCTGTCCCAATGGGAGCTGCTGGGGTGCAGGAGAGGAGAACTGCCAG
AAACTGACCAAAATCATCTGTGCCCAGCAGTGCTCCGGGCGCTGCCGTGGCAAGTCCCCCAGTGACTGCTGCCACAACCAGTGTGCTGCA
GGCTGCACAGGCCCCCGGGAGAGCGACTGCCTGGTCTGCCGCAAATTCCGAGACGAAGCCACGTGCAAGGACACCTGCCCCCCACTCATG
CTCTACAACCCCACCACGTACCAGATGGATGTGAACCCCGAGGGCAAATACAGCTTTGGTGCCACCTGCGTGAAGAAGTGTCCCCGTAAT
TATGTGGTGACAGATCACGGCTCGTGCGTCCGAGCCTGTGGGGCCGACAGCTATGAGATGGAGGAAGACGGCGTCCGCAAGTGTAAGAAG
TGCGAAGGGCCTTGCCGCAAAGTGTGTAACGGAATAGGTATTGGTGAATTTAAAGACTCACTCTCCATAAATGCTACGAATATTAAACAC
TTCAAAAACTGCACCTCCATCAGTGGCGATCTCCACATCCTGCCGGTGGCATTTAGGGGTGACTCCTTCACACATACTCCTCCTCTGGAT
CCACAGGAACTGGATATTCTGAAAACCGTAAAGGAAATCACAGGGTTTTTGCTGATTCAGGCTTGGCCTGAAAACAGGACGGACCTCCAT
GCCTTTGAGAACCTAGAAATCATACGCGGCAGGACCAAGCAACATGGTCAGTTTTCTCTTGCAGTCGTCAGCCTGAACATAACATCCTTG
GGATTACGCTCCCTCAAGGAGATAAGTGATGGAGATGTGATAATTTCAGGAAACAAAAATTTGTGCTATGCAAATACAATAAACTGGAAA
AAACTGTTTGGGACCTCCGGTCAGAAAACCAAAATTATAAGCAACAGAGGTGAAAACAGCTGCAAGGCCACAGGCCAGGTCTGCCATGCC
TTGTGCTCCCCCGAGGGCTGCTGGGGCCCGGAGCCCAGGGACTGCGTCTCTTGCCGGAATGTCAGCCGAGGCAGGGAATGCGTGGACAAG
TGCAACCTTCTGGAGGGTGAGCCAAGGGAGTTTGTGGAGAACTCTGAGTGCATACAGTGCCACCCAGAGTGCCTGCCTCAGGCCATGAAC
ATCACCTGCACAGGACGGGGACCAGACAACTGTATCCAGTGTGCCCACTACATTGACGGCCCCCACTGCGTCAAGACCTGCCCGGCAGGA
GTCATGGGAGAAAACAACACCCTGGTCTGGAAGTACGCAGACGCCGGCCATGTGTGCCACCTGTGCCATCCAAACTGCACCTACGGGTCC
TAATAAATCTTCACTGTCTGACTTTAGTCTCCCACTAAAACTGCATTTCCTTTCTACAATTTCAATTTCTCCCTTTGCTTCAAATAAAGT

>8454_8454_2_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000318724_EGFR_chr7_55209978_ENST00000342916_length(amino acids)=682AA_BP=83
MTSERSRIPCLSAAAAEGTGKKQQEGRAMATLDRKVPSPEAFLGKPWSSWIDAAKLHCSDNVDLEEAGKEGGKSREVMRLNKEVCQGTSN
KLTQLGTFEDHFLSLQRMFNNCEVVLGNLEITYVQRNYDLSFLKTIQEVAGYVLIALNTVERIPLENLQIIRGNMYYENSYALAVLSNYD
ANKTGLKELPMRNLQEILHGAVRFSNNPALCNVESIQWRDIVSSDFLSNMSMDFQNHLGSCQKCDPSCPNGSCWGAGEENCQKLTKIICA
QQCSGRCRGKSPSDCCHNQCAAGCTGPRESDCLVCRKFRDEATCKDTCPPLMLYNPTTYQMDVNPEGKYSFGATCVKKCPRNYVVTDHGS
CVRACGADSYEMEEDGVRKCKKCEGPCRKVCNGIGIGEFKDSLSINATNIKHFKNCTSISGDLHILPVAFRGDSFTHTPPLDPQELDILK
TVKEITGFLLIQAWPENRTDLHAFENLEIIRGRTKQHGQFSLAVVSLNITSLGLRSLKEISDGDVIISGNKNLCYANTINWKKLFGTSGQ
KTKIISNRGENSCKATGQVCHALCSPEGCWGPEPRDCVSCRNVSRGRECVDKCNLLEGEPREFVENSECIQCHPECLPQAMNITCTGRGP

--------------------------------------------------------------
>8454_8454_3_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000318724_EGFR_chr7_55209978_ENST00000344576_length(transcript)=2805nt_BP=274nt
ACTCAATGTCGGAACGTTCCGAAGATGACGTCGGAGCGTTCTCGAATCCCGTGTCTCTCGGCTGCTGCTGCCGAAGGAACAGGGAAAAAG
CAACAAGAAGGAAGAGCAATGGCGACACTGGATCGCAAAGTGCCCAGTCCGGAGGCGTTTCTGGGCAAACCCTGGTCCTCCTGGATCGAC
GCCGCCAAATTACACTGCTCCGACAATGTAGATTTAGAAGAGGCTGGAAAAGAGGGTGGAAAAAGCAGGGAGGTTATGAGGCTTAATAAA
GAAGTTTGCCAAGGCACGAGTAACAAGCTCACGCAGTTGGGCACTTTTGAAGATCATTTTCTCAGCCTCCAGAGGATGTTCAATAACTGT
GAGGTGGTCCTTGGGAATTTGGAAATTACCTATGTGCAGAGGAATTATGATCTTTCCTTCTTAAAGACCATCCAGGAGGTGGCTGGTTAT
GTCCTCATTGCCCTCAACACAGTGGAGCGAATTCCTTTGGAAAACCTGCAGATCATCAGAGGAAATATGTACTACGAAAATTCCTATGCC
TTAGCAGTCTTATCTAACTATGATGCAAATAAAACCGGACTGAAGGAGCTGCCCATGAGAAATTTACAGGAAATCCTGCATGGCGCCGTG
CGGTTCAGCAACAACCCTGCCCTGTGCAACGTGGAGAGCATCCAGTGGCGGGACATAGTCAGCAGTGACTTTCTCAGCAACATGTCGATG
GACTTCCAGAACCACCTGGGCAGCTGCCAAAAGTGTGATCCAAGCTGTCCCAATGGGAGCTGCTGGGGTGCAGGAGAGGAGAACTGCCAG
AAACTGACCAAAATCATCTGTGCCCAGCAGTGCTCCGGGCGCTGCCGTGGCAAGTCCCCCAGTGACTGCTGCCACAACCAGTGTGCTGCA
GGCTGCACAGGCCCCCGGGAGAGCGACTGCCTGGTCTGCCGCAAATTCCGAGACGAAGCCACGTGCAAGGACACCTGCCCCCCACTCATG
CTCTACAACCCCACCACGTACCAGATGGATGTGAACCCCGAGGGCAAATACAGCTTTGGTGCCACCTGCGTGAAGAAGTGTCCCCGTAAT
TATGTGGTGACAGATCACGGCTCGTGCGTCCGAGCCTGTGGGGCCGACAGCTATGAGATGGAGGAAGACGGCGTCCGCAAGTGTAAGAAG
TGCGAAGGGCCTTGCCGCAAAGTGTGTAACGGAATAGGTATTGGTGAATTTAAAGACTCACTCTCCATAAATGCTACGAATATTAAACAC
TTCAAAAACTGCACCTCCATCAGTGGCGATCTCCACATCCTGCCGGTGGCATTTAGGGGTGACTCCTTCACACATACTCCTCCTCTGGAT
CCACAGGAACTGGATATTCTGAAAACCGTAAAGGAAATCACAGGGTTTTTGCTGATTCAGGCTTGGCCTGAAAACAGGACGGACCTCCAT
GCCTTTGAGAACCTAGAAATCATACGCGGCAGGACCAAGCAACATGGTCAGTTTTCTCTTGCAGTCGTCAGCCTGAACATAACATCCTTG
GGATTACGCTCCCTCAAGGAGATAAGTGATGGAGATGTGATAATTTCAGGAAACAAAAATTTGTGCTATGCAAATACAATAAACTGGAAA
AAACTGTTTGGGACCTCCGGTCAGAAAACCAAAATTATAAGCAACAGAGGTGAAAACAGCTGCAAGGCCACAGGCCAGGTCTGCCATGCC
TTGTGCTCCCCCGAGGGCTGCTGGGGCCCGGAGCCCAGGGACTGCGTCTCTTGCCGGAATGTCAGCCGAGGCAGGGAATGCGTGGACAAG
TGCAACCTTCTGGAGGGTGAGCCAAGGGAGTTTGTGGAGAACTCTGAGTGCATACAGTGCCACCCAGAGTGCCTGCCTCAGGCCATGAAC
ATCACCTGCACAGGACGGGGACCAGACAACTGTATCCAGTGTGCCCACTACATTGACGGCCCCCACTGCGTCAAGACCTGCCCGGCAGGA
GTCATGGGAGAAAACAACACCCTGGTCTGGAAGTACGCAGACGCCGGCCATGTGTGCCACCTGTGCCATCCAAACTGCACCTACGGGCCA
GGAAATGAGAGTCTCAAAGCCATGTTATTCTGCCTTTTTAAACTATCATCCTGTAATCAAAGTAATGATGGCAGCGTGTCCCACCAGAGC
GGGAGCCCAGCTGCTCAGGAGTCATGCTTAGGATGGATCCCTTCTCTTCTGCCGTCAGAGTTTCAGCTGGGTTGGGGTGGATGCAGCCAC
CTCCATGCCTGGCCTTCTGCATCTGTGATCATCACGGCCTCCTCCTGCCACTGAGCCTCATGCCTTCACGTGTCTGTTCCCCCCGCTTTT
CCTTTCTGCCACCCCTGCACGTGGGCCGCCAGGTTCCCAAGAGTATCCTACCCATTTCCTTCCTTCCACTCCCTTTGCCAGTGCCTCTCA
CCCCAACTAGTAGCTAACCATCACCCCCAGGACTGACCTCTTCCTCCTCGCTGCCAGATGATTGTTCAAAGCACAGAATTTGTCAGAAAC
CTGCAGGGACTCCATGCTGCCAGCCTTCTCCGTAATTAGCATGGCCCCAGTCCATGCTTCTAGCCTTGGTTCCTTCTGCCCCTCTGTTTG
AAATTCTAGAGCCAGCTGTGGGACAATTATCTGTGTCAAAAGCCAGATGTGAAAACATCTCAATAACAAACTGGCTGCTTTGTTCAATGC
TAGAACAACGCCTGTCACAGAGTAGAAACTCAAAAATATTTGCTGAGTGAATGAACAAATGAATAAATGCATAATAAATAATTAACCACC

>8454_8454_3_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000318724_EGFR_chr7_55209978_ENST00000344576_length(amino acids)=759AA_BP=83
MTSERSRIPCLSAAAAEGTGKKQQEGRAMATLDRKVPSPEAFLGKPWSSWIDAAKLHCSDNVDLEEAGKEGGKSREVMRLNKEVCQGTSN
KLTQLGTFEDHFLSLQRMFNNCEVVLGNLEITYVQRNYDLSFLKTIQEVAGYVLIALNTVERIPLENLQIIRGNMYYENSYALAVLSNYD
ANKTGLKELPMRNLQEILHGAVRFSNNPALCNVESIQWRDIVSSDFLSNMSMDFQNHLGSCQKCDPSCPNGSCWGAGEENCQKLTKIICA
QQCSGRCRGKSPSDCCHNQCAAGCTGPRESDCLVCRKFRDEATCKDTCPPLMLYNPTTYQMDVNPEGKYSFGATCVKKCPRNYVVTDHGS
CVRACGADSYEMEEDGVRKCKKCEGPCRKVCNGIGIGEFKDSLSINATNIKHFKNCTSISGDLHILPVAFRGDSFTHTPPLDPQELDILK
TVKEITGFLLIQAWPENRTDLHAFENLEIIRGRTKQHGQFSLAVVSLNITSLGLRSLKEISDGDVIISGNKNLCYANTINWKKLFGTSGQ
KTKIISNRGENSCKATGQVCHALCSPEGCWGPEPRDCVSCRNVSRGRECVDKCNLLEGEPREFVENSECIQCHPECLPQAMNITCTGRGP
DNCIQCAHYIDGPHCVKTCPAGVMGENNTLVWKYADAGHVCHLCHPNCTYGPGNESLKAMLFCLFKLSSCNQSNDGSVSHQSGSPAAQES

--------------------------------------------------------------
>8454_8454_4_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000318724_EGFR_chr7_55209978_ENST00000420316_length(transcript)=1512nt_BP=274nt
ACTCAATGTCGGAACGTTCCGAAGATGACGTCGGAGCGTTCTCGAATCCCGTGTCTCTCGGCTGCTGCTGCCGAAGGAACAGGGAAAAAG
CAACAAGAAGGAAGAGCAATGGCGACACTGGATCGCAAAGTGCCCAGTCCGGAGGCGTTTCTGGGCAAACCCTGGTCCTCCTGGATCGAC
GCCGCCAAATTACACTGCTCCGACAATGTAGATTTAGAAGAGGCTGGAAAAGAGGGTGGAAAAAGCAGGGAGGTTATGAGGCTTAATAAA
GAAGTTTGCCAAGGCACGAGTAACAAGCTCACGCAGTTGGGCACTTTTGAAGATCATTTTCTCAGCCTCCAGAGGATGTTCAATAACTGT
GAGGTGGTCCTTGGGAATTTGGAAATTACCTATGTGCAGAGGAATTATGATCTTTCCTTCTTAAAGACCATCCAGGAGGTGGCTGGTTAT
GTCCTCATTGCCCTCAACACAGTGGAGCGAATTCCTTTGGAAAACCTGCAGATCATCAGAGGAAATATGTACTACGAAAATTCCTATGCC
TTAGCAGTCTTATCTAACTATGATGCAAATAAAACCGGACTGAAGGAGCTGCCCATGAGAAATTTACAGGAAATCCTGCATGGCGCCGTG
CGGTTCAGCAACAACCCTGCCCTGTGCAACGTGGAGAGCATCCAGTGGCGGGACATAGTCAGCAGTGACTTTCTCAGCAACATGTCGATG
GACTTCCAGAACCACCTGGGCAGCTGCCAAAAGTGTGATCCAAGCTGTCCCAATGGGAGCTGCTGGGGTGCAGGAGAGGAGAACTGCCAG
AAACTGACCAAAATCATCTGTGCCCAGCAGTGCTCCGGGCGCTGCCGTGGCAAGTCCCCCAGTGACTGCTGCCACAACCAGTGTGCTGCA
GGCTGCACAGGCCCCCGGGAGAGCGACTGCCTGGTCTGCCGCAAATTCCGAGACGAAGCCACGTGCAAGGACACCTGCCCCCCACTCATG
CTCTACAACCCCACCACGTACCAGATGGATGTGAACCCCGAGGGCAAATACAGCTTTGGTGCCACCTGCGTGAAGAAGTGTCCCCGTAAT
TATGTGGTGACAGATCACGGCTCGTGCGTCCGAGCCTGTGGGGCCGACAGCTATGAGATGGAGGAAGACGGCGTCCGCAAGTGTAAGAAG
TGCGAAGGGCCTTGCCGCAAAGTGTGTAACGGAATAGGTATTGGTGAATTTAAAGACTCACTCTCCATAAATGCTACGAATATTAAACAC
TTCAAAAACTGCACCTCCATCAGTGGCGATCTCCACATCCTGCCGGTGGCATTTAGGGGTGACTCCTTCACACATACTCCTCCTCTGGAT
CCACAGGAACTGGATATTCTGAAAACCGTAAAGGAAATCACAGGTTTGAGCTGAATTATCACATGAATATAAATGGGAAATCAGTGTTTT

>8454_8454_4_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000318724_EGFR_chr7_55209978_ENST00000420316_length(amino acids)=459AA_BP=83
MTSERSRIPCLSAAAAEGTGKKQQEGRAMATLDRKVPSPEAFLGKPWSSWIDAAKLHCSDNVDLEEAGKEGGKSREVMRLNKEVCQGTSN
KLTQLGTFEDHFLSLQRMFNNCEVVLGNLEITYVQRNYDLSFLKTIQEVAGYVLIALNTVERIPLENLQIIRGNMYYENSYALAVLSNYD
ANKTGLKELPMRNLQEILHGAVRFSNNPALCNVESIQWRDIVSSDFLSNMSMDFQNHLGSCQKCDPSCPNGSCWGAGEENCQKLTKIICA
QQCSGRCRGKSPSDCCHNQCAAGCTGPRESDCLVCRKFRDEATCKDTCPPLMLYNPTTYQMDVNPEGKYSFGATCVKKCPRNYVVTDHGS
CVRACGADSYEMEEDGVRKCKKCEGPCRKVCNGIGIGEFKDSLSINATNIKHFKNCTSISGDLHILPVAFRGDSFTHTPPLDPQELDILK

--------------------------------------------------------------
>8454_8454_5_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000318724_EGFR_chr7_55209978_ENST00000442591_length(transcript)=2555nt_BP=274nt
ACTCAATGTCGGAACGTTCCGAAGATGACGTCGGAGCGTTCTCGAATCCCGTGTCTCTCGGCTGCTGCTGCCGAAGGAACAGGGAAAAAG
CAACAAGAAGGAAGAGCAATGGCGACACTGGATCGCAAAGTGCCCAGTCCGGAGGCGTTTCTGGGCAAACCCTGGTCCTCCTGGATCGAC
GCCGCCAAATTACACTGCTCCGACAATGTAGATTTAGAAGAGGCTGGAAAAGAGGGTGGAAAAAGCAGGGAGGTTATGAGGCTTAATAAA
GAAGTTTGCCAAGGCACGAGTAACAAGCTCACGCAGTTGGGCACTTTTGAAGATCATTTTCTCAGCCTCCAGAGGATGTTCAATAACTGT
GAGGTGGTCCTTGGGAATTTGGAAATTACCTATGTGCAGAGGAATTATGATCTTTCCTTCTTAAAGACCATCCAGGAGGTGGCTGGTTAT
GTCCTCATTGCCCTCAACACAGTGGAGCGAATTCCTTTGGAAAACCTGCAGATCATCAGAGGAAATATGTACTACGAAAATTCCTATGCC
TTAGCAGTCTTATCTAACTATGATGCAAATAAAACCGGACTGAAGGAGCTGCCCATGAGAAATTTACAGGAAATCCTGCATGGCGCCGTG
CGGTTCAGCAACAACCCTGCCCTGTGCAACGTGGAGAGCATCCAGTGGCGGGACATAGTCAGCAGTGACTTTCTCAGCAACATGTCGATG
GACTTCCAGAACCACCTGGGCAGCTGCCAAAAGTGTGATCCAAGCTGTCCCAATGGGAGCTGCTGGGGTGCAGGAGAGGAGAACTGCCAG
AAACTGACCAAAATCATCTGTGCCCAGCAGTGCTCCGGGCGCTGCCGTGGCAAGTCCCCCAGTGACTGCTGCCACAACCAGTGTGCTGCA
GGCTGCACAGGCCCCCGGGAGAGCGACTGCCTGGTCTGCCGCAAATTCCGAGACGAAGCCACGTGCAAGGACACCTGCCCCCCACTCATG
CTCTACAACCCCACCACGTACCAGATGGATGTGAACCCCGAGGGCAAATACAGCTTTGGTGCCACCTGCGTGAAGAAGTGTCCCCGTAAT
TATGTGGTGACAGATCACGGCTCGTGCGTCCGAGCCTGTGGGGCCGACAGCTATGAGATGGAGGAAGACGGCGTCCGCAAGTGTAAGAAG
TGCGAAGGGCCTTGCCGCAAAGTGTGTAACGGAATAGGTATTGGTGAATTTAAAGACTCACTCTCCATAAATGCTACGAATATTAAACAC
TTCAAAAACTGCACCTCCATCAGTGGCGATCTCCACATCCTGCCGGTGGCATTTAGGGGTGACTCCTTCACACATACTCCTCCTCTGGAT
CCACAGGAACTGGATATTCTGAAAACCGTAAAGGAAATCACAGGGTTTTTGCTGATTCAGGCTTGGCCTGAAAACAGGACGGACCTCCAT
GCCTTTGAGAACCTAGAAATCATACGCGGCAGGACCAAGCAACATGGTCAGTTTTCTCTTGCAGTCGTCAGCCTGAACATAACATCCTTG
GGATTACGCTCCCTCAAGGAGATAAGTGATGGAGATGTGATAATTTCAGGAAACAAAAATTTGTGCTATGCAAATACAATAAACTGGAAA
AAACTGTTTGGGACCTCCGGTCAGAAAACCAAAATTATAAGCAACAGAGGTGAAAACAGCTGCAAGGCCACAGGCCAGGTCTGCCATGCC
TTGTGCTCCCCCGAGGGCTGCTGGGGCCCGGAGCCCAGGGACTGCGTCTCTTGCCGGAATGTCAGCCGAGGCAGGGAATGCGTGGACAAG
TGCAACCTTCTGGAGGGTGAGCCAAGGGAGTTTGTGGAGAACTCTGAGTGCATACAGTGCCACCCAGAGTGCCTGCCTCAGGCCATGAAC
ATCACCTGCACAGGACGGGGACCAGACAACTGTATCCAGTGTGCCCACTACATTGACGGCCCCCACTGCGTCAAGACCTGCCCGGCAGGA
GTCATGGGAGAAAACAACACCCTGGTCTGGAAGTACGCAGACGCCGGCCATGTGTGCCACCTGTGCCATCCAAACTGCACCTACGGATGC
ACTGGGCCAGGTCTTGAAGGCTGTCCAACGAATGGAAGCTACATAGTGTCTCACTTTCCAAGATCATTCTACAAGATGTCAGTGCACTGA
AACATGCAGGGGCGTGTTGAGTGCCAAGGATCTTGACAAGTTGTTTTGAAGATGGCATTTTGCTAAGTCCCTGAGGGTCACTGGTCCTCA
AAGCGGCATGGCGGCATGGCGTGGCTGGTTCTGCCACATGCCAGCTGTGTGACCTCTGAGACTCCACTTCTTCAGTGCTGAAAATAAAGA
AGGAGTTTTACTAAGGACCAAACAAGATAATGAATGTGAAACTGCTCCACGAACCCCAAAGAATTATGCACATAGATGCGATCATTAAGA
TGCGAAGCCATCGAGTTACCACCTGGCATGCTTAAACTGTAAAGAGTGGGTCAAAGTAAACTGAATTGGAAAATCCAAAGTTATGCAGAA

>8454_8454_5_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000318724_EGFR_chr7_55209978_ENST00000442591_length(amino acids)=711AA_BP=83
MTSERSRIPCLSAAAAEGTGKKQQEGRAMATLDRKVPSPEAFLGKPWSSWIDAAKLHCSDNVDLEEAGKEGGKSREVMRLNKEVCQGTSN
KLTQLGTFEDHFLSLQRMFNNCEVVLGNLEITYVQRNYDLSFLKTIQEVAGYVLIALNTVERIPLENLQIIRGNMYYENSYALAVLSNYD
ANKTGLKELPMRNLQEILHGAVRFSNNPALCNVESIQWRDIVSSDFLSNMSMDFQNHLGSCQKCDPSCPNGSCWGAGEENCQKLTKIICA
QQCSGRCRGKSPSDCCHNQCAAGCTGPRESDCLVCRKFRDEATCKDTCPPLMLYNPTTYQMDVNPEGKYSFGATCVKKCPRNYVVTDHGS
CVRACGADSYEMEEDGVRKCKKCEGPCRKVCNGIGIGEFKDSLSINATNIKHFKNCTSISGDLHILPVAFRGDSFTHTPPLDPQELDILK
TVKEITGFLLIQAWPENRTDLHAFENLEIIRGRTKQHGQFSLAVVSLNITSLGLRSLKEISDGDVIISGNKNLCYANTINWKKLFGTSGQ
KTKIISNRGENSCKATGQVCHALCSPEGCWGPEPRDCVSCRNVSRGRECVDKCNLLEGEPREFVENSECIQCHPECLPQAMNITCTGRGP

--------------------------------------------------------------
>8454_8454_6_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000318724_EGFR_chr7_55209978_ENST00000455089_length(transcript)=3773nt_BP=274nt
ACTCAATGTCGGAACGTTCCGAAGATGACGTCGGAGCGTTCTCGAATCCCGTGTCTCTCGGCTGCTGCTGCCGAAGGAACAGGGAAAAAG
CAACAAGAAGGAAGAGCAATGGCGACACTGGATCGCAAAGTGCCCAGTCCGGAGGCGTTTCTGGGCAAACCCTGGTCCTCCTGGATCGAC
GCCGCCAAATTACACTGCTCCGACAATGTAGATTTAGAAGAGGCTGGAAAAGAGGGTGGAAAAAGCAGGGAGGTTATGAGGCTTAATAAA
GAAGTTTGCCAAGGCACGAGTAACAAGCTCACGCAGTTGGGCACTTTTGAAGATCATTTTCTCAGCCTCCAGAGGATGTTCAATAACTGT
GAGGTGGTCCTTGGGAATTTGGAAATTACCTATGTGCAGAGGAATTATGATCTTTCCTTCTTAAAGACCATCCAGGAGGTGGCTGGTTAT
GTCCTCATTGCCCTCAACACAGTGGAGCGAATTCCTTTGGAAAACCTGCAGATCATCAGAGGAAATATGTACTACGAAAATTCCTATGCC
TTAGCAGTCTTATCTAACTATGATGCAAATAAAACCGGACTGAAGGAGCTGCCCATGAGAAATTTACAGGGCCAAAAGTGTGATCCAAGC
TGTCCCAATGGGAGCTGCTGGGGTGCAGGAGAGGAGAACTGCCAGAAACTGACCAAAATCATCTGTGCCCAGCAGTGCTCCGGGCGCTGC
CGTGGCAAGTCCCCCAGTGACTGCTGCCACAACCAGTGTGCTGCAGGCTGCACAGGCCCCCGGGAGAGCGACTGCCTGGTCTGCCGCAAA
TTCCGAGACGAAGCCACGTGCAAGGACACCTGCCCCCCACTCATGCTCTACAACCCCACCACGTACCAGATGGATGTGAACCCCGAGGGC
AAATACAGCTTTGGTGCCACCTGCGTGAAGAAGTGTCCCCGTAATTATGTGGTGACAGATCACGGCTCGTGCGTCCGAGCCTGTGGGGCC
GACAGCTATGAGATGGAGGAAGACGGCGTCCGCAAGTGTAAGAAGTGCGAAGGGCCTTGCCGCAAAGTGTGTAACGGAATAGGTATTGGT
GAATTTAAAGACTCACTCTCCATAAATGCTACGAATATTAAACACTTCAAAAACTGCACCTCCATCAGTGGCGATCTCCACATCCTGCCG
GTGGCATTTAGGGGTGACTCCTTCACACATACTCCTCCTCTGGATCCACAGGAACTGGATATTCTGAAAACCGTAAAGGAAATCACAGGG
TTTTTGCTGATTCAGGCTTGGCCTGAAAACAGGACGGACCTCCATGCCTTTGAGAACCTAGAAATCATACGCGGCAGGACCAAGCAACAT
GGTCAGTTTTCTCTTGCAGTCGTCAGCCTGAACATAACATCCTTGGGATTACGCTCCCTCAAGGAGATAAGTGATGGAGATGTGATAATT
TCAGGAAACAAAAATTTGTGCTATGCAAATACAATAAACTGGAAAAAACTGTTTGGGACCTCCGGTCAGAAAACCAAAATTATAAGCAAC
AGAGGTGAAAACAGCTGCAAGGCCACAGGCCAGGTCTGCCATGCCTTGTGCTCCCCCGAGGGCTGCTGGGGCCCGGAGCCCAGGGACTGC
GTCTCTTGCCGGAATGTCAGCCGAGGCAGGGAATGCGTGGACAAGTGCAACCTTCTGGAGGGTGAGCCAAGGGAGTTTGTGGAGAACTCT
GAGTGCATACAGTGCCACCCAGAGTGCCTGCCTCAGGCCATGAACATCACCTGCACAGGACGGGGACCAGACAACTGTATCCAGTGTGCC
CACTACATTGACGGCCCCCACTGCGTCAAGACCTGCCCGGCAGGAGTCATGGGAGAAAACAACACCCTGGTCTGGAAGTACGCAGACGCC
GGCCATGTGTGCCACCTGTGCCATCCAAACTGCACCTACGGATGCACTGGGCCAGGTCTTGAAGGCTGTCCAACGAATGGGCCTAAGATC
CCGTCCATCGCCACTGGGATGGTGGGGGCCCTCCTCTTGCTGCTGGTGGTGGCCCTGGGGATCGGCCTCTTCATGCGAAGGCGCCACATC
GTTCGGAAGCGCACGCTGCGGAGGCTGCTGCAGGAGAGGGAGCTTGTGGAGCCTCTTACACCCAGTGGAGAAGCTCCCAACCAAGCTCTC
TTGAGGATCTTGAAGGAAACTGAATTCAAAAAGATCAAAGTGCTGGGCTCCGGTGCGTTCGGCACGGTGTATAAGGGACTCTGGATCCCA
GAAGGTGAGAAAGTTAAAATTCCCGTCGCTATCAAGGAATTAAGAGAAGCAACATCTCCGAAAGCCAACAAGGAAATCCTCGATGAAGCC
TACGTGATGGCCAGCGTGGACAACCCCCACGTGTGCCGCCTGCTGGGCATCTGCCTCACCTCCACCGTGCAGCTCATCACGCAGCTCATG
CCCTTCGGCTGCCTCCTGGACTATGTCCGGGAACACAAAGACAATATTGGCTCCCAGTACCTGCTCAACTGGTGTGTGCAGATCGCAAAG
GGCATGAACTACTTGGAGGACCGTCGCTTGGTGCACCGCGACCTGGCAGCCAGGAACGTACTGGTGAAAACACCGCAGCATGTCAAGATC
ACAGATTTTGGGCTGGCCAAACTGCTGGGTGCGGAAGAGAAAGAATACCATGCAGAAGGAGGCAAAGTGCCTATCAAGTGGATGGCATTG
GAATCAATTTTACACAGAATCTATACCCACCAGAGTGATGTCTGGAGCTACGGGGTGACTGTTTGGGAGTTGATGACCTTTGGATCCAAG
CCATATGACGGAATCCCTGCCAGCGAGATCTCCTCCATCCTGGAGAAAGGAGAACGCCTCCCTCAGCCACCCATATGTACCATCGATGTC
TACATGATCATGGTCAAGTGCTGGATGATAGACGCAGATAGTCGCCCAAAGTTCCGTGAGTTGATCATCGAATTCTCCAAAATGGCCCGA
GACCCCCAGCGCTACCTTGTCATTCAGGGGGATGAAAGAATGCATTTGCCAAGTCCTACAGACTCCAACTTCTACCGTGCCCTGATGGAT
GAAGAAGACATGGACGACGTGGTGGATGCCGACGAGTACCTCATCCCACAGCAGGGCTTCTTCAGCAGCCCCTCCACGTCACGGACTCCC
CTCCTGAGCTCTCTGAGTGCAACCAGCAACAATTCCACCGTGGCTTGCATTGATAGAAATGGGCTGCAAAGCTGTCCCATCAAGGAAGAC
AGCTTCTTGCAGCGATACAGCTCAGACCCCACAGGCGCCTTGACTGAGGACAGCATAGACGACACCTTCCTCCCAGTGCCTGGTGAGTGG
CTTGTCTGGAAACAGTCCTGCTCCTCAACCTCCTCGACCCACTCAGCAGCAGCCAGTCTCCAGTGTCCAAGCCAGGTGCTCCCTCCAGCA
TCTCCAGAGGGGGAAACAGTGGCAGATTTGCAGACACAGTGAAGGGCGTAAGGAGCAGATAAACACATGACCGAGCCTGCACAAGCTCTT
TGTTGTGTCTGGTTGTTTGCTGTACCTCTGTTGTAAGAATGAATCTGCAAAATTTCTAGCTTATGAAGCAAATCACGGACATACACATCT
GTGTGTGTGAGTGTTCATGATGTGTGTACATCTGTGTATGTGTGTGTGTGTATGTGTGTGTTTGTGACAGATTTGATCCCTGTTCTCTCT

>8454_8454_6_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000318724_EGFR_chr7_55209978_ENST00000455089_length(amino acids)=1145AA_BP=83
MTSERSRIPCLSAAAAEGTGKKQQEGRAMATLDRKVPSPEAFLGKPWSSWIDAAKLHCSDNVDLEEAGKEGGKSREVMRLNKEVCQGTSN
KLTQLGTFEDHFLSLQRMFNNCEVVLGNLEITYVQRNYDLSFLKTIQEVAGYVLIALNTVERIPLENLQIIRGNMYYENSYALAVLSNYD
ANKTGLKELPMRNLQGQKCDPSCPNGSCWGAGEENCQKLTKIICAQQCSGRCRGKSPSDCCHNQCAAGCTGPRESDCLVCRKFRDEATCK
DTCPPLMLYNPTTYQMDVNPEGKYSFGATCVKKCPRNYVVTDHGSCVRACGADSYEMEEDGVRKCKKCEGPCRKVCNGIGIGEFKDSLSI
NATNIKHFKNCTSISGDLHILPVAFRGDSFTHTPPLDPQELDILKTVKEITGFLLIQAWPENRTDLHAFENLEIIRGRTKQHGQFSLAVV
SLNITSLGLRSLKEISDGDVIISGNKNLCYANTINWKKLFGTSGQKTKIISNRGENSCKATGQVCHALCSPEGCWGPEPRDCVSCRNVSR
GRECVDKCNLLEGEPREFVENSECIQCHPECLPQAMNITCTGRGPDNCIQCAHYIDGPHCVKTCPAGVMGENNTLVWKYADAGHVCHLCH
PNCTYGCTGPGLEGCPTNGPKIPSIATGMVGALLLLLVVALGIGLFMRRRHIVRKRTLRRLLQERELVEPLTPSGEAPNQALLRILKETE
FKKIKVLGSGAFGTVYKGLWIPEGEKVKIPVAIKELREATSPKANKEILDEAYVMASVDNPHVCRLLGICLTSTVQLITQLMPFGCLLDY
VREHKDNIGSQYLLNWCVQIAKGMNYLEDRRLVHRDLAARNVLVKTPQHVKITDFGLAKLLGAEEKEYHAEGGKVPIKWMALESILHRIY
THQSDVWSYGVTVWELMTFGSKPYDGIPASEISSILEKGERLPQPPICTIDVYMIMVKCWMIDADSRPKFRELIIEFSKMARDPQRYLVI
QGDERMHLPSPTDSNFYRALMDEEDMDDVVDADEYLIPQQGFFSSPSTSRTPLLSSLSATSNNSTVACIDRNGLQSCPIKEDSFLQRYSS

--------------------------------------------------------------
>8454_8454_7_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000419735_EGFR_chr7_55209978_ENST00000275493_length(transcript)=9852nt_BP=296nt
CCCTCCCTCCCCCTTTCAGAACACTCAATGTCGGAACGTTCCGAAGATGACGTCGGAGCGTTCTCGAATCCCGTGTCTCTCGGCTGCTGC
TGCCGAAGGAACAGGGAAAAAGCAACAAGAAGGAAGAGCAATGGCGACACTGGATCGCAAAGTGCCCAGTCCGGAGGCGTTTCTGGGCAA
ACCCTGGTCCTCCTGGATCGACGCCGCCAAATTACACTGCTCCGACAATGTAGATTTAGAAGAGGCTGGAAAAGAGGGTGGAAAAAGCAG
GGAGGTTATGAGGCTTAATAAAGAAGTTTGCCAAGGCACGAGTAACAAGCTCACGCAGTTGGGCACTTTTGAAGATCATTTTCTCAGCCT
CCAGAGGATGTTCAATAACTGTGAGGTGGTCCTTGGGAATTTGGAAATTACCTATGTGCAGAGGAATTATGATCTTTCCTTCTTAAAGAC
CATCCAGGAGGTGGCTGGTTATGTCCTCATTGCCCTCAACACAGTGGAGCGAATTCCTTTGGAAAACCTGCAGATCATCAGAGGAAATAT
GTACTACGAAAATTCCTATGCCTTAGCAGTCTTATCTAACTATGATGCAAATAAAACCGGACTGAAGGAGCTGCCCATGAGAAATTTACA
GGAAATCCTGCATGGCGCCGTGCGGTTCAGCAACAACCCTGCCCTGTGCAACGTGGAGAGCATCCAGTGGCGGGACATAGTCAGCAGTGA
CTTTCTCAGCAACATGTCGATGGACTTCCAGAACCACCTGGGCAGCTGCCAAAAGTGTGATCCAAGCTGTCCCAATGGGAGCTGCTGGGG
TGCAGGAGAGGAGAACTGCCAGAAACTGACCAAAATCATCTGTGCCCAGCAGTGCTCCGGGCGCTGCCGTGGCAAGTCCCCCAGTGACTG
CTGCCACAACCAGTGTGCTGCAGGCTGCACAGGCCCCCGGGAGAGCGACTGCCTGGTCTGCCGCAAATTCCGAGACGAAGCCACGTGCAA
GGACACCTGCCCCCCACTCATGCTCTACAACCCCACCACGTACCAGATGGATGTGAACCCCGAGGGCAAATACAGCTTTGGTGCCACCTG
CGTGAAGAAGTGTCCCCGTAATTATGTGGTGACAGATCACGGCTCGTGCGTCCGAGCCTGTGGGGCCGACAGCTATGAGATGGAGGAAGA
CGGCGTCCGCAAGTGTAAGAAGTGCGAAGGGCCTTGCCGCAAAGTGTGTAACGGAATAGGTATTGGTGAATTTAAAGACTCACTCTCCAT
AAATGCTACGAATATTAAACACTTCAAAAACTGCACCTCCATCAGTGGCGATCTCCACATCCTGCCGGTGGCATTTAGGGGTGACTCCTT
CACACATACTCCTCCTCTGGATCCACAGGAACTGGATATTCTGAAAACCGTAAAGGAAATCACAGGGTTTTTGCTGATTCAGGCTTGGCC
TGAAAACAGGACGGACCTCCATGCCTTTGAGAACCTAGAAATCATACGCGGCAGGACCAAGCAACATGGTCAGTTTTCTCTTGCAGTCGT
CAGCCTGAACATAACATCCTTGGGATTACGCTCCCTCAAGGAGATAAGTGATGGAGATGTGATAATTTCAGGAAACAAAAATTTGTGCTA
TGCAAATACAATAAACTGGAAAAAACTGTTTGGGACCTCCGGTCAGAAAACCAAAATTATAAGCAACAGAGGTGAAAACAGCTGCAAGGC
CACAGGCCAGGTCTGCCATGCCTTGTGCTCCCCCGAGGGCTGCTGGGGCCCGGAGCCCAGGGACTGCGTCTCTTGCCGGAATGTCAGCCG
AGGCAGGGAATGCGTGGACAAGTGCAACCTTCTGGAGGGTGAGCCAAGGGAGTTTGTGGAGAACTCTGAGTGCATACAGTGCCACCCAGA
GTGCCTGCCTCAGGCCATGAACATCACCTGCACAGGACGGGGACCAGACAACTGTATCCAGTGTGCCCACTACATTGACGGCCCCCACTG
CGTCAAGACCTGCCCGGCAGGAGTCATGGGAGAAAACAACACCCTGGTCTGGAAGTACGCAGACGCCGGCCATGTGTGCCACCTGTGCCA
TCCAAACTGCACCTACGGATGCACTGGGCCAGGTCTTGAAGGCTGTCCAACGAATGGGCCTAAGATCCCGTCCATCGCCACTGGGATGGT
GGGGGCCCTCCTCTTGCTGCTGGTGGTGGCCCTGGGGATCGGCCTCTTCATGCGAAGGCGCCACATCGTTCGGAAGCGCACGCTGCGGAG
GCTGCTGCAGGAGAGGGAGCTTGTGGAGCCTCTTACACCCAGTGGAGAAGCTCCCAACCAAGCTCTCTTGAGGATCTTGAAGGAAACTGA
ATTCAAAAAGATCAAAGTGCTGGGCTCCGGTGCGTTCGGCACGGTGTATAAGGGACTCTGGATCCCAGAAGGTGAGAAAGTTAAAATTCC
CGTCGCTATCAAGGAATTAAGAGAAGCAACATCTCCGAAAGCCAACAAGGAAATCCTCGATGAAGCCTACGTGATGGCCAGCGTGGACAA
CCCCCACGTGTGCCGCCTGCTGGGCATCTGCCTCACCTCCACCGTGCAGCTCATCACGCAGCTCATGCCCTTCGGCTGCCTCCTGGACTA
TGTCCGGGAACACAAAGACAATATTGGCTCCCAGTACCTGCTCAACTGGTGTGTGCAGATCGCAAAGGGCATGAACTACTTGGAGGACCG
TCGCTTGGTGCACCGCGACCTGGCAGCCAGGAACGTACTGGTGAAAACACCGCAGCATGTCAAGATCACAGATTTTGGGCTGGCCAAACT
GCTGGGTGCGGAAGAGAAAGAATACCATGCAGAAGGAGGCAAAGTGCCTATCAAGTGGATGGCATTGGAATCAATTTTACACAGAATCTA
TACCCACCAGAGTGATGTCTGGAGCTACGGGGTGACTGTTTGGGAGTTGATGACCTTTGGATCCAAGCCATATGACGGAATCCCTGCCAG
CGAGATCTCCTCCATCCTGGAGAAAGGAGAACGCCTCCCTCAGCCACCCATATGTACCATCGATGTCTACATGATCATGGTCAAGTGCTG
GATGATAGACGCAGATAGTCGCCCAAAGTTCCGTGAGTTGATCATCGAATTCTCCAAAATGGCCCGAGACCCCCAGCGCTACCTTGTCAT
TCAGGGGGATGAAAGAATGCATTTGCCAAGTCCTACAGACTCCAACTTCTACCGTGCCCTGATGGATGAAGAAGACATGGACGACGTGGT
GGATGCCGACGAGTACCTCATCCCACAGCAGGGCTTCTTCAGCAGCCCCTCCACGTCACGGACTCCCCTCCTGAGCTCTCTGAGTGCAAC
CAGCAACAATTCCACCGTGGCTTGCATTGATAGAAATGGGCTGCAAAGCTGTCCCATCAAGGAAGACAGCTTCTTGCAGCGATACAGCTC
AGACCCCACAGGCGCCTTGACTGAGGACAGCATAGACGACACCTTCCTCCCAGTGCCTGAATACATAAACCAGTCCGTTCCCAAAAGGCC
CGCTGGCTCTGTGCAGAATCCTGTCTATCACAATCAGCCTCTGAACCCCGCGCCCAGCAGAGACCCACACTACCAGGACCCCCACAGCAC
TGCAGTGGGCAACCCCGAGTATCTCAACACTGTCCAGCCCACCTGTGTCAACAGCACATTCGACAGCCCTGCCCACTGGGCCCAGAAAGG
CAGCCACCAAATTAGCCTGGACAACCCTGACTACCAGCAGGACTTCTTTCCCAAGGAAGCCAAGCCAAATGGCATCTTTAAGGGCTCCAC
AGCTGAAAATGCAGAATACCTAAGGGTCGCGCCACAAAGCAGTGAATTTATTGGAGCATGACCACGGAGGATAGTATGAGCCCTAAAAAT
CCAGACTCTTTCGATACCCAGGACCAAGCCACAGCAGGTCCTCCATCCCAACAGCCATGCCCGCATTAGCTCTTAGACCCACAGACTGGT
TTTGCAACGTTTACACCGACTAGCCAGGAAGTACTTCCACCTCGGGCACATTTTGGGAAGTTGCATTCCTTTGTCTTCAAACTGTGAAGC
ATTTACAGAAACGCATCCAGCAAGAATATTGTCCCTTTGAGCAGAAATTTATCTTTCAAAGAGGTATATTTGAAAAAAAAAAAAAGTATA
TGTGAGGATTTTTATTGATTGGGGATCTTGGAGTTTTTCATTGTCGCTATTGATTTTTACTTCAATGGGCTCTTCCAACAAGGAAGAAGC
TTGCTGGTAGCACTTGCTACCCTGAGTTCATCCAGGCCCAACTGTGAGCAAGGAGCACAAGCCACAAGTCTTCCAGAGGATGCTTGATTC
CAGTGGTTCTGCTTCAAGGCTTCCACTGCAAAACACTAAAGATCCAAGAAGGCCTTCATGGCCCCAGCAGGCCGGATCGGTACTGTATCA
AGTCATGGCAGGTACAGTAGGATAAGCCACTCTGTCCCTTCCTGGGCAAAGAAGAAACGGAGGGGATGGAATTCTTCCTTAGACTTACTT
TTGTAAAAATGTCCCCACGGTACTTACTCCCCACTGATGGACCAGTGGTTTCCAGTCATGAGCGTTAGACTGACTTGTTTGTCTTCCATT
CCATTGTTTTGAAACTCAGTATGCTGCCCCTGTCTTGCTGTCATGAAATCAGCAAGAGAGGATGACACATCAAATAATAACTCGGATTCC
AGCCCACATTGGATTCATCAGCATTTGGACCAATAGCCCACAGCTGAGAATGTGGAATACCTAAGGATAGCACCGCTTTTGTTCTCGCAA
AAACGTATCTCCTAATTTGAGGCTCAGATGAAATGCATCAGGTCCTTTGGGGCATAGATCAGAAGACTACAAAAATGAAGCTGCTCTGAA
ATCTCCTTTAGCCATCACCCCAACCCCCCAAAATTAGTTTGTGTTACTTATGGAAGATAGTTTTCTCCTTTTACTTCACTTCAAAAGCTT
TTTACTCAAAGAGTATATGTTCCCTCCAGGTCAGCTGCCCCCAAACCCCCTCCTTACGCTTTGTCACACAAAAAGTGTCTCTGCCTTGAG
TCATCTATTCAAGCACTTACAGCTCTGGCCACAACAGGGCATTTTACAGGTGCGAATGACAGTAGCATTATGAGTAGTGTGGAATTCAGG
TAGTAAATATGAAACTAGGGTTTGAAATTGATAATGCTTTCACAACATTTGCAGATGTTTTAGAAGGAAAAAAGTTCCTTCCTAAAATAA
TTTCTCTACAATTGGAAGATTGGAAGATTCAGCTAGTTAGGAGCCCACCTTTTTTCCTAATCTGTGTGTGCCCTGTAACCTGACTGGTTA
ACAGCAGTCCTTTGTAAACAGTGTTTTAAACTCTCCTAGTCAATATCCACCCCATCCAATTTATCAAGGAAGAAATGGTTCAGAAAATAT
TTTCAGCCTACAGTTATGTTCAGTCACACACACATACAAAATGTTCCTTTTGCTTTTAAAGTAATTTTTGACTCCCAGATCAGTCAGAGC
CCCTACAGCATTGTTAAGAAAGTATTTGATTTTTGTCTCAATGAAAATAAAACTATATTCATTTCCACTCTATTATGCTCTCAAATACCC
CTAAGCATCTATACTAGCCTGGTATGGGTATGAAAGATACAAAGATAAATAAAACATAGTCCCTGATTCTAAGAAATTCACAATTTAGCA
AAGGAAATGGACTCATAGATGCTAACCTTAAAACAACGTGACAAATGCCAGACAGGACCCATCAGCCAGGCACTGTGAGAGCACAGAGCA
GGGAGGTTGGGTCCTGCCTGAGGAGACCTGGAAGGGAGGCCTCACAGGAGGATGACCAGGTCTCAGTCAGCGGGGAGGTGGAAAGTGCAG
GTGCATCAGGGGCACCCTGACCGAGGAAACAGCTGCCAGAGGCCTCCACTGCTAAAGTCCACATAAGGCTGAGGTCAGTCACCCTAAACA
ACCTGCTCCCTCTAAGCCAGGGGATGAGCTTGGAGCATCCCACAAGTTCCCTAAAAGTTGCAGCCCCCAGGGGGATTTTGAGCTATCATC
TCTGCACATGCTTAGTGAGAAGACTACACAACATTTCTAAGAATCTGAGATTTTATATTGTCAGTTAACCACTTTCATTATTCATTCACC
TCAGGACATGCAGAAATATTTCAGTCAGAACTGGGAAACAGAAGGACCTACATTCTGCTGTCACTTATGTGTCAAGAAGCAGATGATCGA
TGAGGCAGGTCAGTTGTAAGTGAGTCACATTGTAGCATTAAATTCTAGTATTTTTGTAGTTTGAAACAGTAACTTAATAAAAGAGCAAAA
GCTATTCTAGCTTTCTTCTTCATATTTTAATTTTCCACCATAAAGTTTAGTTGCTAAATTCTATTAATTTTAAGATTGTGCTTCCCAAAA
TAGTTCTCACTTCATCTGTCCAGGGAGGCACAGTTCTGTCTGGTAGAAGCCGCAAAGCCCTTAGCCTCTTCACGGATCTGGCGACTGTGA
TGGGCAGGTCAGGAGAGGAGCTGCCCAAAGTCCCATGATTTTCACCTAACAGCCCTGATCAGTCAGTACTCAAAGCTTGGACTCCATCCC
TGAAGGTCTTCCTGATTGATAGCCTGGCCTTAATACCCTACAGAAAGCCTGTCCATTGGCTGTTTCTTCCTCAGTCAGTTCCTGGAAGAC
CTTACCCCATGACCCCAGCTTCAGATGTGGTCTTTGGAAACAGAGGTCGAAGGAAAGTAAGGAGCTGAGAGCTCACATTCATAGGTGCCG
CCAGCCTTCGTGCATCTTCTTGCATCATCTCTAAGGAGCTCCTCTAATTACACCATGCCCGTCACCCCATGAGGGATCAGAGAAGGGATG
AGTCTTCTAAACTCTATATTCGCTGTGAGTCCAGGTTGTAAGGGGGAGCACTGTGGATGCATCCTATTGCACTCCAGCTGATGACACCAA
AGCTTAGGTGTTTGCTGAAAGTTCTTGATGTTGTGACTTACCACCCCTGCCTCACAACTGCAGACATAAGGGGACTATGGATTGCTTAGC
AGGAAAGGCACTGGTTCTCAAGGGCGGCTGCCCTTGGGAATCTTCTGGTCCCAACCAGAAAGACTGTGGCTTGATTTTCTCAGGTGCAGC
CCAGCCGTAGGGCCTTTTCAGAGCACCCCCTGGTTATTGCAACATTCATCAAAGTTTCTAGAACCTCTGGCCTAAAGGAAGGGCCTGGTG
GGATCTACTTGGCACTCGCTGGGGGGCCACCCCCCAGTGCCACTCTCACTAGGCCTCTGATTGCACTTGTGTAGGATGAAGCTGGTGGGT
GATGGGAACTCAGCACCTCCCCTCAGGCAGAAAAGAATCATCTGTGGAGCTTCAAAAGAAGGGGCCTGGAGTCTCTGCAGACCAATTCAA
CCCAAATCTCGGGGGCTCTTTCATGATTCTAATGGGCAACCAGGGTTGAAACCCTTATTTCTAGGGTCTTCAGTTGTACAAGACTGTGGG
TCTGTACCAGAGCCCCCGTCAGAGTAGAATAAAAGGCTGGGTAGGGTAGAGATTCCCATGTGCAGTGGAGAGAACAATCTGCAGTCACTG
ATAAGCCTGAGACTTGGCTCATTTCAAAAGCGTTCAATTCATCCTCACCAGCAGTTCAGCTGGAAAGGGGCAAATACCCCCACCTGAGCT
TTGAAAACGCCCTGGGACCCTCTGCATTCTCTAAGTAAGTTATAGAAACCAGTCTCTTCCCTCCTTTGTGAGTGAGCTGCTATTCCACGT
AGGCAACACCTGTTGAAATTGCCCTCAATGTCTACTCTGCATTTCTTTCTTGTGATAAGCACACACTTTTATTGCAACATAATGATCTGC
TCACATTTCCTTGCCTGGGGGCTGTAAAACCTTACAGAACAGAAATCCTTGCCTCTTTCACCAGCCACACCTGCCATACCAGGGGTACAG
CTTTGTACTATTGAAGACACAGACAGGATTTTTAAATGTAAATCTATTTTTGTAACTTTGTTGCGGGATATAGTTCTCTTTATGTAGCAC
TGAACTTTGTACAATATATTTTTAGAAACTCATTTTTCTACTAAAACAAACACAGTTTACTTTAGAGAGACTGCAATAGAATCAAAATTT
GAAACTGAAATCTTTGTTTAAAAGGGTTAAGTTGAGGCAAGAGGAAAGCCCTTTCTCTCTCTTATAAAAAGGCACAACCTCATTGGGGAG
CTAAGCTAGGTCATTGTCATGGTGAAGAAGAGAAGCATCGTTTTTATATTTAGGAAATTTTAAAAGATGATGGAAAGCACATTTAGCTTG
GTCTGAGGCAGGTTCTGTTGGGGCAGTGTTAATGGAAAGGGCTCACTGTTGTTACTACTAGAAAAATCCAGTTGCATGCCATACTCTCAT
CATCTGCCAGTGTAACCCTGTACATGTAAGAAAAGCAATAACATAGCACTTTGTTGGTTTATATATATAATGTGACTTCAATGCAAATTT
TATTTTTATATTTACAATTGATATGCATTTACCAGTATAAACTAGACATGTCTGGAGAGCCTAATAATGTTCAGCACACTTTGGTTAGTT
CACCAACAGTCTTACCAAGCCTGGGCCCAGCCACCCTAGAGAAGTTATTCAGCCCTGGCTGCAGTGACATCACCTGAGGAGCTTTTAAAA
GCTTGAAGCCCAGCTACACCTCAGACCGATTAAACGCAAATCTCTGGGGCTGAAACCCAAGCATTCGTAGTTTTTAAAGCTCCTGAGGTC
ATTCCAATGTGCGGCCAAAGTTGAGAACTACTGGCCTAGGGATTAGCCACAAGGACATGGACTTGGAGGCAAATTCTGCAGGTGTATGTG
ATTCTCAGGCCTAGAGAGCTAAGACACAAAGACCTCCACATCTGTCGCTGAGAGTCAAGAACCTGAACAGAGTTTCCATGAAGGTTCTCC
AAGCACTAGAAGGGAGAGTGTCTAAACAATGGTTGAAAAGCAAAGGAAATATAAAACAGACACCTCTTTCCATTTCCTAAGGTTTCTCTC
TTTATTAAGGGTGGACTAGTAATAAAATATAATATTCTTGCTGCTTATGCAGCTGACATTGTTGCCCTCCCTAAAGCAACCAAGTAGCCT
TTATTTCCCACAGTGAAAGAAAACGCTGGCCTATCAGTTACATTACAAAAGGCAGATTTCAAGAGGATTGAGTAAGTAGTTGGATGGCTT
TCATAAAAACAAGAATTCAAGAAGAGGATTCATGCTTTAAGAAACATTTGTTATACATTCCTCACAAATTATACCTGGGATAAAAACTAT
GTAGCAGGCAGTGTGTTTTCCTTCCATGTCTCTCTGCACTACCTGCAGTGTGTCCTCTGAGGCTGCAAGTCTGTCCTATCTGAATTCCCA
GCAGAAGCACTAAGAAGCTCCACCCTATCACCTAGCAGATAAAACTATGGGGAAAACTTAAATCTGTGCATACATTTCTGGATGCATTTA
CTTATCTTTAAAAAAAAAGGAATCCTATGACCTGATTTGGCCACAAAAATAATCTTGCTGTACAATACAATCTCTTGGAAATTAAGAGAT
CCTATGGATTTGATGACTGGTATTAGAGGTGACAATGTAACCGATTAACAACAGACAGCAATAACTTCGTTTTAGAAACATTCAAGCAAT
AGCTTTATAGCTTCAACATATGGTACGTTTTAACCTTGAAAGTTTTGCAATGATGAAAGCAGTATTTGTACAAATGAAAAGCAGAATTCT
CTTTTATATGGTTTATACTGTTGATCAGAAATGTTGATTGTGCATTGAGTATTAAAAAATTAGATGTATATTATTCATTGTTCTTTACTC

>8454_8454_7_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000419735_EGFR_chr7_55209978_ENST00000275493_length(amino acids)=1264AA_BP=83
MTSERSRIPCLSAAAAEGTGKKQQEGRAMATLDRKVPSPEAFLGKPWSSWIDAAKLHCSDNVDLEEAGKEGGKSREVMRLNKEVCQGTSN
KLTQLGTFEDHFLSLQRMFNNCEVVLGNLEITYVQRNYDLSFLKTIQEVAGYVLIALNTVERIPLENLQIIRGNMYYENSYALAVLSNYD
ANKTGLKELPMRNLQEILHGAVRFSNNPALCNVESIQWRDIVSSDFLSNMSMDFQNHLGSCQKCDPSCPNGSCWGAGEENCQKLTKIICA
QQCSGRCRGKSPSDCCHNQCAAGCTGPRESDCLVCRKFRDEATCKDTCPPLMLYNPTTYQMDVNPEGKYSFGATCVKKCPRNYVVTDHGS
CVRACGADSYEMEEDGVRKCKKCEGPCRKVCNGIGIGEFKDSLSINATNIKHFKNCTSISGDLHILPVAFRGDSFTHTPPLDPQELDILK
TVKEITGFLLIQAWPENRTDLHAFENLEIIRGRTKQHGQFSLAVVSLNITSLGLRSLKEISDGDVIISGNKNLCYANTINWKKLFGTSGQ
KTKIISNRGENSCKATGQVCHALCSPEGCWGPEPRDCVSCRNVSRGRECVDKCNLLEGEPREFVENSECIQCHPECLPQAMNITCTGRGP
DNCIQCAHYIDGPHCVKTCPAGVMGENNTLVWKYADAGHVCHLCHPNCTYGCTGPGLEGCPTNGPKIPSIATGMVGALLLLLVVALGIGL
FMRRRHIVRKRTLRRLLQERELVEPLTPSGEAPNQALLRILKETEFKKIKVLGSGAFGTVYKGLWIPEGEKVKIPVAIKELREATSPKAN
KEILDEAYVMASVDNPHVCRLLGICLTSTVQLITQLMPFGCLLDYVREHKDNIGSQYLLNWCVQIAKGMNYLEDRRLVHRDLAARNVLVK
TPQHVKITDFGLAKLLGAEEKEYHAEGGKVPIKWMALESILHRIYTHQSDVWSYGVTVWELMTFGSKPYDGIPASEISSILEKGERLPQP
PICTIDVYMIMVKCWMIDADSRPKFRELIIEFSKMARDPQRYLVIQGDERMHLPSPTDSNFYRALMDEEDMDDVVDADEYLIPQQGFFSS
PSTSRTPLLSSLSATSNNSTVACIDRNGLQSCPIKEDSFLQRYSSDPTGALTEDSIDDTFLPVPEYINQSVPKRPAGSVQNPVYHNQPLN
PAPSRDPHYQDPHSTAVGNPEYLNTVQPTCVNSTFDSPAHWAQKGSHQISLDNPDYQQDFFPKEAKPNGIFKGSTAENAEYLRVAPQSSE

--------------------------------------------------------------
>8454_8454_8_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000419735_EGFR_chr7_55209978_ENST00000342916_length(transcript)=2201nt_BP=296nt
CCCTCCCTCCCCCTTTCAGAACACTCAATGTCGGAACGTTCCGAAGATGACGTCGGAGCGTTCTCGAATCCCGTGTCTCTCGGCTGCTGC
TGCCGAAGGAACAGGGAAAAAGCAACAAGAAGGAAGAGCAATGGCGACACTGGATCGCAAAGTGCCCAGTCCGGAGGCGTTTCTGGGCAA
ACCCTGGTCCTCCTGGATCGACGCCGCCAAATTACACTGCTCCGACAATGTAGATTTAGAAGAGGCTGGAAAAGAGGGTGGAAAAAGCAG
GGAGGTTATGAGGCTTAATAAAGAAGTTTGCCAAGGCACGAGTAACAAGCTCACGCAGTTGGGCACTTTTGAAGATCATTTTCTCAGCCT
CCAGAGGATGTTCAATAACTGTGAGGTGGTCCTTGGGAATTTGGAAATTACCTATGTGCAGAGGAATTATGATCTTTCCTTCTTAAAGAC
CATCCAGGAGGTGGCTGGTTATGTCCTCATTGCCCTCAACACAGTGGAGCGAATTCCTTTGGAAAACCTGCAGATCATCAGAGGAAATAT
GTACTACGAAAATTCCTATGCCTTAGCAGTCTTATCTAACTATGATGCAAATAAAACCGGACTGAAGGAGCTGCCCATGAGAAATTTACA
GGAAATCCTGCATGGCGCCGTGCGGTTCAGCAACAACCCTGCCCTGTGCAACGTGGAGAGCATCCAGTGGCGGGACATAGTCAGCAGTGA
CTTTCTCAGCAACATGTCGATGGACTTCCAGAACCACCTGGGCAGCTGCCAAAAGTGTGATCCAAGCTGTCCCAATGGGAGCTGCTGGGG
TGCAGGAGAGGAGAACTGCCAGAAACTGACCAAAATCATCTGTGCCCAGCAGTGCTCCGGGCGCTGCCGTGGCAAGTCCCCCAGTGACTG
CTGCCACAACCAGTGTGCTGCAGGCTGCACAGGCCCCCGGGAGAGCGACTGCCTGGTCTGCCGCAAATTCCGAGACGAAGCCACGTGCAA
GGACACCTGCCCCCCACTCATGCTCTACAACCCCACCACGTACCAGATGGATGTGAACCCCGAGGGCAAATACAGCTTTGGTGCCACCTG
CGTGAAGAAGTGTCCCCGTAATTATGTGGTGACAGATCACGGCTCGTGCGTCCGAGCCTGTGGGGCCGACAGCTATGAGATGGAGGAAGA
CGGCGTCCGCAAGTGTAAGAAGTGCGAAGGGCCTTGCCGCAAAGTGTGTAACGGAATAGGTATTGGTGAATTTAAAGACTCACTCTCCAT
AAATGCTACGAATATTAAACACTTCAAAAACTGCACCTCCATCAGTGGCGATCTCCACATCCTGCCGGTGGCATTTAGGGGTGACTCCTT
CACACATACTCCTCCTCTGGATCCACAGGAACTGGATATTCTGAAAACCGTAAAGGAAATCACAGGGTTTTTGCTGATTCAGGCTTGGCC
TGAAAACAGGACGGACCTCCATGCCTTTGAGAACCTAGAAATCATACGCGGCAGGACCAAGCAACATGGTCAGTTTTCTCTTGCAGTCGT
CAGCCTGAACATAACATCCTTGGGATTACGCTCCCTCAAGGAGATAAGTGATGGAGATGTGATAATTTCAGGAAACAAAAATTTGTGCTA
TGCAAATACAATAAACTGGAAAAAACTGTTTGGGACCTCCGGTCAGAAAACCAAAATTATAAGCAACAGAGGTGAAAACAGCTGCAAGGC
CACAGGCCAGGTCTGCCATGCCTTGTGCTCCCCCGAGGGCTGCTGGGGCCCGGAGCCCAGGGACTGCGTCTCTTGCCGGAATGTCAGCCG
AGGCAGGGAATGCGTGGACAAGTGCAACCTTCTGGAGGGTGAGCCAAGGGAGTTTGTGGAGAACTCTGAGTGCATACAGTGCCACCCAGA
GTGCCTGCCTCAGGCCATGAACATCACCTGCACAGGACGGGGACCAGACAACTGTATCCAGTGTGCCCACTACATTGACGGCCCCCACTG
CGTCAAGACCTGCCCGGCAGGAGTCATGGGAGAAAACAACACCCTGGTCTGGAAGTACGCAGACGCCGGCCATGTGTGCCACCTGTGCCA
TCCAAACTGCACCTACGGGTCCTAATAAATCTTCACTGTCTGACTTTAGTCTCCCACTAAAACTGCATTTCCTTTCTACAATTTCAATTT

>8454_8454_8_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000419735_EGFR_chr7_55209978_ENST00000342916_length(amino acids)=682AA_BP=83
MTSERSRIPCLSAAAAEGTGKKQQEGRAMATLDRKVPSPEAFLGKPWSSWIDAAKLHCSDNVDLEEAGKEGGKSREVMRLNKEVCQGTSN
KLTQLGTFEDHFLSLQRMFNNCEVVLGNLEITYVQRNYDLSFLKTIQEVAGYVLIALNTVERIPLENLQIIRGNMYYENSYALAVLSNYD
ANKTGLKELPMRNLQEILHGAVRFSNNPALCNVESIQWRDIVSSDFLSNMSMDFQNHLGSCQKCDPSCPNGSCWGAGEENCQKLTKIICA
QQCSGRCRGKSPSDCCHNQCAAGCTGPRESDCLVCRKFRDEATCKDTCPPLMLYNPTTYQMDVNPEGKYSFGATCVKKCPRNYVVTDHGS
CVRACGADSYEMEEDGVRKCKKCEGPCRKVCNGIGIGEFKDSLSINATNIKHFKNCTSISGDLHILPVAFRGDSFTHTPPLDPQELDILK
TVKEITGFLLIQAWPENRTDLHAFENLEIIRGRTKQHGQFSLAVVSLNITSLGLRSLKEISDGDVIISGNKNLCYANTINWKKLFGTSGQ
KTKIISNRGENSCKATGQVCHALCSPEGCWGPEPRDCVSCRNVSRGRECVDKCNLLEGEPREFVENSECIQCHPECLPQAMNITCTGRGP

--------------------------------------------------------------
>8454_8454_9_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000419735_EGFR_chr7_55209978_ENST00000344576_length(transcript)=2827nt_BP=296nt
CCCTCCCTCCCCCTTTCAGAACACTCAATGTCGGAACGTTCCGAAGATGACGTCGGAGCGTTCTCGAATCCCGTGTCTCTCGGCTGCTGC
TGCCGAAGGAACAGGGAAAAAGCAACAAGAAGGAAGAGCAATGGCGACACTGGATCGCAAAGTGCCCAGTCCGGAGGCGTTTCTGGGCAA
ACCCTGGTCCTCCTGGATCGACGCCGCCAAATTACACTGCTCCGACAATGTAGATTTAGAAGAGGCTGGAAAAGAGGGTGGAAAAAGCAG
GGAGGTTATGAGGCTTAATAAAGAAGTTTGCCAAGGCACGAGTAACAAGCTCACGCAGTTGGGCACTTTTGAAGATCATTTTCTCAGCCT
CCAGAGGATGTTCAATAACTGTGAGGTGGTCCTTGGGAATTTGGAAATTACCTATGTGCAGAGGAATTATGATCTTTCCTTCTTAAAGAC
CATCCAGGAGGTGGCTGGTTATGTCCTCATTGCCCTCAACACAGTGGAGCGAATTCCTTTGGAAAACCTGCAGATCATCAGAGGAAATAT
GTACTACGAAAATTCCTATGCCTTAGCAGTCTTATCTAACTATGATGCAAATAAAACCGGACTGAAGGAGCTGCCCATGAGAAATTTACA
GGAAATCCTGCATGGCGCCGTGCGGTTCAGCAACAACCCTGCCCTGTGCAACGTGGAGAGCATCCAGTGGCGGGACATAGTCAGCAGTGA
CTTTCTCAGCAACATGTCGATGGACTTCCAGAACCACCTGGGCAGCTGCCAAAAGTGTGATCCAAGCTGTCCCAATGGGAGCTGCTGGGG
TGCAGGAGAGGAGAACTGCCAGAAACTGACCAAAATCATCTGTGCCCAGCAGTGCTCCGGGCGCTGCCGTGGCAAGTCCCCCAGTGACTG
CTGCCACAACCAGTGTGCTGCAGGCTGCACAGGCCCCCGGGAGAGCGACTGCCTGGTCTGCCGCAAATTCCGAGACGAAGCCACGTGCAA
GGACACCTGCCCCCCACTCATGCTCTACAACCCCACCACGTACCAGATGGATGTGAACCCCGAGGGCAAATACAGCTTTGGTGCCACCTG
CGTGAAGAAGTGTCCCCGTAATTATGTGGTGACAGATCACGGCTCGTGCGTCCGAGCCTGTGGGGCCGACAGCTATGAGATGGAGGAAGA
CGGCGTCCGCAAGTGTAAGAAGTGCGAAGGGCCTTGCCGCAAAGTGTGTAACGGAATAGGTATTGGTGAATTTAAAGACTCACTCTCCAT
AAATGCTACGAATATTAAACACTTCAAAAACTGCACCTCCATCAGTGGCGATCTCCACATCCTGCCGGTGGCATTTAGGGGTGACTCCTT
CACACATACTCCTCCTCTGGATCCACAGGAACTGGATATTCTGAAAACCGTAAAGGAAATCACAGGGTTTTTGCTGATTCAGGCTTGGCC
TGAAAACAGGACGGACCTCCATGCCTTTGAGAACCTAGAAATCATACGCGGCAGGACCAAGCAACATGGTCAGTTTTCTCTTGCAGTCGT
CAGCCTGAACATAACATCCTTGGGATTACGCTCCCTCAAGGAGATAAGTGATGGAGATGTGATAATTTCAGGAAACAAAAATTTGTGCTA
TGCAAATACAATAAACTGGAAAAAACTGTTTGGGACCTCCGGTCAGAAAACCAAAATTATAAGCAACAGAGGTGAAAACAGCTGCAAGGC
CACAGGCCAGGTCTGCCATGCCTTGTGCTCCCCCGAGGGCTGCTGGGGCCCGGAGCCCAGGGACTGCGTCTCTTGCCGGAATGTCAGCCG
AGGCAGGGAATGCGTGGACAAGTGCAACCTTCTGGAGGGTGAGCCAAGGGAGTTTGTGGAGAACTCTGAGTGCATACAGTGCCACCCAGA
GTGCCTGCCTCAGGCCATGAACATCACCTGCACAGGACGGGGACCAGACAACTGTATCCAGTGTGCCCACTACATTGACGGCCCCCACTG
CGTCAAGACCTGCCCGGCAGGAGTCATGGGAGAAAACAACACCCTGGTCTGGAAGTACGCAGACGCCGGCCATGTGTGCCACCTGTGCCA
TCCAAACTGCACCTACGGGCCAGGAAATGAGAGTCTCAAAGCCATGTTATTCTGCCTTTTTAAACTATCATCCTGTAATCAAAGTAATGA
TGGCAGCGTGTCCCACCAGAGCGGGAGCCCAGCTGCTCAGGAGTCATGCTTAGGATGGATCCCTTCTCTTCTGCCGTCAGAGTTTCAGCT
GGGTTGGGGTGGATGCAGCCACCTCCATGCCTGGCCTTCTGCATCTGTGATCATCACGGCCTCCTCCTGCCACTGAGCCTCATGCCTTCA
CGTGTCTGTTCCCCCCGCTTTTCCTTTCTGCCACCCCTGCACGTGGGCCGCCAGGTTCCCAAGAGTATCCTACCCATTTCCTTCCTTCCA
CTCCCTTTGCCAGTGCCTCTCACCCCAACTAGTAGCTAACCATCACCCCCAGGACTGACCTCTTCCTCCTCGCTGCCAGATGATTGTTCA
AAGCACAGAATTTGTCAGAAACCTGCAGGGACTCCATGCTGCCAGCCTTCTCCGTAATTAGCATGGCCCCAGTCCATGCTTCTAGCCTTG
GTTCCTTCTGCCCCTCTGTTTGAAATTCTAGAGCCAGCTGTGGGACAATTATCTGTGTCAAAAGCCAGATGTGAAAACATCTCAATAACA
AACTGGCTGCTTTGTTCAATGCTAGAACAACGCCTGTCACAGAGTAGAAACTCAAAAATATTTGCTGAGTGAATGAACAAATGAATAAAT

>8454_8454_9_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000419735_EGFR_chr7_55209978_ENST00000344576_length(amino acids)=759AA_BP=83
MTSERSRIPCLSAAAAEGTGKKQQEGRAMATLDRKVPSPEAFLGKPWSSWIDAAKLHCSDNVDLEEAGKEGGKSREVMRLNKEVCQGTSN
KLTQLGTFEDHFLSLQRMFNNCEVVLGNLEITYVQRNYDLSFLKTIQEVAGYVLIALNTVERIPLENLQIIRGNMYYENSYALAVLSNYD
ANKTGLKELPMRNLQEILHGAVRFSNNPALCNVESIQWRDIVSSDFLSNMSMDFQNHLGSCQKCDPSCPNGSCWGAGEENCQKLTKIICA
QQCSGRCRGKSPSDCCHNQCAAGCTGPRESDCLVCRKFRDEATCKDTCPPLMLYNPTTYQMDVNPEGKYSFGATCVKKCPRNYVVTDHGS
CVRACGADSYEMEEDGVRKCKKCEGPCRKVCNGIGIGEFKDSLSINATNIKHFKNCTSISGDLHILPVAFRGDSFTHTPPLDPQELDILK
TVKEITGFLLIQAWPENRTDLHAFENLEIIRGRTKQHGQFSLAVVSLNITSLGLRSLKEISDGDVIISGNKNLCYANTINWKKLFGTSGQ
KTKIISNRGENSCKATGQVCHALCSPEGCWGPEPRDCVSCRNVSRGRECVDKCNLLEGEPREFVENSECIQCHPECLPQAMNITCTGRGP
DNCIQCAHYIDGPHCVKTCPAGVMGENNTLVWKYADAGHVCHLCHPNCTYGPGNESLKAMLFCLFKLSSCNQSNDGSVSHQSGSPAAQES

--------------------------------------------------------------
>8454_8454_10_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000419735_EGFR_chr7_55209978_ENST00000420316_length(transcript)=1534nt_BP=296nt
CCCTCCCTCCCCCTTTCAGAACACTCAATGTCGGAACGTTCCGAAGATGACGTCGGAGCGTTCTCGAATCCCGTGTCTCTCGGCTGCTGC
TGCCGAAGGAACAGGGAAAAAGCAACAAGAAGGAAGAGCAATGGCGACACTGGATCGCAAAGTGCCCAGTCCGGAGGCGTTTCTGGGCAA
ACCCTGGTCCTCCTGGATCGACGCCGCCAAATTACACTGCTCCGACAATGTAGATTTAGAAGAGGCTGGAAAAGAGGGTGGAAAAAGCAG
GGAGGTTATGAGGCTTAATAAAGAAGTTTGCCAAGGCACGAGTAACAAGCTCACGCAGTTGGGCACTTTTGAAGATCATTTTCTCAGCCT
CCAGAGGATGTTCAATAACTGTGAGGTGGTCCTTGGGAATTTGGAAATTACCTATGTGCAGAGGAATTATGATCTTTCCTTCTTAAAGAC
CATCCAGGAGGTGGCTGGTTATGTCCTCATTGCCCTCAACACAGTGGAGCGAATTCCTTTGGAAAACCTGCAGATCATCAGAGGAAATAT
GTACTACGAAAATTCCTATGCCTTAGCAGTCTTATCTAACTATGATGCAAATAAAACCGGACTGAAGGAGCTGCCCATGAGAAATTTACA
GGAAATCCTGCATGGCGCCGTGCGGTTCAGCAACAACCCTGCCCTGTGCAACGTGGAGAGCATCCAGTGGCGGGACATAGTCAGCAGTGA
CTTTCTCAGCAACATGTCGATGGACTTCCAGAACCACCTGGGCAGCTGCCAAAAGTGTGATCCAAGCTGTCCCAATGGGAGCTGCTGGGG
TGCAGGAGAGGAGAACTGCCAGAAACTGACCAAAATCATCTGTGCCCAGCAGTGCTCCGGGCGCTGCCGTGGCAAGTCCCCCAGTGACTG
CTGCCACAACCAGTGTGCTGCAGGCTGCACAGGCCCCCGGGAGAGCGACTGCCTGGTCTGCCGCAAATTCCGAGACGAAGCCACGTGCAA
GGACACCTGCCCCCCACTCATGCTCTACAACCCCACCACGTACCAGATGGATGTGAACCCCGAGGGCAAATACAGCTTTGGTGCCACCTG
CGTGAAGAAGTGTCCCCGTAATTATGTGGTGACAGATCACGGCTCGTGCGTCCGAGCCTGTGGGGCCGACAGCTATGAGATGGAGGAAGA
CGGCGTCCGCAAGTGTAAGAAGTGCGAAGGGCCTTGCCGCAAAGTGTGTAACGGAATAGGTATTGGTGAATTTAAAGACTCACTCTCCAT
AAATGCTACGAATATTAAACACTTCAAAAACTGCACCTCCATCAGTGGCGATCTCCACATCCTGCCGGTGGCATTTAGGGGTGACTCCTT
CACACATACTCCTCCTCTGGATCCACAGGAACTGGATATTCTGAAAACCGTAAAGGAAATCACAGGTTTGAGCTGAATTATCACATGAAT
ATAAATGGGAAATCAGTGTTTTAGAGAGAGAACTTTTCGACATATTTCCTGTTCCCTTGGAATAAAAACATTTCTTCTGAAATTTTACCG

>8454_8454_10_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000419735_EGFR_chr7_55209978_ENST00000420316_length(amino acids)=459AA_BP=83
MTSERSRIPCLSAAAAEGTGKKQQEGRAMATLDRKVPSPEAFLGKPWSSWIDAAKLHCSDNVDLEEAGKEGGKSREVMRLNKEVCQGTSN
KLTQLGTFEDHFLSLQRMFNNCEVVLGNLEITYVQRNYDLSFLKTIQEVAGYVLIALNTVERIPLENLQIIRGNMYYENSYALAVLSNYD
ANKTGLKELPMRNLQEILHGAVRFSNNPALCNVESIQWRDIVSSDFLSNMSMDFQNHLGSCQKCDPSCPNGSCWGAGEENCQKLTKIICA
QQCSGRCRGKSPSDCCHNQCAAGCTGPRESDCLVCRKFRDEATCKDTCPPLMLYNPTTYQMDVNPEGKYSFGATCVKKCPRNYVVTDHGS
CVRACGADSYEMEEDGVRKCKKCEGPCRKVCNGIGIGEFKDSLSINATNIKHFKNCTSISGDLHILPVAFRGDSFTHTPPLDPQELDILK

--------------------------------------------------------------
>8454_8454_11_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000419735_EGFR_chr7_55209978_ENST00000442591_length(transcript)=2577nt_BP=296nt
CCCTCCCTCCCCCTTTCAGAACACTCAATGTCGGAACGTTCCGAAGATGACGTCGGAGCGTTCTCGAATCCCGTGTCTCTCGGCTGCTGC
TGCCGAAGGAACAGGGAAAAAGCAACAAGAAGGAAGAGCAATGGCGACACTGGATCGCAAAGTGCCCAGTCCGGAGGCGTTTCTGGGCAA
ACCCTGGTCCTCCTGGATCGACGCCGCCAAATTACACTGCTCCGACAATGTAGATTTAGAAGAGGCTGGAAAAGAGGGTGGAAAAAGCAG
GGAGGTTATGAGGCTTAATAAAGAAGTTTGCCAAGGCACGAGTAACAAGCTCACGCAGTTGGGCACTTTTGAAGATCATTTTCTCAGCCT
CCAGAGGATGTTCAATAACTGTGAGGTGGTCCTTGGGAATTTGGAAATTACCTATGTGCAGAGGAATTATGATCTTTCCTTCTTAAAGAC
CATCCAGGAGGTGGCTGGTTATGTCCTCATTGCCCTCAACACAGTGGAGCGAATTCCTTTGGAAAACCTGCAGATCATCAGAGGAAATAT
GTACTACGAAAATTCCTATGCCTTAGCAGTCTTATCTAACTATGATGCAAATAAAACCGGACTGAAGGAGCTGCCCATGAGAAATTTACA
GGAAATCCTGCATGGCGCCGTGCGGTTCAGCAACAACCCTGCCCTGTGCAACGTGGAGAGCATCCAGTGGCGGGACATAGTCAGCAGTGA
CTTTCTCAGCAACATGTCGATGGACTTCCAGAACCACCTGGGCAGCTGCCAAAAGTGTGATCCAAGCTGTCCCAATGGGAGCTGCTGGGG
TGCAGGAGAGGAGAACTGCCAGAAACTGACCAAAATCATCTGTGCCCAGCAGTGCTCCGGGCGCTGCCGTGGCAAGTCCCCCAGTGACTG
CTGCCACAACCAGTGTGCTGCAGGCTGCACAGGCCCCCGGGAGAGCGACTGCCTGGTCTGCCGCAAATTCCGAGACGAAGCCACGTGCAA
GGACACCTGCCCCCCACTCATGCTCTACAACCCCACCACGTACCAGATGGATGTGAACCCCGAGGGCAAATACAGCTTTGGTGCCACCTG
CGTGAAGAAGTGTCCCCGTAATTATGTGGTGACAGATCACGGCTCGTGCGTCCGAGCCTGTGGGGCCGACAGCTATGAGATGGAGGAAGA
CGGCGTCCGCAAGTGTAAGAAGTGCGAAGGGCCTTGCCGCAAAGTGTGTAACGGAATAGGTATTGGTGAATTTAAAGACTCACTCTCCAT
AAATGCTACGAATATTAAACACTTCAAAAACTGCACCTCCATCAGTGGCGATCTCCACATCCTGCCGGTGGCATTTAGGGGTGACTCCTT
CACACATACTCCTCCTCTGGATCCACAGGAACTGGATATTCTGAAAACCGTAAAGGAAATCACAGGGTTTTTGCTGATTCAGGCTTGGCC
TGAAAACAGGACGGACCTCCATGCCTTTGAGAACCTAGAAATCATACGCGGCAGGACCAAGCAACATGGTCAGTTTTCTCTTGCAGTCGT
CAGCCTGAACATAACATCCTTGGGATTACGCTCCCTCAAGGAGATAAGTGATGGAGATGTGATAATTTCAGGAAACAAAAATTTGTGCTA
TGCAAATACAATAAACTGGAAAAAACTGTTTGGGACCTCCGGTCAGAAAACCAAAATTATAAGCAACAGAGGTGAAAACAGCTGCAAGGC
CACAGGCCAGGTCTGCCATGCCTTGTGCTCCCCCGAGGGCTGCTGGGGCCCGGAGCCCAGGGACTGCGTCTCTTGCCGGAATGTCAGCCG
AGGCAGGGAATGCGTGGACAAGTGCAACCTTCTGGAGGGTGAGCCAAGGGAGTTTGTGGAGAACTCTGAGTGCATACAGTGCCACCCAGA
GTGCCTGCCTCAGGCCATGAACATCACCTGCACAGGACGGGGACCAGACAACTGTATCCAGTGTGCCCACTACATTGACGGCCCCCACTG
CGTCAAGACCTGCCCGGCAGGAGTCATGGGAGAAAACAACACCCTGGTCTGGAAGTACGCAGACGCCGGCCATGTGTGCCACCTGTGCCA
TCCAAACTGCACCTACGGATGCACTGGGCCAGGTCTTGAAGGCTGTCCAACGAATGGAAGCTACATAGTGTCTCACTTTCCAAGATCATT
CTACAAGATGTCAGTGCACTGAAACATGCAGGGGCGTGTTGAGTGCCAAGGATCTTGACAAGTTGTTTTGAAGATGGCATTTTGCTAAGT
CCCTGAGGGTCACTGGTCCTCAAAGCGGCATGGCGGCATGGCGTGGCTGGTTCTGCCACATGCCAGCTGTGTGACCTCTGAGACTCCACT
TCTTCAGTGCTGAAAATAAAGAAGGAGTTTTACTAAGGACCAAACAAGATAATGAATGTGAAACTGCTCCACGAACCCCAAAGAATTATG
CACATAGATGCGATCATTAAGATGCGAAGCCATCGAGTTACCACCTGGCATGCTTAAACTGTAAAGAGTGGGTCAAAGTAAACTGAATTG

>8454_8454_11_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000419735_EGFR_chr7_55209978_ENST00000442591_length(amino acids)=711AA_BP=83
MTSERSRIPCLSAAAAEGTGKKQQEGRAMATLDRKVPSPEAFLGKPWSSWIDAAKLHCSDNVDLEEAGKEGGKSREVMRLNKEVCQGTSN
KLTQLGTFEDHFLSLQRMFNNCEVVLGNLEITYVQRNYDLSFLKTIQEVAGYVLIALNTVERIPLENLQIIRGNMYYENSYALAVLSNYD
ANKTGLKELPMRNLQEILHGAVRFSNNPALCNVESIQWRDIVSSDFLSNMSMDFQNHLGSCQKCDPSCPNGSCWGAGEENCQKLTKIICA
QQCSGRCRGKSPSDCCHNQCAAGCTGPRESDCLVCRKFRDEATCKDTCPPLMLYNPTTYQMDVNPEGKYSFGATCVKKCPRNYVVTDHGS
CVRACGADSYEMEEDGVRKCKKCEGPCRKVCNGIGIGEFKDSLSINATNIKHFKNCTSISGDLHILPVAFRGDSFTHTPPLDPQELDILK
TVKEITGFLLIQAWPENRTDLHAFENLEIIRGRTKQHGQFSLAVVSLNITSLGLRSLKEISDGDVIISGNKNLCYANTINWKKLFGTSGQ
KTKIISNRGENSCKATGQVCHALCSPEGCWGPEPRDCVSCRNVSRGRECVDKCNLLEGEPREFVENSECIQCHPECLPQAMNITCTGRGP

--------------------------------------------------------------
>8454_8454_12_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000419735_EGFR_chr7_55209978_ENST00000455089_length(transcript)=3795nt_BP=296nt
CCCTCCCTCCCCCTTTCAGAACACTCAATGTCGGAACGTTCCGAAGATGACGTCGGAGCGTTCTCGAATCCCGTGTCTCTCGGCTGCTGC
TGCCGAAGGAACAGGGAAAAAGCAACAAGAAGGAAGAGCAATGGCGACACTGGATCGCAAAGTGCCCAGTCCGGAGGCGTTTCTGGGCAA
ACCCTGGTCCTCCTGGATCGACGCCGCCAAATTACACTGCTCCGACAATGTAGATTTAGAAGAGGCTGGAAAAGAGGGTGGAAAAAGCAG
GGAGGTTATGAGGCTTAATAAAGAAGTTTGCCAAGGCACGAGTAACAAGCTCACGCAGTTGGGCACTTTTGAAGATCATTTTCTCAGCCT
CCAGAGGATGTTCAATAACTGTGAGGTGGTCCTTGGGAATTTGGAAATTACCTATGTGCAGAGGAATTATGATCTTTCCTTCTTAAAGAC
CATCCAGGAGGTGGCTGGTTATGTCCTCATTGCCCTCAACACAGTGGAGCGAATTCCTTTGGAAAACCTGCAGATCATCAGAGGAAATAT
GTACTACGAAAATTCCTATGCCTTAGCAGTCTTATCTAACTATGATGCAAATAAAACCGGACTGAAGGAGCTGCCCATGAGAAATTTACA
GGGCCAAAAGTGTGATCCAAGCTGTCCCAATGGGAGCTGCTGGGGTGCAGGAGAGGAGAACTGCCAGAAACTGACCAAAATCATCTGTGC
CCAGCAGTGCTCCGGGCGCTGCCGTGGCAAGTCCCCCAGTGACTGCTGCCACAACCAGTGTGCTGCAGGCTGCACAGGCCCCCGGGAGAG
CGACTGCCTGGTCTGCCGCAAATTCCGAGACGAAGCCACGTGCAAGGACACCTGCCCCCCACTCATGCTCTACAACCCCACCACGTACCA
GATGGATGTGAACCCCGAGGGCAAATACAGCTTTGGTGCCACCTGCGTGAAGAAGTGTCCCCGTAATTATGTGGTGACAGATCACGGCTC
GTGCGTCCGAGCCTGTGGGGCCGACAGCTATGAGATGGAGGAAGACGGCGTCCGCAAGTGTAAGAAGTGCGAAGGGCCTTGCCGCAAAGT
GTGTAACGGAATAGGTATTGGTGAATTTAAAGACTCACTCTCCATAAATGCTACGAATATTAAACACTTCAAAAACTGCACCTCCATCAG
TGGCGATCTCCACATCCTGCCGGTGGCATTTAGGGGTGACTCCTTCACACATACTCCTCCTCTGGATCCACAGGAACTGGATATTCTGAA
AACCGTAAAGGAAATCACAGGGTTTTTGCTGATTCAGGCTTGGCCTGAAAACAGGACGGACCTCCATGCCTTTGAGAACCTAGAAATCAT
ACGCGGCAGGACCAAGCAACATGGTCAGTTTTCTCTTGCAGTCGTCAGCCTGAACATAACATCCTTGGGATTACGCTCCCTCAAGGAGAT
AAGTGATGGAGATGTGATAATTTCAGGAAACAAAAATTTGTGCTATGCAAATACAATAAACTGGAAAAAACTGTTTGGGACCTCCGGTCA
GAAAACCAAAATTATAAGCAACAGAGGTGAAAACAGCTGCAAGGCCACAGGCCAGGTCTGCCATGCCTTGTGCTCCCCCGAGGGCTGCTG
GGGCCCGGAGCCCAGGGACTGCGTCTCTTGCCGGAATGTCAGCCGAGGCAGGGAATGCGTGGACAAGTGCAACCTTCTGGAGGGTGAGCC
AAGGGAGTTTGTGGAGAACTCTGAGTGCATACAGTGCCACCCAGAGTGCCTGCCTCAGGCCATGAACATCACCTGCACAGGACGGGGACC
AGACAACTGTATCCAGTGTGCCCACTACATTGACGGCCCCCACTGCGTCAAGACCTGCCCGGCAGGAGTCATGGGAGAAAACAACACCCT
GGTCTGGAAGTACGCAGACGCCGGCCATGTGTGCCACCTGTGCCATCCAAACTGCACCTACGGATGCACTGGGCCAGGTCTTGAAGGCTG
TCCAACGAATGGGCCTAAGATCCCGTCCATCGCCACTGGGATGGTGGGGGCCCTCCTCTTGCTGCTGGTGGTGGCCCTGGGGATCGGCCT
CTTCATGCGAAGGCGCCACATCGTTCGGAAGCGCACGCTGCGGAGGCTGCTGCAGGAGAGGGAGCTTGTGGAGCCTCTTACACCCAGTGG
AGAAGCTCCCAACCAAGCTCTCTTGAGGATCTTGAAGGAAACTGAATTCAAAAAGATCAAAGTGCTGGGCTCCGGTGCGTTCGGCACGGT
GTATAAGGGACTCTGGATCCCAGAAGGTGAGAAAGTTAAAATTCCCGTCGCTATCAAGGAATTAAGAGAAGCAACATCTCCGAAAGCCAA
CAAGGAAATCCTCGATGAAGCCTACGTGATGGCCAGCGTGGACAACCCCCACGTGTGCCGCCTGCTGGGCATCTGCCTCACCTCCACCGT
GCAGCTCATCACGCAGCTCATGCCCTTCGGCTGCCTCCTGGACTATGTCCGGGAACACAAAGACAATATTGGCTCCCAGTACCTGCTCAA
CTGGTGTGTGCAGATCGCAAAGGGCATGAACTACTTGGAGGACCGTCGCTTGGTGCACCGCGACCTGGCAGCCAGGAACGTACTGGTGAA
AACACCGCAGCATGTCAAGATCACAGATTTTGGGCTGGCCAAACTGCTGGGTGCGGAAGAGAAAGAATACCATGCAGAAGGAGGCAAAGT
GCCTATCAAGTGGATGGCATTGGAATCAATTTTACACAGAATCTATACCCACCAGAGTGATGTCTGGAGCTACGGGGTGACTGTTTGGGA
GTTGATGACCTTTGGATCCAAGCCATATGACGGAATCCCTGCCAGCGAGATCTCCTCCATCCTGGAGAAAGGAGAACGCCTCCCTCAGCC
ACCCATATGTACCATCGATGTCTACATGATCATGGTCAAGTGCTGGATGATAGACGCAGATAGTCGCCCAAAGTTCCGTGAGTTGATCAT
CGAATTCTCCAAAATGGCCCGAGACCCCCAGCGCTACCTTGTCATTCAGGGGGATGAAAGAATGCATTTGCCAAGTCCTACAGACTCCAA
CTTCTACCGTGCCCTGATGGATGAAGAAGACATGGACGACGTGGTGGATGCCGACGAGTACCTCATCCCACAGCAGGGCTTCTTCAGCAG
CCCCTCCACGTCACGGACTCCCCTCCTGAGCTCTCTGAGTGCAACCAGCAACAATTCCACCGTGGCTTGCATTGATAGAAATGGGCTGCA
AAGCTGTCCCATCAAGGAAGACAGCTTCTTGCAGCGATACAGCTCAGACCCCACAGGCGCCTTGACTGAGGACAGCATAGACGACACCTT
CCTCCCAGTGCCTGGTGAGTGGCTTGTCTGGAAACAGTCCTGCTCCTCAACCTCCTCGACCCACTCAGCAGCAGCCAGTCTCCAGTGTCC
AAGCCAGGTGCTCCCTCCAGCATCTCCAGAGGGGGAAACAGTGGCAGATTTGCAGACACAGTGAAGGGCGTAAGGAGCAGATAAACACAT
GACCGAGCCTGCACAAGCTCTTTGTTGTGTCTGGTTGTTTGCTGTACCTCTGTTGTAAGAATGAATCTGCAAAATTTCTAGCTTATGAAG
CAAATCACGGACATACACATCTGTGTGTGTGAGTGTTCATGATGTGTGTACATCTGTGTATGTGTGTGTGTGTATGTGTGTGTTTGTGAC
AGATTTGATCCCTGTTCTCTCTGCTGGCTCTATCTTGACCTGTGAAACGTATATTTAACTAATTAAATATTAGTTAATATTAATAAATTT

>8454_8454_12_ATXN7L1-EGFR_ATXN7L1_chr7_105516257_ENST00000419735_EGFR_chr7_55209978_ENST00000455089_length(amino acids)=1145AA_BP=83
MTSERSRIPCLSAAAAEGTGKKQQEGRAMATLDRKVPSPEAFLGKPWSSWIDAAKLHCSDNVDLEEAGKEGGKSREVMRLNKEVCQGTSN
KLTQLGTFEDHFLSLQRMFNNCEVVLGNLEITYVQRNYDLSFLKTIQEVAGYVLIALNTVERIPLENLQIIRGNMYYENSYALAVLSNYD
ANKTGLKELPMRNLQGQKCDPSCPNGSCWGAGEENCQKLTKIICAQQCSGRCRGKSPSDCCHNQCAAGCTGPRESDCLVCRKFRDEATCK
DTCPPLMLYNPTTYQMDVNPEGKYSFGATCVKKCPRNYVVTDHGSCVRACGADSYEMEEDGVRKCKKCEGPCRKVCNGIGIGEFKDSLSI
NATNIKHFKNCTSISGDLHILPVAFRGDSFTHTPPLDPQELDILKTVKEITGFLLIQAWPENRTDLHAFENLEIIRGRTKQHGQFSLAVV
SLNITSLGLRSLKEISDGDVIISGNKNLCYANTINWKKLFGTSGQKTKIISNRGENSCKATGQVCHALCSPEGCWGPEPRDCVSCRNVSR
GRECVDKCNLLEGEPREFVENSECIQCHPECLPQAMNITCTGRGPDNCIQCAHYIDGPHCVKTCPAGVMGENNTLVWKYADAGHVCHLCH
PNCTYGCTGPGLEGCPTNGPKIPSIATGMVGALLLLLVVALGIGLFMRRRHIVRKRTLRRLLQERELVEPLTPSGEAPNQALLRILKETE
FKKIKVLGSGAFGTVYKGLWIPEGEKVKIPVAIKELREATSPKANKEILDEAYVMASVDNPHVCRLLGICLTSTVQLITQLMPFGCLLDY
VREHKDNIGSQYLLNWCVQIAKGMNYLEDRRLVHRDLAARNVLVKTPQHVKITDFGLAKLLGAEEKEYHAEGGKVPIKWMALESILHRIY
THQSDVWSYGVTVWELMTFGSKPYDGIPASEISSILEKGERLPQPPICTIDVYMIMVKCWMIDADSRPKFRELIIEFSKMARDPQRYLVI
QGDERMHLPSPTDSNFYRALMDEEDMDDVVDADEYLIPQQGFFSSPSTSRTPLLSSLSATSNNSTVACIDRNGLQSCPIKEDSFLQRYSS

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for ATXN7L1-EGFR


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for ATXN7L1-EGFR


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for ATXN7L1-EGFR


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
TgeneC0007131Non-Small Cell Lung Carcinoma16CGI;CTD_human
TgeneC0024121Lung Neoplasms7CGI;CTD_human
TgeneC0242379Malignant neoplasm of lung7CGI;CTD_human
TgeneC0006142Malignant neoplasm of breast6CTD_human
TgeneC0678222Breast Carcinoma6CTD_human
TgeneC1257931Mammary Neoplasms, Human6CTD_human
TgeneC1458155Mammary Neoplasms6CTD_human
TgeneC4704874Mammary Carcinoma, Human6CTD_human
TgeneC0001418Adenocarcinoma5CTD_human
TgeneC0205641Adenocarcinoma, Basal Cell5CTD_human
TgeneC0205642Adenocarcinoma, Oxyphilic5CTD_human
TgeneC0205643Carcinoma, Cribriform5CTD_human
TgeneC0205644Carcinoma, Granular Cell5CTD_human
TgeneC0205645Adenocarcinoma, Tubular5CTD_human
TgeneC0334588Giant Cell Glioblastoma5CTD_human;ORPHANET
TgeneC0007137Squamous cell carcinoma3CTD_human
TgeneC0007873Uterine Cervical Neoplasm3CTD_human
TgeneC0014859Esophageal Neoplasms3CGI;CTD_human
TgeneC0017636Glioblastoma3CGI;CTD_human
TgeneC0018671Head and Neck Neoplasms3CGI;CTD_human
TgeneC0018675Head Neoplasms3CTD_human
TgeneC0024623Malignant neoplasm of stomach3CTD_human
TgeneC0027533Neck Neoplasms3CTD_human
TgeneC0027627Neoplasm Metastasis3CTD_human
TgeneC0033578Prostatic Neoplasms3CTD_human
TgeneC0038356Stomach Neoplasms3CTD_human
TgeneC0278996Malignant Head and Neck Neoplasm3CGI;CTD_human
TgeneC0376358Malignant neoplasm of prostate3CTD_human
TgeneC0546837Malignant neoplasm of esophagus3CGI;CTD_human
TgeneC0746787Cancer of Neck3CTD_human
TgeneC0751177Cancer of Head3CTD_human
TgeneC0887900Upper Aerodigestive Tract Neoplasms3CTD_human
TgeneC1621958Glioblastoma Multiforme3CTD_human
TgeneC1708349Hereditary Diffuse Gastric Cancer3CTD_human
TgeneC4048328cervical cancer3CTD_human
TgeneC0005684Malignant neoplasm of urinary bladder2CTD_human
TgeneC0005695Bladder Neoplasm2CTD_human
TgeneC0007102Malignant tumor of colon2CTD_human
TgeneC0009375Colonic Neoplasms2CTD_human
TgeneC0009402Colorectal Carcinoma2CTD_human
TgeneC0009404Colorectal Neoplasms2CTD_human
TgeneC0024668Mammary Neoplasms, Experimental2CTD_human
TgeneC0027626Neoplasm Invasiveness2CTD_human
TgeneC0027643Neoplasm Recurrence, Local2CTD_human
TgeneC0152013Adenocarcinoma of lung (disorder)2CGI;CTD_human
TgeneC0206726gliosarcoma2ORPHANET
TgeneC0919267ovarian neoplasm2CTD_human
TgeneC1140680Malignant neoplasm of ovary2CTD_human
TgeneC4015130INFLAMMATORY SKIN AND BOWEL DISEASE, NEONATAL, 22CTD_human;GENOMICS_ENGLAND;UNIPROT
TgeneC0001973Alcoholic Intoxication, Chronic1PSYGENET
TgeneC0003865Arthritis, Adjuvant-Induced1CTD_human
TgeneC0005396Bile Duct Neoplasms1CTD_human
TgeneC0007097Carcinoma1CTD_human
TgeneC0007113Rectal Carcinoma1CTD_human
TgeneC0007193Cardiomyopathy, Dilated1CTD_human
TgeneC0011603Dermatitis1CTD_human
TgeneC0011860Diabetes Mellitus, Non-Insulin-Dependent1CTD_human
TgeneC0014175Endometriosis1CTD_human
TgeneC0016978gallbladder neoplasm1CTD_human
TgeneC0021655Insulin Resistance1CTD_human
TgeneC0022660Kidney Failure, Acute1CTD_human
TgeneC0024667Animal Mammary Neoplasms1CTD_human
TgeneC0024809Marijuana Abuse1PSYGENET
TgeneC0025500Mesothelioma1CTD_human
TgeneC0027439Nasopharyngeal Neoplasms1CTD_human
TgeneC0029463Osteosarcoma1CTD_human
TgeneC0030297Pancreatic Neoplasm1CTD_human
TgeneC0030354Papilloma1CTD_human
TgeneC0032580Adenomatous Polyposis Coli1CTD_human
TgeneC0034885Rectal Neoplasms1CTD_human
TgeneC0041696Unipolar Depression1PSYGENET
TgeneC0085548Autosomal Recessive Polycystic Kidney Disease1CTD_human
TgeneC0085762Alcohol abuse1PSYGENET
TgeneC0149925Small cell carcinoma of lung1CTD_human
TgeneC0153452Malignant neoplasm of gallbladder1CTD_human
TgeneC0205696Anaplastic carcinoma1CTD_human
TgeneC0205697Carcinoma, Spindle-Cell1CTD_human
TgeneC0205698Undifferentiated carcinoma1CTD_human
TgeneC0205699Carcinomatosis1CTD_human
TgeneC0205874Papilloma, Squamous Cell1CTD_human
TgeneC0205875Papillomatosis1CTD_human
TgeneC0206686Adrenocortical carcinoma1CTD_human
TgeneC0206698Cholangiocarcinoma1CTD_human
TgeneC0235874Disease Exacerbation1CTD_human
TgeneC0238301Cancer of Nasopharynx1CTD_human
TgeneC0263454Chloracne1CTD_human
TgeneC0269102Endometrioma1CTD_human
TgeneC0279626Squamous cell carcinoma of esophagus1CGI;CTD_human
TgeneC0345905Intrahepatic Cholangiocarcinoma1CTD_human
TgeneC0345967Malignant mesothelioma1CTD_human
TgeneC0346647Malignant neoplasm of pancreas1CTD_human
TgeneC0376634Craniofacial Abnormalities1CTD_human
TgeneC0740277Bile duct carcinoma1CTD_human
TgeneC0920563Insulin Sensitivity1CTD_human
TgeneC0971858Arthritis, Collagen-Induced1CTD_human
TgeneC0993582Arthritis, Experimental1CTD_human
TgeneC1257925Mammary Carcinoma, Animal1CTD_human
TgeneC1269683Major Depressive Disorder1PSYGENET
TgeneC1449563Cardiomyopathy, Familial Idiopathic1CTD_human
TgeneC1565662Acute Kidney Insufficiency1CTD_human
TgeneC2239176Liver carcinoma1CTD_human
TgeneC2609414Acute kidney injury1CTD_human
TgeneC2713442Polyposis, Adenomatous Intestinal1CTD_human
TgeneC2713443Familial Intestinal Polyposis1CTD_human
TgeneC3805278Extrahepatic Cholangiocarcinoma1CTD_human
TgeneC4751120Neonatal inflammatory skin and bowel disease1ORPHANET