Fusion Gene Studies
in Kim Lab

FusionBase FusionGDB FusionGDB2 FusionPDB FusionNeoAntigen FusionAI FusionNW FGviewer Publication Contact
FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:AURKA-SOGA1 (FusionGDB2 ID:HG6790TG140710)

Fusion Gene Summary for AURKA-SOGA1

check button Fusion gene summary
Fusion gene informationFusion gene name: AURKA-SOGA1
Fusion gene ID: hg6790tg140710
HgeneTgene
Gene symbol

AURKA

SOGA1

Gene ID

6790

140710

Gene nameaurora kinase Asuppressor of glucose, autophagy associated 1
SynonymsAIK|ARK1|AURA|BTAK|PPP1R47|STK15|STK6|STK7C20orf117|KIAA0889|SOGA
Cytomap('AURKA')('SOGA1')

20q13.2

20q11.23

Type of geneprotein-codingprotein-coding
Descriptionaurora kinase Aaurora 2aurora/IPL1-like kinaseaurora/IPL1-related kinase 1breast tumor-amplified kinaseprotein phosphatase 1, regulatory subunit 47serine/threonine protein kinase 15serine/threonine-protein kinase 6serine/threonine-protein kinase aprotein SOGA1SOGA family member 1suppressor of glucose by autophagysuppressor of glucose from autophagysuppressor of glucose, autophagy-associated protein 1
Modification date2020032920200313
UniProtAcc..
Ensembl transtripts involved in fusion geneENST00000312783, ENST00000347343, 
ENST00000371356, ENST00000395907, 
ENST00000395909, ENST00000395911, 
ENST00000395913, ENST00000395914, 
ENST00000395915, 
Fusion gene scores* DoF score6 X 8 X 3=14410 X 10 X 4=400
# samples 812
** MAII scorelog2(8/144*10)=-0.84799690655495
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(12/400*10)=-1.73696559416621
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: AURKA [Title/Abstract] AND SOGA1 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointAURKA(54956489)-SOGA1(35445872), # samples:2
Anticipated loss of major functional domain due to fusion event.AURKA-SOGA1 seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
AURKA-SOGA1 seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
AURKA-SOGA1 seems lost the major protein functional domain in Hgene partner, which is a epigenetic factor due to the frame-shifted ORF.
AURKA-SOGA1 seems lost the major protein functional domain in Hgene partner, which is a essential gene due to the frame-shifted ORF.
AURKA-SOGA1 seems lost the major protein functional domain in Hgene partner, which is a IUPHAR drug target due to the frame-shifted ORF.
AURKA-SOGA1 seems lost the major protein functional domain in Hgene partner, which is a kinase due to the frame-shifted ORF.
AURKA-SOGA1 seems lost the major protein functional domain in Tgene partner, which is a essential gene due to the frame-shifted ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneAURKA

GO:0006468

protein phosphorylation

21600873|21820309

HgeneAURKA

GO:0009611

response to wounding

19435814

HgeneAURKA

GO:0032091

negative regulation of protein binding

21820309

HgeneAURKA

GO:0097421

liver regeneration

19435814


check buttonFusion gene breakpoints across AURKA (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure
check buttonFusion gene breakpoints across SOGA1 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4OVTCGA-61-1728-01AAURKAchr20

54956489

-SOGA1chr20

35445872

-
ChimerDB4OVTCGA-61-1728AURKAchr20

54956488

-SOGA1chr20

35445872

-


Top

Fusion Gene ORF analysis for AURKA-SOGA1

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-intronENST00000312783ENST00000357779AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000312783ENST00000357779AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000312783ENST00000456801AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000312783ENST00000456801AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000347343ENST00000357779AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000347343ENST00000357779AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000347343ENST00000456801AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000347343ENST00000456801AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000371356ENST00000357779AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000371356ENST00000357779AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000371356ENST00000456801AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000371356ENST00000456801AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395907ENST00000357779AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395907ENST00000357779AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395907ENST00000456801AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395907ENST00000456801AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395909ENST00000357779AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395909ENST00000357779AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395909ENST00000456801AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395909ENST00000456801AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395911ENST00000357779AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395911ENST00000357779AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395911ENST00000456801AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395911ENST00000456801AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395913ENST00000357779AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395913ENST00000357779AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395913ENST00000456801AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395913ENST00000456801AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395914ENST00000357779AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395914ENST00000357779AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395914ENST00000456801AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395914ENST00000456801AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395915ENST00000357779AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395915ENST00000357779AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395915ENST00000456801AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395915ENST00000456801AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000347343ENST00000237536AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000347343ENST00000237536AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000347343ENST00000279034AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000347343ENST00000279034AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000395907ENST00000237536AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000395907ENST00000237536AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000395907ENST00000279034AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000395907ENST00000279034AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000395909ENST00000237536AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000395909ENST00000237536AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000395909ENST00000279034AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000395909ENST00000279034AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000395911ENST00000237536AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000395911ENST00000237536AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000395911ENST00000279034AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000395911ENST00000279034AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000395914ENST00000237536AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000395914ENST00000237536AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000395914ENST00000279034AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000395914ENST00000279034AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000395915ENST00000237536AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000395915ENST00000237536AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000395915ENST00000279034AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000395915ENST00000279034AURKAchr20

54956488

-SOGA1chr20

35445872

-
In-frameENST00000312783ENST00000237536AURKAchr20

54956489

-SOGA1chr20

35445872

-
In-frameENST00000312783ENST00000237536AURKAchr20

54956488

-SOGA1chr20

35445872

-
In-frameENST00000312783ENST00000279034AURKAchr20

54956489

-SOGA1chr20

35445872

-
In-frameENST00000312783ENST00000279034AURKAchr20

54956488

-SOGA1chr20

35445872

-
In-frameENST00000371356ENST00000237536AURKAchr20

54956489

-SOGA1chr20

35445872

-
In-frameENST00000371356ENST00000237536AURKAchr20

54956488

-SOGA1chr20

35445872

-
In-frameENST00000371356ENST00000279034AURKAchr20

54956489

-SOGA1chr20

35445872

-
In-frameENST00000371356ENST00000279034AURKAchr20

54956488

-SOGA1chr20

35445872

-
In-frameENST00000395913ENST00000237536AURKAchr20

54956489

-SOGA1chr20

35445872

-
In-frameENST00000395913ENST00000237536AURKAchr20

54956488

-SOGA1chr20

35445872

-
In-frameENST00000395913ENST00000279034AURKAchr20

54956489

-SOGA1chr20

35445872

-
In-frameENST00000395913ENST00000279034AURKAchr20

54956488

-SOGA1chr20

35445872

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000371356AURKAchr2054956489-ENST00000237536SOGA1chr2035445872-137958371847511577
ENST00000371356AURKAchr2054956489-ENST00000279034SOGA1chr2035445872-38568371835301170
ENST00000312783AURKAchr2054956489-ENST00000237536SOGA1chr2035445872-1390694824348621539
ENST00000312783AURKAchr2054956489-ENST00000279034SOGA1chr2035445872-396794824336411132
ENST00000395913AURKAchr2054956489-ENST00000237536SOGA1chr2035445872-1385289415948081549
ENST00000395913AURKAchr2054956489-ENST00000279034SOGA1chr2035445872-391389415935871142
ENST00000371356AURKAchr2054956488-ENST00000237536SOGA1chr2035445872-137958371847511577
ENST00000371356AURKAchr2054956488-ENST00000279034SOGA1chr2035445872-38568371835301170
ENST00000312783AURKAchr2054956488-ENST00000237536SOGA1chr2035445872-1390694824348621539
ENST00000312783AURKAchr2054956488-ENST00000279034SOGA1chr2035445872-396794824336411132
ENST00000395913AURKAchr2054956488-ENST00000237536SOGA1chr2035445872-1385289415948081549
ENST00000395913AURKAchr2054956488-ENST00000279034SOGA1chr2035445872-391389415935871142

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000371356ENST00000237536AURKAchr2054956489-SOGA1chr2035445872-0.0009886440.9990113
ENST00000371356ENST00000279034AURKAchr2054956489-SOGA1chr2035445872-0.0083886760.9916113
ENST00000312783ENST00000237536AURKAchr2054956489-SOGA1chr2035445872-0.001144840.9988551
ENST00000312783ENST00000279034AURKAchr2054956489-SOGA1chr2035445872-0.0098925890.9901074
ENST00000395913ENST00000237536AURKAchr2054956489-SOGA1chr2035445872-0.0010568320.99894315
ENST00000395913ENST00000279034AURKAchr2054956489-SOGA1chr2035445872-0.0090108210.99098915
ENST00000371356ENST00000237536AURKAchr2054956488-SOGA1chr2035445872-0.0009886440.9990113
ENST00000371356ENST00000279034AURKAchr2054956488-SOGA1chr2035445872-0.0083886760.9916113
ENST00000312783ENST00000237536AURKAchr2054956488-SOGA1chr2035445872-0.001144840.9988551
ENST00000312783ENST00000279034AURKAchr2054956488-SOGA1chr2035445872-0.0098925890.9901074
ENST00000395913ENST00000237536AURKAchr2054956488-SOGA1chr2035445872-0.0010568320.99894315
ENST00000395913ENST00000279034AURKAchr2054956488-SOGA1chr2035445872-0.0090108210.99098915

Top

Fusion Genomic Features for AURKA-SOGA1


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

Top

Fusion Protein Features for AURKA-SOGA1


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr20:54956489/chr20:35445872)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
..
FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneAURKAchr20:54956488chr20:35445872ENST00000312783-710211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000347343-69211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000371356-710211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000395909-811211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000395911-710211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000395913-69211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000395914-710211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000395915-69211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000312783-710211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000347343-69211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000371356-710211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000395909-811211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000395911-710211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000395913-69211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000395914-710211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000395915-69211_213235404.0Nucleotide bindingATP

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneAURKAchr20:54956488chr20:35445872ENST00000312783-710133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956488chr20:35445872ENST00000347343-69133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956488chr20:35445872ENST00000371356-710133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956488chr20:35445872ENST00000395909-811133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956488chr20:35445872ENST00000395911-710133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956488chr20:35445872ENST00000395913-69133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956488chr20:35445872ENST00000395914-710133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956488chr20:35445872ENST00000395915-69133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956489chr20:35445872ENST00000312783-710133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956489chr20:35445872ENST00000347343-69133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956489chr20:35445872ENST00000371356-710133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956489chr20:35445872ENST00000395909-811133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956489chr20:35445872ENST00000395911-710133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956489chr20:35445872ENST00000395913-69133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956489chr20:35445872ENST00000395914-710133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956489chr20:35445872ENST00000395915-69133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956488chr20:35445872ENST00000312783-710260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000347343-69260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000371356-710260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000395909-811260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000395911-710260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000395913-69260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000395914-710260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000395915-69260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000312783-710260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000347343-69260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000371356-710260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000395909-811260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000395911-710260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000395913-69260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000395914-710260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000395915-69260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000312783-710280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956488chr20:35445872ENST00000347343-69280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956488chr20:35445872ENST00000371356-710280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956488chr20:35445872ENST00000395909-811280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956488chr20:35445872ENST00000395911-710280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956488chr20:35445872ENST00000395913-69280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956488chr20:35445872ENST00000395914-710280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956488chr20:35445872ENST00000395915-69280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956489chr20:35445872ENST00000312783-710280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956489chr20:35445872ENST00000347343-69280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956489chr20:35445872ENST00000371356-710280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956489chr20:35445872ENST00000395909-811280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956489chr20:35445872ENST00000395911-710280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956489chr20:35445872ENST00000395913-69280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956489chr20:35445872ENST00000395914-710280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956489chr20:35445872ENST00000395915-69280_293235404.0RegionActivation segment


Top

Fusion Gene Sequence for AURKA-SOGA1


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>8490_8490_1_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000312783_SOGA1_chr20_35445872_ENST00000237536_length(transcript)=13906nt_BP=948nt
ACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAGGGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAAC
GGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGGTCTCACTCCATTGCCCAGGCCAGAGTGCGGGGATATTTGATAAGAAACTT
CAGTGAAGGCCGGGCGCGGTGGCTCATGCCCGTAATCCCAGCATTTTCGGAGGCCGAGGCATCATGGACCGATCTAAAGAAAACTGCATT
TCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTA
AATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAG
CCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAG
CAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCT
TTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTG
GCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGG
CATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTAT
AGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCC
GATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAG
CTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCA
GCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGC
CGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCT
GAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAG
GAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCAC
GAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGC
GAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGT
ACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCG
GTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTG
GCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGC
GAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCA
GATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATT
CTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGC
ACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAG
GAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAG
GCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGC
CTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTG
CTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGC
ATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAG
AACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGT
CTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAG
CTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTA
GCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAG
CGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGG
GTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAA
GTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTT
GCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACC
AAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGGCAGATGCAGCGCAGC
TACACGGCTCCTGACAAGACGGGCATCCGAGTCTACTATAGTCCCCCGGTGGCCCGGCGCCTCGGAGTCCCTGTGGTTCATGACAAAGAG
GGCAAGATCATTATCGAGCCCGGCTTCCTCTTCACCACAGCCAAGCCCAAAGAGTCGGCCGAGGCTGATGGGCTGGCTGAGAGCTCCTAT
GGTCGGTGGCTCTGCAACTTCTCACGGCAGCGCCTGGACGGAGGCTCAGCGGGCAGCCCCTCGGCGGCCGGGCCTGGCTTCCCAGCGGCC
CTGCATGACTTTGAGATGTCAGGCAACATGAGTGATGACATGAAGGAGATCACCAACTGTGTGCGCCAGGCCATGCGCTCCGGCTCACTG
GAGAGGAAAGTGAAGAGCACATCCAGCCAGACGGTGGGCCTGGCCAGTGTGGGCACACAGACCATCCGCACGGTCAGCGTGGGCCTGCAG
ACCGACCCACCCCGCAGCAGCCTCCATGGCAAGGCCTGGTCACCCCGCAGCTCTTCGCTCGTGTCTGTGCGCAGCAAGCAGATCTCCTCC
TCCCTGGACAAGGTCCATTCGCGCATCGAGCGGCCCTGCTGCTCCCCCAAGTATGGCTCACCAAAGCTCCAGAGGCGGTCTGTGTCCAAG
CTGGACAGCAGCAAGGACCGCAGCCTGTGGAACCTGCACCAGGGCAAGCAGAACGGCTCGGCCTGGGCCCGCTCCACCACCACGCGGGAC
AGCCCTGTATTGAGAAACATCAACGATGGACTCTCCAGCCTCTTCAGTGTGGTGGAGCACTCAGGGAGCACGGAGTCTGTCTGGAAACTA
GGCATGTCTGAGACGCGGGCCAAGCCCGAGCCTCCCAAGTACGGCATTGTGCAGGAATTCTTCCGTAATGTGTGTGGCCGGGCACCGAGC
CCCACCTCATCAGCAGGAGAGGAGGGCACCAAGAAGCCAGAGCCCCTCTCCCCAGCCAGCTACCATCAGCCAGAGGGTGTGGCCAGGATC
CTGAACAAGAAGGCAGCCAAGTTGGGCAGCAGTGAGGAGGTCAGACTCACCATGCTCCCCCAGGTGGGGAAGGATGGTGTCCTCCGGGAC
GGAGATGGAGCCGTGGTCCTTCCCAATGAGGACGCTGTTTGTGACTGTAGTACCCAGTCTCTCACCTCCTGCTTCGCCCGATCGTCCCGC
TCTGCCATCCGCCACTCTCCTTCCAAGTGCAGGCTGCACCCTTCAGAGTCCAGCTGGGGTGGGGAGGAGAGGGCACTCCCCCCCAGCGAG
TGACAGAGCAGCCAAGCTCCCCGCCTCAACCAGCCCAGCCCCTGGATAGCAGAAGGGAACCAGCAGAGACGAGACGAGGTGAGGCGAGGG
GCTGTGTCCTCAGCATTGCCTGGCCCTGGAGGGACAGCAGTGATGCCACTGCCAGAATGCAGCTTTCACATCAAGGTAAAGCCGGGTCTC
CTGCTGGCCCCTGGGTGGTGAGCTTCGACTTCCCAGGGGAAGGCAGTGAGTGGGAGAGAGACCAAACCTGGGCTTCCCAAGCATCCACTG
AGAGATCTGTCAAGAGCCGATCCCTGGGTCCTAAGAGAGAGCCTTGCCTGGTTCTGCCCATGCCACCCTCTTGGAAGAGCCCAAGAAGGA
TACATGTCTGGCCATGCCTTTGGGGAAAAGGAGTCGGAGAGATGTTTCCTGCTGACCATCCACCCCTTCATTTGGGAGGAGACACTGCTG
AGAAGAACAGGCTTTGCTCTAGGGCTCCATGTTTGGTTCCTGGTGGAGCCCTGTTGGGCATCATCACCATCACCTCCTTCTCTCCACCAC
CTCCTCCTCCCAGCCCCACTGCTCTAGATTGCTGGGACACTAGGGAGTATGATAGGGCAGTAGCCAGGGCCATTGCTTAGTGTCCTGGAG
CCCTGGATCTCCCTGCCCATAGCCTGGATGCAGCAAGAGCTGGGAGGCGAAGTGGAAACATGCAGGGCTCAGGGTTGGGGAGTGATTGCA
ATTGCCTTCCTTGCCAAAGTGACTTGGGGCCCCAACGTTCCCAGCAGACCCCTTGAGGACAGAAATAGGTAGAGTCAGTCTCAAGACCTG
GTGCATAGATAAATGCCTAAATACACTGCCTTGATCTCAAGTGATCTCAGAGGCCTCTTTCCCTGGCACCCTGAGAGGCAGCAGGCACTA
CATCTCCACTGTGTTTACATCCTGCAGCTGGTCGGGGGGCAAAGATATTCCCAGTAAGAGATTCTTGGTTGGCCAGGTCAGGCCCAGGAG
AACACCAAGAGGCCAGAGCCCAGGACACAGCAGTAGACTGGGGCCTGGAAACACGTATCTTGCCTAGATTGTTTATTTGAATTTTTCCTA
CTATAAATATTTAAGGTGGTTTACTTTATTTTAATAATTTAATTTACCCCAAAGTCCCTAAGGTAATTTATTGGAGGTTGAAACATGCAT
TCTTGCCACTGGGACAACATGAGGCCTCTAACAGCACGGGCAGGCATGGGGTCCCCTGGGTGGACGAGGCCGCTTGGCAGCCAGGTTTGG
AGACCTGGCCTCCTGGTCAGCTTTGGAGGGCCCCTCAACAGAGCTGGAGCCCTGCACCCCAACACGGCTGGCCATGTGGCCTCAGAACAC
TACTTATTACTCAATGCCTGGTACTTGGCCAGCCCCAGCGGTCAGTCTATAAATACTCACTGACAAGGTGGAGGGCTGGACGGCCATCAC
CACTCCCCAGACGTTCTCCATTGCCTGTCTCATTTCCCTCCCTCCCTCTGACACCTTTCTTCATGAGTCGAACGTGGATTACTAAAGCTC
TATTAAGAGTGTGGAGATCCCTCCAAGTTTCCCAAATGAGAACTCACAGGAAAACAGGACTGAACTTTGAGAATGTTGTTTATCGCAGCT
TTGCACATAAACCTGAGTGTCTCCCAGCCTGCCTCGGTTCTCACCAGCCTGCCAGCCTTTTCACCAGCCTCTCTCCTTAGCCTTATGGCC
TTTCACGGCTCTTCTCCCTGCCCCAGCTCTGCTGCCCGCCCTTCCTCACGTCCCCTGTGAGCTGCCTGAGCCATTGGTTGGATTTCGATG
TGGCTCATTGCAGCATGTGGGGCAGCGCCTCCCATGGCCTCGCCTTGGTGCCGGTGAACCCCTTTTGGTTGCACACATGCTCCCCACACA
CACATAGACATCAGCCTTCCTGTAACTGACTGGGGACCCAGAGTGGAAACACCAGGATGGATCAGCTTGTCTGCAGAATTGCCCATCAGG
AAGACCAAAAGCCAGTAGCTTTGCTGATCTGCCCCCAGGACTCTGGAGGCGCCCCTGCACTCCCACCTCCCACCTGCCAGTTCCCAGACC
CACCCATTCGGGATCACCTGGACCAGTTACCCCAAGTCCTGCATCTCCCTTCCCTGCAGGCTGAACACCAGGGTCATGCCAGTCCCGCCA
GCCGCCTCCTCCATGCCCCAGTGACTGGTGTGGGCAGAGCAGGCAGCCAGTGGAGCTGTGGGCCAGTTCCGCTCTTGGATGCTGCTGCTC
TCACCCATGAGGTCAGGGGGGCCCTCCAAGGTTATCTCCAGGTGAGGGGATTCACATCAGGCCACAAGCCACCAGAGGCCTTCTGCCACC
TCCCAGAGCGACAGCCAGGGAGGCTGCGTACTCAGCCTCGGGGAGAAATCCCCGTGGGACCTGAGCCCCAAGACCTACGGACCACTCAGC
CTTACCATCGTACCGTCCAGGATTGTCCTTGCCATCTTTGTTGTCTCAGCCAGACCTTGGTTTTCAGTAAAGCCCCAGTTTCTACTTCCT
GCATGCCACTGTGCAAGGCCACTCATCACTGTTCCTGCAGAAGCCTCTGGACGTGGGGCTGGATGGGGTTGAAAATGTTACATGTAAATA
TTGGTTTGGTTCGGTTTTTAGCATTTTACTTGGTAACTGGTTGTTTTCTTTTTTGGGGTGGGGGGATTGGTTTGTAAAAATTCTCTACTC
TTTTGGAATGTGATTTCTAAGTTTGTTGGTTTCTTCAAATGCCTTTTAAGTCTTGGTAACATTCCCAAAGCAGAAAACTGCCTGACCCAC
AGTGGGGATTCCCTGGAGAATTGGGGTCCCAAGAAGGAATGCTGCCCTTCTCGAACCCGTTCTCCCCCTTCCTCCTGCCTCTCTGCCTTT
TACTGCTATTCCCTTCTTCTCCTCCTTTATCCTTCTTTCTGTTTTCCCCATCTCCACTCTCTCTTCAACCAAAGTCCCAAGGAACCCTCG
GGGCTCAATCCCCCATAGACCACTTGGCTTGGGTCCATGGGGTTGGCATCAGTTGGTTGGCGGAAATGGGGGACCAGTTGGCATGATGGC
CCTAAACTGGGAAACCTCATGTTTCTTATGTCTCACCTCTTTCCAGAGCCAAATCAGCCCCTTTTGGAATGATGACTTCATTGGAATGCA
AATCAAGTCATTTTGGTGCATCAGTGGCTCTTAGGCCTGCACACACGAGACATCAGAATCCAATCCTCTGACCCTGTGCCAGCCCTTTCC
CCCAGTTTATTTCCCACCAAAGGCTGACCTCTAAGAGGTCTTGCTTTCTATGAACTCAAGATGGGTCCCACCTCTAGGTGTCCCCAGGTG
CACTCTTCTACCGGTTGGCTTCCGATGTGACAAGGCCAAGGGCCCAAAGACTTGACCCTCTTACACCCTTGCTGACATGGTTCCATCATG
TCCACCCGCATGCACTTTTATGGTTTCATCACCCAGCCTCTTCTCCTCTGGCCCACCCAGCGTCCAGGCTCTTTCTCCCTCTCCCCTCCT
ATCTAGAATGTCCCCTGCTTCTAGCCTCACCAGACCCCCCAAGCTCCCACTACTTCTTCCATAATAATAGTAATAACAATGGTTATCATC
ATCCCCTGCACATCCCGCCTAAAGCACTTTACTATATAGAAAACGTTTCCCCTGGCCGGGCATGGTGGCTCACGCCTGAAATCCCAGCTC
TTTGGGAGGCTGAGGCGAGCGGATCAGTTGAGGTCAGGAGTTCAACGCCAGCCTGGCCAACGTGGTGAATCCCTGTCTGTACTAAAAGTA
CAAAAAATTAGCTGAGCATGGTGGTGCGTGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGTGGGAGAATCGCTTGAGCCCAGGAGGCGG
AGGTTGCAGGAGCAGAGATTGCGCCACTGCACTCCAGCCTGGATGACAGAGTGAGACCCAATCTCAAAAAAGAAATCGTTTCCCACCCCA
CATCTCCTTCAGACCTCTCAGGGACAACTCTGGGAGGCAGCCTTGGCAGGACATGGGTTAGTGCGCCCATTTTGCTGTGAGGAAACTGAG
GTACAGGTCTCATCCCAGAGCATGAGAAGTCACTGAGTTTAGATGAGAACTTGGGTCCAACTCTGTCCTGTTTGCTGTGCAAATCCGCTG
CCCTGCTGGGGGCTTTTGGTGGGTCCAGAATACCCAGAATATGCTGCTGGCCAACCCAGGCATAAAACAAGTCCATTCTAGATCACTGAG
CCTTGTGTATTCCAGAGGGTGATCTGAGGTCCCCATTCAGCAGAATTCTCTGAGGGCATGTTCAGAATGTAGATTCCTGGGCCCCACCTT
GAATTTGCATGTTTAACAAACTCTCCTGGGGTTGAGGGGTGGGTGCAGTGGTCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGG
GTGGATCACTTGAGCTCAGGAGTTCGAGCCTGGCCAATATGGTGAAACCCTGTCTCTACTAAAAATGCAAAAATTAGCCAGGTGTGGTGG
CACATGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGTGGGGAGCAGCGGTTGCAGTGAGCCGAGATT
GTGCCATTGCGCTCTAGCCTGGGCAACGGAATGAGACTTGTCTCAAAAAAAATAAAAAATAAAACCAGCCCTCCCCGGGGGATCTTAGGC
ACTATTGGCCACACCATTGGTGTTCGTGGCCCTGATTATTAGGCTATCTTTCTTTTTTTAAGTTTTTTTAGATTTATTTTTTATTTTATT
TATTTATTTATTTATTATTTATTATTATTATTTTTAGAGACAGGGGGTCTCCCTATGTTGCCCAGGCTGGTTTCAAACTCCTGGGCTCAA
GTGATCTGCCCTCCTCAGCCTCCCAAAGTGCTGGGGTTACAGGCAGGCATCAGCCACCGTGCCAGGTCATCTTCCTTTTTCTTTTTTTTT
TGGAGACAGAGTCTTGCTCTGTCATTCAGGCTGGAGTACAGTGGCGCCATTTCAGCTCACCGCAGCCTCCACCTCCCAGGTTCAAGCAGT
TCTCCTGCCTCAACCTCCCGAGTAACTGGGATTACAGGTGTGTGCCACCACACCCGACTAATTTTTTTATTTTTAGTAGAGACGGGGTTT
CACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCACCCACCTCGGCCTCCCAAAGTACTGGGATTATAGGCGTGAGC
CACGGCATCCAGCCTCATCTTTCTTTTAACCAATAAACATGATGCTGTATCTTAAAAAGAGCACTGAGCAGGGACTTAAGGGATCGAGTC
CTCAACCAAACTGATTTAATTACTCAGGATTTTCAAAAAGCATCAGAGGCTATTTACAATCTTAATCATAGGGGTTCAGTAAAATAAAAA
TAAGAAGTAAAAAAGCAAGAGAAATTATTCTGTAAATCTAACTGGTGTAATTCCCATAATCATGCAATTAAGTTTTACTCTTGAGTTTCC
TGACAGCCATTGGTAAAAAGAGAAACACATCAGGATTTATAATTTTTATCATCCAATTATGGGAAGCAAGCATGTTGGCCCCAGGAGACG
AACTCTTCTACTAATTTATAGCATGTATCCTTTCATCAAGGGCCACTGAGCCAGTTGGTGAGTCAACGGGTGAACCTAAGATGCAAGGAT
GTTTTCCAGGTGACTATTTAAAAAAATAAATAATCCACCAACGTGATTGACCTTGGCGAGATCATGTTTCTAGTCTATACCTCAGTTTCC
CCATCTGTAAAGTGAGGATAATGTCCCACCCCATGTAACTGTGGTGAGGACCAACTGCAACACTGTGCCTGCGAGTCTCCTTGGAAAAGT
GTAAGGTTCTACACAAATGGAAAGTGATCTGATCACACTCAGTGTCCCCAGCCCAGCCTTTCAGTGCCCTGGCCCTGGGGTGGGGGACAA
TACTCTCCTCACCCCCTTCACTAGTCTTCATGAATAGCAAGGAGGCCATAACATAATTTGGTCTAAACCCCTTCCTTTTTAAAAGAATGA
TGGCAAAATGTGCATAACATTAAATTTACCATTTTAAAGATTTCTAAGTGTCTCGAAGTACATTTGCAATGTGTAACTGCCACCTCCAGA
ACTTTTTCATCATCCTAAACTGAAACTCTGTACCCACTAAACAGTAACTCCCCACTCCCCCTGTCCCCAGTCCCTGGTAACCTCTATTCT
ACTTTTTTTTTTTCTCTGTGGATTGGCCTATTCTAGGTATCTCTTTTAAGTAGAATCATATAGTATTTGTCCTTTTGTGTCCAGCCCCCT
CATTTTTTGAGATGAGGAATCAGGCCCAGAGAAGGCAGTGGCTCACCCAAGGTCACATCGCAAACCAGAGGCAGAGCCAAGACCAGAACC
CAGGTTTCCTGACTCCTAACAGGTCTTGCATCCCCAGTAGTGAATGAGACACTGCTTTGACTTTCTGTAATCTTGGTTTAGCCCCTTCCT
TTCTCTGGGCTCAGTCTGCTTCCCACCACTCACCCATTCTTTCTTTATCATCTCCCTCCAAAGCCTCTTGTCCTCCTGCCTCCTCTTCTC
CTTGGCTGGTTCCTGCCAGCAAATGGAGTGGCCCTGGTCCATGCCAGGTTTTCCTCTTCTGGGTCCGGAGCTCACTATAGTATTCAGCCC
TCAGTCCTCCCAGGATGTTCTTCCCAGCACTGCTGCCTCGTGCGGATTTTCCCGTAACCTCAGTAACTGGCTTCTTGTCCCCCTGCTTCC
TACCAGGGAAGCCTTCCTGTCCGCGTCTGTGGTTTCCCAACTCTGATATTTGCTCTCAAATGTGGTGGTGTCCTGGTTCTGTGTTTATTT
ATTTTGTGTTTTCTCACACACACAAAAAAAAGCCTCTTCCTCCAATGCATCAGGAGGCACCAGCCCTGCCAGCCCTTCTCACTGGGCTCA
CCCTGCCCCAGCAACCCCCCGGTGCCACCCTTAAGCCACACTGTCTTCTCTGTAAGCAGCCTGCCAGCAGCAGCCCCAGCACTTTGCAAT
GGGCGTGTGTGTGGTGGTGGGTGGGGGGGGCTTGGATCCCTCCTTTTTCCTCCCCTGCCCTGCCCAGGCCCAGATGGCCTTGACTGTAAA
GCAGGTGCTGCCTGACAGGTTCTTCTCTCCCTGTCTCTGGTCATTGATCCATCTCTTTGTCCATTCAGTATCCAACCATCCTCTCCATTC
TCCTCTGGACCTCACCACTCTCAGAGCTGCTTGTCCTGGCAGAATCTACAGTTCACCCCAACTCTATGCCTTACCCCTCCCAACCCAACA
GCATTTGCAGTTTGCAAAATATACAGACCCAAGTCCTGAGGGGACTGAGGACATGATGCTGGGCCCAAGTCTCCTGCTCAGGGCTTCTCT
CCAATGCCAGCCCTGCCACTCCTTCCTCACCCTCCTTGGAGCCTCCTCTGCTGCTTGTCTATCCCAACGGCCCTGCTCCCCTCCCTTCCT
GCCCTTCACCAGCTTTCTGGACACCATGCCCTGAGGAAGGGACCTTTGGTTTTCTCTAAACATCTTTGAAGGGCTGAGGCAGTCAGGGCT
GGCTGCCTTGTCACTCTTTATTTGGAAGCCACTCAAACCATTCCCAAGAAGAGGGACCTCAGCTGGCAATCTGGAAACCTGGCCCAGGTC
TGGGCAGATGTCTTCACTTCTCCTACCTTCCCAGTCTTGTGATCCTGTGATGAGCACCAGGATGGCCCTGTGGTCCCTAGAGCACCCCTC
ATGCTGTAGGGTCCTGCAGCCCCATCCTTTCTCTACTGGGCCCTGGTATCCTGGCTCCTCTCTCAGCTCTGCCACTGATCTCTGTGCCTT
AGTTTACTTCTCTGCACGGGGGACTCACCCCAAGACCATTTCCAGCAGCTTCCCAGGTGATGTGGTGCCCCAAGGCTGGGCTTTGCCAGC
TGTGGCCCAGCTCCTTAGTGCTGCCCAGGAGACACCAGGCTGCTCAGAATGAGGTGACTGCGGGCACCATTCTCAGCCAGTGGTTCTTGT
ATTGCATTCCAGCAGCAGGAATATCACCTGGGAACTTGATAGAAGTGCAGATTAGCAGCCCCACCCAAGACCCACTGAATTAGAGCTTGT
GGAGTGGGGCCCTACAAGCTGGGGTTTTAAGGAGCCCTCCAAATGATTCTGACGCATAAGAATATGCCAACTGCTGATCTGGGCTAGCCA
TTAGTAGAGCCTGGGGAGGGACTGGGACTGGCTAGGCCAAGAACAGGTGGAAAACACCAGCCTTATCTGGACTCCTGAGATTGGGAACCA
CCACCAACAAAAACCAACCCTATAGTCGCTCCTCTTGGAAGAGGAAGAGAAGTTGAAGGGCCTGGAGAAAGCACACATTGTTTGTTTCCC
TGCTCCTGCTCACCTCTCTCACTTGTCTTGGTTTTACAAAAGGCTGTGTGGATGGTGCCAGCCAGGGAGGGGGTGGGAGTCCTGGGGAGG
CAGGAGGCAGAAGACCCTGACTGTTTCTCCCTTGGGAACCTCACCATAGGCCAGATAGCGCCTCTTCAAACTGAAAGAAATCTTAACTCC
ACAAAGAAAGCATCCTAAATCCCCAGTTCCTCCTCCTCCCAACCCCAGGGATACCTTGTAGACAGTGCCAAAAAACAGCTCCAACCCCCA
GCAGCTGGGAAGAGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCA
GCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCT
TGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAG
GACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAAGCGT

>8490_8490_1_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000312783_SOGA1_chr20_35445872_ENST00000237536_length(amino acids)=1539AA_BP=235
MDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSR
PLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRR
EVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAK
EESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMK
DHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEE
LAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAAL
VSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPL
GSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLS
QERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSE
GDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAE
LKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLS
QLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSS
EPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGRQMQRSYTAPDKTGIRVYYSPPVARRLGVPVVHDKEGKIIIEPGFLFTTAKPKESAE
ADGLAESSYGRWLCNFSRQRLDGGSAGSPSAAGPGFPAALHDFEMSGNMSDDMKEITNCVRQAMRSGSLERKVKSTSSQTVGLASVGTQT
IRTVSVGLQTDPPRSSLHGKAWSPRSSSLVSVRSKQISSSLDKVHSRIERPCCSPKYGSPKLQRRSVSKLDSSKDRSLWNLHQGKQNGSA
WARSTTTRDSPVLRNINDGLSSLFSVVEHSGSTESVWKLGMSETRAKPEPPKYGIVQEFFRNVCGRAPSPTSSAGEEGTKKPEPLSPASY
HQPEGVARILNKKAAKLGSSEEVRLTMLPQVGKDGVLRDGDGAVVLPNEDAVCDCSTQSLTSCFARSSRSAIRHSPSKCRLHPSESSWGG

--------------------------------------------------------------
>8490_8490_2_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000312783_SOGA1_chr20_35445872_ENST00000279034_length(transcript)=3967nt_BP=948nt
ACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAGGGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAAC
GGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGGTCTCACTCCATTGCCCAGGCCAGAGTGCGGGGATATTTGATAAGAAACTT
CAGTGAAGGCCGGGCGCGGTGGCTCATGCCCGTAATCCCAGCATTTTCGGAGGCCGAGGCATCATGGACCGATCTAAAGAAAACTGCATT
TCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTA
AATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAG
CCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAG
CAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCT
TTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTG
GCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGG
CATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTAT
AGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCC
GATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAG
CTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCA
GCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGC
CGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCT
GAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAG
GAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCAC
GAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGC
GAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGT
ACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCG
GTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTG
GCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGC
GAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCA
GATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATT
CTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGC
ACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAG
GAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAG
GCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGC
CTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTG
CTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGC
ATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAG
AACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGT
CTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAG
CTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTA
GCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAG
CGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGG
GTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAA
GTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTT
GCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACC
AAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGCCAGAAGCTGCCCTTC
CTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCAT
GGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTAT
TCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCT
GCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAAGCGTTTTTAATGGTTGTAGATTGGAAATAAAGTATG

>8490_8490_2_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000312783_SOGA1_chr20_35445872_ENST00000279034_length(amino acids)=1132AA_BP=235
MDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSR
PLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRR
EVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAK
EESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMK
DHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEE
LAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAAL
VSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPL
GSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLS
QERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSE
GDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAE
LKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLS
QLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSS

--------------------------------------------------------------
>8490_8490_3_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000371356_SOGA1_chr20_35445872_ENST00000237536_length(transcript)=13795nt_BP=837nt
CTTGGAAGACTTGGGTCCTTGGGTCGCAGGCTGGAGTGCAATGGTGTGATCTCAGCTCACTGCAACCTCTGCTTCCTGGGTTTAAGTGAT
TCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACA
GCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGG
GTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAG
AAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCT
GAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGT
CGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAA
GCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTG
TATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCA
AAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAG
GAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAG
AATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCC
CCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAG
GTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCC
GCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGC
TCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCG
GAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAG
AAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACG
GACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGG
GAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACG
GCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAG
CTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCT
GTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGC
TCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAG
CCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCT
GAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAA
ATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAG
AAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGG
TTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTG
ATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATC
AAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAG
TTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCC
GGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTC
ATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAG
CAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTG
GAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGT
TTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGC
AAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCG
TCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGGCAGATGCAGCGCAGCTACACGGCTCCTGACAAGACG
GGCATCCGAGTCTACTATAGTCCCCCGGTGGCCCGGCGCCTCGGAGTCCCTGTGGTTCATGACAAAGAGGGCAAGATCATTATCGAGCCC
GGCTTCCTCTTCACCACAGCCAAGCCCAAAGAGTCGGCCGAGGCTGATGGGCTGGCTGAGAGCTCCTATGGTCGGTGGCTCTGCAACTTC
TCACGGCAGCGCCTGGACGGAGGCTCAGCGGGCAGCCCCTCGGCGGCCGGGCCTGGCTTCCCAGCGGCCCTGCATGACTTTGAGATGTCA
GGCAACATGAGTGATGACATGAAGGAGATCACCAACTGTGTGCGCCAGGCCATGCGCTCCGGCTCACTGGAGAGGAAAGTGAAGAGCACA
TCCAGCCAGACGGTGGGCCTGGCCAGTGTGGGCACACAGACCATCCGCACGGTCAGCGTGGGCCTGCAGACCGACCCACCCCGCAGCAGC
CTCCATGGCAAGGCCTGGTCACCCCGCAGCTCTTCGCTCGTGTCTGTGCGCAGCAAGCAGATCTCCTCCTCCCTGGACAAGGTCCATTCG
CGCATCGAGCGGCCCTGCTGCTCCCCCAAGTATGGCTCACCAAAGCTCCAGAGGCGGTCTGTGTCCAAGCTGGACAGCAGCAAGGACCGC
AGCCTGTGGAACCTGCACCAGGGCAAGCAGAACGGCTCGGCCTGGGCCCGCTCCACCACCACGCGGGACAGCCCTGTATTGAGAAACATC
AACGATGGACTCTCCAGCCTCTTCAGTGTGGTGGAGCACTCAGGGAGCACGGAGTCTGTCTGGAAACTAGGCATGTCTGAGACGCGGGCC
AAGCCCGAGCCTCCCAAGTACGGCATTGTGCAGGAATTCTTCCGTAATGTGTGTGGCCGGGCACCGAGCCCCACCTCATCAGCAGGAGAG
GAGGGCACCAAGAAGCCAGAGCCCCTCTCCCCAGCCAGCTACCATCAGCCAGAGGGTGTGGCCAGGATCCTGAACAAGAAGGCAGCCAAG
TTGGGCAGCAGTGAGGAGGTCAGACTCACCATGCTCCCCCAGGTGGGGAAGGATGGTGTCCTCCGGGACGGAGATGGAGCCGTGGTCCTT
CCCAATGAGGACGCTGTTTGTGACTGTAGTACCCAGTCTCTCACCTCCTGCTTCGCCCGATCGTCCCGCTCTGCCATCCGCCACTCTCCT
TCCAAGTGCAGGCTGCACCCTTCAGAGTCCAGCTGGGGTGGGGAGGAGAGGGCACTCCCCCCCAGCGAGTGACAGAGCAGCCAAGCTCCC
CGCCTCAACCAGCCCAGCCCCTGGATAGCAGAAGGGAACCAGCAGAGACGAGACGAGGTGAGGCGAGGGGCTGTGTCCTCAGCATTGCCT
GGCCCTGGAGGGACAGCAGTGATGCCACTGCCAGAATGCAGCTTTCACATCAAGGTAAAGCCGGGTCTCCTGCTGGCCCCTGGGTGGTGA
GCTTCGACTTCCCAGGGGAAGGCAGTGAGTGGGAGAGAGACCAAACCTGGGCTTCCCAAGCATCCACTGAGAGATCTGTCAAGAGCCGAT
CCCTGGGTCCTAAGAGAGAGCCTTGCCTGGTTCTGCCCATGCCACCCTCTTGGAAGAGCCCAAGAAGGATACATGTCTGGCCATGCCTTT
GGGGAAAAGGAGTCGGAGAGATGTTTCCTGCTGACCATCCACCCCTTCATTTGGGAGGAGACACTGCTGAGAAGAACAGGCTTTGCTCTA
GGGCTCCATGTTTGGTTCCTGGTGGAGCCCTGTTGGGCATCATCACCATCACCTCCTTCTCTCCACCACCTCCTCCTCCCAGCCCCACTG
CTCTAGATTGCTGGGACACTAGGGAGTATGATAGGGCAGTAGCCAGGGCCATTGCTTAGTGTCCTGGAGCCCTGGATCTCCCTGCCCATA
GCCTGGATGCAGCAAGAGCTGGGAGGCGAAGTGGAAACATGCAGGGCTCAGGGTTGGGGAGTGATTGCAATTGCCTTCCTTGCCAAAGTG
ACTTGGGGCCCCAACGTTCCCAGCAGACCCCTTGAGGACAGAAATAGGTAGAGTCAGTCTCAAGACCTGGTGCATAGATAAATGCCTAAA
TACACTGCCTTGATCTCAAGTGATCTCAGAGGCCTCTTTCCCTGGCACCCTGAGAGGCAGCAGGCACTACATCTCCACTGTGTTTACATC
CTGCAGCTGGTCGGGGGGCAAAGATATTCCCAGTAAGAGATTCTTGGTTGGCCAGGTCAGGCCCAGGAGAACACCAAGAGGCCAGAGCCC
AGGACACAGCAGTAGACTGGGGCCTGGAAACACGTATCTTGCCTAGATTGTTTATTTGAATTTTTCCTACTATAAATATTTAAGGTGGTT
TACTTTATTTTAATAATTTAATTTACCCCAAAGTCCCTAAGGTAATTTATTGGAGGTTGAAACATGCATTCTTGCCACTGGGACAACATG
AGGCCTCTAACAGCACGGGCAGGCATGGGGTCCCCTGGGTGGACGAGGCCGCTTGGCAGCCAGGTTTGGAGACCTGGCCTCCTGGTCAGC
TTTGGAGGGCCCCTCAACAGAGCTGGAGCCCTGCACCCCAACACGGCTGGCCATGTGGCCTCAGAACACTACTTATTACTCAATGCCTGG
TACTTGGCCAGCCCCAGCGGTCAGTCTATAAATACTCACTGACAAGGTGGAGGGCTGGACGGCCATCACCACTCCCCAGACGTTCTCCAT
TGCCTGTCTCATTTCCCTCCCTCCCTCTGACACCTTTCTTCATGAGTCGAACGTGGATTACTAAAGCTCTATTAAGAGTGTGGAGATCCC
TCCAAGTTTCCCAAATGAGAACTCACAGGAAAACAGGACTGAACTTTGAGAATGTTGTTTATCGCAGCTTTGCACATAAACCTGAGTGTC
TCCCAGCCTGCCTCGGTTCTCACCAGCCTGCCAGCCTTTTCACCAGCCTCTCTCCTTAGCCTTATGGCCTTTCACGGCTCTTCTCCCTGC
CCCAGCTCTGCTGCCCGCCCTTCCTCACGTCCCCTGTGAGCTGCCTGAGCCATTGGTTGGATTTCGATGTGGCTCATTGCAGCATGTGGG
GCAGCGCCTCCCATGGCCTCGCCTTGGTGCCGGTGAACCCCTTTTGGTTGCACACATGCTCCCCACACACACATAGACATCAGCCTTCCT
GTAACTGACTGGGGACCCAGAGTGGAAACACCAGGATGGATCAGCTTGTCTGCAGAATTGCCCATCAGGAAGACCAAAAGCCAGTAGCTT
TGCTGATCTGCCCCCAGGACTCTGGAGGCGCCCCTGCACTCCCACCTCCCACCTGCCAGTTCCCAGACCCACCCATTCGGGATCACCTGG
ACCAGTTACCCCAAGTCCTGCATCTCCCTTCCCTGCAGGCTGAACACCAGGGTCATGCCAGTCCCGCCAGCCGCCTCCTCCATGCCCCAG
TGACTGGTGTGGGCAGAGCAGGCAGCCAGTGGAGCTGTGGGCCAGTTCCGCTCTTGGATGCTGCTGCTCTCACCCATGAGGTCAGGGGGG
CCCTCCAAGGTTATCTCCAGGTGAGGGGATTCACATCAGGCCACAAGCCACCAGAGGCCTTCTGCCACCTCCCAGAGCGACAGCCAGGGA
GGCTGCGTACTCAGCCTCGGGGAGAAATCCCCGTGGGACCTGAGCCCCAAGACCTACGGACCACTCAGCCTTACCATCGTACCGTCCAGG
ATTGTCCTTGCCATCTTTGTTGTCTCAGCCAGACCTTGGTTTTCAGTAAAGCCCCAGTTTCTACTTCCTGCATGCCACTGTGCAAGGCCA
CTCATCACTGTTCCTGCAGAAGCCTCTGGACGTGGGGCTGGATGGGGTTGAAAATGTTACATGTAAATATTGGTTTGGTTCGGTTTTTAG
CATTTTACTTGGTAACTGGTTGTTTTCTTTTTTGGGGTGGGGGGATTGGTTTGTAAAAATTCTCTACTCTTTTGGAATGTGATTTCTAAG
TTTGTTGGTTTCTTCAAATGCCTTTTAAGTCTTGGTAACATTCCCAAAGCAGAAAACTGCCTGACCCACAGTGGGGATTCCCTGGAGAAT
TGGGGTCCCAAGAAGGAATGCTGCCCTTCTCGAACCCGTTCTCCCCCTTCCTCCTGCCTCTCTGCCTTTTACTGCTATTCCCTTCTTCTC
CTCCTTTATCCTTCTTTCTGTTTTCCCCATCTCCACTCTCTCTTCAACCAAAGTCCCAAGGAACCCTCGGGGCTCAATCCCCCATAGACC
ACTTGGCTTGGGTCCATGGGGTTGGCATCAGTTGGTTGGCGGAAATGGGGGACCAGTTGGCATGATGGCCCTAAACTGGGAAACCTCATG
TTTCTTATGTCTCACCTCTTTCCAGAGCCAAATCAGCCCCTTTTGGAATGATGACTTCATTGGAATGCAAATCAAGTCATTTTGGTGCAT
CAGTGGCTCTTAGGCCTGCACACACGAGACATCAGAATCCAATCCTCTGACCCTGTGCCAGCCCTTTCCCCCAGTTTATTTCCCACCAAA
GGCTGACCTCTAAGAGGTCTTGCTTTCTATGAACTCAAGATGGGTCCCACCTCTAGGTGTCCCCAGGTGCACTCTTCTACCGGTTGGCTT
CCGATGTGACAAGGCCAAGGGCCCAAAGACTTGACCCTCTTACACCCTTGCTGACATGGTTCCATCATGTCCACCCGCATGCACTTTTAT
GGTTTCATCACCCAGCCTCTTCTCCTCTGGCCCACCCAGCGTCCAGGCTCTTTCTCCCTCTCCCCTCCTATCTAGAATGTCCCCTGCTTC
TAGCCTCACCAGACCCCCCAAGCTCCCACTACTTCTTCCATAATAATAGTAATAACAATGGTTATCATCATCCCCTGCACATCCCGCCTA
AAGCACTTTACTATATAGAAAACGTTTCCCCTGGCCGGGCATGGTGGCTCACGCCTGAAATCCCAGCTCTTTGGGAGGCTGAGGCGAGCG
GATCAGTTGAGGTCAGGAGTTCAACGCCAGCCTGGCCAACGTGGTGAATCCCTGTCTGTACTAAAAGTACAAAAAATTAGCTGAGCATGG
TGGTGCGTGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGTGGGAGAATCGCTTGAGCCCAGGAGGCGGAGGTTGCAGGAGCAGAGATTG
CGCCACTGCACTCCAGCCTGGATGACAGAGTGAGACCCAATCTCAAAAAAGAAATCGTTTCCCACCCCACATCTCCTTCAGACCTCTCAG
GGACAACTCTGGGAGGCAGCCTTGGCAGGACATGGGTTAGTGCGCCCATTTTGCTGTGAGGAAACTGAGGTACAGGTCTCATCCCAGAGC
ATGAGAAGTCACTGAGTTTAGATGAGAACTTGGGTCCAACTCTGTCCTGTTTGCTGTGCAAATCCGCTGCCCTGCTGGGGGCTTTTGGTG
GGTCCAGAATACCCAGAATATGCTGCTGGCCAACCCAGGCATAAAACAAGTCCATTCTAGATCACTGAGCCTTGTGTATTCCAGAGGGTG
ATCTGAGGTCCCCATTCAGCAGAATTCTCTGAGGGCATGTTCAGAATGTAGATTCCTGGGCCCCACCTTGAATTTGCATGTTTAACAAAC
TCTCCTGGGGTTGAGGGGTGGGTGCAGTGGTCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGTGGATCACTTGAGCTCAGGA
GTTCGAGCCTGGCCAATATGGTGAAACCCTGTCTCTACTAAAAATGCAAAAATTAGCCAGGTGTGGTGGCACATGCCTGTAATCCCAGCT
ACTTGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGTGGGGAGCAGCGGTTGCAGTGAGCCGAGATTGTGCCATTGCGCTCTAGCCTG
GGCAACGGAATGAGACTTGTCTCAAAAAAAATAAAAAATAAAACCAGCCCTCCCCGGGGGATCTTAGGCACTATTGGCCACACCATTGGT
GTTCGTGGCCCTGATTATTAGGCTATCTTTCTTTTTTTAAGTTTTTTTAGATTTATTTTTTATTTTATTTATTTATTTATTTATTATTTA
TTATTATTATTTTTAGAGACAGGGGGTCTCCCTATGTTGCCCAGGCTGGTTTCAAACTCCTGGGCTCAAGTGATCTGCCCTCCTCAGCCT
CCCAAAGTGCTGGGGTTACAGGCAGGCATCAGCCACCGTGCCAGGTCATCTTCCTTTTTCTTTTTTTTTTGGAGACAGAGTCTTGCTCTG
TCATTCAGGCTGGAGTACAGTGGCGCCATTTCAGCTCACCGCAGCCTCCACCTCCCAGGTTCAAGCAGTTCTCCTGCCTCAACCTCCCGA
GTAACTGGGATTACAGGTGTGTGCCACCACACCCGACTAATTTTTTTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGT
CTTGAACTCCTGACCTCAGGTGATCCACCCACCTCGGCCTCCCAAAGTACTGGGATTATAGGCGTGAGCCACGGCATCCAGCCTCATCTT
TCTTTTAACCAATAAACATGATGCTGTATCTTAAAAAGAGCACTGAGCAGGGACTTAAGGGATCGAGTCCTCAACCAAACTGATTTAATT
ACTCAGGATTTTCAAAAAGCATCAGAGGCTATTTACAATCTTAATCATAGGGGTTCAGTAAAATAAAAATAAGAAGTAAAAAAGCAAGAG
AAATTATTCTGTAAATCTAACTGGTGTAATTCCCATAATCATGCAATTAAGTTTTACTCTTGAGTTTCCTGACAGCCATTGGTAAAAAGA
GAAACACATCAGGATTTATAATTTTTATCATCCAATTATGGGAAGCAAGCATGTTGGCCCCAGGAGACGAACTCTTCTACTAATTTATAG
CATGTATCCTTTCATCAAGGGCCACTGAGCCAGTTGGTGAGTCAACGGGTGAACCTAAGATGCAAGGATGTTTTCCAGGTGACTATTTAA
AAAAATAAATAATCCACCAACGTGATTGACCTTGGCGAGATCATGTTTCTAGTCTATACCTCAGTTTCCCCATCTGTAAAGTGAGGATAA
TGTCCCACCCCATGTAACTGTGGTGAGGACCAACTGCAACACTGTGCCTGCGAGTCTCCTTGGAAAAGTGTAAGGTTCTACACAAATGGA
AAGTGATCTGATCACACTCAGTGTCCCCAGCCCAGCCTTTCAGTGCCCTGGCCCTGGGGTGGGGGACAATACTCTCCTCACCCCCTTCAC
TAGTCTTCATGAATAGCAAGGAGGCCATAACATAATTTGGTCTAAACCCCTTCCTTTTTAAAAGAATGATGGCAAAATGTGCATAACATT
AAATTTACCATTTTAAAGATTTCTAAGTGTCTCGAAGTACATTTGCAATGTGTAACTGCCACCTCCAGAACTTTTTCATCATCCTAAACT
GAAACTCTGTACCCACTAAACAGTAACTCCCCACTCCCCCTGTCCCCAGTCCCTGGTAACCTCTATTCTACTTTTTTTTTTTCTCTGTGG
ATTGGCCTATTCTAGGTATCTCTTTTAAGTAGAATCATATAGTATTTGTCCTTTTGTGTCCAGCCCCCTCATTTTTTGAGATGAGGAATC
AGGCCCAGAGAAGGCAGTGGCTCACCCAAGGTCACATCGCAAACCAGAGGCAGAGCCAAGACCAGAACCCAGGTTTCCTGACTCCTAACA
GGTCTTGCATCCCCAGTAGTGAATGAGACACTGCTTTGACTTTCTGTAATCTTGGTTTAGCCCCTTCCTTTCTCTGGGCTCAGTCTGCTT
CCCACCACTCACCCATTCTTTCTTTATCATCTCCCTCCAAAGCCTCTTGTCCTCCTGCCTCCTCTTCTCCTTGGCTGGTTCCTGCCAGCA
AATGGAGTGGCCCTGGTCCATGCCAGGTTTTCCTCTTCTGGGTCCGGAGCTCACTATAGTATTCAGCCCTCAGTCCTCCCAGGATGTTCT
TCCCAGCACTGCTGCCTCGTGCGGATTTTCCCGTAACCTCAGTAACTGGCTTCTTGTCCCCCTGCTTCCTACCAGGGAAGCCTTCCTGTC
CGCGTCTGTGGTTTCCCAACTCTGATATTTGCTCTCAAATGTGGTGGTGTCCTGGTTCTGTGTTTATTTATTTTGTGTTTTCTCACACAC
ACAAAAAAAAGCCTCTTCCTCCAATGCATCAGGAGGCACCAGCCCTGCCAGCCCTTCTCACTGGGCTCACCCTGCCCCAGCAACCCCCCG
GTGCCACCCTTAAGCCACACTGTCTTCTCTGTAAGCAGCCTGCCAGCAGCAGCCCCAGCACTTTGCAATGGGCGTGTGTGTGGTGGTGGG
TGGGGGGGGCTTGGATCCCTCCTTTTTCCTCCCCTGCCCTGCCCAGGCCCAGATGGCCTTGACTGTAAAGCAGGTGCTGCCTGACAGGTT
CTTCTCTCCCTGTCTCTGGTCATTGATCCATCTCTTTGTCCATTCAGTATCCAACCATCCTCTCCATTCTCCTCTGGACCTCACCACTCT
CAGAGCTGCTTGTCCTGGCAGAATCTACAGTTCACCCCAACTCTATGCCTTACCCCTCCCAACCCAACAGCATTTGCAGTTTGCAAAATA
TACAGACCCAAGTCCTGAGGGGACTGAGGACATGATGCTGGGCCCAAGTCTCCTGCTCAGGGCTTCTCTCCAATGCCAGCCCTGCCACTC
CTTCCTCACCCTCCTTGGAGCCTCCTCTGCTGCTTGTCTATCCCAACGGCCCTGCTCCCCTCCCTTCCTGCCCTTCACCAGCTTTCTGGA
CACCATGCCCTGAGGAAGGGACCTTTGGTTTTCTCTAAACATCTTTGAAGGGCTGAGGCAGTCAGGGCTGGCTGCCTTGTCACTCTTTAT
TTGGAAGCCACTCAAACCATTCCCAAGAAGAGGGACCTCAGCTGGCAATCTGGAAACCTGGCCCAGGTCTGGGCAGATGTCTTCACTTCT
CCTACCTTCCCAGTCTTGTGATCCTGTGATGAGCACCAGGATGGCCCTGTGGTCCCTAGAGCACCCCTCATGCTGTAGGGTCCTGCAGCC
CCATCCTTTCTCTACTGGGCCCTGGTATCCTGGCTCCTCTCTCAGCTCTGCCACTGATCTCTGTGCCTTAGTTTACTTCTCTGCACGGGG
GACTCACCCCAAGACCATTTCCAGCAGCTTCCCAGGTGATGTGGTGCCCCAAGGCTGGGCTTTGCCAGCTGTGGCCCAGCTCCTTAGTGC
TGCCCAGGAGACACCAGGCTGCTCAGAATGAGGTGACTGCGGGCACCATTCTCAGCCAGTGGTTCTTGTATTGCATTCCAGCAGCAGGAA
TATCACCTGGGAACTTGATAGAAGTGCAGATTAGCAGCCCCACCCAAGACCCACTGAATTAGAGCTTGTGGAGTGGGGCCCTACAAGCTG
GGGTTTTAAGGAGCCCTCCAAATGATTCTGACGCATAAGAATATGCCAACTGCTGATCTGGGCTAGCCATTAGTAGAGCCTGGGGAGGGA
CTGGGACTGGCTAGGCCAAGAACAGGTGGAAAACACCAGCCTTATCTGGACTCCTGAGATTGGGAACCACCACCAACAAAAACCAACCCT
ATAGTCGCTCCTCTTGGAAGAGGAAGAGAAGTTGAAGGGCCTGGAGAAAGCACACATTGTTTGTTTCCCTGCTCCTGCTCACCTCTCTCA
CTTGTCTTGGTTTTACAAAAGGCTGTGTGGATGGTGCCAGCCAGGGAGGGGGTGGGAGTCCTGGGGAGGCAGGAGGCAGAAGACCCTGAC
TGTTTCTCCCTTGGGAACCTCACCATAGGCCAGATAGCGCCTCTTCAAACTGAAAGAAATCTTAACTCCACAAAGAAAGCATCCTAAATC
CCCAGTTCCTCCTCCTCCCAACCCCAGGGATACCTTGTAGACAGTGCCAAAAAACAGCTCCAACCCCCAGCAGCTGGGAAGAGAGCCAGA
AGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGT
CCTGCAGTCATGGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTC
TTGTCCACTATTCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCAC
CAGCTCTCCCTGCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAAGCGTTTTTAATGGTTGTAGATTGGA

>8490_8490_3_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000371356_SOGA1_chr20_35445872_ENST00000237536_length(amino acids)=1577AA_BP=3
MGRRLECNGVISAHCNLCFLGLSDSPASASRVAGITGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSN
SSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKG
KFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQR
TATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRET
ELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELE
DQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGD
SAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAIL
VRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRS
FRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQE
KNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADM
KVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQ
LQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKS
VSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSSEPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGRQMQRSYTAPDKTGIRVYY
SPPVARRLGVPVVHDKEGKIIIEPGFLFTTAKPKESAEADGLAESSYGRWLCNFSRQRLDGGSAGSPSAAGPGFPAALHDFEMSGNMSDD
MKEITNCVRQAMRSGSLERKVKSTSSQTVGLASVGTQTIRTVSVGLQTDPPRSSLHGKAWSPRSSSLVSVRSKQISSSLDKVHSRIERPC
CSPKYGSPKLQRRSVSKLDSSKDRSLWNLHQGKQNGSAWARSTTTRDSPVLRNINDGLSSLFSVVEHSGSTESVWKLGMSETRAKPEPPK
YGIVQEFFRNVCGRAPSPTSSAGEEGTKKPEPLSPASYHQPEGVARILNKKAAKLGSSEEVRLTMLPQVGKDGVLRDGDGAVVLPNEDAV

--------------------------------------------------------------
>8490_8490_4_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000371356_SOGA1_chr20_35445872_ENST00000279034_length(transcript)=3856nt_BP=837nt
CTTGGAAGACTTGGGTCCTTGGGTCGCAGGCTGGAGTGCAATGGTGTGATCTCAGCTCACTGCAACCTCTGCTTCCTGGGTTTAAGTGAT
TCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACA
GCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGG
GTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAG
AAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCT
GAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGT
CGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAA
GCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTG
TATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCA
AAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAG
GAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAG
AATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCC
CCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAG
GTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCC
GCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGC
TCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCG
GAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAG
AAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACG
GACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGG
GAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACG
GCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAG
CTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCT
GTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGC
TCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAG
CCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCT
GAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAA
ATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAG
AAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGG
TTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTG
ATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATC
AAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAG
TTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCC
GGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTC
ATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAG
CAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTG
GAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGT
TTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGC
AAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCG
TCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCC
CAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGG
GCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTC
TTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGA

>8490_8490_4_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000371356_SOGA1_chr20_35445872_ENST00000279034_length(amino acids)=1170AA_BP=3
MGRRLECNGVISAHCNLCFLGLSDSPASASRVAGITGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSN
SSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKG
KFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQR
TATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRET
ELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELE
DQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGD
SAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAIL
VRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRS
FRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQE
KNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADM
KVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQ
LQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKS
VSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSSEPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGSQKLPFLLILAPPQPPPIL

--------------------------------------------------------------
>8490_8490_5_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000395913_SOGA1_chr20_35445872_ENST00000237536_length(transcript)=13852nt_BP=894nt
CTTAAACGCGACTCAAGGCGTCGGGTTTGTTGTCAACCAATCACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAG
GGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAACGGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGTGGGAGCCG
ACGGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTG
ACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATT
CCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCT
GTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAA
CAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTT
TATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAG
CTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTAC
CTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCC
TTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCAC
TTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTAC
CGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCAC
CTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGAC
GACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTG
GCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTG
CTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCA
CAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCC
AACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTG
CCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAG
GCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGT
GCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTG
CTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTG
CCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTG
CCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGA
GCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAG
CAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCG
CTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTG
CAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTG
CCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCT
GATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAG
GAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAG
CGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCC
CTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAG
CGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAAC
CGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGC
AGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCT
GAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTT
GTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCC
TGCCAGGACCCCCCAGGGAGGCAGATGCAGCGCAGCTACACGGCTCCTGACAAGACGGGCATCCGAGTCTACTATAGTCCCCCGGTGGCC
CGGCGCCTCGGAGTCCCTGTGGTTCATGACAAAGAGGGCAAGATCATTATCGAGCCCGGCTTCCTCTTCACCACAGCCAAGCCCAAAGAG
TCGGCCGAGGCTGATGGGCTGGCTGAGAGCTCCTATGGTCGGTGGCTCTGCAACTTCTCACGGCAGCGCCTGGACGGAGGCTCAGCGGGC
AGCCCCTCGGCGGCCGGGCCTGGCTTCCCAGCGGCCCTGCATGACTTTGAGATGTCAGGCAACATGAGTGATGACATGAAGGAGATCACC
AACTGTGTGCGCCAGGCCATGCGCTCCGGCTCACTGGAGAGGAAAGTGAAGAGCACATCCAGCCAGACGGTGGGCCTGGCCAGTGTGGGC
ACACAGACCATCCGCACGGTCAGCGTGGGCCTGCAGACCGACCCACCCCGCAGCAGCCTCCATGGCAAGGCCTGGTCACCCCGCAGCTCT
TCGCTCGTGTCTGTGCGCAGCAAGCAGATCTCCTCCTCCCTGGACAAGGTCCATTCGCGCATCGAGCGGCCCTGCTGCTCCCCCAAGTAT
GGCTCACCAAAGCTCCAGAGGCGGTCTGTGTCCAAGCTGGACAGCAGCAAGGACCGCAGCCTGTGGAACCTGCACCAGGGCAAGCAGAAC
GGCTCGGCCTGGGCCCGCTCCACCACCACGCGGGACAGCCCTGTATTGAGAAACATCAACGATGGACTCTCCAGCCTCTTCAGTGTGGTG
GAGCACTCAGGGAGCACGGAGTCTGTCTGGAAACTAGGCATGTCTGAGACGCGGGCCAAGCCCGAGCCTCCCAAGTACGGCATTGTGCAG
GAATTCTTCCGTAATGTGTGTGGCCGGGCACCGAGCCCCACCTCATCAGCAGGAGAGGAGGGCACCAAGAAGCCAGAGCCCCTCTCCCCA
GCCAGCTACCATCAGCCAGAGGGTGTGGCCAGGATCCTGAACAAGAAGGCAGCCAAGTTGGGCAGCAGTGAGGAGGTCAGACTCACCATG
CTCCCCCAGGTGGGGAAGGATGGTGTCCTCCGGGACGGAGATGGAGCCGTGGTCCTTCCCAATGAGGACGCTGTTTGTGACTGTAGTACC
CAGTCTCTCACCTCCTGCTTCGCCCGATCGTCCCGCTCTGCCATCCGCCACTCTCCTTCCAAGTGCAGGCTGCACCCTTCAGAGTCCAGC
TGGGGTGGGGAGGAGAGGGCACTCCCCCCCAGCGAGTGACAGAGCAGCCAAGCTCCCCGCCTCAACCAGCCCAGCCCCTGGATAGCAGAA
GGGAACCAGCAGAGACGAGACGAGGTGAGGCGAGGGGCTGTGTCCTCAGCATTGCCTGGCCCTGGAGGGACAGCAGTGATGCCACTGCCA
GAATGCAGCTTTCACATCAAGGTAAAGCCGGGTCTCCTGCTGGCCCCTGGGTGGTGAGCTTCGACTTCCCAGGGGAAGGCAGTGAGTGGG
AGAGAGACCAAACCTGGGCTTCCCAAGCATCCACTGAGAGATCTGTCAAGAGCCGATCCCTGGGTCCTAAGAGAGAGCCTTGCCTGGTTC
TGCCCATGCCACCCTCTTGGAAGAGCCCAAGAAGGATACATGTCTGGCCATGCCTTTGGGGAAAAGGAGTCGGAGAGATGTTTCCTGCTG
ACCATCCACCCCTTCATTTGGGAGGAGACACTGCTGAGAAGAACAGGCTTTGCTCTAGGGCTCCATGTTTGGTTCCTGGTGGAGCCCTGT
TGGGCATCATCACCATCACCTCCTTCTCTCCACCACCTCCTCCTCCCAGCCCCACTGCTCTAGATTGCTGGGACACTAGGGAGTATGATA
GGGCAGTAGCCAGGGCCATTGCTTAGTGTCCTGGAGCCCTGGATCTCCCTGCCCATAGCCTGGATGCAGCAAGAGCTGGGAGGCGAAGTG
GAAACATGCAGGGCTCAGGGTTGGGGAGTGATTGCAATTGCCTTCCTTGCCAAAGTGACTTGGGGCCCCAACGTTCCCAGCAGACCCCTT
GAGGACAGAAATAGGTAGAGTCAGTCTCAAGACCTGGTGCATAGATAAATGCCTAAATACACTGCCTTGATCTCAAGTGATCTCAGAGGC
CTCTTTCCCTGGCACCCTGAGAGGCAGCAGGCACTACATCTCCACTGTGTTTACATCCTGCAGCTGGTCGGGGGGCAAAGATATTCCCAG
TAAGAGATTCTTGGTTGGCCAGGTCAGGCCCAGGAGAACACCAAGAGGCCAGAGCCCAGGACACAGCAGTAGACTGGGGCCTGGAAACAC
GTATCTTGCCTAGATTGTTTATTTGAATTTTTCCTACTATAAATATTTAAGGTGGTTTACTTTATTTTAATAATTTAATTTACCCCAAAG
TCCCTAAGGTAATTTATTGGAGGTTGAAACATGCATTCTTGCCACTGGGACAACATGAGGCCTCTAACAGCACGGGCAGGCATGGGGTCC
CCTGGGTGGACGAGGCCGCTTGGCAGCCAGGTTTGGAGACCTGGCCTCCTGGTCAGCTTTGGAGGGCCCCTCAACAGAGCTGGAGCCCTG
CACCCCAACACGGCTGGCCATGTGGCCTCAGAACACTACTTATTACTCAATGCCTGGTACTTGGCCAGCCCCAGCGGTCAGTCTATAAAT
ACTCACTGACAAGGTGGAGGGCTGGACGGCCATCACCACTCCCCAGACGTTCTCCATTGCCTGTCTCATTTCCCTCCCTCCCTCTGACAC
CTTTCTTCATGAGTCGAACGTGGATTACTAAAGCTCTATTAAGAGTGTGGAGATCCCTCCAAGTTTCCCAAATGAGAACTCACAGGAAAA
CAGGACTGAACTTTGAGAATGTTGTTTATCGCAGCTTTGCACATAAACCTGAGTGTCTCCCAGCCTGCCTCGGTTCTCACCAGCCTGCCA
GCCTTTTCACCAGCCTCTCTCCTTAGCCTTATGGCCTTTCACGGCTCTTCTCCCTGCCCCAGCTCTGCTGCCCGCCCTTCCTCACGTCCC
CTGTGAGCTGCCTGAGCCATTGGTTGGATTTCGATGTGGCTCATTGCAGCATGTGGGGCAGCGCCTCCCATGGCCTCGCCTTGGTGCCGG
TGAACCCCTTTTGGTTGCACACATGCTCCCCACACACACATAGACATCAGCCTTCCTGTAACTGACTGGGGACCCAGAGTGGAAACACCA
GGATGGATCAGCTTGTCTGCAGAATTGCCCATCAGGAAGACCAAAAGCCAGTAGCTTTGCTGATCTGCCCCCAGGACTCTGGAGGCGCCC
CTGCACTCCCACCTCCCACCTGCCAGTTCCCAGACCCACCCATTCGGGATCACCTGGACCAGTTACCCCAAGTCCTGCATCTCCCTTCCC
TGCAGGCTGAACACCAGGGTCATGCCAGTCCCGCCAGCCGCCTCCTCCATGCCCCAGTGACTGGTGTGGGCAGAGCAGGCAGCCAGTGGA
GCTGTGGGCCAGTTCCGCTCTTGGATGCTGCTGCTCTCACCCATGAGGTCAGGGGGGCCCTCCAAGGTTATCTCCAGGTGAGGGGATTCA
CATCAGGCCACAAGCCACCAGAGGCCTTCTGCCACCTCCCAGAGCGACAGCCAGGGAGGCTGCGTACTCAGCCTCGGGGAGAAATCCCCG
TGGGACCTGAGCCCCAAGACCTACGGACCACTCAGCCTTACCATCGTACCGTCCAGGATTGTCCTTGCCATCTTTGTTGTCTCAGCCAGA
CCTTGGTTTTCAGTAAAGCCCCAGTTTCTACTTCCTGCATGCCACTGTGCAAGGCCACTCATCACTGTTCCTGCAGAAGCCTCTGGACGT
GGGGCTGGATGGGGTTGAAAATGTTACATGTAAATATTGGTTTGGTTCGGTTTTTAGCATTTTACTTGGTAACTGGTTGTTTTCTTTTTT
GGGGTGGGGGGATTGGTTTGTAAAAATTCTCTACTCTTTTGGAATGTGATTTCTAAGTTTGTTGGTTTCTTCAAATGCCTTTTAAGTCTT
GGTAACATTCCCAAAGCAGAAAACTGCCTGACCCACAGTGGGGATTCCCTGGAGAATTGGGGTCCCAAGAAGGAATGCTGCCCTTCTCGA
ACCCGTTCTCCCCCTTCCTCCTGCCTCTCTGCCTTTTACTGCTATTCCCTTCTTCTCCTCCTTTATCCTTCTTTCTGTTTTCCCCATCTC
CACTCTCTCTTCAACCAAAGTCCCAAGGAACCCTCGGGGCTCAATCCCCCATAGACCACTTGGCTTGGGTCCATGGGGTTGGCATCAGTT
GGTTGGCGGAAATGGGGGACCAGTTGGCATGATGGCCCTAAACTGGGAAACCTCATGTTTCTTATGTCTCACCTCTTTCCAGAGCCAAAT
CAGCCCCTTTTGGAATGATGACTTCATTGGAATGCAAATCAAGTCATTTTGGTGCATCAGTGGCTCTTAGGCCTGCACACACGAGACATC
AGAATCCAATCCTCTGACCCTGTGCCAGCCCTTTCCCCCAGTTTATTTCCCACCAAAGGCTGACCTCTAAGAGGTCTTGCTTTCTATGAA
CTCAAGATGGGTCCCACCTCTAGGTGTCCCCAGGTGCACTCTTCTACCGGTTGGCTTCCGATGTGACAAGGCCAAGGGCCCAAAGACTTG
ACCCTCTTACACCCTTGCTGACATGGTTCCATCATGTCCACCCGCATGCACTTTTATGGTTTCATCACCCAGCCTCTTCTCCTCTGGCCC
ACCCAGCGTCCAGGCTCTTTCTCCCTCTCCCCTCCTATCTAGAATGTCCCCTGCTTCTAGCCTCACCAGACCCCCCAAGCTCCCACTACT
TCTTCCATAATAATAGTAATAACAATGGTTATCATCATCCCCTGCACATCCCGCCTAAAGCACTTTACTATATAGAAAACGTTTCCCCTG
GCCGGGCATGGTGGCTCACGCCTGAAATCCCAGCTCTTTGGGAGGCTGAGGCGAGCGGATCAGTTGAGGTCAGGAGTTCAACGCCAGCCT
GGCCAACGTGGTGAATCCCTGTCTGTACTAAAAGTACAAAAAATTAGCTGAGCATGGTGGTGCGTGCCTGTAATCCCAGCTACTCGGGAG
GCTGAGGTGGGAGAATCGCTTGAGCCCAGGAGGCGGAGGTTGCAGGAGCAGAGATTGCGCCACTGCACTCCAGCCTGGATGACAGAGTGA
GACCCAATCTCAAAAAAGAAATCGTTTCCCACCCCACATCTCCTTCAGACCTCTCAGGGACAACTCTGGGAGGCAGCCTTGGCAGGACAT
GGGTTAGTGCGCCCATTTTGCTGTGAGGAAACTGAGGTACAGGTCTCATCCCAGAGCATGAGAAGTCACTGAGTTTAGATGAGAACTTGG
GTCCAACTCTGTCCTGTTTGCTGTGCAAATCCGCTGCCCTGCTGGGGGCTTTTGGTGGGTCCAGAATACCCAGAATATGCTGCTGGCCAA
CCCAGGCATAAAACAAGTCCATTCTAGATCACTGAGCCTTGTGTATTCCAGAGGGTGATCTGAGGTCCCCATTCAGCAGAATTCTCTGAG
GGCATGTTCAGAATGTAGATTCCTGGGCCCCACCTTGAATTTGCATGTTTAACAAACTCTCCTGGGGTTGAGGGGTGGGTGCAGTGGTCA
CACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGTGGATCACTTGAGCTCAGGAGTTCGAGCCTGGCCAATATGGTGAAACCCTGTC
TCTACTAAAAATGCAAAAATTAGCCAGGTGTGGTGGCACATGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCACTTGAA
CCTGGTGGGGAGCAGCGGTTGCAGTGAGCCGAGATTGTGCCATTGCGCTCTAGCCTGGGCAACGGAATGAGACTTGTCTCAAAAAAAATA
AAAAATAAAACCAGCCCTCCCCGGGGGATCTTAGGCACTATTGGCCACACCATTGGTGTTCGTGGCCCTGATTATTAGGCTATCTTTCTT
TTTTTAAGTTTTTTTAGATTTATTTTTTATTTTATTTATTTATTTATTTATTATTTATTATTATTATTTTTAGAGACAGGGGGTCTCCCT
ATGTTGCCCAGGCTGGTTTCAAACTCCTGGGCTCAAGTGATCTGCCCTCCTCAGCCTCCCAAAGTGCTGGGGTTACAGGCAGGCATCAGC
CACCGTGCCAGGTCATCTTCCTTTTTCTTTTTTTTTTGGAGACAGAGTCTTGCTCTGTCATTCAGGCTGGAGTACAGTGGCGCCATTTCA
GCTCACCGCAGCCTCCACCTCCCAGGTTCAAGCAGTTCTCCTGCCTCAACCTCCCGAGTAACTGGGATTACAGGTGTGTGCCACCACACC
CGACTAATTTTTTTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCACCCACC
TCGGCCTCCCAAAGTACTGGGATTATAGGCGTGAGCCACGGCATCCAGCCTCATCTTTCTTTTAACCAATAAACATGATGCTGTATCTTA
AAAAGAGCACTGAGCAGGGACTTAAGGGATCGAGTCCTCAACCAAACTGATTTAATTACTCAGGATTTTCAAAAAGCATCAGAGGCTATT
TACAATCTTAATCATAGGGGTTCAGTAAAATAAAAATAAGAAGTAAAAAAGCAAGAGAAATTATTCTGTAAATCTAACTGGTGTAATTCC
CATAATCATGCAATTAAGTTTTACTCTTGAGTTTCCTGACAGCCATTGGTAAAAAGAGAAACACATCAGGATTTATAATTTTTATCATCC
AATTATGGGAAGCAAGCATGTTGGCCCCAGGAGACGAACTCTTCTACTAATTTATAGCATGTATCCTTTCATCAAGGGCCACTGAGCCAG
TTGGTGAGTCAACGGGTGAACCTAAGATGCAAGGATGTTTTCCAGGTGACTATTTAAAAAAATAAATAATCCACCAACGTGATTGACCTT
GGCGAGATCATGTTTCTAGTCTATACCTCAGTTTCCCCATCTGTAAAGTGAGGATAATGTCCCACCCCATGTAACTGTGGTGAGGACCAA
CTGCAACACTGTGCCTGCGAGTCTCCTTGGAAAAGTGTAAGGTTCTACACAAATGGAAAGTGATCTGATCACACTCAGTGTCCCCAGCCC
AGCCTTTCAGTGCCCTGGCCCTGGGGTGGGGGACAATACTCTCCTCACCCCCTTCACTAGTCTTCATGAATAGCAAGGAGGCCATAACAT
AATTTGGTCTAAACCCCTTCCTTTTTAAAAGAATGATGGCAAAATGTGCATAACATTAAATTTACCATTTTAAAGATTTCTAAGTGTCTC
GAAGTACATTTGCAATGTGTAACTGCCACCTCCAGAACTTTTTCATCATCCTAAACTGAAACTCTGTACCCACTAAACAGTAACTCCCCA
CTCCCCCTGTCCCCAGTCCCTGGTAACCTCTATTCTACTTTTTTTTTTTCTCTGTGGATTGGCCTATTCTAGGTATCTCTTTTAAGTAGA
ATCATATAGTATTTGTCCTTTTGTGTCCAGCCCCCTCATTTTTTGAGATGAGGAATCAGGCCCAGAGAAGGCAGTGGCTCACCCAAGGTC
ACATCGCAAACCAGAGGCAGAGCCAAGACCAGAACCCAGGTTTCCTGACTCCTAACAGGTCTTGCATCCCCAGTAGTGAATGAGACACTG
CTTTGACTTTCTGTAATCTTGGTTTAGCCCCTTCCTTTCTCTGGGCTCAGTCTGCTTCCCACCACTCACCCATTCTTTCTTTATCATCTC
CCTCCAAAGCCTCTTGTCCTCCTGCCTCCTCTTCTCCTTGGCTGGTTCCTGCCAGCAAATGGAGTGGCCCTGGTCCATGCCAGGTTTTCC
TCTTCTGGGTCCGGAGCTCACTATAGTATTCAGCCCTCAGTCCTCCCAGGATGTTCTTCCCAGCACTGCTGCCTCGTGCGGATTTTCCCG
TAACCTCAGTAACTGGCTTCTTGTCCCCCTGCTTCCTACCAGGGAAGCCTTCCTGTCCGCGTCTGTGGTTTCCCAACTCTGATATTTGCT
CTCAAATGTGGTGGTGTCCTGGTTCTGTGTTTATTTATTTTGTGTTTTCTCACACACACAAAAAAAAGCCTCTTCCTCCAATGCATCAGG
AGGCACCAGCCCTGCCAGCCCTTCTCACTGGGCTCACCCTGCCCCAGCAACCCCCCGGTGCCACCCTTAAGCCACACTGTCTTCTCTGTA
AGCAGCCTGCCAGCAGCAGCCCCAGCACTTTGCAATGGGCGTGTGTGTGGTGGTGGGTGGGGGGGGCTTGGATCCCTCCTTTTTCCTCCC
CTGCCCTGCCCAGGCCCAGATGGCCTTGACTGTAAAGCAGGTGCTGCCTGACAGGTTCTTCTCTCCCTGTCTCTGGTCATTGATCCATCT
CTTTGTCCATTCAGTATCCAACCATCCTCTCCATTCTCCTCTGGACCTCACCACTCTCAGAGCTGCTTGTCCTGGCAGAATCTACAGTTC
ACCCCAACTCTATGCCTTACCCCTCCCAACCCAACAGCATTTGCAGTTTGCAAAATATACAGACCCAAGTCCTGAGGGGACTGAGGACAT
GATGCTGGGCCCAAGTCTCCTGCTCAGGGCTTCTCTCCAATGCCAGCCCTGCCACTCCTTCCTCACCCTCCTTGGAGCCTCCTCTGCTGC
TTGTCTATCCCAACGGCCCTGCTCCCCTCCCTTCCTGCCCTTCACCAGCTTTCTGGACACCATGCCCTGAGGAAGGGACCTTTGGTTTTC
TCTAAACATCTTTGAAGGGCTGAGGCAGTCAGGGCTGGCTGCCTTGTCACTCTTTATTTGGAAGCCACTCAAACCATTCCCAAGAAGAGG
GACCTCAGCTGGCAATCTGGAAACCTGGCCCAGGTCTGGGCAGATGTCTTCACTTCTCCTACCTTCCCAGTCTTGTGATCCTGTGATGAG
CACCAGGATGGCCCTGTGGTCCCTAGAGCACCCCTCATGCTGTAGGGTCCTGCAGCCCCATCCTTTCTCTACTGGGCCCTGGTATCCTGG
CTCCTCTCTCAGCTCTGCCACTGATCTCTGTGCCTTAGTTTACTTCTCTGCACGGGGGACTCACCCCAAGACCATTTCCAGCAGCTTCCC
AGGTGATGTGGTGCCCCAAGGCTGGGCTTTGCCAGCTGTGGCCCAGCTCCTTAGTGCTGCCCAGGAGACACCAGGCTGCTCAGAATGAGG
TGACTGCGGGCACCATTCTCAGCCAGTGGTTCTTGTATTGCATTCCAGCAGCAGGAATATCACCTGGGAACTTGATAGAAGTGCAGATTA
GCAGCCCCACCCAAGACCCACTGAATTAGAGCTTGTGGAGTGGGGCCCTACAAGCTGGGGTTTTAAGGAGCCCTCCAAATGATTCTGACG
CATAAGAATATGCCAACTGCTGATCTGGGCTAGCCATTAGTAGAGCCTGGGGAGGGACTGGGACTGGCTAGGCCAAGAACAGGTGGAAAA
CACCAGCCTTATCTGGACTCCTGAGATTGGGAACCACCACCAACAAAAACCAACCCTATAGTCGCTCCTCTTGGAAGAGGAAGAGAAGTT
GAAGGGCCTGGAGAAAGCACACATTGTTTGTTTCCCTGCTCCTGCTCACCTCTCTCACTTGTCTTGGTTTTACAAAAGGCTGTGTGGATG
GTGCCAGCCAGGGAGGGGGTGGGAGTCCTGGGGAGGCAGGAGGCAGAAGACCCTGACTGTTTCTCCCTTGGGAACCTCACCATAGGCCAG
ATAGCGCCTCTTCAAACTGAAAGAAATCTTAACTCCACAAAGAAAGCATCCTAAATCCCCAGTTCCTCCTCCTCCCAACCCCAGGGATAC
CTTGTAGACAGTGCCAAAAAACAGCTCCAACCCCCAGCAGCTGGGAAGAGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCC
AGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGGG
CAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTCT
TCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGAC

>8490_8490_5_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000395913_SOGA1_chr20_35445872_ENST00000237536_length(amino acids)=1549AA_BP=245
MGRRWEPTGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQ
ATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLE
KAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSA
DLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENR
GLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSC
SVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEV
LPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQ
QESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHD
SLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKR
WRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKA
KATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQ
QVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLP
NNPAFGFVSSEPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGRQMQRSYTAPDKTGIRVYYSPPVARRLGVPVVHDKEGKIIIEPGFLF
TTAKPKESAEADGLAESSYGRWLCNFSRQRLDGGSAGSPSAAGPGFPAALHDFEMSGNMSDDMKEITNCVRQAMRSGSLERKVKSTSSQT
VGLASVGTQTIRTVSVGLQTDPPRSSLHGKAWSPRSSSLVSVRSKQISSSLDKVHSRIERPCCSPKYGSPKLQRRSVSKLDSSKDRSLWN
LHQGKQNGSAWARSTTTRDSPVLRNINDGLSSLFSVVEHSGSTESVWKLGMSETRAKPEPPKYGIVQEFFRNVCGRAPSPTSSAGEEGTK
KPEPLSPASYHQPEGVARILNKKAAKLGSSEEVRLTMLPQVGKDGVLRDGDGAVVLPNEDAVCDCSTQSLTSCFARSSRSAIRHSPSKCR

--------------------------------------------------------------
>8490_8490_6_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000395913_SOGA1_chr20_35445872_ENST00000279034_length(transcript)=3913nt_BP=894nt
CTTAAACGCGACTCAAGGCGTCGGGTTTGTTGTCAACCAATCACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAG
GGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAACGGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGTGGGAGCCG
ACGGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTG
ACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATT
CCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCT
GTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAA
CAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTT
TATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAG
CTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTAC
CTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCC
TTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCAC
TTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTAC
CGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCAC
CTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGAC
GACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTG
GCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTG
CTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCA
CAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCC
AACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTG
CCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAG
GCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGT
GCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTG
CTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTG
CCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTG
CCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGA
GCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAG
CAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCG
CTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTG
CAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTG
CCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCT
GATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAG
GAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAG
CGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCC
CTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAG
CGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAAC
CGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGC
AGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCT
GAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTT
GTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCC
TGCCAGGACCCCCCAGGGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCA
CTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCA
TCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTG
CTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAA

>8490_8490_6_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000395913_SOGA1_chr20_35445872_ENST00000279034_length(amino acids)=1142AA_BP=245
MGRRWEPTGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQ
ATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLE
KAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSA
DLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENR
GLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSC
SVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEV
LPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQ
QESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHD
SLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKR
WRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKA
KATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQ
QVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLP

--------------------------------------------------------------
>8490_8490_7_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000312783_SOGA1_chr20_35445872_ENST00000237536_length(transcript)=13906nt_BP=948nt
ACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAGGGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAAC
GGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGGTCTCACTCCATTGCCCAGGCCAGAGTGCGGGGATATTTGATAAGAAACTT
CAGTGAAGGCCGGGCGCGGTGGCTCATGCCCGTAATCCCAGCATTTTCGGAGGCCGAGGCATCATGGACCGATCTAAAGAAAACTGCATT
TCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTA
AATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAG
CCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAG
CAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCT
TTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTG
GCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGG
CATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTAT
AGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCC
GATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAG
CTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCA
GCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGC
CGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCT
GAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAG
GAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCAC
GAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGC
GAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGT
ACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCG
GTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTG
GCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGC
GAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCA
GATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATT
CTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGC
ACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAG
GAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAG
GCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGC
CTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTG
CTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGC
ATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAG
AACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGT
CTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAG
CTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTA
GCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAG
CGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGG
GTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAA
GTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTT
GCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACC
AAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGGCAGATGCAGCGCAGC
TACACGGCTCCTGACAAGACGGGCATCCGAGTCTACTATAGTCCCCCGGTGGCCCGGCGCCTCGGAGTCCCTGTGGTTCATGACAAAGAG
GGCAAGATCATTATCGAGCCCGGCTTCCTCTTCACCACAGCCAAGCCCAAAGAGTCGGCCGAGGCTGATGGGCTGGCTGAGAGCTCCTAT
GGTCGGTGGCTCTGCAACTTCTCACGGCAGCGCCTGGACGGAGGCTCAGCGGGCAGCCCCTCGGCGGCCGGGCCTGGCTTCCCAGCGGCC
CTGCATGACTTTGAGATGTCAGGCAACATGAGTGATGACATGAAGGAGATCACCAACTGTGTGCGCCAGGCCATGCGCTCCGGCTCACTG
GAGAGGAAAGTGAAGAGCACATCCAGCCAGACGGTGGGCCTGGCCAGTGTGGGCACACAGACCATCCGCACGGTCAGCGTGGGCCTGCAG
ACCGACCCACCCCGCAGCAGCCTCCATGGCAAGGCCTGGTCACCCCGCAGCTCTTCGCTCGTGTCTGTGCGCAGCAAGCAGATCTCCTCC
TCCCTGGACAAGGTCCATTCGCGCATCGAGCGGCCCTGCTGCTCCCCCAAGTATGGCTCACCAAAGCTCCAGAGGCGGTCTGTGTCCAAG
CTGGACAGCAGCAAGGACCGCAGCCTGTGGAACCTGCACCAGGGCAAGCAGAACGGCTCGGCCTGGGCCCGCTCCACCACCACGCGGGAC
AGCCCTGTATTGAGAAACATCAACGATGGACTCTCCAGCCTCTTCAGTGTGGTGGAGCACTCAGGGAGCACGGAGTCTGTCTGGAAACTA
GGCATGTCTGAGACGCGGGCCAAGCCCGAGCCTCCCAAGTACGGCATTGTGCAGGAATTCTTCCGTAATGTGTGTGGCCGGGCACCGAGC
CCCACCTCATCAGCAGGAGAGGAGGGCACCAAGAAGCCAGAGCCCCTCTCCCCAGCCAGCTACCATCAGCCAGAGGGTGTGGCCAGGATC
CTGAACAAGAAGGCAGCCAAGTTGGGCAGCAGTGAGGAGGTCAGACTCACCATGCTCCCCCAGGTGGGGAAGGATGGTGTCCTCCGGGAC
GGAGATGGAGCCGTGGTCCTTCCCAATGAGGACGCTGTTTGTGACTGTAGTACCCAGTCTCTCACCTCCTGCTTCGCCCGATCGTCCCGC
TCTGCCATCCGCCACTCTCCTTCCAAGTGCAGGCTGCACCCTTCAGAGTCCAGCTGGGGTGGGGAGGAGAGGGCACTCCCCCCCAGCGAG
TGACAGAGCAGCCAAGCTCCCCGCCTCAACCAGCCCAGCCCCTGGATAGCAGAAGGGAACCAGCAGAGACGAGACGAGGTGAGGCGAGGG
GCTGTGTCCTCAGCATTGCCTGGCCCTGGAGGGACAGCAGTGATGCCACTGCCAGAATGCAGCTTTCACATCAAGGTAAAGCCGGGTCTC
CTGCTGGCCCCTGGGTGGTGAGCTTCGACTTCCCAGGGGAAGGCAGTGAGTGGGAGAGAGACCAAACCTGGGCTTCCCAAGCATCCACTG
AGAGATCTGTCAAGAGCCGATCCCTGGGTCCTAAGAGAGAGCCTTGCCTGGTTCTGCCCATGCCACCCTCTTGGAAGAGCCCAAGAAGGA
TACATGTCTGGCCATGCCTTTGGGGAAAAGGAGTCGGAGAGATGTTTCCTGCTGACCATCCACCCCTTCATTTGGGAGGAGACACTGCTG
AGAAGAACAGGCTTTGCTCTAGGGCTCCATGTTTGGTTCCTGGTGGAGCCCTGTTGGGCATCATCACCATCACCTCCTTCTCTCCACCAC
CTCCTCCTCCCAGCCCCACTGCTCTAGATTGCTGGGACACTAGGGAGTATGATAGGGCAGTAGCCAGGGCCATTGCTTAGTGTCCTGGAG
CCCTGGATCTCCCTGCCCATAGCCTGGATGCAGCAAGAGCTGGGAGGCGAAGTGGAAACATGCAGGGCTCAGGGTTGGGGAGTGATTGCA
ATTGCCTTCCTTGCCAAAGTGACTTGGGGCCCCAACGTTCCCAGCAGACCCCTTGAGGACAGAAATAGGTAGAGTCAGTCTCAAGACCTG
GTGCATAGATAAATGCCTAAATACACTGCCTTGATCTCAAGTGATCTCAGAGGCCTCTTTCCCTGGCACCCTGAGAGGCAGCAGGCACTA
CATCTCCACTGTGTTTACATCCTGCAGCTGGTCGGGGGGCAAAGATATTCCCAGTAAGAGATTCTTGGTTGGCCAGGTCAGGCCCAGGAG
AACACCAAGAGGCCAGAGCCCAGGACACAGCAGTAGACTGGGGCCTGGAAACACGTATCTTGCCTAGATTGTTTATTTGAATTTTTCCTA
CTATAAATATTTAAGGTGGTTTACTTTATTTTAATAATTTAATTTACCCCAAAGTCCCTAAGGTAATTTATTGGAGGTTGAAACATGCAT
TCTTGCCACTGGGACAACATGAGGCCTCTAACAGCACGGGCAGGCATGGGGTCCCCTGGGTGGACGAGGCCGCTTGGCAGCCAGGTTTGG
AGACCTGGCCTCCTGGTCAGCTTTGGAGGGCCCCTCAACAGAGCTGGAGCCCTGCACCCCAACACGGCTGGCCATGTGGCCTCAGAACAC
TACTTATTACTCAATGCCTGGTACTTGGCCAGCCCCAGCGGTCAGTCTATAAATACTCACTGACAAGGTGGAGGGCTGGACGGCCATCAC
CACTCCCCAGACGTTCTCCATTGCCTGTCTCATTTCCCTCCCTCCCTCTGACACCTTTCTTCATGAGTCGAACGTGGATTACTAAAGCTC
TATTAAGAGTGTGGAGATCCCTCCAAGTTTCCCAAATGAGAACTCACAGGAAAACAGGACTGAACTTTGAGAATGTTGTTTATCGCAGCT
TTGCACATAAACCTGAGTGTCTCCCAGCCTGCCTCGGTTCTCACCAGCCTGCCAGCCTTTTCACCAGCCTCTCTCCTTAGCCTTATGGCC
TTTCACGGCTCTTCTCCCTGCCCCAGCTCTGCTGCCCGCCCTTCCTCACGTCCCCTGTGAGCTGCCTGAGCCATTGGTTGGATTTCGATG
TGGCTCATTGCAGCATGTGGGGCAGCGCCTCCCATGGCCTCGCCTTGGTGCCGGTGAACCCCTTTTGGTTGCACACATGCTCCCCACACA
CACATAGACATCAGCCTTCCTGTAACTGACTGGGGACCCAGAGTGGAAACACCAGGATGGATCAGCTTGTCTGCAGAATTGCCCATCAGG
AAGACCAAAAGCCAGTAGCTTTGCTGATCTGCCCCCAGGACTCTGGAGGCGCCCCTGCACTCCCACCTCCCACCTGCCAGTTCCCAGACC
CACCCATTCGGGATCACCTGGACCAGTTACCCCAAGTCCTGCATCTCCCTTCCCTGCAGGCTGAACACCAGGGTCATGCCAGTCCCGCCA
GCCGCCTCCTCCATGCCCCAGTGACTGGTGTGGGCAGAGCAGGCAGCCAGTGGAGCTGTGGGCCAGTTCCGCTCTTGGATGCTGCTGCTC
TCACCCATGAGGTCAGGGGGGCCCTCCAAGGTTATCTCCAGGTGAGGGGATTCACATCAGGCCACAAGCCACCAGAGGCCTTCTGCCACC
TCCCAGAGCGACAGCCAGGGAGGCTGCGTACTCAGCCTCGGGGAGAAATCCCCGTGGGACCTGAGCCCCAAGACCTACGGACCACTCAGC
CTTACCATCGTACCGTCCAGGATTGTCCTTGCCATCTTTGTTGTCTCAGCCAGACCTTGGTTTTCAGTAAAGCCCCAGTTTCTACTTCCT
GCATGCCACTGTGCAAGGCCACTCATCACTGTTCCTGCAGAAGCCTCTGGACGTGGGGCTGGATGGGGTTGAAAATGTTACATGTAAATA
TTGGTTTGGTTCGGTTTTTAGCATTTTACTTGGTAACTGGTTGTTTTCTTTTTTGGGGTGGGGGGATTGGTTTGTAAAAATTCTCTACTC
TTTTGGAATGTGATTTCTAAGTTTGTTGGTTTCTTCAAATGCCTTTTAAGTCTTGGTAACATTCCCAAAGCAGAAAACTGCCTGACCCAC
AGTGGGGATTCCCTGGAGAATTGGGGTCCCAAGAAGGAATGCTGCCCTTCTCGAACCCGTTCTCCCCCTTCCTCCTGCCTCTCTGCCTTT
TACTGCTATTCCCTTCTTCTCCTCCTTTATCCTTCTTTCTGTTTTCCCCATCTCCACTCTCTCTTCAACCAAAGTCCCAAGGAACCCTCG
GGGCTCAATCCCCCATAGACCACTTGGCTTGGGTCCATGGGGTTGGCATCAGTTGGTTGGCGGAAATGGGGGACCAGTTGGCATGATGGC
CCTAAACTGGGAAACCTCATGTTTCTTATGTCTCACCTCTTTCCAGAGCCAAATCAGCCCCTTTTGGAATGATGACTTCATTGGAATGCA
AATCAAGTCATTTTGGTGCATCAGTGGCTCTTAGGCCTGCACACACGAGACATCAGAATCCAATCCTCTGACCCTGTGCCAGCCCTTTCC
CCCAGTTTATTTCCCACCAAAGGCTGACCTCTAAGAGGTCTTGCTTTCTATGAACTCAAGATGGGTCCCACCTCTAGGTGTCCCCAGGTG
CACTCTTCTACCGGTTGGCTTCCGATGTGACAAGGCCAAGGGCCCAAAGACTTGACCCTCTTACACCCTTGCTGACATGGTTCCATCATG
TCCACCCGCATGCACTTTTATGGTTTCATCACCCAGCCTCTTCTCCTCTGGCCCACCCAGCGTCCAGGCTCTTTCTCCCTCTCCCCTCCT
ATCTAGAATGTCCCCTGCTTCTAGCCTCACCAGACCCCCCAAGCTCCCACTACTTCTTCCATAATAATAGTAATAACAATGGTTATCATC
ATCCCCTGCACATCCCGCCTAAAGCACTTTACTATATAGAAAACGTTTCCCCTGGCCGGGCATGGTGGCTCACGCCTGAAATCCCAGCTC
TTTGGGAGGCTGAGGCGAGCGGATCAGTTGAGGTCAGGAGTTCAACGCCAGCCTGGCCAACGTGGTGAATCCCTGTCTGTACTAAAAGTA
CAAAAAATTAGCTGAGCATGGTGGTGCGTGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGTGGGAGAATCGCTTGAGCCCAGGAGGCGG
AGGTTGCAGGAGCAGAGATTGCGCCACTGCACTCCAGCCTGGATGACAGAGTGAGACCCAATCTCAAAAAAGAAATCGTTTCCCACCCCA
CATCTCCTTCAGACCTCTCAGGGACAACTCTGGGAGGCAGCCTTGGCAGGACATGGGTTAGTGCGCCCATTTTGCTGTGAGGAAACTGAG
GTACAGGTCTCATCCCAGAGCATGAGAAGTCACTGAGTTTAGATGAGAACTTGGGTCCAACTCTGTCCTGTTTGCTGTGCAAATCCGCTG
CCCTGCTGGGGGCTTTTGGTGGGTCCAGAATACCCAGAATATGCTGCTGGCCAACCCAGGCATAAAACAAGTCCATTCTAGATCACTGAG
CCTTGTGTATTCCAGAGGGTGATCTGAGGTCCCCATTCAGCAGAATTCTCTGAGGGCATGTTCAGAATGTAGATTCCTGGGCCCCACCTT
GAATTTGCATGTTTAACAAACTCTCCTGGGGTTGAGGGGTGGGTGCAGTGGTCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGG
GTGGATCACTTGAGCTCAGGAGTTCGAGCCTGGCCAATATGGTGAAACCCTGTCTCTACTAAAAATGCAAAAATTAGCCAGGTGTGGTGG
CACATGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGTGGGGAGCAGCGGTTGCAGTGAGCCGAGATT
GTGCCATTGCGCTCTAGCCTGGGCAACGGAATGAGACTTGTCTCAAAAAAAATAAAAAATAAAACCAGCCCTCCCCGGGGGATCTTAGGC
ACTATTGGCCACACCATTGGTGTTCGTGGCCCTGATTATTAGGCTATCTTTCTTTTTTTAAGTTTTTTTAGATTTATTTTTTATTTTATT
TATTTATTTATTTATTATTTATTATTATTATTTTTAGAGACAGGGGGTCTCCCTATGTTGCCCAGGCTGGTTTCAAACTCCTGGGCTCAA
GTGATCTGCCCTCCTCAGCCTCCCAAAGTGCTGGGGTTACAGGCAGGCATCAGCCACCGTGCCAGGTCATCTTCCTTTTTCTTTTTTTTT
TGGAGACAGAGTCTTGCTCTGTCATTCAGGCTGGAGTACAGTGGCGCCATTTCAGCTCACCGCAGCCTCCACCTCCCAGGTTCAAGCAGT
TCTCCTGCCTCAACCTCCCGAGTAACTGGGATTACAGGTGTGTGCCACCACACCCGACTAATTTTTTTATTTTTAGTAGAGACGGGGTTT
CACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCACCCACCTCGGCCTCCCAAAGTACTGGGATTATAGGCGTGAGC
CACGGCATCCAGCCTCATCTTTCTTTTAACCAATAAACATGATGCTGTATCTTAAAAAGAGCACTGAGCAGGGACTTAAGGGATCGAGTC
CTCAACCAAACTGATTTAATTACTCAGGATTTTCAAAAAGCATCAGAGGCTATTTACAATCTTAATCATAGGGGTTCAGTAAAATAAAAA
TAAGAAGTAAAAAAGCAAGAGAAATTATTCTGTAAATCTAACTGGTGTAATTCCCATAATCATGCAATTAAGTTTTACTCTTGAGTTTCC
TGACAGCCATTGGTAAAAAGAGAAACACATCAGGATTTATAATTTTTATCATCCAATTATGGGAAGCAAGCATGTTGGCCCCAGGAGACG
AACTCTTCTACTAATTTATAGCATGTATCCTTTCATCAAGGGCCACTGAGCCAGTTGGTGAGTCAACGGGTGAACCTAAGATGCAAGGAT
GTTTTCCAGGTGACTATTTAAAAAAATAAATAATCCACCAACGTGATTGACCTTGGCGAGATCATGTTTCTAGTCTATACCTCAGTTTCC
CCATCTGTAAAGTGAGGATAATGTCCCACCCCATGTAACTGTGGTGAGGACCAACTGCAACACTGTGCCTGCGAGTCTCCTTGGAAAAGT
GTAAGGTTCTACACAAATGGAAAGTGATCTGATCACACTCAGTGTCCCCAGCCCAGCCTTTCAGTGCCCTGGCCCTGGGGTGGGGGACAA
TACTCTCCTCACCCCCTTCACTAGTCTTCATGAATAGCAAGGAGGCCATAACATAATTTGGTCTAAACCCCTTCCTTTTTAAAAGAATGA
TGGCAAAATGTGCATAACATTAAATTTACCATTTTAAAGATTTCTAAGTGTCTCGAAGTACATTTGCAATGTGTAACTGCCACCTCCAGA
ACTTTTTCATCATCCTAAACTGAAACTCTGTACCCACTAAACAGTAACTCCCCACTCCCCCTGTCCCCAGTCCCTGGTAACCTCTATTCT
ACTTTTTTTTTTTCTCTGTGGATTGGCCTATTCTAGGTATCTCTTTTAAGTAGAATCATATAGTATTTGTCCTTTTGTGTCCAGCCCCCT
CATTTTTTGAGATGAGGAATCAGGCCCAGAGAAGGCAGTGGCTCACCCAAGGTCACATCGCAAACCAGAGGCAGAGCCAAGACCAGAACC
CAGGTTTCCTGACTCCTAACAGGTCTTGCATCCCCAGTAGTGAATGAGACACTGCTTTGACTTTCTGTAATCTTGGTTTAGCCCCTTCCT
TTCTCTGGGCTCAGTCTGCTTCCCACCACTCACCCATTCTTTCTTTATCATCTCCCTCCAAAGCCTCTTGTCCTCCTGCCTCCTCTTCTC
CTTGGCTGGTTCCTGCCAGCAAATGGAGTGGCCCTGGTCCATGCCAGGTTTTCCTCTTCTGGGTCCGGAGCTCACTATAGTATTCAGCCC
TCAGTCCTCCCAGGATGTTCTTCCCAGCACTGCTGCCTCGTGCGGATTTTCCCGTAACCTCAGTAACTGGCTTCTTGTCCCCCTGCTTCC
TACCAGGGAAGCCTTCCTGTCCGCGTCTGTGGTTTCCCAACTCTGATATTTGCTCTCAAATGTGGTGGTGTCCTGGTTCTGTGTTTATTT
ATTTTGTGTTTTCTCACACACACAAAAAAAAGCCTCTTCCTCCAATGCATCAGGAGGCACCAGCCCTGCCAGCCCTTCTCACTGGGCTCA
CCCTGCCCCAGCAACCCCCCGGTGCCACCCTTAAGCCACACTGTCTTCTCTGTAAGCAGCCTGCCAGCAGCAGCCCCAGCACTTTGCAAT
GGGCGTGTGTGTGGTGGTGGGTGGGGGGGGCTTGGATCCCTCCTTTTTCCTCCCCTGCCCTGCCCAGGCCCAGATGGCCTTGACTGTAAA
GCAGGTGCTGCCTGACAGGTTCTTCTCTCCCTGTCTCTGGTCATTGATCCATCTCTTTGTCCATTCAGTATCCAACCATCCTCTCCATTC
TCCTCTGGACCTCACCACTCTCAGAGCTGCTTGTCCTGGCAGAATCTACAGTTCACCCCAACTCTATGCCTTACCCCTCCCAACCCAACA
GCATTTGCAGTTTGCAAAATATACAGACCCAAGTCCTGAGGGGACTGAGGACATGATGCTGGGCCCAAGTCTCCTGCTCAGGGCTTCTCT
CCAATGCCAGCCCTGCCACTCCTTCCTCACCCTCCTTGGAGCCTCCTCTGCTGCTTGTCTATCCCAACGGCCCTGCTCCCCTCCCTTCCT
GCCCTTCACCAGCTTTCTGGACACCATGCCCTGAGGAAGGGACCTTTGGTTTTCTCTAAACATCTTTGAAGGGCTGAGGCAGTCAGGGCT
GGCTGCCTTGTCACTCTTTATTTGGAAGCCACTCAAACCATTCCCAAGAAGAGGGACCTCAGCTGGCAATCTGGAAACCTGGCCCAGGTC
TGGGCAGATGTCTTCACTTCTCCTACCTTCCCAGTCTTGTGATCCTGTGATGAGCACCAGGATGGCCCTGTGGTCCCTAGAGCACCCCTC
ATGCTGTAGGGTCCTGCAGCCCCATCCTTTCTCTACTGGGCCCTGGTATCCTGGCTCCTCTCTCAGCTCTGCCACTGATCTCTGTGCCTT
AGTTTACTTCTCTGCACGGGGGACTCACCCCAAGACCATTTCCAGCAGCTTCCCAGGTGATGTGGTGCCCCAAGGCTGGGCTTTGCCAGC
TGTGGCCCAGCTCCTTAGTGCTGCCCAGGAGACACCAGGCTGCTCAGAATGAGGTGACTGCGGGCACCATTCTCAGCCAGTGGTTCTTGT
ATTGCATTCCAGCAGCAGGAATATCACCTGGGAACTTGATAGAAGTGCAGATTAGCAGCCCCACCCAAGACCCACTGAATTAGAGCTTGT
GGAGTGGGGCCCTACAAGCTGGGGTTTTAAGGAGCCCTCCAAATGATTCTGACGCATAAGAATATGCCAACTGCTGATCTGGGCTAGCCA
TTAGTAGAGCCTGGGGAGGGACTGGGACTGGCTAGGCCAAGAACAGGTGGAAAACACCAGCCTTATCTGGACTCCTGAGATTGGGAACCA
CCACCAACAAAAACCAACCCTATAGTCGCTCCTCTTGGAAGAGGAAGAGAAGTTGAAGGGCCTGGAGAAAGCACACATTGTTTGTTTCCC
TGCTCCTGCTCACCTCTCTCACTTGTCTTGGTTTTACAAAAGGCTGTGTGGATGGTGCCAGCCAGGGAGGGGGTGGGAGTCCTGGGGAGG
CAGGAGGCAGAAGACCCTGACTGTTTCTCCCTTGGGAACCTCACCATAGGCCAGATAGCGCCTCTTCAAACTGAAAGAAATCTTAACTCC
ACAAAGAAAGCATCCTAAATCCCCAGTTCCTCCTCCTCCCAACCCCAGGGATACCTTGTAGACAGTGCCAAAAAACAGCTCCAACCCCCA
GCAGCTGGGAAGAGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCA
GCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCT
TGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAG
GACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAAGCGT

>8490_8490_7_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000312783_SOGA1_chr20_35445872_ENST00000237536_length(amino acids)=1539AA_BP=235
MDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSR
PLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRR
EVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAK
EESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMK
DHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEE
LAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAAL
VSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPL
GSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLS
QERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSE
GDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAE
LKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLS
QLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSS
EPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGRQMQRSYTAPDKTGIRVYYSPPVARRLGVPVVHDKEGKIIIEPGFLFTTAKPKESAE
ADGLAESSYGRWLCNFSRQRLDGGSAGSPSAAGPGFPAALHDFEMSGNMSDDMKEITNCVRQAMRSGSLERKVKSTSSQTVGLASVGTQT
IRTVSVGLQTDPPRSSLHGKAWSPRSSSLVSVRSKQISSSLDKVHSRIERPCCSPKYGSPKLQRRSVSKLDSSKDRSLWNLHQGKQNGSA
WARSTTTRDSPVLRNINDGLSSLFSVVEHSGSTESVWKLGMSETRAKPEPPKYGIVQEFFRNVCGRAPSPTSSAGEEGTKKPEPLSPASY
HQPEGVARILNKKAAKLGSSEEVRLTMLPQVGKDGVLRDGDGAVVLPNEDAVCDCSTQSLTSCFARSSRSAIRHSPSKCRLHPSESSWGG

--------------------------------------------------------------
>8490_8490_8_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000312783_SOGA1_chr20_35445872_ENST00000279034_length(transcript)=3967nt_BP=948nt
ACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAGGGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAAC
GGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGGTCTCACTCCATTGCCCAGGCCAGAGTGCGGGGATATTTGATAAGAAACTT
CAGTGAAGGCCGGGCGCGGTGGCTCATGCCCGTAATCCCAGCATTTTCGGAGGCCGAGGCATCATGGACCGATCTAAAGAAAACTGCATT
TCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTA
AATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAG
CCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAG
CAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCT
TTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTG
GCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGG
CATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTAT
AGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCC
GATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAG
CTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCA
GCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGC
CGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCT
GAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAG
GAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCAC
GAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGC
GAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGT
ACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCG
GTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTG
GCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGC
GAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCA
GATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATT
CTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGC
ACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAG
GAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAG
GCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGC
CTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTG
CTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGC
ATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAG
AACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGT
CTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAG
CTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTA
GCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAG
CGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGG
GTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAA
GTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTT
GCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACC
AAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGCCAGAAGCTGCCCTTC
CTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCAT
GGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTAT
TCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCT
GCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAAGCGTTTTTAATGGTTGTAGATTGGAAATAAAGTATG

>8490_8490_8_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000312783_SOGA1_chr20_35445872_ENST00000279034_length(amino acids)=1132AA_BP=235
MDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSR
PLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRR
EVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAK
EESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMK
DHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEE
LAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAAL
VSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPL
GSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLS
QERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSE
GDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAE
LKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLS
QLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSS

--------------------------------------------------------------
>8490_8490_9_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000371356_SOGA1_chr20_35445872_ENST00000237536_length(transcript)=13795nt_BP=837nt
CTTGGAAGACTTGGGTCCTTGGGTCGCAGGCTGGAGTGCAATGGTGTGATCTCAGCTCACTGCAACCTCTGCTTCCTGGGTTTAAGTGAT
TCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACA
GCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGG
GTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAG
AAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCT
GAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGT
CGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAA
GCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTG
TATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCA
AAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAG
GAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAG
AATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCC
CCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAG
GTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCC
GCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGC
TCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCG
GAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAG
AAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACG
GACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGG
GAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACG
GCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAG
CTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCT
GTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGC
TCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAG
CCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCT
GAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAA
ATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAG
AAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGG
TTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTG
ATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATC
AAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAG
TTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCC
GGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTC
ATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAG
CAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTG
GAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGT
TTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGC
AAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCG
TCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGGCAGATGCAGCGCAGCTACACGGCTCCTGACAAGACG
GGCATCCGAGTCTACTATAGTCCCCCGGTGGCCCGGCGCCTCGGAGTCCCTGTGGTTCATGACAAAGAGGGCAAGATCATTATCGAGCCC
GGCTTCCTCTTCACCACAGCCAAGCCCAAAGAGTCGGCCGAGGCTGATGGGCTGGCTGAGAGCTCCTATGGTCGGTGGCTCTGCAACTTC
TCACGGCAGCGCCTGGACGGAGGCTCAGCGGGCAGCCCCTCGGCGGCCGGGCCTGGCTTCCCAGCGGCCCTGCATGACTTTGAGATGTCA
GGCAACATGAGTGATGACATGAAGGAGATCACCAACTGTGTGCGCCAGGCCATGCGCTCCGGCTCACTGGAGAGGAAAGTGAAGAGCACA
TCCAGCCAGACGGTGGGCCTGGCCAGTGTGGGCACACAGACCATCCGCACGGTCAGCGTGGGCCTGCAGACCGACCCACCCCGCAGCAGC
CTCCATGGCAAGGCCTGGTCACCCCGCAGCTCTTCGCTCGTGTCTGTGCGCAGCAAGCAGATCTCCTCCTCCCTGGACAAGGTCCATTCG
CGCATCGAGCGGCCCTGCTGCTCCCCCAAGTATGGCTCACCAAAGCTCCAGAGGCGGTCTGTGTCCAAGCTGGACAGCAGCAAGGACCGC
AGCCTGTGGAACCTGCACCAGGGCAAGCAGAACGGCTCGGCCTGGGCCCGCTCCACCACCACGCGGGACAGCCCTGTATTGAGAAACATC
AACGATGGACTCTCCAGCCTCTTCAGTGTGGTGGAGCACTCAGGGAGCACGGAGTCTGTCTGGAAACTAGGCATGTCTGAGACGCGGGCC
AAGCCCGAGCCTCCCAAGTACGGCATTGTGCAGGAATTCTTCCGTAATGTGTGTGGCCGGGCACCGAGCCCCACCTCATCAGCAGGAGAG
GAGGGCACCAAGAAGCCAGAGCCCCTCTCCCCAGCCAGCTACCATCAGCCAGAGGGTGTGGCCAGGATCCTGAACAAGAAGGCAGCCAAG
TTGGGCAGCAGTGAGGAGGTCAGACTCACCATGCTCCCCCAGGTGGGGAAGGATGGTGTCCTCCGGGACGGAGATGGAGCCGTGGTCCTT
CCCAATGAGGACGCTGTTTGTGACTGTAGTACCCAGTCTCTCACCTCCTGCTTCGCCCGATCGTCCCGCTCTGCCATCCGCCACTCTCCT
TCCAAGTGCAGGCTGCACCCTTCAGAGTCCAGCTGGGGTGGGGAGGAGAGGGCACTCCCCCCCAGCGAGTGACAGAGCAGCCAAGCTCCC
CGCCTCAACCAGCCCAGCCCCTGGATAGCAGAAGGGAACCAGCAGAGACGAGACGAGGTGAGGCGAGGGGCTGTGTCCTCAGCATTGCCT
GGCCCTGGAGGGACAGCAGTGATGCCACTGCCAGAATGCAGCTTTCACATCAAGGTAAAGCCGGGTCTCCTGCTGGCCCCTGGGTGGTGA
GCTTCGACTTCCCAGGGGAAGGCAGTGAGTGGGAGAGAGACCAAACCTGGGCTTCCCAAGCATCCACTGAGAGATCTGTCAAGAGCCGAT
CCCTGGGTCCTAAGAGAGAGCCTTGCCTGGTTCTGCCCATGCCACCCTCTTGGAAGAGCCCAAGAAGGATACATGTCTGGCCATGCCTTT
GGGGAAAAGGAGTCGGAGAGATGTTTCCTGCTGACCATCCACCCCTTCATTTGGGAGGAGACACTGCTGAGAAGAACAGGCTTTGCTCTA
GGGCTCCATGTTTGGTTCCTGGTGGAGCCCTGTTGGGCATCATCACCATCACCTCCTTCTCTCCACCACCTCCTCCTCCCAGCCCCACTG
CTCTAGATTGCTGGGACACTAGGGAGTATGATAGGGCAGTAGCCAGGGCCATTGCTTAGTGTCCTGGAGCCCTGGATCTCCCTGCCCATA
GCCTGGATGCAGCAAGAGCTGGGAGGCGAAGTGGAAACATGCAGGGCTCAGGGTTGGGGAGTGATTGCAATTGCCTTCCTTGCCAAAGTG
ACTTGGGGCCCCAACGTTCCCAGCAGACCCCTTGAGGACAGAAATAGGTAGAGTCAGTCTCAAGACCTGGTGCATAGATAAATGCCTAAA
TACACTGCCTTGATCTCAAGTGATCTCAGAGGCCTCTTTCCCTGGCACCCTGAGAGGCAGCAGGCACTACATCTCCACTGTGTTTACATC
CTGCAGCTGGTCGGGGGGCAAAGATATTCCCAGTAAGAGATTCTTGGTTGGCCAGGTCAGGCCCAGGAGAACACCAAGAGGCCAGAGCCC
AGGACACAGCAGTAGACTGGGGCCTGGAAACACGTATCTTGCCTAGATTGTTTATTTGAATTTTTCCTACTATAAATATTTAAGGTGGTT
TACTTTATTTTAATAATTTAATTTACCCCAAAGTCCCTAAGGTAATTTATTGGAGGTTGAAACATGCATTCTTGCCACTGGGACAACATG
AGGCCTCTAACAGCACGGGCAGGCATGGGGTCCCCTGGGTGGACGAGGCCGCTTGGCAGCCAGGTTTGGAGACCTGGCCTCCTGGTCAGC
TTTGGAGGGCCCCTCAACAGAGCTGGAGCCCTGCACCCCAACACGGCTGGCCATGTGGCCTCAGAACACTACTTATTACTCAATGCCTGG
TACTTGGCCAGCCCCAGCGGTCAGTCTATAAATACTCACTGACAAGGTGGAGGGCTGGACGGCCATCACCACTCCCCAGACGTTCTCCAT
TGCCTGTCTCATTTCCCTCCCTCCCTCTGACACCTTTCTTCATGAGTCGAACGTGGATTACTAAAGCTCTATTAAGAGTGTGGAGATCCC
TCCAAGTTTCCCAAATGAGAACTCACAGGAAAACAGGACTGAACTTTGAGAATGTTGTTTATCGCAGCTTTGCACATAAACCTGAGTGTC
TCCCAGCCTGCCTCGGTTCTCACCAGCCTGCCAGCCTTTTCACCAGCCTCTCTCCTTAGCCTTATGGCCTTTCACGGCTCTTCTCCCTGC
CCCAGCTCTGCTGCCCGCCCTTCCTCACGTCCCCTGTGAGCTGCCTGAGCCATTGGTTGGATTTCGATGTGGCTCATTGCAGCATGTGGG
GCAGCGCCTCCCATGGCCTCGCCTTGGTGCCGGTGAACCCCTTTTGGTTGCACACATGCTCCCCACACACACATAGACATCAGCCTTCCT
GTAACTGACTGGGGACCCAGAGTGGAAACACCAGGATGGATCAGCTTGTCTGCAGAATTGCCCATCAGGAAGACCAAAAGCCAGTAGCTT
TGCTGATCTGCCCCCAGGACTCTGGAGGCGCCCCTGCACTCCCACCTCCCACCTGCCAGTTCCCAGACCCACCCATTCGGGATCACCTGG
ACCAGTTACCCCAAGTCCTGCATCTCCCTTCCCTGCAGGCTGAACACCAGGGTCATGCCAGTCCCGCCAGCCGCCTCCTCCATGCCCCAG
TGACTGGTGTGGGCAGAGCAGGCAGCCAGTGGAGCTGTGGGCCAGTTCCGCTCTTGGATGCTGCTGCTCTCACCCATGAGGTCAGGGGGG
CCCTCCAAGGTTATCTCCAGGTGAGGGGATTCACATCAGGCCACAAGCCACCAGAGGCCTTCTGCCACCTCCCAGAGCGACAGCCAGGGA
GGCTGCGTACTCAGCCTCGGGGAGAAATCCCCGTGGGACCTGAGCCCCAAGACCTACGGACCACTCAGCCTTACCATCGTACCGTCCAGG
ATTGTCCTTGCCATCTTTGTTGTCTCAGCCAGACCTTGGTTTTCAGTAAAGCCCCAGTTTCTACTTCCTGCATGCCACTGTGCAAGGCCA
CTCATCACTGTTCCTGCAGAAGCCTCTGGACGTGGGGCTGGATGGGGTTGAAAATGTTACATGTAAATATTGGTTTGGTTCGGTTTTTAG
CATTTTACTTGGTAACTGGTTGTTTTCTTTTTTGGGGTGGGGGGATTGGTTTGTAAAAATTCTCTACTCTTTTGGAATGTGATTTCTAAG
TTTGTTGGTTTCTTCAAATGCCTTTTAAGTCTTGGTAACATTCCCAAAGCAGAAAACTGCCTGACCCACAGTGGGGATTCCCTGGAGAAT
TGGGGTCCCAAGAAGGAATGCTGCCCTTCTCGAACCCGTTCTCCCCCTTCCTCCTGCCTCTCTGCCTTTTACTGCTATTCCCTTCTTCTC
CTCCTTTATCCTTCTTTCTGTTTTCCCCATCTCCACTCTCTCTTCAACCAAAGTCCCAAGGAACCCTCGGGGCTCAATCCCCCATAGACC
ACTTGGCTTGGGTCCATGGGGTTGGCATCAGTTGGTTGGCGGAAATGGGGGACCAGTTGGCATGATGGCCCTAAACTGGGAAACCTCATG
TTTCTTATGTCTCACCTCTTTCCAGAGCCAAATCAGCCCCTTTTGGAATGATGACTTCATTGGAATGCAAATCAAGTCATTTTGGTGCAT
CAGTGGCTCTTAGGCCTGCACACACGAGACATCAGAATCCAATCCTCTGACCCTGTGCCAGCCCTTTCCCCCAGTTTATTTCCCACCAAA
GGCTGACCTCTAAGAGGTCTTGCTTTCTATGAACTCAAGATGGGTCCCACCTCTAGGTGTCCCCAGGTGCACTCTTCTACCGGTTGGCTT
CCGATGTGACAAGGCCAAGGGCCCAAAGACTTGACCCTCTTACACCCTTGCTGACATGGTTCCATCATGTCCACCCGCATGCACTTTTAT
GGTTTCATCACCCAGCCTCTTCTCCTCTGGCCCACCCAGCGTCCAGGCTCTTTCTCCCTCTCCCCTCCTATCTAGAATGTCCCCTGCTTC
TAGCCTCACCAGACCCCCCAAGCTCCCACTACTTCTTCCATAATAATAGTAATAACAATGGTTATCATCATCCCCTGCACATCCCGCCTA
AAGCACTTTACTATATAGAAAACGTTTCCCCTGGCCGGGCATGGTGGCTCACGCCTGAAATCCCAGCTCTTTGGGAGGCTGAGGCGAGCG
GATCAGTTGAGGTCAGGAGTTCAACGCCAGCCTGGCCAACGTGGTGAATCCCTGTCTGTACTAAAAGTACAAAAAATTAGCTGAGCATGG
TGGTGCGTGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGTGGGAGAATCGCTTGAGCCCAGGAGGCGGAGGTTGCAGGAGCAGAGATTG
CGCCACTGCACTCCAGCCTGGATGACAGAGTGAGACCCAATCTCAAAAAAGAAATCGTTTCCCACCCCACATCTCCTTCAGACCTCTCAG
GGACAACTCTGGGAGGCAGCCTTGGCAGGACATGGGTTAGTGCGCCCATTTTGCTGTGAGGAAACTGAGGTACAGGTCTCATCCCAGAGC
ATGAGAAGTCACTGAGTTTAGATGAGAACTTGGGTCCAACTCTGTCCTGTTTGCTGTGCAAATCCGCTGCCCTGCTGGGGGCTTTTGGTG
GGTCCAGAATACCCAGAATATGCTGCTGGCCAACCCAGGCATAAAACAAGTCCATTCTAGATCACTGAGCCTTGTGTATTCCAGAGGGTG
ATCTGAGGTCCCCATTCAGCAGAATTCTCTGAGGGCATGTTCAGAATGTAGATTCCTGGGCCCCACCTTGAATTTGCATGTTTAACAAAC
TCTCCTGGGGTTGAGGGGTGGGTGCAGTGGTCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGTGGATCACTTGAGCTCAGGA
GTTCGAGCCTGGCCAATATGGTGAAACCCTGTCTCTACTAAAAATGCAAAAATTAGCCAGGTGTGGTGGCACATGCCTGTAATCCCAGCT
ACTTGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGTGGGGAGCAGCGGTTGCAGTGAGCCGAGATTGTGCCATTGCGCTCTAGCCTG
GGCAACGGAATGAGACTTGTCTCAAAAAAAATAAAAAATAAAACCAGCCCTCCCCGGGGGATCTTAGGCACTATTGGCCACACCATTGGT
GTTCGTGGCCCTGATTATTAGGCTATCTTTCTTTTTTTAAGTTTTTTTAGATTTATTTTTTATTTTATTTATTTATTTATTTATTATTTA
TTATTATTATTTTTAGAGACAGGGGGTCTCCCTATGTTGCCCAGGCTGGTTTCAAACTCCTGGGCTCAAGTGATCTGCCCTCCTCAGCCT
CCCAAAGTGCTGGGGTTACAGGCAGGCATCAGCCACCGTGCCAGGTCATCTTCCTTTTTCTTTTTTTTTTGGAGACAGAGTCTTGCTCTG
TCATTCAGGCTGGAGTACAGTGGCGCCATTTCAGCTCACCGCAGCCTCCACCTCCCAGGTTCAAGCAGTTCTCCTGCCTCAACCTCCCGA
GTAACTGGGATTACAGGTGTGTGCCACCACACCCGACTAATTTTTTTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGT
CTTGAACTCCTGACCTCAGGTGATCCACCCACCTCGGCCTCCCAAAGTACTGGGATTATAGGCGTGAGCCACGGCATCCAGCCTCATCTT
TCTTTTAACCAATAAACATGATGCTGTATCTTAAAAAGAGCACTGAGCAGGGACTTAAGGGATCGAGTCCTCAACCAAACTGATTTAATT
ACTCAGGATTTTCAAAAAGCATCAGAGGCTATTTACAATCTTAATCATAGGGGTTCAGTAAAATAAAAATAAGAAGTAAAAAAGCAAGAG
AAATTATTCTGTAAATCTAACTGGTGTAATTCCCATAATCATGCAATTAAGTTTTACTCTTGAGTTTCCTGACAGCCATTGGTAAAAAGA
GAAACACATCAGGATTTATAATTTTTATCATCCAATTATGGGAAGCAAGCATGTTGGCCCCAGGAGACGAACTCTTCTACTAATTTATAG
CATGTATCCTTTCATCAAGGGCCACTGAGCCAGTTGGTGAGTCAACGGGTGAACCTAAGATGCAAGGATGTTTTCCAGGTGACTATTTAA
AAAAATAAATAATCCACCAACGTGATTGACCTTGGCGAGATCATGTTTCTAGTCTATACCTCAGTTTCCCCATCTGTAAAGTGAGGATAA
TGTCCCACCCCATGTAACTGTGGTGAGGACCAACTGCAACACTGTGCCTGCGAGTCTCCTTGGAAAAGTGTAAGGTTCTACACAAATGGA
AAGTGATCTGATCACACTCAGTGTCCCCAGCCCAGCCTTTCAGTGCCCTGGCCCTGGGGTGGGGGACAATACTCTCCTCACCCCCTTCAC
TAGTCTTCATGAATAGCAAGGAGGCCATAACATAATTTGGTCTAAACCCCTTCCTTTTTAAAAGAATGATGGCAAAATGTGCATAACATT
AAATTTACCATTTTAAAGATTTCTAAGTGTCTCGAAGTACATTTGCAATGTGTAACTGCCACCTCCAGAACTTTTTCATCATCCTAAACT
GAAACTCTGTACCCACTAAACAGTAACTCCCCACTCCCCCTGTCCCCAGTCCCTGGTAACCTCTATTCTACTTTTTTTTTTTCTCTGTGG
ATTGGCCTATTCTAGGTATCTCTTTTAAGTAGAATCATATAGTATTTGTCCTTTTGTGTCCAGCCCCCTCATTTTTTGAGATGAGGAATC
AGGCCCAGAGAAGGCAGTGGCTCACCCAAGGTCACATCGCAAACCAGAGGCAGAGCCAAGACCAGAACCCAGGTTTCCTGACTCCTAACA
GGTCTTGCATCCCCAGTAGTGAATGAGACACTGCTTTGACTTTCTGTAATCTTGGTTTAGCCCCTTCCTTTCTCTGGGCTCAGTCTGCTT
CCCACCACTCACCCATTCTTTCTTTATCATCTCCCTCCAAAGCCTCTTGTCCTCCTGCCTCCTCTTCTCCTTGGCTGGTTCCTGCCAGCA
AATGGAGTGGCCCTGGTCCATGCCAGGTTTTCCTCTTCTGGGTCCGGAGCTCACTATAGTATTCAGCCCTCAGTCCTCCCAGGATGTTCT
TCCCAGCACTGCTGCCTCGTGCGGATTTTCCCGTAACCTCAGTAACTGGCTTCTTGTCCCCCTGCTTCCTACCAGGGAAGCCTTCCTGTC
CGCGTCTGTGGTTTCCCAACTCTGATATTTGCTCTCAAATGTGGTGGTGTCCTGGTTCTGTGTTTATTTATTTTGTGTTTTCTCACACAC
ACAAAAAAAAGCCTCTTCCTCCAATGCATCAGGAGGCACCAGCCCTGCCAGCCCTTCTCACTGGGCTCACCCTGCCCCAGCAACCCCCCG
GTGCCACCCTTAAGCCACACTGTCTTCTCTGTAAGCAGCCTGCCAGCAGCAGCCCCAGCACTTTGCAATGGGCGTGTGTGTGGTGGTGGG
TGGGGGGGGCTTGGATCCCTCCTTTTTCCTCCCCTGCCCTGCCCAGGCCCAGATGGCCTTGACTGTAAAGCAGGTGCTGCCTGACAGGTT
CTTCTCTCCCTGTCTCTGGTCATTGATCCATCTCTTTGTCCATTCAGTATCCAACCATCCTCTCCATTCTCCTCTGGACCTCACCACTCT
CAGAGCTGCTTGTCCTGGCAGAATCTACAGTTCACCCCAACTCTATGCCTTACCCCTCCCAACCCAACAGCATTTGCAGTTTGCAAAATA
TACAGACCCAAGTCCTGAGGGGACTGAGGACATGATGCTGGGCCCAAGTCTCCTGCTCAGGGCTTCTCTCCAATGCCAGCCCTGCCACTC
CTTCCTCACCCTCCTTGGAGCCTCCTCTGCTGCTTGTCTATCCCAACGGCCCTGCTCCCCTCCCTTCCTGCCCTTCACCAGCTTTCTGGA
CACCATGCCCTGAGGAAGGGACCTTTGGTTTTCTCTAAACATCTTTGAAGGGCTGAGGCAGTCAGGGCTGGCTGCCTTGTCACTCTTTAT
TTGGAAGCCACTCAAACCATTCCCAAGAAGAGGGACCTCAGCTGGCAATCTGGAAACCTGGCCCAGGTCTGGGCAGATGTCTTCACTTCT
CCTACCTTCCCAGTCTTGTGATCCTGTGATGAGCACCAGGATGGCCCTGTGGTCCCTAGAGCACCCCTCATGCTGTAGGGTCCTGCAGCC
CCATCCTTTCTCTACTGGGCCCTGGTATCCTGGCTCCTCTCTCAGCTCTGCCACTGATCTCTGTGCCTTAGTTTACTTCTCTGCACGGGG
GACTCACCCCAAGACCATTTCCAGCAGCTTCCCAGGTGATGTGGTGCCCCAAGGCTGGGCTTTGCCAGCTGTGGCCCAGCTCCTTAGTGC
TGCCCAGGAGACACCAGGCTGCTCAGAATGAGGTGACTGCGGGCACCATTCTCAGCCAGTGGTTCTTGTATTGCATTCCAGCAGCAGGAA
TATCACCTGGGAACTTGATAGAAGTGCAGATTAGCAGCCCCACCCAAGACCCACTGAATTAGAGCTTGTGGAGTGGGGCCCTACAAGCTG
GGGTTTTAAGGAGCCCTCCAAATGATTCTGACGCATAAGAATATGCCAACTGCTGATCTGGGCTAGCCATTAGTAGAGCCTGGGGAGGGA
CTGGGACTGGCTAGGCCAAGAACAGGTGGAAAACACCAGCCTTATCTGGACTCCTGAGATTGGGAACCACCACCAACAAAAACCAACCCT
ATAGTCGCTCCTCTTGGAAGAGGAAGAGAAGTTGAAGGGCCTGGAGAAAGCACACATTGTTTGTTTCCCTGCTCCTGCTCACCTCTCTCA
CTTGTCTTGGTTTTACAAAAGGCTGTGTGGATGGTGCCAGCCAGGGAGGGGGTGGGAGTCCTGGGGAGGCAGGAGGCAGAAGACCCTGAC
TGTTTCTCCCTTGGGAACCTCACCATAGGCCAGATAGCGCCTCTTCAAACTGAAAGAAATCTTAACTCCACAAAGAAAGCATCCTAAATC
CCCAGTTCCTCCTCCTCCCAACCCCAGGGATACCTTGTAGACAGTGCCAAAAAACAGCTCCAACCCCCAGCAGCTGGGAAGAGAGCCAGA
AGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGT
CCTGCAGTCATGGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTC
TTGTCCACTATTCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCAC
CAGCTCTCCCTGCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAAGCGTTTTTAATGGTTGTAGATTGGA

>8490_8490_9_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000371356_SOGA1_chr20_35445872_ENST00000237536_length(amino acids)=1577AA_BP=3
MGRRLECNGVISAHCNLCFLGLSDSPASASRVAGITGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSN
SSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKG
KFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQR
TATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRET
ELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELE
DQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGD
SAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAIL
VRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRS
FRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQE
KNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADM
KVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQ
LQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKS
VSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSSEPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGRQMQRSYTAPDKTGIRVYY
SPPVARRLGVPVVHDKEGKIIIEPGFLFTTAKPKESAEADGLAESSYGRWLCNFSRQRLDGGSAGSPSAAGPGFPAALHDFEMSGNMSDD
MKEITNCVRQAMRSGSLERKVKSTSSQTVGLASVGTQTIRTVSVGLQTDPPRSSLHGKAWSPRSSSLVSVRSKQISSSLDKVHSRIERPC
CSPKYGSPKLQRRSVSKLDSSKDRSLWNLHQGKQNGSAWARSTTTRDSPVLRNINDGLSSLFSVVEHSGSTESVWKLGMSETRAKPEPPK
YGIVQEFFRNVCGRAPSPTSSAGEEGTKKPEPLSPASYHQPEGVARILNKKAAKLGSSEEVRLTMLPQVGKDGVLRDGDGAVVLPNEDAV

--------------------------------------------------------------
>8490_8490_10_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000371356_SOGA1_chr20_35445872_ENST00000279034_length(transcript)=3856nt_BP=837nt
CTTGGAAGACTTGGGTCCTTGGGTCGCAGGCTGGAGTGCAATGGTGTGATCTCAGCTCACTGCAACCTCTGCTTCCTGGGTTTAAGTGAT
TCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACA
GCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGG
GTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAG
AAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCT
GAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGT
CGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAA
GCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTG
TATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCA
AAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAG
GAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAG
AATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCC
CCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAG
GTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCC
GCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGC
TCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCG
GAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAG
AAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACG
GACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGG
GAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACG
GCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAG
CTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCT
GTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGC
TCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAG
CCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCT
GAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAA
ATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAG
AAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGG
TTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTG
ATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATC
AAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAG
TTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCC
GGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTC
ATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAG
CAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTG
GAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGT
TTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGC
AAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCG
TCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCC
CAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGG
GCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTC
TTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGA

>8490_8490_10_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000371356_SOGA1_chr20_35445872_ENST00000279034_length(amino acids)=1170AA_BP=3
MGRRLECNGVISAHCNLCFLGLSDSPASASRVAGITGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSN
SSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKG
KFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQR
TATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRET
ELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELE
DQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGD
SAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAIL
VRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRS
FRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQE
KNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADM
KVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQ
LQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKS
VSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSSEPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGSQKLPFLLILAPPQPPPIL

--------------------------------------------------------------
>8490_8490_11_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000395913_SOGA1_chr20_35445872_ENST00000237536_length(transcript)=13852nt_BP=894nt
CTTAAACGCGACTCAAGGCGTCGGGTTTGTTGTCAACCAATCACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAG
GGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAACGGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGTGGGAGCCG
ACGGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTG
ACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATT
CCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCT
GTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAA
CAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTT
TATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAG
CTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTAC
CTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCC
TTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCAC
TTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTAC
CGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCAC
CTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGAC
GACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTG
GCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTG
CTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCA
CAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCC
AACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTG
CCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAG
GCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGT
GCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTG
CTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTG
CCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTG
CCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGA
GCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAG
CAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCG
CTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTG
CAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTG
CCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCT
GATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAG
GAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAG
CGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCC
CTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAG
CGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAAC
CGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGC
AGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCT
GAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTT
GTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCC
TGCCAGGACCCCCCAGGGAGGCAGATGCAGCGCAGCTACACGGCTCCTGACAAGACGGGCATCCGAGTCTACTATAGTCCCCCGGTGGCC
CGGCGCCTCGGAGTCCCTGTGGTTCATGACAAAGAGGGCAAGATCATTATCGAGCCCGGCTTCCTCTTCACCACAGCCAAGCCCAAAGAG
TCGGCCGAGGCTGATGGGCTGGCTGAGAGCTCCTATGGTCGGTGGCTCTGCAACTTCTCACGGCAGCGCCTGGACGGAGGCTCAGCGGGC
AGCCCCTCGGCGGCCGGGCCTGGCTTCCCAGCGGCCCTGCATGACTTTGAGATGTCAGGCAACATGAGTGATGACATGAAGGAGATCACC
AACTGTGTGCGCCAGGCCATGCGCTCCGGCTCACTGGAGAGGAAAGTGAAGAGCACATCCAGCCAGACGGTGGGCCTGGCCAGTGTGGGC
ACACAGACCATCCGCACGGTCAGCGTGGGCCTGCAGACCGACCCACCCCGCAGCAGCCTCCATGGCAAGGCCTGGTCACCCCGCAGCTCT
TCGCTCGTGTCTGTGCGCAGCAAGCAGATCTCCTCCTCCCTGGACAAGGTCCATTCGCGCATCGAGCGGCCCTGCTGCTCCCCCAAGTAT
GGCTCACCAAAGCTCCAGAGGCGGTCTGTGTCCAAGCTGGACAGCAGCAAGGACCGCAGCCTGTGGAACCTGCACCAGGGCAAGCAGAAC
GGCTCGGCCTGGGCCCGCTCCACCACCACGCGGGACAGCCCTGTATTGAGAAACATCAACGATGGACTCTCCAGCCTCTTCAGTGTGGTG
GAGCACTCAGGGAGCACGGAGTCTGTCTGGAAACTAGGCATGTCTGAGACGCGGGCCAAGCCCGAGCCTCCCAAGTACGGCATTGTGCAG
GAATTCTTCCGTAATGTGTGTGGCCGGGCACCGAGCCCCACCTCATCAGCAGGAGAGGAGGGCACCAAGAAGCCAGAGCCCCTCTCCCCA
GCCAGCTACCATCAGCCAGAGGGTGTGGCCAGGATCCTGAACAAGAAGGCAGCCAAGTTGGGCAGCAGTGAGGAGGTCAGACTCACCATG
CTCCCCCAGGTGGGGAAGGATGGTGTCCTCCGGGACGGAGATGGAGCCGTGGTCCTTCCCAATGAGGACGCTGTTTGTGACTGTAGTACC
CAGTCTCTCACCTCCTGCTTCGCCCGATCGTCCCGCTCTGCCATCCGCCACTCTCCTTCCAAGTGCAGGCTGCACCCTTCAGAGTCCAGC
TGGGGTGGGGAGGAGAGGGCACTCCCCCCCAGCGAGTGACAGAGCAGCCAAGCTCCCCGCCTCAACCAGCCCAGCCCCTGGATAGCAGAA
GGGAACCAGCAGAGACGAGACGAGGTGAGGCGAGGGGCTGTGTCCTCAGCATTGCCTGGCCCTGGAGGGACAGCAGTGATGCCACTGCCA
GAATGCAGCTTTCACATCAAGGTAAAGCCGGGTCTCCTGCTGGCCCCTGGGTGGTGAGCTTCGACTTCCCAGGGGAAGGCAGTGAGTGGG
AGAGAGACCAAACCTGGGCTTCCCAAGCATCCACTGAGAGATCTGTCAAGAGCCGATCCCTGGGTCCTAAGAGAGAGCCTTGCCTGGTTC
TGCCCATGCCACCCTCTTGGAAGAGCCCAAGAAGGATACATGTCTGGCCATGCCTTTGGGGAAAAGGAGTCGGAGAGATGTTTCCTGCTG
ACCATCCACCCCTTCATTTGGGAGGAGACACTGCTGAGAAGAACAGGCTTTGCTCTAGGGCTCCATGTTTGGTTCCTGGTGGAGCCCTGT
TGGGCATCATCACCATCACCTCCTTCTCTCCACCACCTCCTCCTCCCAGCCCCACTGCTCTAGATTGCTGGGACACTAGGGAGTATGATA
GGGCAGTAGCCAGGGCCATTGCTTAGTGTCCTGGAGCCCTGGATCTCCCTGCCCATAGCCTGGATGCAGCAAGAGCTGGGAGGCGAAGTG
GAAACATGCAGGGCTCAGGGTTGGGGAGTGATTGCAATTGCCTTCCTTGCCAAAGTGACTTGGGGCCCCAACGTTCCCAGCAGACCCCTT
GAGGACAGAAATAGGTAGAGTCAGTCTCAAGACCTGGTGCATAGATAAATGCCTAAATACACTGCCTTGATCTCAAGTGATCTCAGAGGC
CTCTTTCCCTGGCACCCTGAGAGGCAGCAGGCACTACATCTCCACTGTGTTTACATCCTGCAGCTGGTCGGGGGGCAAAGATATTCCCAG
TAAGAGATTCTTGGTTGGCCAGGTCAGGCCCAGGAGAACACCAAGAGGCCAGAGCCCAGGACACAGCAGTAGACTGGGGCCTGGAAACAC
GTATCTTGCCTAGATTGTTTATTTGAATTTTTCCTACTATAAATATTTAAGGTGGTTTACTTTATTTTAATAATTTAATTTACCCCAAAG
TCCCTAAGGTAATTTATTGGAGGTTGAAACATGCATTCTTGCCACTGGGACAACATGAGGCCTCTAACAGCACGGGCAGGCATGGGGTCC
CCTGGGTGGACGAGGCCGCTTGGCAGCCAGGTTTGGAGACCTGGCCTCCTGGTCAGCTTTGGAGGGCCCCTCAACAGAGCTGGAGCCCTG
CACCCCAACACGGCTGGCCATGTGGCCTCAGAACACTACTTATTACTCAATGCCTGGTACTTGGCCAGCCCCAGCGGTCAGTCTATAAAT
ACTCACTGACAAGGTGGAGGGCTGGACGGCCATCACCACTCCCCAGACGTTCTCCATTGCCTGTCTCATTTCCCTCCCTCCCTCTGACAC
CTTTCTTCATGAGTCGAACGTGGATTACTAAAGCTCTATTAAGAGTGTGGAGATCCCTCCAAGTTTCCCAAATGAGAACTCACAGGAAAA
CAGGACTGAACTTTGAGAATGTTGTTTATCGCAGCTTTGCACATAAACCTGAGTGTCTCCCAGCCTGCCTCGGTTCTCACCAGCCTGCCA
GCCTTTTCACCAGCCTCTCTCCTTAGCCTTATGGCCTTTCACGGCTCTTCTCCCTGCCCCAGCTCTGCTGCCCGCCCTTCCTCACGTCCC
CTGTGAGCTGCCTGAGCCATTGGTTGGATTTCGATGTGGCTCATTGCAGCATGTGGGGCAGCGCCTCCCATGGCCTCGCCTTGGTGCCGG
TGAACCCCTTTTGGTTGCACACATGCTCCCCACACACACATAGACATCAGCCTTCCTGTAACTGACTGGGGACCCAGAGTGGAAACACCA
GGATGGATCAGCTTGTCTGCAGAATTGCCCATCAGGAAGACCAAAAGCCAGTAGCTTTGCTGATCTGCCCCCAGGACTCTGGAGGCGCCC
CTGCACTCCCACCTCCCACCTGCCAGTTCCCAGACCCACCCATTCGGGATCACCTGGACCAGTTACCCCAAGTCCTGCATCTCCCTTCCC
TGCAGGCTGAACACCAGGGTCATGCCAGTCCCGCCAGCCGCCTCCTCCATGCCCCAGTGACTGGTGTGGGCAGAGCAGGCAGCCAGTGGA
GCTGTGGGCCAGTTCCGCTCTTGGATGCTGCTGCTCTCACCCATGAGGTCAGGGGGGCCCTCCAAGGTTATCTCCAGGTGAGGGGATTCA
CATCAGGCCACAAGCCACCAGAGGCCTTCTGCCACCTCCCAGAGCGACAGCCAGGGAGGCTGCGTACTCAGCCTCGGGGAGAAATCCCCG
TGGGACCTGAGCCCCAAGACCTACGGACCACTCAGCCTTACCATCGTACCGTCCAGGATTGTCCTTGCCATCTTTGTTGTCTCAGCCAGA
CCTTGGTTTTCAGTAAAGCCCCAGTTTCTACTTCCTGCATGCCACTGTGCAAGGCCACTCATCACTGTTCCTGCAGAAGCCTCTGGACGT
GGGGCTGGATGGGGTTGAAAATGTTACATGTAAATATTGGTTTGGTTCGGTTTTTAGCATTTTACTTGGTAACTGGTTGTTTTCTTTTTT
GGGGTGGGGGGATTGGTTTGTAAAAATTCTCTACTCTTTTGGAATGTGATTTCTAAGTTTGTTGGTTTCTTCAAATGCCTTTTAAGTCTT
GGTAACATTCCCAAAGCAGAAAACTGCCTGACCCACAGTGGGGATTCCCTGGAGAATTGGGGTCCCAAGAAGGAATGCTGCCCTTCTCGA
ACCCGTTCTCCCCCTTCCTCCTGCCTCTCTGCCTTTTACTGCTATTCCCTTCTTCTCCTCCTTTATCCTTCTTTCTGTTTTCCCCATCTC
CACTCTCTCTTCAACCAAAGTCCCAAGGAACCCTCGGGGCTCAATCCCCCATAGACCACTTGGCTTGGGTCCATGGGGTTGGCATCAGTT
GGTTGGCGGAAATGGGGGACCAGTTGGCATGATGGCCCTAAACTGGGAAACCTCATGTTTCTTATGTCTCACCTCTTTCCAGAGCCAAAT
CAGCCCCTTTTGGAATGATGACTTCATTGGAATGCAAATCAAGTCATTTTGGTGCATCAGTGGCTCTTAGGCCTGCACACACGAGACATC
AGAATCCAATCCTCTGACCCTGTGCCAGCCCTTTCCCCCAGTTTATTTCCCACCAAAGGCTGACCTCTAAGAGGTCTTGCTTTCTATGAA
CTCAAGATGGGTCCCACCTCTAGGTGTCCCCAGGTGCACTCTTCTACCGGTTGGCTTCCGATGTGACAAGGCCAAGGGCCCAAAGACTTG
ACCCTCTTACACCCTTGCTGACATGGTTCCATCATGTCCACCCGCATGCACTTTTATGGTTTCATCACCCAGCCTCTTCTCCTCTGGCCC
ACCCAGCGTCCAGGCTCTTTCTCCCTCTCCCCTCCTATCTAGAATGTCCCCTGCTTCTAGCCTCACCAGACCCCCCAAGCTCCCACTACT
TCTTCCATAATAATAGTAATAACAATGGTTATCATCATCCCCTGCACATCCCGCCTAAAGCACTTTACTATATAGAAAACGTTTCCCCTG
GCCGGGCATGGTGGCTCACGCCTGAAATCCCAGCTCTTTGGGAGGCTGAGGCGAGCGGATCAGTTGAGGTCAGGAGTTCAACGCCAGCCT
GGCCAACGTGGTGAATCCCTGTCTGTACTAAAAGTACAAAAAATTAGCTGAGCATGGTGGTGCGTGCCTGTAATCCCAGCTACTCGGGAG
GCTGAGGTGGGAGAATCGCTTGAGCCCAGGAGGCGGAGGTTGCAGGAGCAGAGATTGCGCCACTGCACTCCAGCCTGGATGACAGAGTGA
GACCCAATCTCAAAAAAGAAATCGTTTCCCACCCCACATCTCCTTCAGACCTCTCAGGGACAACTCTGGGAGGCAGCCTTGGCAGGACAT
GGGTTAGTGCGCCCATTTTGCTGTGAGGAAACTGAGGTACAGGTCTCATCCCAGAGCATGAGAAGTCACTGAGTTTAGATGAGAACTTGG
GTCCAACTCTGTCCTGTTTGCTGTGCAAATCCGCTGCCCTGCTGGGGGCTTTTGGTGGGTCCAGAATACCCAGAATATGCTGCTGGCCAA
CCCAGGCATAAAACAAGTCCATTCTAGATCACTGAGCCTTGTGTATTCCAGAGGGTGATCTGAGGTCCCCATTCAGCAGAATTCTCTGAG
GGCATGTTCAGAATGTAGATTCCTGGGCCCCACCTTGAATTTGCATGTTTAACAAACTCTCCTGGGGTTGAGGGGTGGGTGCAGTGGTCA
CACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGTGGATCACTTGAGCTCAGGAGTTCGAGCCTGGCCAATATGGTGAAACCCTGTC
TCTACTAAAAATGCAAAAATTAGCCAGGTGTGGTGGCACATGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCACTTGAA
CCTGGTGGGGAGCAGCGGTTGCAGTGAGCCGAGATTGTGCCATTGCGCTCTAGCCTGGGCAACGGAATGAGACTTGTCTCAAAAAAAATA
AAAAATAAAACCAGCCCTCCCCGGGGGATCTTAGGCACTATTGGCCACACCATTGGTGTTCGTGGCCCTGATTATTAGGCTATCTTTCTT
TTTTTAAGTTTTTTTAGATTTATTTTTTATTTTATTTATTTATTTATTTATTATTTATTATTATTATTTTTAGAGACAGGGGGTCTCCCT
ATGTTGCCCAGGCTGGTTTCAAACTCCTGGGCTCAAGTGATCTGCCCTCCTCAGCCTCCCAAAGTGCTGGGGTTACAGGCAGGCATCAGC
CACCGTGCCAGGTCATCTTCCTTTTTCTTTTTTTTTTGGAGACAGAGTCTTGCTCTGTCATTCAGGCTGGAGTACAGTGGCGCCATTTCA
GCTCACCGCAGCCTCCACCTCCCAGGTTCAAGCAGTTCTCCTGCCTCAACCTCCCGAGTAACTGGGATTACAGGTGTGTGCCACCACACC
CGACTAATTTTTTTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCACCCACC
TCGGCCTCCCAAAGTACTGGGATTATAGGCGTGAGCCACGGCATCCAGCCTCATCTTTCTTTTAACCAATAAACATGATGCTGTATCTTA
AAAAGAGCACTGAGCAGGGACTTAAGGGATCGAGTCCTCAACCAAACTGATTTAATTACTCAGGATTTTCAAAAAGCATCAGAGGCTATT
TACAATCTTAATCATAGGGGTTCAGTAAAATAAAAATAAGAAGTAAAAAAGCAAGAGAAATTATTCTGTAAATCTAACTGGTGTAATTCC
CATAATCATGCAATTAAGTTTTACTCTTGAGTTTCCTGACAGCCATTGGTAAAAAGAGAAACACATCAGGATTTATAATTTTTATCATCC
AATTATGGGAAGCAAGCATGTTGGCCCCAGGAGACGAACTCTTCTACTAATTTATAGCATGTATCCTTTCATCAAGGGCCACTGAGCCAG
TTGGTGAGTCAACGGGTGAACCTAAGATGCAAGGATGTTTTCCAGGTGACTATTTAAAAAAATAAATAATCCACCAACGTGATTGACCTT
GGCGAGATCATGTTTCTAGTCTATACCTCAGTTTCCCCATCTGTAAAGTGAGGATAATGTCCCACCCCATGTAACTGTGGTGAGGACCAA
CTGCAACACTGTGCCTGCGAGTCTCCTTGGAAAAGTGTAAGGTTCTACACAAATGGAAAGTGATCTGATCACACTCAGTGTCCCCAGCCC
AGCCTTTCAGTGCCCTGGCCCTGGGGTGGGGGACAATACTCTCCTCACCCCCTTCACTAGTCTTCATGAATAGCAAGGAGGCCATAACAT
AATTTGGTCTAAACCCCTTCCTTTTTAAAAGAATGATGGCAAAATGTGCATAACATTAAATTTACCATTTTAAAGATTTCTAAGTGTCTC
GAAGTACATTTGCAATGTGTAACTGCCACCTCCAGAACTTTTTCATCATCCTAAACTGAAACTCTGTACCCACTAAACAGTAACTCCCCA
CTCCCCCTGTCCCCAGTCCCTGGTAACCTCTATTCTACTTTTTTTTTTTCTCTGTGGATTGGCCTATTCTAGGTATCTCTTTTAAGTAGA
ATCATATAGTATTTGTCCTTTTGTGTCCAGCCCCCTCATTTTTTGAGATGAGGAATCAGGCCCAGAGAAGGCAGTGGCTCACCCAAGGTC
ACATCGCAAACCAGAGGCAGAGCCAAGACCAGAACCCAGGTTTCCTGACTCCTAACAGGTCTTGCATCCCCAGTAGTGAATGAGACACTG
CTTTGACTTTCTGTAATCTTGGTTTAGCCCCTTCCTTTCTCTGGGCTCAGTCTGCTTCCCACCACTCACCCATTCTTTCTTTATCATCTC
CCTCCAAAGCCTCTTGTCCTCCTGCCTCCTCTTCTCCTTGGCTGGTTCCTGCCAGCAAATGGAGTGGCCCTGGTCCATGCCAGGTTTTCC
TCTTCTGGGTCCGGAGCTCACTATAGTATTCAGCCCTCAGTCCTCCCAGGATGTTCTTCCCAGCACTGCTGCCTCGTGCGGATTTTCCCG
TAACCTCAGTAACTGGCTTCTTGTCCCCCTGCTTCCTACCAGGGAAGCCTTCCTGTCCGCGTCTGTGGTTTCCCAACTCTGATATTTGCT
CTCAAATGTGGTGGTGTCCTGGTTCTGTGTTTATTTATTTTGTGTTTTCTCACACACACAAAAAAAAGCCTCTTCCTCCAATGCATCAGG
AGGCACCAGCCCTGCCAGCCCTTCTCACTGGGCTCACCCTGCCCCAGCAACCCCCCGGTGCCACCCTTAAGCCACACTGTCTTCTCTGTA
AGCAGCCTGCCAGCAGCAGCCCCAGCACTTTGCAATGGGCGTGTGTGTGGTGGTGGGTGGGGGGGGCTTGGATCCCTCCTTTTTCCTCCC
CTGCCCTGCCCAGGCCCAGATGGCCTTGACTGTAAAGCAGGTGCTGCCTGACAGGTTCTTCTCTCCCTGTCTCTGGTCATTGATCCATCT
CTTTGTCCATTCAGTATCCAACCATCCTCTCCATTCTCCTCTGGACCTCACCACTCTCAGAGCTGCTTGTCCTGGCAGAATCTACAGTTC
ACCCCAACTCTATGCCTTACCCCTCCCAACCCAACAGCATTTGCAGTTTGCAAAATATACAGACCCAAGTCCTGAGGGGACTGAGGACAT
GATGCTGGGCCCAAGTCTCCTGCTCAGGGCTTCTCTCCAATGCCAGCCCTGCCACTCCTTCCTCACCCTCCTTGGAGCCTCCTCTGCTGC
TTGTCTATCCCAACGGCCCTGCTCCCCTCCCTTCCTGCCCTTCACCAGCTTTCTGGACACCATGCCCTGAGGAAGGGACCTTTGGTTTTC
TCTAAACATCTTTGAAGGGCTGAGGCAGTCAGGGCTGGCTGCCTTGTCACTCTTTATTTGGAAGCCACTCAAACCATTCCCAAGAAGAGG
GACCTCAGCTGGCAATCTGGAAACCTGGCCCAGGTCTGGGCAGATGTCTTCACTTCTCCTACCTTCCCAGTCTTGTGATCCTGTGATGAG
CACCAGGATGGCCCTGTGGTCCCTAGAGCACCCCTCATGCTGTAGGGTCCTGCAGCCCCATCCTTTCTCTACTGGGCCCTGGTATCCTGG
CTCCTCTCTCAGCTCTGCCACTGATCTCTGTGCCTTAGTTTACTTCTCTGCACGGGGGACTCACCCCAAGACCATTTCCAGCAGCTTCCC
AGGTGATGTGGTGCCCCAAGGCTGGGCTTTGCCAGCTGTGGCCCAGCTCCTTAGTGCTGCCCAGGAGACACCAGGCTGCTCAGAATGAGG
TGACTGCGGGCACCATTCTCAGCCAGTGGTTCTTGTATTGCATTCCAGCAGCAGGAATATCACCTGGGAACTTGATAGAAGTGCAGATTA
GCAGCCCCACCCAAGACCCACTGAATTAGAGCTTGTGGAGTGGGGCCCTACAAGCTGGGGTTTTAAGGAGCCCTCCAAATGATTCTGACG
CATAAGAATATGCCAACTGCTGATCTGGGCTAGCCATTAGTAGAGCCTGGGGAGGGACTGGGACTGGCTAGGCCAAGAACAGGTGGAAAA
CACCAGCCTTATCTGGACTCCTGAGATTGGGAACCACCACCAACAAAAACCAACCCTATAGTCGCTCCTCTTGGAAGAGGAAGAGAAGTT
GAAGGGCCTGGAGAAAGCACACATTGTTTGTTTCCCTGCTCCTGCTCACCTCTCTCACTTGTCTTGGTTTTACAAAAGGCTGTGTGGATG
GTGCCAGCCAGGGAGGGGGTGGGAGTCCTGGGGAGGCAGGAGGCAGAAGACCCTGACTGTTTCTCCCTTGGGAACCTCACCATAGGCCAG
ATAGCGCCTCTTCAAACTGAAAGAAATCTTAACTCCACAAAGAAAGCATCCTAAATCCCCAGTTCCTCCTCCTCCCAACCCCAGGGATAC
CTTGTAGACAGTGCCAAAAAACAGCTCCAACCCCCAGCAGCTGGGAAGAGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCC
AGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGGG
CAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTCT
TCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGAC

>8490_8490_11_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000395913_SOGA1_chr20_35445872_ENST00000237536_length(amino acids)=1549AA_BP=245
MGRRWEPTGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQ
ATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLE
KAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSA
DLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENR
GLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSC
SVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEV
LPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQ
QESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHD
SLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKR
WRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKA
KATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQ
QVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLP
NNPAFGFVSSEPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGRQMQRSYTAPDKTGIRVYYSPPVARRLGVPVVHDKEGKIIIEPGFLF
TTAKPKESAEADGLAESSYGRWLCNFSRQRLDGGSAGSPSAAGPGFPAALHDFEMSGNMSDDMKEITNCVRQAMRSGSLERKVKSTSSQT
VGLASVGTQTIRTVSVGLQTDPPRSSLHGKAWSPRSSSLVSVRSKQISSSLDKVHSRIERPCCSPKYGSPKLQRRSVSKLDSSKDRSLWN
LHQGKQNGSAWARSTTTRDSPVLRNINDGLSSLFSVVEHSGSTESVWKLGMSETRAKPEPPKYGIVQEFFRNVCGRAPSPTSSAGEEGTK
KPEPLSPASYHQPEGVARILNKKAAKLGSSEEVRLTMLPQVGKDGVLRDGDGAVVLPNEDAVCDCSTQSLTSCFARSSRSAIRHSPSKCR

--------------------------------------------------------------
>8490_8490_12_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000395913_SOGA1_chr20_35445872_ENST00000279034_length(transcript)=3913nt_BP=894nt
CTTAAACGCGACTCAAGGCGTCGGGTTTGTTGTCAACCAATCACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAG
GGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAACGGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGTGGGAGCCG
ACGGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTG
ACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATT
CCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCT
GTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAA
CAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTT
TATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAG
CTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTAC
CTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCC
TTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCAC
TTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTAC
CGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCAC
CTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGAC
GACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTG
GCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTG
CTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCA
CAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCC
AACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTG
CCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAG
GCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGT
GCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTG
CTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTG
CCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTG
CCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGA
GCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAG
CAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCG
CTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTG
CAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTG
CCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCT
GATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAG
GAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAG
CGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCC
CTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAG
CGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAAC
CGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGC
AGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCT
GAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTT
GTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCC
TGCCAGGACCCCCCAGGGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCA
CTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCA
TCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTG
CTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAA

>8490_8490_12_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000395913_SOGA1_chr20_35445872_ENST00000279034_length(amino acids)=1142AA_BP=245
MGRRWEPTGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQ
ATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLE
KAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSA
DLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENR
GLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSC
SVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEV
LPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQ
QESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHD
SLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKR
WRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKA
KATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQ
QVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLP

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for AURKA-SOGA1


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for AURKA-SOGA1


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for AURKA-SOGA1


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
HgeneAURKAC0002938Aneuploidy1CTD_human
HgeneAURKAC0006142Malignant neoplasm of breast1CTD_human
HgeneAURKAC0009402Colorectal Carcinoma1CTD_human
HgeneAURKAC0009404Colorectal Neoplasms1CTD_human
HgeneAURKAC0022665Kidney Neoplasm1CTD_human
HgeneAURKAC0024668Mammary Neoplasms, Experimental1CTD_human
HgeneAURKAC0025202melanoma1CTD_human
HgeneAURKAC0027819Neuroblastoma1CTD_human
HgeneAURKAC0033578Prostatic Neoplasms1CTD_human
HgeneAURKAC0376358Malignant neoplasm of prostate1CTD_human
HgeneAURKAC0678222Breast Carcinoma1CTD_human
HgeneAURKAC0740457Malignant neoplasm of kidney1CTD_human
HgeneAURKAC1257806Chromosomal Instability1CTD_human
HgeneAURKAC1257931Mammary Neoplasms, Human1CTD_human
HgeneAURKAC1458155Mammary Neoplasms1CTD_human
HgeneAURKAC2239176Liver carcinoma1CTD_human
HgeneAURKAC4704874Mammary Carcinoma, Human1CTD_human
HgeneAURKAC4721453Peripheral Nervous System Diseases1CTD_human