FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:AURKA-SOGA1 (FusionGDB2 ID:8490)

Fusion Gene Summary for AURKA-SOGA1

check button Fusion gene summary
Fusion gene informationFusion gene name: AURKA-SOGA1
Fusion gene ID: 8490
HgeneTgene
Gene symbol

AURKA

SOGA1

Gene ID

6790

140710

Gene nameaurora kinase Asuppressor of glucose, autophagy associated 1
SynonymsAIK|ARK1|AURA|BTAK|PPP1R47|STK15|STK6|STK7C20orf117|KIAA0889|SOGA
Cytomap

20q13.2

20q11.23

Type of geneprotein-codingprotein-coding
Descriptionaurora kinase Aaurora 2aurora/IPL1-like kinaseaurora/IPL1-related kinase 1breast tumor-amplified kinaseprotein phosphatase 1, regulatory subunit 47serine/threonine protein kinase 15serine/threonine-protein kinase 6serine/threonine-protein kinase aprotein SOGA1SOGA family member 1suppressor of glucose by autophagysuppressor of glucose from autophagysuppressor of glucose, autophagy-associated protein 1
Modification date2020032920200313
UniProtAcc

O14965

O94964

Ensembl transtripts involved in fusion geneENST00000312783, ENST00000347343, 
ENST00000371356, ENST00000395907, 
ENST00000395909, ENST00000395911, 
ENST00000395913, ENST00000395914, 
ENST00000395915, 
ENST00000357779, 
ENST00000456801, ENST00000237536, 
ENST00000279034, 
Fusion gene scores* DoF score6 X 8 X 3=14410 X 10 X 4=400
# samples 812
** MAII scorelog2(8/144*10)=-0.84799690655495
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(12/400*10)=-1.73696559416621
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: AURKA [Title/Abstract] AND SOGA1 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointAURKA(54956489)-SOGA1(35445872), # samples:2
Anticipated loss of major functional domain due to fusion event.AURKA-SOGA1 seems lost the major protein functional domain in Hgene partner, which is a epigenetic factor due to the frame-shifted ORF.
AURKA-SOGA1 seems lost the major protein functional domain in Hgene partner, which is a essential gene due to the frame-shifted ORF.
AURKA-SOGA1 seems lost the major protein functional domain in Hgene partner, which is a IUPHAR drug target due to the frame-shifted ORF.
AURKA-SOGA1 seems lost the major protein functional domain in Hgene partner, which is a kinase due to the frame-shifted ORF.
AURKA-SOGA1 seems lost the major protein functional domain in Tgene partner, which is a essential gene due to the frame-shifted ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneAURKA

GO:0006468

protein phosphorylation

21600873|21820309

HgeneAURKA

GO:0009611

response to wounding

19435814

HgeneAURKA

GO:0032091

negative regulation of protein binding

21820309

HgeneAURKA

GO:0097421

liver regeneration

19435814


check buttonFusion gene breakpoints across AURKA (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across SOGA1 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4OVTCGA-61-1728-01AAURKAchr20

54956489

-SOGA1chr20

35445872

-
ChimerDB4OVTCGA-61-1728AURKAchr20

54956488

-SOGA1chr20

35445872

-


Top

Fusion Gene ORF analysis for AURKA-SOGA1

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-intronENST00000312783ENST00000357779AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000312783ENST00000357779AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000312783ENST00000456801AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000312783ENST00000456801AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000347343ENST00000357779AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000347343ENST00000357779AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000347343ENST00000456801AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000347343ENST00000456801AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000371356ENST00000357779AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000371356ENST00000357779AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000371356ENST00000456801AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000371356ENST00000456801AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395907ENST00000357779AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395907ENST00000357779AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395907ENST00000456801AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395907ENST00000456801AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395909ENST00000357779AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395909ENST00000357779AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395909ENST00000456801AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395909ENST00000456801AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395911ENST00000357779AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395911ENST00000357779AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395911ENST00000456801AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395911ENST00000456801AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395913ENST00000357779AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395913ENST00000357779AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395913ENST00000456801AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395913ENST00000456801AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395914ENST00000357779AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395914ENST00000357779AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395914ENST00000456801AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395914ENST00000456801AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395915ENST00000357779AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395915ENST00000357779AURKAchr20

54956488

-SOGA1chr20

35445872

-
5CDS-intronENST00000395915ENST00000456801AURKAchr20

54956489

-SOGA1chr20

35445872

-
5CDS-intronENST00000395915ENST00000456801AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000347343ENST00000237536AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000347343ENST00000237536AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000347343ENST00000279034AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000347343ENST00000279034AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000395907ENST00000237536AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000395907ENST00000237536AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000395907ENST00000279034AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000395907ENST00000279034AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000395909ENST00000237536AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000395909ENST00000237536AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000395909ENST00000279034AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000395909ENST00000279034AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000395911ENST00000237536AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000395911ENST00000237536AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000395911ENST00000279034AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000395911ENST00000279034AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000395914ENST00000237536AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000395914ENST00000237536AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000395914ENST00000279034AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000395914ENST00000279034AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000395915ENST00000237536AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000395915ENST00000237536AURKAchr20

54956488

-SOGA1chr20

35445872

-
Frame-shiftENST00000395915ENST00000279034AURKAchr20

54956489

-SOGA1chr20

35445872

-
Frame-shiftENST00000395915ENST00000279034AURKAchr20

54956488

-SOGA1chr20

35445872

-
In-frameENST00000312783ENST00000237536AURKAchr20

54956489

-SOGA1chr20

35445872

-
In-frameENST00000312783ENST00000237536AURKAchr20

54956488

-SOGA1chr20

35445872

-
In-frameENST00000312783ENST00000279034AURKAchr20

54956489

-SOGA1chr20

35445872

-
In-frameENST00000312783ENST00000279034AURKAchr20

54956488

-SOGA1chr20

35445872

-
In-frameENST00000371356ENST00000237536AURKAchr20

54956489

-SOGA1chr20

35445872

-
In-frameENST00000371356ENST00000237536AURKAchr20

54956488

-SOGA1chr20

35445872

-
In-frameENST00000371356ENST00000279034AURKAchr20

54956489

-SOGA1chr20

35445872

-
In-frameENST00000371356ENST00000279034AURKAchr20

54956488

-SOGA1chr20

35445872

-
In-frameENST00000395913ENST00000237536AURKAchr20

54956489

-SOGA1chr20

35445872

-
In-frameENST00000395913ENST00000237536AURKAchr20

54956488

-SOGA1chr20

35445872

-
In-frameENST00000395913ENST00000279034AURKAchr20

54956489

-SOGA1chr20

35445872

-
In-frameENST00000395913ENST00000279034AURKAchr20

54956488

-SOGA1chr20

35445872

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000371356AURKAchr2054956489-ENST00000237536SOGA1chr2035445872-137958371847511577
ENST00000371356AURKAchr2054956489-ENST00000279034SOGA1chr2035445872-38568371835301170
ENST00000312783AURKAchr2054956489-ENST00000237536SOGA1chr2035445872-1390694824348621539
ENST00000312783AURKAchr2054956489-ENST00000279034SOGA1chr2035445872-396794824336411132
ENST00000395913AURKAchr2054956489-ENST00000237536SOGA1chr2035445872-1385289415948081549
ENST00000395913AURKAchr2054956489-ENST00000279034SOGA1chr2035445872-391389415935871142
ENST00000371356AURKAchr2054956488-ENST00000237536SOGA1chr2035445872-137958371847511577
ENST00000371356AURKAchr2054956488-ENST00000279034SOGA1chr2035445872-38568371835301170
ENST00000312783AURKAchr2054956488-ENST00000237536SOGA1chr2035445872-1390694824348621539
ENST00000312783AURKAchr2054956488-ENST00000279034SOGA1chr2035445872-396794824336411132
ENST00000395913AURKAchr2054956488-ENST00000237536SOGA1chr2035445872-1385289415948081549
ENST00000395913AURKAchr2054956488-ENST00000279034SOGA1chr2035445872-391389415935871142

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000371356ENST00000237536AURKAchr2054956489-SOGA1chr2035445872-0.0009886440.9990113
ENST00000371356ENST00000279034AURKAchr2054956489-SOGA1chr2035445872-0.0083886760.9916113
ENST00000312783ENST00000237536AURKAchr2054956489-SOGA1chr2035445872-0.001144840.9988551
ENST00000312783ENST00000279034AURKAchr2054956489-SOGA1chr2035445872-0.0098925890.9901074
ENST00000395913ENST00000237536AURKAchr2054956489-SOGA1chr2035445872-0.0010568320.99894315
ENST00000395913ENST00000279034AURKAchr2054956489-SOGA1chr2035445872-0.0090108210.99098915
ENST00000371356ENST00000237536AURKAchr2054956488-SOGA1chr2035445872-0.0009886440.9990113
ENST00000371356ENST00000279034AURKAchr2054956488-SOGA1chr2035445872-0.0083886760.9916113
ENST00000312783ENST00000237536AURKAchr2054956488-SOGA1chr2035445872-0.001144840.9988551
ENST00000312783ENST00000279034AURKAchr2054956488-SOGA1chr2035445872-0.0098925890.9901074
ENST00000395913ENST00000237536AURKAchr2054956488-SOGA1chr2035445872-0.0010568320.99894315
ENST00000395913ENST00000279034AURKAchr2054956488-SOGA1chr2035445872-0.0090108210.99098915

Top

Fusion Genomic Features for AURKA-SOGA1


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.

Top

Fusion Protein Features for AURKA-SOGA1


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr20:54956489/chr20:35445872)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
AURKA

O14965

SOGA1

O94964

FUNCTION: Mitotic serine/threonine kinase that contributes to the regulation of cell cycle progression (PubMed:26246606, PubMed:12390251, PubMed:18615013, PubMed:11039908, PubMed:17125279, PubMed:17360485). Associates with the centrosome and the spindle microtubules during mitosis and plays a critical role in various mitotic events including the establishment of mitotic spindle, centrosome duplication, centrosome separation as well as maturation, chromosomal alignment, spindle assembly checkpoint, and cytokinesis (PubMed:26246606, PubMed:14523000). Required for normal spindle positioning during mitosis and for the localization of NUMA1 and DCTN1 to the cell cortex during metaphase (PubMed:27335426). Required for initial activation of CDK1 at centrosomes (PubMed:13678582, PubMed:15128871). Phosphorylates numerous target proteins, including ARHGEF2, BORA, BRCA1, CDC25B, DLGP5, HDAC6, KIF2A, LATS2, NDEL1, PARD3, PPP1R2, PLK1, RASSF1, TACC3, p53/TP53 and TPX2 (PubMed:18056443, PubMed:15128871, PubMed:14702041, PubMed:11551964, PubMed:15147269, PubMed:15987997, PubMed:17604723, PubMed:18615013). Regulates KIF2A tubulin depolymerase activity (PubMed:19351716). Important for microtubule formation and/or stabilization (PubMed:18056443). Required for normal axon formation (PubMed:19812038). Plays a role in microtubule remodeling during neurite extension (PubMed:19668197). Also acts as a key regulatory component of the p53/TP53 pathway, and particularly the checkpoint-response pathways critical for oncogenic transformation of cells, by phosphorylating and destabilizing p53/TP53 (PubMed:14702041). Phosphorylates its own inhibitors, the protein phosphatase type 1 (PP1) isoforms, to inhibit their activity (PubMed:11551964). Necessary for proper cilia disassembly prior to mitosis (PubMed:17604723, PubMed:20643351). Regulates protein levels of the anti-apoptosis protein BIRC5 by suppressing the expression of the SCF(FBXL7) E3 ubiquitin-protein ligase substrate adapter FBXL7 through the phosphorylation of the transcription factor FOXP1 (PubMed:28218735). {ECO:0000269|PubMed:11039908, ECO:0000269|PubMed:11551964, ECO:0000269|PubMed:12390251, ECO:0000269|PubMed:13678582, ECO:0000269|PubMed:14523000, ECO:0000269|PubMed:14702041, ECO:0000269|PubMed:15128871, ECO:0000269|PubMed:15147269, ECO:0000269|PubMed:15987997, ECO:0000269|PubMed:17125279, ECO:0000269|PubMed:17360485, ECO:0000269|PubMed:17604723, ECO:0000269|PubMed:18056443, ECO:0000269|PubMed:18615013, ECO:0000269|PubMed:19351716, ECO:0000269|PubMed:19668197, ECO:0000269|PubMed:19812038, ECO:0000269|PubMed:20643351, ECO:0000269|PubMed:26246606, ECO:0000269|PubMed:27335426, ECO:0000269|PubMed:28218735}.FUNCTION: Regulates autophagy by playing a role in the reduction of glucose production in an adiponectin- and insulin-dependent manner. {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneAURKAchr20:54956488chr20:35445872ENST00000312783-710211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000347343-69211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000371356-710211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000395909-811211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000395911-710211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000395913-69211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000395914-710211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000395915-69211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000312783-710211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000347343-69211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000371356-710211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000395909-811211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000395911-710211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000395913-69211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000395914-710211_213235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000395915-69211_213235404.0Nucleotide bindingATP

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneAURKAchr20:54956488chr20:35445872ENST00000312783-710133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956488chr20:35445872ENST00000347343-69133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956488chr20:35445872ENST00000371356-710133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956488chr20:35445872ENST00000395909-811133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956488chr20:35445872ENST00000395911-710133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956488chr20:35445872ENST00000395913-69133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956488chr20:35445872ENST00000395914-710133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956488chr20:35445872ENST00000395915-69133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956489chr20:35445872ENST00000312783-710133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956489chr20:35445872ENST00000347343-69133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956489chr20:35445872ENST00000371356-710133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956489chr20:35445872ENST00000395909-811133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956489chr20:35445872ENST00000395911-710133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956489chr20:35445872ENST00000395913-69133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956489chr20:35445872ENST00000395914-710133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956489chr20:35445872ENST00000395915-69133_383235404.0DomainProtein kinase
HgeneAURKAchr20:54956488chr20:35445872ENST00000312783-710260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000347343-69260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000371356-710260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000395909-811260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000395911-710260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000395913-69260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000395914-710260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000395915-69260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000312783-710260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000347343-69260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000371356-710260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000395909-811260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000395911-710260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000395913-69260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000395914-710260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956489chr20:35445872ENST00000395915-69260_261235404.0Nucleotide bindingATP
HgeneAURKAchr20:54956488chr20:35445872ENST00000312783-710280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956488chr20:35445872ENST00000347343-69280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956488chr20:35445872ENST00000371356-710280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956488chr20:35445872ENST00000395909-811280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956488chr20:35445872ENST00000395911-710280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956488chr20:35445872ENST00000395913-69280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956488chr20:35445872ENST00000395914-710280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956488chr20:35445872ENST00000395915-69280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956489chr20:35445872ENST00000312783-710280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956489chr20:35445872ENST00000347343-69280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956489chr20:35445872ENST00000371356-710280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956489chr20:35445872ENST00000395909-811280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956489chr20:35445872ENST00000395911-710280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956489chr20:35445872ENST00000395913-69280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956489chr20:35445872ENST00000395914-710280_293235404.0RegionActivation segment
HgeneAURKAchr20:54956489chr20:35445872ENST00000395915-69280_293235404.0RegionActivation segment


Top

Fusion Gene Sequence for AURKA-SOGA1


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>8490_8490_1_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000312783_SOGA1_chr20_35445872_ENST00000237536_length(transcript)=13906nt_BP=948nt
ACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAGGGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAAC
GGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGGTCTCACTCCATTGCCCAGGCCAGAGTGCGGGGATATTTGATAAGAAACTT
CAGTGAAGGCCGGGCGCGGTGGCTCATGCCCGTAATCCCAGCATTTTCGGAGGCCGAGGCATCATGGACCGATCTAAAGAAAACTGCATT
TCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTA
AATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAG
CCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAG
CAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCT
TTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTG
GCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGG
CATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTAT
AGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCC
GATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAG
CTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCA
GCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGC
CGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCT
GAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAG
GAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCAC
GAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGC
GAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGT
ACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCG
GTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTG
GCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGC
GAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCA
GATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATT
CTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGC
ACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAG
GAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAG
GCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGC
CTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTG
CTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGC
ATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAG
AACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGT
CTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAG
CTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTA
GCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAG
CGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGG
GTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAA
GTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTT
GCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACC
AAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGGCAGATGCAGCGCAGC
TACACGGCTCCTGACAAGACGGGCATCCGAGTCTACTATAGTCCCCCGGTGGCCCGGCGCCTCGGAGTCCCTGTGGTTCATGACAAAGAG
GGCAAGATCATTATCGAGCCCGGCTTCCTCTTCACCACAGCCAAGCCCAAAGAGTCGGCCGAGGCTGATGGGCTGGCTGAGAGCTCCTAT
GGTCGGTGGCTCTGCAACTTCTCACGGCAGCGCCTGGACGGAGGCTCAGCGGGCAGCCCCTCGGCGGCCGGGCCTGGCTTCCCAGCGGCC
CTGCATGACTTTGAGATGTCAGGCAACATGAGTGATGACATGAAGGAGATCACCAACTGTGTGCGCCAGGCCATGCGCTCCGGCTCACTG
GAGAGGAAAGTGAAGAGCACATCCAGCCAGACGGTGGGCCTGGCCAGTGTGGGCACACAGACCATCCGCACGGTCAGCGTGGGCCTGCAG
ACCGACCCACCCCGCAGCAGCCTCCATGGCAAGGCCTGGTCACCCCGCAGCTCTTCGCTCGTGTCTGTGCGCAGCAAGCAGATCTCCTCC
TCCCTGGACAAGGTCCATTCGCGCATCGAGCGGCCCTGCTGCTCCCCCAAGTATGGCTCACCAAAGCTCCAGAGGCGGTCTGTGTCCAAG
CTGGACAGCAGCAAGGACCGCAGCCTGTGGAACCTGCACCAGGGCAAGCAGAACGGCTCGGCCTGGGCCCGCTCCACCACCACGCGGGAC
AGCCCTGTATTGAGAAACATCAACGATGGACTCTCCAGCCTCTTCAGTGTGGTGGAGCACTCAGGGAGCACGGAGTCTGTCTGGAAACTA
GGCATGTCTGAGACGCGGGCCAAGCCCGAGCCTCCCAAGTACGGCATTGTGCAGGAATTCTTCCGTAATGTGTGTGGCCGGGCACCGAGC
CCCACCTCATCAGCAGGAGAGGAGGGCACCAAGAAGCCAGAGCCCCTCTCCCCAGCCAGCTACCATCAGCCAGAGGGTGTGGCCAGGATC
CTGAACAAGAAGGCAGCCAAGTTGGGCAGCAGTGAGGAGGTCAGACTCACCATGCTCCCCCAGGTGGGGAAGGATGGTGTCCTCCGGGAC
GGAGATGGAGCCGTGGTCCTTCCCAATGAGGACGCTGTTTGTGACTGTAGTACCCAGTCTCTCACCTCCTGCTTCGCCCGATCGTCCCGC
TCTGCCATCCGCCACTCTCCTTCCAAGTGCAGGCTGCACCCTTCAGAGTCCAGCTGGGGTGGGGAGGAGAGGGCACTCCCCCCCAGCGAG
TGACAGAGCAGCCAAGCTCCCCGCCTCAACCAGCCCAGCCCCTGGATAGCAGAAGGGAACCAGCAGAGACGAGACGAGGTGAGGCGAGGG
GCTGTGTCCTCAGCATTGCCTGGCCCTGGAGGGACAGCAGTGATGCCACTGCCAGAATGCAGCTTTCACATCAAGGTAAAGCCGGGTCTC
CTGCTGGCCCCTGGGTGGTGAGCTTCGACTTCCCAGGGGAAGGCAGTGAGTGGGAGAGAGACCAAACCTGGGCTTCCCAAGCATCCACTG
AGAGATCTGTCAAGAGCCGATCCCTGGGTCCTAAGAGAGAGCCTTGCCTGGTTCTGCCCATGCCACCCTCTTGGAAGAGCCCAAGAAGGA
TACATGTCTGGCCATGCCTTTGGGGAAAAGGAGTCGGAGAGATGTTTCCTGCTGACCATCCACCCCTTCATTTGGGAGGAGACACTGCTG
AGAAGAACAGGCTTTGCTCTAGGGCTCCATGTTTGGTTCCTGGTGGAGCCCTGTTGGGCATCATCACCATCACCTCCTTCTCTCCACCAC
CTCCTCCTCCCAGCCCCACTGCTCTAGATTGCTGGGACACTAGGGAGTATGATAGGGCAGTAGCCAGGGCCATTGCTTAGTGTCCTGGAG
CCCTGGATCTCCCTGCCCATAGCCTGGATGCAGCAAGAGCTGGGAGGCGAAGTGGAAACATGCAGGGCTCAGGGTTGGGGAGTGATTGCA
ATTGCCTTCCTTGCCAAAGTGACTTGGGGCCCCAACGTTCCCAGCAGACCCCTTGAGGACAGAAATAGGTAGAGTCAGTCTCAAGACCTG
GTGCATAGATAAATGCCTAAATACACTGCCTTGATCTCAAGTGATCTCAGAGGCCTCTTTCCCTGGCACCCTGAGAGGCAGCAGGCACTA
CATCTCCACTGTGTTTACATCCTGCAGCTGGTCGGGGGGCAAAGATATTCCCAGTAAGAGATTCTTGGTTGGCCAGGTCAGGCCCAGGAG
AACACCAAGAGGCCAGAGCCCAGGACACAGCAGTAGACTGGGGCCTGGAAACACGTATCTTGCCTAGATTGTTTATTTGAATTTTTCCTA
CTATAAATATTTAAGGTGGTTTACTTTATTTTAATAATTTAATTTACCCCAAAGTCCCTAAGGTAATTTATTGGAGGTTGAAACATGCAT
TCTTGCCACTGGGACAACATGAGGCCTCTAACAGCACGGGCAGGCATGGGGTCCCCTGGGTGGACGAGGCCGCTTGGCAGCCAGGTTTGG
AGACCTGGCCTCCTGGTCAGCTTTGGAGGGCCCCTCAACAGAGCTGGAGCCCTGCACCCCAACACGGCTGGCCATGTGGCCTCAGAACAC
TACTTATTACTCAATGCCTGGTACTTGGCCAGCCCCAGCGGTCAGTCTATAAATACTCACTGACAAGGTGGAGGGCTGGACGGCCATCAC
CACTCCCCAGACGTTCTCCATTGCCTGTCTCATTTCCCTCCCTCCCTCTGACACCTTTCTTCATGAGTCGAACGTGGATTACTAAAGCTC
TATTAAGAGTGTGGAGATCCCTCCAAGTTTCCCAAATGAGAACTCACAGGAAAACAGGACTGAACTTTGAGAATGTTGTTTATCGCAGCT
TTGCACATAAACCTGAGTGTCTCCCAGCCTGCCTCGGTTCTCACCAGCCTGCCAGCCTTTTCACCAGCCTCTCTCCTTAGCCTTATGGCC
TTTCACGGCTCTTCTCCCTGCCCCAGCTCTGCTGCCCGCCCTTCCTCACGTCCCCTGTGAGCTGCCTGAGCCATTGGTTGGATTTCGATG
TGGCTCATTGCAGCATGTGGGGCAGCGCCTCCCATGGCCTCGCCTTGGTGCCGGTGAACCCCTTTTGGTTGCACACATGCTCCCCACACA
CACATAGACATCAGCCTTCCTGTAACTGACTGGGGACCCAGAGTGGAAACACCAGGATGGATCAGCTTGTCTGCAGAATTGCCCATCAGG
AAGACCAAAAGCCAGTAGCTTTGCTGATCTGCCCCCAGGACTCTGGAGGCGCCCCTGCACTCCCACCTCCCACCTGCCAGTTCCCAGACC
CACCCATTCGGGATCACCTGGACCAGTTACCCCAAGTCCTGCATCTCCCTTCCCTGCAGGCTGAACACCAGGGTCATGCCAGTCCCGCCA
GCCGCCTCCTCCATGCCCCAGTGACTGGTGTGGGCAGAGCAGGCAGCCAGTGGAGCTGTGGGCCAGTTCCGCTCTTGGATGCTGCTGCTC
TCACCCATGAGGTCAGGGGGGCCCTCCAAGGTTATCTCCAGGTGAGGGGATTCACATCAGGCCACAAGCCACCAGAGGCCTTCTGCCACC
TCCCAGAGCGACAGCCAGGGAGGCTGCGTACTCAGCCTCGGGGAGAAATCCCCGTGGGACCTGAGCCCCAAGACCTACGGACCACTCAGC
CTTACCATCGTACCGTCCAGGATTGTCCTTGCCATCTTTGTTGTCTCAGCCAGACCTTGGTTTTCAGTAAAGCCCCAGTTTCTACTTCCT
GCATGCCACTGTGCAAGGCCACTCATCACTGTTCCTGCAGAAGCCTCTGGACGTGGGGCTGGATGGGGTTGAAAATGTTACATGTAAATA
TTGGTTTGGTTCGGTTTTTAGCATTTTACTTGGTAACTGGTTGTTTTCTTTTTTGGGGTGGGGGGATTGGTTTGTAAAAATTCTCTACTC
TTTTGGAATGTGATTTCTAAGTTTGTTGGTTTCTTCAAATGCCTTTTAAGTCTTGGTAACATTCCCAAAGCAGAAAACTGCCTGACCCAC
AGTGGGGATTCCCTGGAGAATTGGGGTCCCAAGAAGGAATGCTGCCCTTCTCGAACCCGTTCTCCCCCTTCCTCCTGCCTCTCTGCCTTT
TACTGCTATTCCCTTCTTCTCCTCCTTTATCCTTCTTTCTGTTTTCCCCATCTCCACTCTCTCTTCAACCAAAGTCCCAAGGAACCCTCG
GGGCTCAATCCCCCATAGACCACTTGGCTTGGGTCCATGGGGTTGGCATCAGTTGGTTGGCGGAAATGGGGGACCAGTTGGCATGATGGC
CCTAAACTGGGAAACCTCATGTTTCTTATGTCTCACCTCTTTCCAGAGCCAAATCAGCCCCTTTTGGAATGATGACTTCATTGGAATGCA
AATCAAGTCATTTTGGTGCATCAGTGGCTCTTAGGCCTGCACACACGAGACATCAGAATCCAATCCTCTGACCCTGTGCCAGCCCTTTCC
CCCAGTTTATTTCCCACCAAAGGCTGACCTCTAAGAGGTCTTGCTTTCTATGAACTCAAGATGGGTCCCACCTCTAGGTGTCCCCAGGTG
CACTCTTCTACCGGTTGGCTTCCGATGTGACAAGGCCAAGGGCCCAAAGACTTGACCCTCTTACACCCTTGCTGACATGGTTCCATCATG
TCCACCCGCATGCACTTTTATGGTTTCATCACCCAGCCTCTTCTCCTCTGGCCCACCCAGCGTCCAGGCTCTTTCTCCCTCTCCCCTCCT
ATCTAGAATGTCCCCTGCTTCTAGCCTCACCAGACCCCCCAAGCTCCCACTACTTCTTCCATAATAATAGTAATAACAATGGTTATCATC
ATCCCCTGCACATCCCGCCTAAAGCACTTTACTATATAGAAAACGTTTCCCCTGGCCGGGCATGGTGGCTCACGCCTGAAATCCCAGCTC
TTTGGGAGGCTGAGGCGAGCGGATCAGTTGAGGTCAGGAGTTCAACGCCAGCCTGGCCAACGTGGTGAATCCCTGTCTGTACTAAAAGTA
CAAAAAATTAGCTGAGCATGGTGGTGCGTGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGTGGGAGAATCGCTTGAGCCCAGGAGGCGG
AGGTTGCAGGAGCAGAGATTGCGCCACTGCACTCCAGCCTGGATGACAGAGTGAGACCCAATCTCAAAAAAGAAATCGTTTCCCACCCCA
CATCTCCTTCAGACCTCTCAGGGACAACTCTGGGAGGCAGCCTTGGCAGGACATGGGTTAGTGCGCCCATTTTGCTGTGAGGAAACTGAG
GTACAGGTCTCATCCCAGAGCATGAGAAGTCACTGAGTTTAGATGAGAACTTGGGTCCAACTCTGTCCTGTTTGCTGTGCAAATCCGCTG
CCCTGCTGGGGGCTTTTGGTGGGTCCAGAATACCCAGAATATGCTGCTGGCCAACCCAGGCATAAAACAAGTCCATTCTAGATCACTGAG
CCTTGTGTATTCCAGAGGGTGATCTGAGGTCCCCATTCAGCAGAATTCTCTGAGGGCATGTTCAGAATGTAGATTCCTGGGCCCCACCTT
GAATTTGCATGTTTAACAAACTCTCCTGGGGTTGAGGGGTGGGTGCAGTGGTCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGG
GTGGATCACTTGAGCTCAGGAGTTCGAGCCTGGCCAATATGGTGAAACCCTGTCTCTACTAAAAATGCAAAAATTAGCCAGGTGTGGTGG
CACATGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGTGGGGAGCAGCGGTTGCAGTGAGCCGAGATT
GTGCCATTGCGCTCTAGCCTGGGCAACGGAATGAGACTTGTCTCAAAAAAAATAAAAAATAAAACCAGCCCTCCCCGGGGGATCTTAGGC
ACTATTGGCCACACCATTGGTGTTCGTGGCCCTGATTATTAGGCTATCTTTCTTTTTTTAAGTTTTTTTAGATTTATTTTTTATTTTATT
TATTTATTTATTTATTATTTATTATTATTATTTTTAGAGACAGGGGGTCTCCCTATGTTGCCCAGGCTGGTTTCAAACTCCTGGGCTCAA
GTGATCTGCCCTCCTCAGCCTCCCAAAGTGCTGGGGTTACAGGCAGGCATCAGCCACCGTGCCAGGTCATCTTCCTTTTTCTTTTTTTTT
TGGAGACAGAGTCTTGCTCTGTCATTCAGGCTGGAGTACAGTGGCGCCATTTCAGCTCACCGCAGCCTCCACCTCCCAGGTTCAAGCAGT
TCTCCTGCCTCAACCTCCCGAGTAACTGGGATTACAGGTGTGTGCCACCACACCCGACTAATTTTTTTATTTTTAGTAGAGACGGGGTTT
CACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCACCCACCTCGGCCTCCCAAAGTACTGGGATTATAGGCGTGAGC
CACGGCATCCAGCCTCATCTTTCTTTTAACCAATAAACATGATGCTGTATCTTAAAAAGAGCACTGAGCAGGGACTTAAGGGATCGAGTC
CTCAACCAAACTGATTTAATTACTCAGGATTTTCAAAAAGCATCAGAGGCTATTTACAATCTTAATCATAGGGGTTCAGTAAAATAAAAA
TAAGAAGTAAAAAAGCAAGAGAAATTATTCTGTAAATCTAACTGGTGTAATTCCCATAATCATGCAATTAAGTTTTACTCTTGAGTTTCC
TGACAGCCATTGGTAAAAAGAGAAACACATCAGGATTTATAATTTTTATCATCCAATTATGGGAAGCAAGCATGTTGGCCCCAGGAGACG
AACTCTTCTACTAATTTATAGCATGTATCCTTTCATCAAGGGCCACTGAGCCAGTTGGTGAGTCAACGGGTGAACCTAAGATGCAAGGAT
GTTTTCCAGGTGACTATTTAAAAAAATAAATAATCCACCAACGTGATTGACCTTGGCGAGATCATGTTTCTAGTCTATACCTCAGTTTCC
CCATCTGTAAAGTGAGGATAATGTCCCACCCCATGTAACTGTGGTGAGGACCAACTGCAACACTGTGCCTGCGAGTCTCCTTGGAAAAGT
GTAAGGTTCTACACAAATGGAAAGTGATCTGATCACACTCAGTGTCCCCAGCCCAGCCTTTCAGTGCCCTGGCCCTGGGGTGGGGGACAA
TACTCTCCTCACCCCCTTCACTAGTCTTCATGAATAGCAAGGAGGCCATAACATAATTTGGTCTAAACCCCTTCCTTTTTAAAAGAATGA
TGGCAAAATGTGCATAACATTAAATTTACCATTTTAAAGATTTCTAAGTGTCTCGAAGTACATTTGCAATGTGTAACTGCCACCTCCAGA
ACTTTTTCATCATCCTAAACTGAAACTCTGTACCCACTAAACAGTAACTCCCCACTCCCCCTGTCCCCAGTCCCTGGTAACCTCTATTCT
ACTTTTTTTTTTTCTCTGTGGATTGGCCTATTCTAGGTATCTCTTTTAAGTAGAATCATATAGTATTTGTCCTTTTGTGTCCAGCCCCCT
CATTTTTTGAGATGAGGAATCAGGCCCAGAGAAGGCAGTGGCTCACCCAAGGTCACATCGCAAACCAGAGGCAGAGCCAAGACCAGAACC
CAGGTTTCCTGACTCCTAACAGGTCTTGCATCCCCAGTAGTGAATGAGACACTGCTTTGACTTTCTGTAATCTTGGTTTAGCCCCTTCCT
TTCTCTGGGCTCAGTCTGCTTCCCACCACTCACCCATTCTTTCTTTATCATCTCCCTCCAAAGCCTCTTGTCCTCCTGCCTCCTCTTCTC
CTTGGCTGGTTCCTGCCAGCAAATGGAGTGGCCCTGGTCCATGCCAGGTTTTCCTCTTCTGGGTCCGGAGCTCACTATAGTATTCAGCCC
TCAGTCCTCCCAGGATGTTCTTCCCAGCACTGCTGCCTCGTGCGGATTTTCCCGTAACCTCAGTAACTGGCTTCTTGTCCCCCTGCTTCC
TACCAGGGAAGCCTTCCTGTCCGCGTCTGTGGTTTCCCAACTCTGATATTTGCTCTCAAATGTGGTGGTGTCCTGGTTCTGTGTTTATTT
ATTTTGTGTTTTCTCACACACACAAAAAAAAGCCTCTTCCTCCAATGCATCAGGAGGCACCAGCCCTGCCAGCCCTTCTCACTGGGCTCA
CCCTGCCCCAGCAACCCCCCGGTGCCACCCTTAAGCCACACTGTCTTCTCTGTAAGCAGCCTGCCAGCAGCAGCCCCAGCACTTTGCAAT
GGGCGTGTGTGTGGTGGTGGGTGGGGGGGGCTTGGATCCCTCCTTTTTCCTCCCCTGCCCTGCCCAGGCCCAGATGGCCTTGACTGTAAA
GCAGGTGCTGCCTGACAGGTTCTTCTCTCCCTGTCTCTGGTCATTGATCCATCTCTTTGTCCATTCAGTATCCAACCATCCTCTCCATTC
TCCTCTGGACCTCACCACTCTCAGAGCTGCTTGTCCTGGCAGAATCTACAGTTCACCCCAACTCTATGCCTTACCCCTCCCAACCCAACA
GCATTTGCAGTTTGCAAAATATACAGACCCAAGTCCTGAGGGGACTGAGGACATGATGCTGGGCCCAAGTCTCCTGCTCAGGGCTTCTCT
CCAATGCCAGCCCTGCCACTCCTTCCTCACCCTCCTTGGAGCCTCCTCTGCTGCTTGTCTATCCCAACGGCCCTGCTCCCCTCCCTTCCT
GCCCTTCACCAGCTTTCTGGACACCATGCCCTGAGGAAGGGACCTTTGGTTTTCTCTAAACATCTTTGAAGGGCTGAGGCAGTCAGGGCT
GGCTGCCTTGTCACTCTTTATTTGGAAGCCACTCAAACCATTCCCAAGAAGAGGGACCTCAGCTGGCAATCTGGAAACCTGGCCCAGGTC
TGGGCAGATGTCTTCACTTCTCCTACCTTCCCAGTCTTGTGATCCTGTGATGAGCACCAGGATGGCCCTGTGGTCCCTAGAGCACCCCTC
ATGCTGTAGGGTCCTGCAGCCCCATCCTTTCTCTACTGGGCCCTGGTATCCTGGCTCCTCTCTCAGCTCTGCCACTGATCTCTGTGCCTT
AGTTTACTTCTCTGCACGGGGGACTCACCCCAAGACCATTTCCAGCAGCTTCCCAGGTGATGTGGTGCCCCAAGGCTGGGCTTTGCCAGC
TGTGGCCCAGCTCCTTAGTGCTGCCCAGGAGACACCAGGCTGCTCAGAATGAGGTGACTGCGGGCACCATTCTCAGCCAGTGGTTCTTGT
ATTGCATTCCAGCAGCAGGAATATCACCTGGGAACTTGATAGAAGTGCAGATTAGCAGCCCCACCCAAGACCCACTGAATTAGAGCTTGT
GGAGTGGGGCCCTACAAGCTGGGGTTTTAAGGAGCCCTCCAAATGATTCTGACGCATAAGAATATGCCAACTGCTGATCTGGGCTAGCCA
TTAGTAGAGCCTGGGGAGGGACTGGGACTGGCTAGGCCAAGAACAGGTGGAAAACACCAGCCTTATCTGGACTCCTGAGATTGGGAACCA
CCACCAACAAAAACCAACCCTATAGTCGCTCCTCTTGGAAGAGGAAGAGAAGTTGAAGGGCCTGGAGAAAGCACACATTGTTTGTTTCCC
TGCTCCTGCTCACCTCTCTCACTTGTCTTGGTTTTACAAAAGGCTGTGTGGATGGTGCCAGCCAGGGAGGGGGTGGGAGTCCTGGGGAGG
CAGGAGGCAGAAGACCCTGACTGTTTCTCCCTTGGGAACCTCACCATAGGCCAGATAGCGCCTCTTCAAACTGAAAGAAATCTTAACTCC
ACAAAGAAAGCATCCTAAATCCCCAGTTCCTCCTCCTCCCAACCCCAGGGATACCTTGTAGACAGTGCCAAAAAACAGCTCCAACCCCCA
GCAGCTGGGAAGAGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCA
GCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCT
TGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAG
GACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAAGCGT

>8490_8490_1_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000312783_SOGA1_chr20_35445872_ENST00000237536_length(amino acids)=1539AA_BP=235
MDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSR
PLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRR
EVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAK
EESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMK
DHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEE
LAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAAL
VSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPL
GSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLS
QERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSE
GDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAE
LKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLS
QLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSS
EPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGRQMQRSYTAPDKTGIRVYYSPPVARRLGVPVVHDKEGKIIIEPGFLFTTAKPKESAE
ADGLAESSYGRWLCNFSRQRLDGGSAGSPSAAGPGFPAALHDFEMSGNMSDDMKEITNCVRQAMRSGSLERKVKSTSSQTVGLASVGTQT
IRTVSVGLQTDPPRSSLHGKAWSPRSSSLVSVRSKQISSSLDKVHSRIERPCCSPKYGSPKLQRRSVSKLDSSKDRSLWNLHQGKQNGSA
WARSTTTRDSPVLRNINDGLSSLFSVVEHSGSTESVWKLGMSETRAKPEPPKYGIVQEFFRNVCGRAPSPTSSAGEEGTKKPEPLSPASY
HQPEGVARILNKKAAKLGSSEEVRLTMLPQVGKDGVLRDGDGAVVLPNEDAVCDCSTQSLTSCFARSSRSAIRHSPSKCRLHPSESSWGG

--------------------------------------------------------------
>8490_8490_2_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000312783_SOGA1_chr20_35445872_ENST00000279034_length(transcript)=3967nt_BP=948nt
ACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAGGGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAAC
GGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGGTCTCACTCCATTGCCCAGGCCAGAGTGCGGGGATATTTGATAAGAAACTT
CAGTGAAGGCCGGGCGCGGTGGCTCATGCCCGTAATCCCAGCATTTTCGGAGGCCGAGGCATCATGGACCGATCTAAAGAAAACTGCATT
TCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTA
AATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAG
CCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAG
CAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCT
TTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTG
GCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGG
CATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTAT
AGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCC
GATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAG
CTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCA
GCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGC
CGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCT
GAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAG
GAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCAC
GAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGC
GAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGT
ACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCG
GTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTG
GCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGC
GAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCA
GATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATT
CTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGC
ACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAG
GAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAG
GCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGC
CTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTG
CTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGC
ATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAG
AACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGT
CTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAG
CTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTA
GCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAG
CGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGG
GTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAA
GTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTT
GCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACC
AAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGCCAGAAGCTGCCCTTC
CTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCAT
GGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTAT
TCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCT
GCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAAGCGTTTTTAATGGTTGTAGATTGGAAATAAAGTATG

>8490_8490_2_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000312783_SOGA1_chr20_35445872_ENST00000279034_length(amino acids)=1132AA_BP=235
MDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSR
PLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRR
EVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAK
EESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMK
DHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEE
LAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAAL
VSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPL
GSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLS
QERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSE
GDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAE
LKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLS
QLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSS

--------------------------------------------------------------
>8490_8490_3_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000371356_SOGA1_chr20_35445872_ENST00000237536_length(transcript)=13795nt_BP=837nt
CTTGGAAGACTTGGGTCCTTGGGTCGCAGGCTGGAGTGCAATGGTGTGATCTCAGCTCACTGCAACCTCTGCTTCCTGGGTTTAAGTGAT
TCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACA
GCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGG
GTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAG
AAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCT
GAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGT
CGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAA
GCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTG
TATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCA
AAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAG
GAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAG
AATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCC
CCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAG
GTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCC
GCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGC
TCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCG
GAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAG
AAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACG
GACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGG
GAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACG
GCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAG
CTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCT
GTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGC
TCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAG
CCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCT
GAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAA
ATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAG
AAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGG
TTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTG
ATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATC
AAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAG
TTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCC
GGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTC
ATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAG
CAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTG
GAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGT
TTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGC
AAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCG
TCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGGCAGATGCAGCGCAGCTACACGGCTCCTGACAAGACG
GGCATCCGAGTCTACTATAGTCCCCCGGTGGCCCGGCGCCTCGGAGTCCCTGTGGTTCATGACAAAGAGGGCAAGATCATTATCGAGCCC
GGCTTCCTCTTCACCACAGCCAAGCCCAAAGAGTCGGCCGAGGCTGATGGGCTGGCTGAGAGCTCCTATGGTCGGTGGCTCTGCAACTTC
TCACGGCAGCGCCTGGACGGAGGCTCAGCGGGCAGCCCCTCGGCGGCCGGGCCTGGCTTCCCAGCGGCCCTGCATGACTTTGAGATGTCA
GGCAACATGAGTGATGACATGAAGGAGATCACCAACTGTGTGCGCCAGGCCATGCGCTCCGGCTCACTGGAGAGGAAAGTGAAGAGCACA
TCCAGCCAGACGGTGGGCCTGGCCAGTGTGGGCACACAGACCATCCGCACGGTCAGCGTGGGCCTGCAGACCGACCCACCCCGCAGCAGC
CTCCATGGCAAGGCCTGGTCACCCCGCAGCTCTTCGCTCGTGTCTGTGCGCAGCAAGCAGATCTCCTCCTCCCTGGACAAGGTCCATTCG
CGCATCGAGCGGCCCTGCTGCTCCCCCAAGTATGGCTCACCAAAGCTCCAGAGGCGGTCTGTGTCCAAGCTGGACAGCAGCAAGGACCGC
AGCCTGTGGAACCTGCACCAGGGCAAGCAGAACGGCTCGGCCTGGGCCCGCTCCACCACCACGCGGGACAGCCCTGTATTGAGAAACATC
AACGATGGACTCTCCAGCCTCTTCAGTGTGGTGGAGCACTCAGGGAGCACGGAGTCTGTCTGGAAACTAGGCATGTCTGAGACGCGGGCC
AAGCCCGAGCCTCCCAAGTACGGCATTGTGCAGGAATTCTTCCGTAATGTGTGTGGCCGGGCACCGAGCCCCACCTCATCAGCAGGAGAG
GAGGGCACCAAGAAGCCAGAGCCCCTCTCCCCAGCCAGCTACCATCAGCCAGAGGGTGTGGCCAGGATCCTGAACAAGAAGGCAGCCAAG
TTGGGCAGCAGTGAGGAGGTCAGACTCACCATGCTCCCCCAGGTGGGGAAGGATGGTGTCCTCCGGGACGGAGATGGAGCCGTGGTCCTT
CCCAATGAGGACGCTGTTTGTGACTGTAGTACCCAGTCTCTCACCTCCTGCTTCGCCCGATCGTCCCGCTCTGCCATCCGCCACTCTCCT
TCCAAGTGCAGGCTGCACCCTTCAGAGTCCAGCTGGGGTGGGGAGGAGAGGGCACTCCCCCCCAGCGAGTGACAGAGCAGCCAAGCTCCC
CGCCTCAACCAGCCCAGCCCCTGGATAGCAGAAGGGAACCAGCAGAGACGAGACGAGGTGAGGCGAGGGGCTGTGTCCTCAGCATTGCCT
GGCCCTGGAGGGACAGCAGTGATGCCACTGCCAGAATGCAGCTTTCACATCAAGGTAAAGCCGGGTCTCCTGCTGGCCCCTGGGTGGTGA
GCTTCGACTTCCCAGGGGAAGGCAGTGAGTGGGAGAGAGACCAAACCTGGGCTTCCCAAGCATCCACTGAGAGATCTGTCAAGAGCCGAT
CCCTGGGTCCTAAGAGAGAGCCTTGCCTGGTTCTGCCCATGCCACCCTCTTGGAAGAGCCCAAGAAGGATACATGTCTGGCCATGCCTTT
GGGGAAAAGGAGTCGGAGAGATGTTTCCTGCTGACCATCCACCCCTTCATTTGGGAGGAGACACTGCTGAGAAGAACAGGCTTTGCTCTA
GGGCTCCATGTTTGGTTCCTGGTGGAGCCCTGTTGGGCATCATCACCATCACCTCCTTCTCTCCACCACCTCCTCCTCCCAGCCCCACTG
CTCTAGATTGCTGGGACACTAGGGAGTATGATAGGGCAGTAGCCAGGGCCATTGCTTAGTGTCCTGGAGCCCTGGATCTCCCTGCCCATA
GCCTGGATGCAGCAAGAGCTGGGAGGCGAAGTGGAAACATGCAGGGCTCAGGGTTGGGGAGTGATTGCAATTGCCTTCCTTGCCAAAGTG
ACTTGGGGCCCCAACGTTCCCAGCAGACCCCTTGAGGACAGAAATAGGTAGAGTCAGTCTCAAGACCTGGTGCATAGATAAATGCCTAAA
TACACTGCCTTGATCTCAAGTGATCTCAGAGGCCTCTTTCCCTGGCACCCTGAGAGGCAGCAGGCACTACATCTCCACTGTGTTTACATC
CTGCAGCTGGTCGGGGGGCAAAGATATTCCCAGTAAGAGATTCTTGGTTGGCCAGGTCAGGCCCAGGAGAACACCAAGAGGCCAGAGCCC
AGGACACAGCAGTAGACTGGGGCCTGGAAACACGTATCTTGCCTAGATTGTTTATTTGAATTTTTCCTACTATAAATATTTAAGGTGGTT
TACTTTATTTTAATAATTTAATTTACCCCAAAGTCCCTAAGGTAATTTATTGGAGGTTGAAACATGCATTCTTGCCACTGGGACAACATG
AGGCCTCTAACAGCACGGGCAGGCATGGGGTCCCCTGGGTGGACGAGGCCGCTTGGCAGCCAGGTTTGGAGACCTGGCCTCCTGGTCAGC
TTTGGAGGGCCCCTCAACAGAGCTGGAGCCCTGCACCCCAACACGGCTGGCCATGTGGCCTCAGAACACTACTTATTACTCAATGCCTGG
TACTTGGCCAGCCCCAGCGGTCAGTCTATAAATACTCACTGACAAGGTGGAGGGCTGGACGGCCATCACCACTCCCCAGACGTTCTCCAT
TGCCTGTCTCATTTCCCTCCCTCCCTCTGACACCTTTCTTCATGAGTCGAACGTGGATTACTAAAGCTCTATTAAGAGTGTGGAGATCCC
TCCAAGTTTCCCAAATGAGAACTCACAGGAAAACAGGACTGAACTTTGAGAATGTTGTTTATCGCAGCTTTGCACATAAACCTGAGTGTC
TCCCAGCCTGCCTCGGTTCTCACCAGCCTGCCAGCCTTTTCACCAGCCTCTCTCCTTAGCCTTATGGCCTTTCACGGCTCTTCTCCCTGC
CCCAGCTCTGCTGCCCGCCCTTCCTCACGTCCCCTGTGAGCTGCCTGAGCCATTGGTTGGATTTCGATGTGGCTCATTGCAGCATGTGGG
GCAGCGCCTCCCATGGCCTCGCCTTGGTGCCGGTGAACCCCTTTTGGTTGCACACATGCTCCCCACACACACATAGACATCAGCCTTCCT
GTAACTGACTGGGGACCCAGAGTGGAAACACCAGGATGGATCAGCTTGTCTGCAGAATTGCCCATCAGGAAGACCAAAAGCCAGTAGCTT
TGCTGATCTGCCCCCAGGACTCTGGAGGCGCCCCTGCACTCCCACCTCCCACCTGCCAGTTCCCAGACCCACCCATTCGGGATCACCTGG
ACCAGTTACCCCAAGTCCTGCATCTCCCTTCCCTGCAGGCTGAACACCAGGGTCATGCCAGTCCCGCCAGCCGCCTCCTCCATGCCCCAG
TGACTGGTGTGGGCAGAGCAGGCAGCCAGTGGAGCTGTGGGCCAGTTCCGCTCTTGGATGCTGCTGCTCTCACCCATGAGGTCAGGGGGG
CCCTCCAAGGTTATCTCCAGGTGAGGGGATTCACATCAGGCCACAAGCCACCAGAGGCCTTCTGCCACCTCCCAGAGCGACAGCCAGGGA
GGCTGCGTACTCAGCCTCGGGGAGAAATCCCCGTGGGACCTGAGCCCCAAGACCTACGGACCACTCAGCCTTACCATCGTACCGTCCAGG
ATTGTCCTTGCCATCTTTGTTGTCTCAGCCAGACCTTGGTTTTCAGTAAAGCCCCAGTTTCTACTTCCTGCATGCCACTGTGCAAGGCCA
CTCATCACTGTTCCTGCAGAAGCCTCTGGACGTGGGGCTGGATGGGGTTGAAAATGTTACATGTAAATATTGGTTTGGTTCGGTTTTTAG
CATTTTACTTGGTAACTGGTTGTTTTCTTTTTTGGGGTGGGGGGATTGGTTTGTAAAAATTCTCTACTCTTTTGGAATGTGATTTCTAAG
TTTGTTGGTTTCTTCAAATGCCTTTTAAGTCTTGGTAACATTCCCAAAGCAGAAAACTGCCTGACCCACAGTGGGGATTCCCTGGAGAAT
TGGGGTCCCAAGAAGGAATGCTGCCCTTCTCGAACCCGTTCTCCCCCTTCCTCCTGCCTCTCTGCCTTTTACTGCTATTCCCTTCTTCTC
CTCCTTTATCCTTCTTTCTGTTTTCCCCATCTCCACTCTCTCTTCAACCAAAGTCCCAAGGAACCCTCGGGGCTCAATCCCCCATAGACC
ACTTGGCTTGGGTCCATGGGGTTGGCATCAGTTGGTTGGCGGAAATGGGGGACCAGTTGGCATGATGGCCCTAAACTGGGAAACCTCATG
TTTCTTATGTCTCACCTCTTTCCAGAGCCAAATCAGCCCCTTTTGGAATGATGACTTCATTGGAATGCAAATCAAGTCATTTTGGTGCAT
CAGTGGCTCTTAGGCCTGCACACACGAGACATCAGAATCCAATCCTCTGACCCTGTGCCAGCCCTTTCCCCCAGTTTATTTCCCACCAAA
GGCTGACCTCTAAGAGGTCTTGCTTTCTATGAACTCAAGATGGGTCCCACCTCTAGGTGTCCCCAGGTGCACTCTTCTACCGGTTGGCTT
CCGATGTGACAAGGCCAAGGGCCCAAAGACTTGACCCTCTTACACCCTTGCTGACATGGTTCCATCATGTCCACCCGCATGCACTTTTAT
GGTTTCATCACCCAGCCTCTTCTCCTCTGGCCCACCCAGCGTCCAGGCTCTTTCTCCCTCTCCCCTCCTATCTAGAATGTCCCCTGCTTC
TAGCCTCACCAGACCCCCCAAGCTCCCACTACTTCTTCCATAATAATAGTAATAACAATGGTTATCATCATCCCCTGCACATCCCGCCTA
AAGCACTTTACTATATAGAAAACGTTTCCCCTGGCCGGGCATGGTGGCTCACGCCTGAAATCCCAGCTCTTTGGGAGGCTGAGGCGAGCG
GATCAGTTGAGGTCAGGAGTTCAACGCCAGCCTGGCCAACGTGGTGAATCCCTGTCTGTACTAAAAGTACAAAAAATTAGCTGAGCATGG
TGGTGCGTGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGTGGGAGAATCGCTTGAGCCCAGGAGGCGGAGGTTGCAGGAGCAGAGATTG
CGCCACTGCACTCCAGCCTGGATGACAGAGTGAGACCCAATCTCAAAAAAGAAATCGTTTCCCACCCCACATCTCCTTCAGACCTCTCAG
GGACAACTCTGGGAGGCAGCCTTGGCAGGACATGGGTTAGTGCGCCCATTTTGCTGTGAGGAAACTGAGGTACAGGTCTCATCCCAGAGC
ATGAGAAGTCACTGAGTTTAGATGAGAACTTGGGTCCAACTCTGTCCTGTTTGCTGTGCAAATCCGCTGCCCTGCTGGGGGCTTTTGGTG
GGTCCAGAATACCCAGAATATGCTGCTGGCCAACCCAGGCATAAAACAAGTCCATTCTAGATCACTGAGCCTTGTGTATTCCAGAGGGTG
ATCTGAGGTCCCCATTCAGCAGAATTCTCTGAGGGCATGTTCAGAATGTAGATTCCTGGGCCCCACCTTGAATTTGCATGTTTAACAAAC
TCTCCTGGGGTTGAGGGGTGGGTGCAGTGGTCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGTGGATCACTTGAGCTCAGGA
GTTCGAGCCTGGCCAATATGGTGAAACCCTGTCTCTACTAAAAATGCAAAAATTAGCCAGGTGTGGTGGCACATGCCTGTAATCCCAGCT
ACTTGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGTGGGGAGCAGCGGTTGCAGTGAGCCGAGATTGTGCCATTGCGCTCTAGCCTG
GGCAACGGAATGAGACTTGTCTCAAAAAAAATAAAAAATAAAACCAGCCCTCCCCGGGGGATCTTAGGCACTATTGGCCACACCATTGGT
GTTCGTGGCCCTGATTATTAGGCTATCTTTCTTTTTTTAAGTTTTTTTAGATTTATTTTTTATTTTATTTATTTATTTATTTATTATTTA
TTATTATTATTTTTAGAGACAGGGGGTCTCCCTATGTTGCCCAGGCTGGTTTCAAACTCCTGGGCTCAAGTGATCTGCCCTCCTCAGCCT
CCCAAAGTGCTGGGGTTACAGGCAGGCATCAGCCACCGTGCCAGGTCATCTTCCTTTTTCTTTTTTTTTTGGAGACAGAGTCTTGCTCTG
TCATTCAGGCTGGAGTACAGTGGCGCCATTTCAGCTCACCGCAGCCTCCACCTCCCAGGTTCAAGCAGTTCTCCTGCCTCAACCTCCCGA
GTAACTGGGATTACAGGTGTGTGCCACCACACCCGACTAATTTTTTTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGT
CTTGAACTCCTGACCTCAGGTGATCCACCCACCTCGGCCTCCCAAAGTACTGGGATTATAGGCGTGAGCCACGGCATCCAGCCTCATCTT
TCTTTTAACCAATAAACATGATGCTGTATCTTAAAAAGAGCACTGAGCAGGGACTTAAGGGATCGAGTCCTCAACCAAACTGATTTAATT
ACTCAGGATTTTCAAAAAGCATCAGAGGCTATTTACAATCTTAATCATAGGGGTTCAGTAAAATAAAAATAAGAAGTAAAAAAGCAAGAG
AAATTATTCTGTAAATCTAACTGGTGTAATTCCCATAATCATGCAATTAAGTTTTACTCTTGAGTTTCCTGACAGCCATTGGTAAAAAGA
GAAACACATCAGGATTTATAATTTTTATCATCCAATTATGGGAAGCAAGCATGTTGGCCCCAGGAGACGAACTCTTCTACTAATTTATAG
CATGTATCCTTTCATCAAGGGCCACTGAGCCAGTTGGTGAGTCAACGGGTGAACCTAAGATGCAAGGATGTTTTCCAGGTGACTATTTAA
AAAAATAAATAATCCACCAACGTGATTGACCTTGGCGAGATCATGTTTCTAGTCTATACCTCAGTTTCCCCATCTGTAAAGTGAGGATAA
TGTCCCACCCCATGTAACTGTGGTGAGGACCAACTGCAACACTGTGCCTGCGAGTCTCCTTGGAAAAGTGTAAGGTTCTACACAAATGGA
AAGTGATCTGATCACACTCAGTGTCCCCAGCCCAGCCTTTCAGTGCCCTGGCCCTGGGGTGGGGGACAATACTCTCCTCACCCCCTTCAC
TAGTCTTCATGAATAGCAAGGAGGCCATAACATAATTTGGTCTAAACCCCTTCCTTTTTAAAAGAATGATGGCAAAATGTGCATAACATT
AAATTTACCATTTTAAAGATTTCTAAGTGTCTCGAAGTACATTTGCAATGTGTAACTGCCACCTCCAGAACTTTTTCATCATCCTAAACT
GAAACTCTGTACCCACTAAACAGTAACTCCCCACTCCCCCTGTCCCCAGTCCCTGGTAACCTCTATTCTACTTTTTTTTTTTCTCTGTGG
ATTGGCCTATTCTAGGTATCTCTTTTAAGTAGAATCATATAGTATTTGTCCTTTTGTGTCCAGCCCCCTCATTTTTTGAGATGAGGAATC
AGGCCCAGAGAAGGCAGTGGCTCACCCAAGGTCACATCGCAAACCAGAGGCAGAGCCAAGACCAGAACCCAGGTTTCCTGACTCCTAACA
GGTCTTGCATCCCCAGTAGTGAATGAGACACTGCTTTGACTTTCTGTAATCTTGGTTTAGCCCCTTCCTTTCTCTGGGCTCAGTCTGCTT
CCCACCACTCACCCATTCTTTCTTTATCATCTCCCTCCAAAGCCTCTTGTCCTCCTGCCTCCTCTTCTCCTTGGCTGGTTCCTGCCAGCA
AATGGAGTGGCCCTGGTCCATGCCAGGTTTTCCTCTTCTGGGTCCGGAGCTCACTATAGTATTCAGCCCTCAGTCCTCCCAGGATGTTCT
TCCCAGCACTGCTGCCTCGTGCGGATTTTCCCGTAACCTCAGTAACTGGCTTCTTGTCCCCCTGCTTCCTACCAGGGAAGCCTTCCTGTC
CGCGTCTGTGGTTTCCCAACTCTGATATTTGCTCTCAAATGTGGTGGTGTCCTGGTTCTGTGTTTATTTATTTTGTGTTTTCTCACACAC
ACAAAAAAAAGCCTCTTCCTCCAATGCATCAGGAGGCACCAGCCCTGCCAGCCCTTCTCACTGGGCTCACCCTGCCCCAGCAACCCCCCG
GTGCCACCCTTAAGCCACACTGTCTTCTCTGTAAGCAGCCTGCCAGCAGCAGCCCCAGCACTTTGCAATGGGCGTGTGTGTGGTGGTGGG
TGGGGGGGGCTTGGATCCCTCCTTTTTCCTCCCCTGCCCTGCCCAGGCCCAGATGGCCTTGACTGTAAAGCAGGTGCTGCCTGACAGGTT
CTTCTCTCCCTGTCTCTGGTCATTGATCCATCTCTTTGTCCATTCAGTATCCAACCATCCTCTCCATTCTCCTCTGGACCTCACCACTCT
CAGAGCTGCTTGTCCTGGCAGAATCTACAGTTCACCCCAACTCTATGCCTTACCCCTCCCAACCCAACAGCATTTGCAGTTTGCAAAATA
TACAGACCCAAGTCCTGAGGGGACTGAGGACATGATGCTGGGCCCAAGTCTCCTGCTCAGGGCTTCTCTCCAATGCCAGCCCTGCCACTC
CTTCCTCACCCTCCTTGGAGCCTCCTCTGCTGCTTGTCTATCCCAACGGCCCTGCTCCCCTCCCTTCCTGCCCTTCACCAGCTTTCTGGA
CACCATGCCCTGAGGAAGGGACCTTTGGTTTTCTCTAAACATCTTTGAAGGGCTGAGGCAGTCAGGGCTGGCTGCCTTGTCACTCTTTAT
TTGGAAGCCACTCAAACCATTCCCAAGAAGAGGGACCTCAGCTGGCAATCTGGAAACCTGGCCCAGGTCTGGGCAGATGTCTTCACTTCT
CCTACCTTCCCAGTCTTGTGATCCTGTGATGAGCACCAGGATGGCCCTGTGGTCCCTAGAGCACCCCTCATGCTGTAGGGTCCTGCAGCC
CCATCCTTTCTCTACTGGGCCCTGGTATCCTGGCTCCTCTCTCAGCTCTGCCACTGATCTCTGTGCCTTAGTTTACTTCTCTGCACGGGG
GACTCACCCCAAGACCATTTCCAGCAGCTTCCCAGGTGATGTGGTGCCCCAAGGCTGGGCTTTGCCAGCTGTGGCCCAGCTCCTTAGTGC
TGCCCAGGAGACACCAGGCTGCTCAGAATGAGGTGACTGCGGGCACCATTCTCAGCCAGTGGTTCTTGTATTGCATTCCAGCAGCAGGAA
TATCACCTGGGAACTTGATAGAAGTGCAGATTAGCAGCCCCACCCAAGACCCACTGAATTAGAGCTTGTGGAGTGGGGCCCTACAAGCTG
GGGTTTTAAGGAGCCCTCCAAATGATTCTGACGCATAAGAATATGCCAACTGCTGATCTGGGCTAGCCATTAGTAGAGCCTGGGGAGGGA
CTGGGACTGGCTAGGCCAAGAACAGGTGGAAAACACCAGCCTTATCTGGACTCCTGAGATTGGGAACCACCACCAACAAAAACCAACCCT
ATAGTCGCTCCTCTTGGAAGAGGAAGAGAAGTTGAAGGGCCTGGAGAAAGCACACATTGTTTGTTTCCCTGCTCCTGCTCACCTCTCTCA
CTTGTCTTGGTTTTACAAAAGGCTGTGTGGATGGTGCCAGCCAGGGAGGGGGTGGGAGTCCTGGGGAGGCAGGAGGCAGAAGACCCTGAC
TGTTTCTCCCTTGGGAACCTCACCATAGGCCAGATAGCGCCTCTTCAAACTGAAAGAAATCTTAACTCCACAAAGAAAGCATCCTAAATC
CCCAGTTCCTCCTCCTCCCAACCCCAGGGATACCTTGTAGACAGTGCCAAAAAACAGCTCCAACCCCCAGCAGCTGGGAAGAGAGCCAGA
AGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGT
CCTGCAGTCATGGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTC
TTGTCCACTATTCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCAC
CAGCTCTCCCTGCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAAGCGTTTTTAATGGTTGTAGATTGGA

>8490_8490_3_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000371356_SOGA1_chr20_35445872_ENST00000237536_length(amino acids)=1577AA_BP=3
MGRRLECNGVISAHCNLCFLGLSDSPASASRVAGITGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSN
SSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKG
KFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQR
TATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRET
ELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELE
DQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGD
SAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAIL
VRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRS
FRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQE
KNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADM
KVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQ
LQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKS
VSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSSEPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGRQMQRSYTAPDKTGIRVYY
SPPVARRLGVPVVHDKEGKIIIEPGFLFTTAKPKESAEADGLAESSYGRWLCNFSRQRLDGGSAGSPSAAGPGFPAALHDFEMSGNMSDD
MKEITNCVRQAMRSGSLERKVKSTSSQTVGLASVGTQTIRTVSVGLQTDPPRSSLHGKAWSPRSSSLVSVRSKQISSSLDKVHSRIERPC
CSPKYGSPKLQRRSVSKLDSSKDRSLWNLHQGKQNGSAWARSTTTRDSPVLRNINDGLSSLFSVVEHSGSTESVWKLGMSETRAKPEPPK
YGIVQEFFRNVCGRAPSPTSSAGEEGTKKPEPLSPASYHQPEGVARILNKKAAKLGSSEEVRLTMLPQVGKDGVLRDGDGAVVLPNEDAV

--------------------------------------------------------------
>8490_8490_4_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000371356_SOGA1_chr20_35445872_ENST00000279034_length(transcript)=3856nt_BP=837nt
CTTGGAAGACTTGGGTCCTTGGGTCGCAGGCTGGAGTGCAATGGTGTGATCTCAGCTCACTGCAACCTCTGCTTCCTGGGTTTAAGTGAT
TCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACA
GCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGG
GTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAG
AAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCT
GAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGT
CGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAA
GCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTG
TATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCA
AAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAG
GAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAG
AATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCC
CCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAG
GTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCC
GCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGC
TCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCG
GAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAG
AAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACG
GACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGG
GAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACG
GCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAG
CTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCT
GTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGC
TCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAG
CCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCT
GAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAA
ATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAG
AAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGG
TTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTG
ATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATC
AAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAG
TTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCC
GGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTC
ATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAG
CAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTG
GAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGT
TTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGC
AAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCG
TCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCC
CAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGG
GCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTC
TTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGA

>8490_8490_4_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000371356_SOGA1_chr20_35445872_ENST00000279034_length(amino acids)=1170AA_BP=3
MGRRLECNGVISAHCNLCFLGLSDSPASASRVAGITGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSN
SSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKG
KFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQR
TATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRET
ELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELE
DQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGD
SAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAIL
VRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRS
FRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQE
KNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADM
KVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQ
LQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKS
VSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSSEPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGSQKLPFLLILAPPQPPPIL

--------------------------------------------------------------
>8490_8490_5_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000395913_SOGA1_chr20_35445872_ENST00000237536_length(transcript)=13852nt_BP=894nt
CTTAAACGCGACTCAAGGCGTCGGGTTTGTTGTCAACCAATCACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAG
GGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAACGGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGTGGGAGCCG
ACGGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTG
ACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATT
CCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCT
GTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAA
CAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTT
TATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAG
CTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTAC
CTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCC
TTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCAC
TTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTAC
CGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCAC
CTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGAC
GACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTG
GCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTG
CTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCA
CAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCC
AACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTG
CCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAG
GCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGT
GCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTG
CTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTG
CCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTG
CCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGA
GCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAG
CAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCG
CTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTG
CAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTG
CCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCT
GATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAG
GAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAG
CGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCC
CTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAG
CGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAAC
CGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGC
AGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCT
GAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTT
GTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCC
TGCCAGGACCCCCCAGGGAGGCAGATGCAGCGCAGCTACACGGCTCCTGACAAGACGGGCATCCGAGTCTACTATAGTCCCCCGGTGGCC
CGGCGCCTCGGAGTCCCTGTGGTTCATGACAAAGAGGGCAAGATCATTATCGAGCCCGGCTTCCTCTTCACCACAGCCAAGCCCAAAGAG
TCGGCCGAGGCTGATGGGCTGGCTGAGAGCTCCTATGGTCGGTGGCTCTGCAACTTCTCACGGCAGCGCCTGGACGGAGGCTCAGCGGGC
AGCCCCTCGGCGGCCGGGCCTGGCTTCCCAGCGGCCCTGCATGACTTTGAGATGTCAGGCAACATGAGTGATGACATGAAGGAGATCACC
AACTGTGTGCGCCAGGCCATGCGCTCCGGCTCACTGGAGAGGAAAGTGAAGAGCACATCCAGCCAGACGGTGGGCCTGGCCAGTGTGGGC
ACACAGACCATCCGCACGGTCAGCGTGGGCCTGCAGACCGACCCACCCCGCAGCAGCCTCCATGGCAAGGCCTGGTCACCCCGCAGCTCT
TCGCTCGTGTCTGTGCGCAGCAAGCAGATCTCCTCCTCCCTGGACAAGGTCCATTCGCGCATCGAGCGGCCCTGCTGCTCCCCCAAGTAT
GGCTCACCAAAGCTCCAGAGGCGGTCTGTGTCCAAGCTGGACAGCAGCAAGGACCGCAGCCTGTGGAACCTGCACCAGGGCAAGCAGAAC
GGCTCGGCCTGGGCCCGCTCCACCACCACGCGGGACAGCCCTGTATTGAGAAACATCAACGATGGACTCTCCAGCCTCTTCAGTGTGGTG
GAGCACTCAGGGAGCACGGAGTCTGTCTGGAAACTAGGCATGTCTGAGACGCGGGCCAAGCCCGAGCCTCCCAAGTACGGCATTGTGCAG
GAATTCTTCCGTAATGTGTGTGGCCGGGCACCGAGCCCCACCTCATCAGCAGGAGAGGAGGGCACCAAGAAGCCAGAGCCCCTCTCCCCA
GCCAGCTACCATCAGCCAGAGGGTGTGGCCAGGATCCTGAACAAGAAGGCAGCCAAGTTGGGCAGCAGTGAGGAGGTCAGACTCACCATG
CTCCCCCAGGTGGGGAAGGATGGTGTCCTCCGGGACGGAGATGGAGCCGTGGTCCTTCCCAATGAGGACGCTGTTTGTGACTGTAGTACC
CAGTCTCTCACCTCCTGCTTCGCCCGATCGTCCCGCTCTGCCATCCGCCACTCTCCTTCCAAGTGCAGGCTGCACCCTTCAGAGTCCAGC
TGGGGTGGGGAGGAGAGGGCACTCCCCCCCAGCGAGTGACAGAGCAGCCAAGCTCCCCGCCTCAACCAGCCCAGCCCCTGGATAGCAGAA
GGGAACCAGCAGAGACGAGACGAGGTGAGGCGAGGGGCTGTGTCCTCAGCATTGCCTGGCCCTGGAGGGACAGCAGTGATGCCACTGCCA
GAATGCAGCTTTCACATCAAGGTAAAGCCGGGTCTCCTGCTGGCCCCTGGGTGGTGAGCTTCGACTTCCCAGGGGAAGGCAGTGAGTGGG
AGAGAGACCAAACCTGGGCTTCCCAAGCATCCACTGAGAGATCTGTCAAGAGCCGATCCCTGGGTCCTAAGAGAGAGCCTTGCCTGGTTC
TGCCCATGCCACCCTCTTGGAAGAGCCCAAGAAGGATACATGTCTGGCCATGCCTTTGGGGAAAAGGAGTCGGAGAGATGTTTCCTGCTG
ACCATCCACCCCTTCATTTGGGAGGAGACACTGCTGAGAAGAACAGGCTTTGCTCTAGGGCTCCATGTTTGGTTCCTGGTGGAGCCCTGT
TGGGCATCATCACCATCACCTCCTTCTCTCCACCACCTCCTCCTCCCAGCCCCACTGCTCTAGATTGCTGGGACACTAGGGAGTATGATA
GGGCAGTAGCCAGGGCCATTGCTTAGTGTCCTGGAGCCCTGGATCTCCCTGCCCATAGCCTGGATGCAGCAAGAGCTGGGAGGCGAAGTG
GAAACATGCAGGGCTCAGGGTTGGGGAGTGATTGCAATTGCCTTCCTTGCCAAAGTGACTTGGGGCCCCAACGTTCCCAGCAGACCCCTT
GAGGACAGAAATAGGTAGAGTCAGTCTCAAGACCTGGTGCATAGATAAATGCCTAAATACACTGCCTTGATCTCAAGTGATCTCAGAGGC
CTCTTTCCCTGGCACCCTGAGAGGCAGCAGGCACTACATCTCCACTGTGTTTACATCCTGCAGCTGGTCGGGGGGCAAAGATATTCCCAG
TAAGAGATTCTTGGTTGGCCAGGTCAGGCCCAGGAGAACACCAAGAGGCCAGAGCCCAGGACACAGCAGTAGACTGGGGCCTGGAAACAC
GTATCTTGCCTAGATTGTTTATTTGAATTTTTCCTACTATAAATATTTAAGGTGGTTTACTTTATTTTAATAATTTAATTTACCCCAAAG
TCCCTAAGGTAATTTATTGGAGGTTGAAACATGCATTCTTGCCACTGGGACAACATGAGGCCTCTAACAGCACGGGCAGGCATGGGGTCC
CCTGGGTGGACGAGGCCGCTTGGCAGCCAGGTTTGGAGACCTGGCCTCCTGGTCAGCTTTGGAGGGCCCCTCAACAGAGCTGGAGCCCTG
CACCCCAACACGGCTGGCCATGTGGCCTCAGAACACTACTTATTACTCAATGCCTGGTACTTGGCCAGCCCCAGCGGTCAGTCTATAAAT
ACTCACTGACAAGGTGGAGGGCTGGACGGCCATCACCACTCCCCAGACGTTCTCCATTGCCTGTCTCATTTCCCTCCCTCCCTCTGACAC
CTTTCTTCATGAGTCGAACGTGGATTACTAAAGCTCTATTAAGAGTGTGGAGATCCCTCCAAGTTTCCCAAATGAGAACTCACAGGAAAA
CAGGACTGAACTTTGAGAATGTTGTTTATCGCAGCTTTGCACATAAACCTGAGTGTCTCCCAGCCTGCCTCGGTTCTCACCAGCCTGCCA
GCCTTTTCACCAGCCTCTCTCCTTAGCCTTATGGCCTTTCACGGCTCTTCTCCCTGCCCCAGCTCTGCTGCCCGCCCTTCCTCACGTCCC
CTGTGAGCTGCCTGAGCCATTGGTTGGATTTCGATGTGGCTCATTGCAGCATGTGGGGCAGCGCCTCCCATGGCCTCGCCTTGGTGCCGG
TGAACCCCTTTTGGTTGCACACATGCTCCCCACACACACATAGACATCAGCCTTCCTGTAACTGACTGGGGACCCAGAGTGGAAACACCA
GGATGGATCAGCTTGTCTGCAGAATTGCCCATCAGGAAGACCAAAAGCCAGTAGCTTTGCTGATCTGCCCCCAGGACTCTGGAGGCGCCC
CTGCACTCCCACCTCCCACCTGCCAGTTCCCAGACCCACCCATTCGGGATCACCTGGACCAGTTACCCCAAGTCCTGCATCTCCCTTCCC
TGCAGGCTGAACACCAGGGTCATGCCAGTCCCGCCAGCCGCCTCCTCCATGCCCCAGTGACTGGTGTGGGCAGAGCAGGCAGCCAGTGGA
GCTGTGGGCCAGTTCCGCTCTTGGATGCTGCTGCTCTCACCCATGAGGTCAGGGGGGCCCTCCAAGGTTATCTCCAGGTGAGGGGATTCA
CATCAGGCCACAAGCCACCAGAGGCCTTCTGCCACCTCCCAGAGCGACAGCCAGGGAGGCTGCGTACTCAGCCTCGGGGAGAAATCCCCG
TGGGACCTGAGCCCCAAGACCTACGGACCACTCAGCCTTACCATCGTACCGTCCAGGATTGTCCTTGCCATCTTTGTTGTCTCAGCCAGA
CCTTGGTTTTCAGTAAAGCCCCAGTTTCTACTTCCTGCATGCCACTGTGCAAGGCCACTCATCACTGTTCCTGCAGAAGCCTCTGGACGT
GGGGCTGGATGGGGTTGAAAATGTTACATGTAAATATTGGTTTGGTTCGGTTTTTAGCATTTTACTTGGTAACTGGTTGTTTTCTTTTTT
GGGGTGGGGGGATTGGTTTGTAAAAATTCTCTACTCTTTTGGAATGTGATTTCTAAGTTTGTTGGTTTCTTCAAATGCCTTTTAAGTCTT
GGTAACATTCCCAAAGCAGAAAACTGCCTGACCCACAGTGGGGATTCCCTGGAGAATTGGGGTCCCAAGAAGGAATGCTGCCCTTCTCGA
ACCCGTTCTCCCCCTTCCTCCTGCCTCTCTGCCTTTTACTGCTATTCCCTTCTTCTCCTCCTTTATCCTTCTTTCTGTTTTCCCCATCTC
CACTCTCTCTTCAACCAAAGTCCCAAGGAACCCTCGGGGCTCAATCCCCCATAGACCACTTGGCTTGGGTCCATGGGGTTGGCATCAGTT
GGTTGGCGGAAATGGGGGACCAGTTGGCATGATGGCCCTAAACTGGGAAACCTCATGTTTCTTATGTCTCACCTCTTTCCAGAGCCAAAT
CAGCCCCTTTTGGAATGATGACTTCATTGGAATGCAAATCAAGTCATTTTGGTGCATCAGTGGCTCTTAGGCCTGCACACACGAGACATC
AGAATCCAATCCTCTGACCCTGTGCCAGCCCTTTCCCCCAGTTTATTTCCCACCAAAGGCTGACCTCTAAGAGGTCTTGCTTTCTATGAA
CTCAAGATGGGTCCCACCTCTAGGTGTCCCCAGGTGCACTCTTCTACCGGTTGGCTTCCGATGTGACAAGGCCAAGGGCCCAAAGACTTG
ACCCTCTTACACCCTTGCTGACATGGTTCCATCATGTCCACCCGCATGCACTTTTATGGTTTCATCACCCAGCCTCTTCTCCTCTGGCCC
ACCCAGCGTCCAGGCTCTTTCTCCCTCTCCCCTCCTATCTAGAATGTCCCCTGCTTCTAGCCTCACCAGACCCCCCAAGCTCCCACTACT
TCTTCCATAATAATAGTAATAACAATGGTTATCATCATCCCCTGCACATCCCGCCTAAAGCACTTTACTATATAGAAAACGTTTCCCCTG
GCCGGGCATGGTGGCTCACGCCTGAAATCCCAGCTCTTTGGGAGGCTGAGGCGAGCGGATCAGTTGAGGTCAGGAGTTCAACGCCAGCCT
GGCCAACGTGGTGAATCCCTGTCTGTACTAAAAGTACAAAAAATTAGCTGAGCATGGTGGTGCGTGCCTGTAATCCCAGCTACTCGGGAG
GCTGAGGTGGGAGAATCGCTTGAGCCCAGGAGGCGGAGGTTGCAGGAGCAGAGATTGCGCCACTGCACTCCAGCCTGGATGACAGAGTGA
GACCCAATCTCAAAAAAGAAATCGTTTCCCACCCCACATCTCCTTCAGACCTCTCAGGGACAACTCTGGGAGGCAGCCTTGGCAGGACAT
GGGTTAGTGCGCCCATTTTGCTGTGAGGAAACTGAGGTACAGGTCTCATCCCAGAGCATGAGAAGTCACTGAGTTTAGATGAGAACTTGG
GTCCAACTCTGTCCTGTTTGCTGTGCAAATCCGCTGCCCTGCTGGGGGCTTTTGGTGGGTCCAGAATACCCAGAATATGCTGCTGGCCAA
CCCAGGCATAAAACAAGTCCATTCTAGATCACTGAGCCTTGTGTATTCCAGAGGGTGATCTGAGGTCCCCATTCAGCAGAATTCTCTGAG
GGCATGTTCAGAATGTAGATTCCTGGGCCCCACCTTGAATTTGCATGTTTAACAAACTCTCCTGGGGTTGAGGGGTGGGTGCAGTGGTCA
CACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGTGGATCACTTGAGCTCAGGAGTTCGAGCCTGGCCAATATGGTGAAACCCTGTC
TCTACTAAAAATGCAAAAATTAGCCAGGTGTGGTGGCACATGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCACTTGAA
CCTGGTGGGGAGCAGCGGTTGCAGTGAGCCGAGATTGTGCCATTGCGCTCTAGCCTGGGCAACGGAATGAGACTTGTCTCAAAAAAAATA
AAAAATAAAACCAGCCCTCCCCGGGGGATCTTAGGCACTATTGGCCACACCATTGGTGTTCGTGGCCCTGATTATTAGGCTATCTTTCTT
TTTTTAAGTTTTTTTAGATTTATTTTTTATTTTATTTATTTATTTATTTATTATTTATTATTATTATTTTTAGAGACAGGGGGTCTCCCT
ATGTTGCCCAGGCTGGTTTCAAACTCCTGGGCTCAAGTGATCTGCCCTCCTCAGCCTCCCAAAGTGCTGGGGTTACAGGCAGGCATCAGC
CACCGTGCCAGGTCATCTTCCTTTTTCTTTTTTTTTTGGAGACAGAGTCTTGCTCTGTCATTCAGGCTGGAGTACAGTGGCGCCATTTCA
GCTCACCGCAGCCTCCACCTCCCAGGTTCAAGCAGTTCTCCTGCCTCAACCTCCCGAGTAACTGGGATTACAGGTGTGTGCCACCACACC
CGACTAATTTTTTTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCACCCACC
TCGGCCTCCCAAAGTACTGGGATTATAGGCGTGAGCCACGGCATCCAGCCTCATCTTTCTTTTAACCAATAAACATGATGCTGTATCTTA
AAAAGAGCACTGAGCAGGGACTTAAGGGATCGAGTCCTCAACCAAACTGATTTAATTACTCAGGATTTTCAAAAAGCATCAGAGGCTATT
TACAATCTTAATCATAGGGGTTCAGTAAAATAAAAATAAGAAGTAAAAAAGCAAGAGAAATTATTCTGTAAATCTAACTGGTGTAATTCC
CATAATCATGCAATTAAGTTTTACTCTTGAGTTTCCTGACAGCCATTGGTAAAAAGAGAAACACATCAGGATTTATAATTTTTATCATCC
AATTATGGGAAGCAAGCATGTTGGCCCCAGGAGACGAACTCTTCTACTAATTTATAGCATGTATCCTTTCATCAAGGGCCACTGAGCCAG
TTGGTGAGTCAACGGGTGAACCTAAGATGCAAGGATGTTTTCCAGGTGACTATTTAAAAAAATAAATAATCCACCAACGTGATTGACCTT
GGCGAGATCATGTTTCTAGTCTATACCTCAGTTTCCCCATCTGTAAAGTGAGGATAATGTCCCACCCCATGTAACTGTGGTGAGGACCAA
CTGCAACACTGTGCCTGCGAGTCTCCTTGGAAAAGTGTAAGGTTCTACACAAATGGAAAGTGATCTGATCACACTCAGTGTCCCCAGCCC
AGCCTTTCAGTGCCCTGGCCCTGGGGTGGGGGACAATACTCTCCTCACCCCCTTCACTAGTCTTCATGAATAGCAAGGAGGCCATAACAT
AATTTGGTCTAAACCCCTTCCTTTTTAAAAGAATGATGGCAAAATGTGCATAACATTAAATTTACCATTTTAAAGATTTCTAAGTGTCTC
GAAGTACATTTGCAATGTGTAACTGCCACCTCCAGAACTTTTTCATCATCCTAAACTGAAACTCTGTACCCACTAAACAGTAACTCCCCA
CTCCCCCTGTCCCCAGTCCCTGGTAACCTCTATTCTACTTTTTTTTTTTCTCTGTGGATTGGCCTATTCTAGGTATCTCTTTTAAGTAGA
ATCATATAGTATTTGTCCTTTTGTGTCCAGCCCCCTCATTTTTTGAGATGAGGAATCAGGCCCAGAGAAGGCAGTGGCTCACCCAAGGTC
ACATCGCAAACCAGAGGCAGAGCCAAGACCAGAACCCAGGTTTCCTGACTCCTAACAGGTCTTGCATCCCCAGTAGTGAATGAGACACTG
CTTTGACTTTCTGTAATCTTGGTTTAGCCCCTTCCTTTCTCTGGGCTCAGTCTGCTTCCCACCACTCACCCATTCTTTCTTTATCATCTC
CCTCCAAAGCCTCTTGTCCTCCTGCCTCCTCTTCTCCTTGGCTGGTTCCTGCCAGCAAATGGAGTGGCCCTGGTCCATGCCAGGTTTTCC
TCTTCTGGGTCCGGAGCTCACTATAGTATTCAGCCCTCAGTCCTCCCAGGATGTTCTTCCCAGCACTGCTGCCTCGTGCGGATTTTCCCG
TAACCTCAGTAACTGGCTTCTTGTCCCCCTGCTTCCTACCAGGGAAGCCTTCCTGTCCGCGTCTGTGGTTTCCCAACTCTGATATTTGCT
CTCAAATGTGGTGGTGTCCTGGTTCTGTGTTTATTTATTTTGTGTTTTCTCACACACACAAAAAAAAGCCTCTTCCTCCAATGCATCAGG
AGGCACCAGCCCTGCCAGCCCTTCTCACTGGGCTCACCCTGCCCCAGCAACCCCCCGGTGCCACCCTTAAGCCACACTGTCTTCTCTGTA
AGCAGCCTGCCAGCAGCAGCCCCAGCACTTTGCAATGGGCGTGTGTGTGGTGGTGGGTGGGGGGGGCTTGGATCCCTCCTTTTTCCTCCC
CTGCCCTGCCCAGGCCCAGATGGCCTTGACTGTAAAGCAGGTGCTGCCTGACAGGTTCTTCTCTCCCTGTCTCTGGTCATTGATCCATCT
CTTTGTCCATTCAGTATCCAACCATCCTCTCCATTCTCCTCTGGACCTCACCACTCTCAGAGCTGCTTGTCCTGGCAGAATCTACAGTTC
ACCCCAACTCTATGCCTTACCCCTCCCAACCCAACAGCATTTGCAGTTTGCAAAATATACAGACCCAAGTCCTGAGGGGACTGAGGACAT
GATGCTGGGCCCAAGTCTCCTGCTCAGGGCTTCTCTCCAATGCCAGCCCTGCCACTCCTTCCTCACCCTCCTTGGAGCCTCCTCTGCTGC
TTGTCTATCCCAACGGCCCTGCTCCCCTCCCTTCCTGCCCTTCACCAGCTTTCTGGACACCATGCCCTGAGGAAGGGACCTTTGGTTTTC
TCTAAACATCTTTGAAGGGCTGAGGCAGTCAGGGCTGGCTGCCTTGTCACTCTTTATTTGGAAGCCACTCAAACCATTCCCAAGAAGAGG
GACCTCAGCTGGCAATCTGGAAACCTGGCCCAGGTCTGGGCAGATGTCTTCACTTCTCCTACCTTCCCAGTCTTGTGATCCTGTGATGAG
CACCAGGATGGCCCTGTGGTCCCTAGAGCACCCCTCATGCTGTAGGGTCCTGCAGCCCCATCCTTTCTCTACTGGGCCCTGGTATCCTGG
CTCCTCTCTCAGCTCTGCCACTGATCTCTGTGCCTTAGTTTACTTCTCTGCACGGGGGACTCACCCCAAGACCATTTCCAGCAGCTTCCC
AGGTGATGTGGTGCCCCAAGGCTGGGCTTTGCCAGCTGTGGCCCAGCTCCTTAGTGCTGCCCAGGAGACACCAGGCTGCTCAGAATGAGG
TGACTGCGGGCACCATTCTCAGCCAGTGGTTCTTGTATTGCATTCCAGCAGCAGGAATATCACCTGGGAACTTGATAGAAGTGCAGATTA
GCAGCCCCACCCAAGACCCACTGAATTAGAGCTTGTGGAGTGGGGCCCTACAAGCTGGGGTTTTAAGGAGCCCTCCAAATGATTCTGACG
CATAAGAATATGCCAACTGCTGATCTGGGCTAGCCATTAGTAGAGCCTGGGGAGGGACTGGGACTGGCTAGGCCAAGAACAGGTGGAAAA
CACCAGCCTTATCTGGACTCCTGAGATTGGGAACCACCACCAACAAAAACCAACCCTATAGTCGCTCCTCTTGGAAGAGGAAGAGAAGTT
GAAGGGCCTGGAGAAAGCACACATTGTTTGTTTCCCTGCTCCTGCTCACCTCTCTCACTTGTCTTGGTTTTACAAAAGGCTGTGTGGATG
GTGCCAGCCAGGGAGGGGGTGGGAGTCCTGGGGAGGCAGGAGGCAGAAGACCCTGACTGTTTCTCCCTTGGGAACCTCACCATAGGCCAG
ATAGCGCCTCTTCAAACTGAAAGAAATCTTAACTCCACAAAGAAAGCATCCTAAATCCCCAGTTCCTCCTCCTCCCAACCCCAGGGATAC
CTTGTAGACAGTGCCAAAAAACAGCTCCAACCCCCAGCAGCTGGGAAGAGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCC
AGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGGG
CAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTCT
TCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGAC

>8490_8490_5_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000395913_SOGA1_chr20_35445872_ENST00000237536_length(amino acids)=1549AA_BP=245
MGRRWEPTGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQ
ATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLE
KAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSA
DLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENR
GLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSC
SVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEV
LPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQ
QESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHD
SLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKR
WRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKA
KATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQ
QVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLP
NNPAFGFVSSEPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGRQMQRSYTAPDKTGIRVYYSPPVARRLGVPVVHDKEGKIIIEPGFLF
TTAKPKESAEADGLAESSYGRWLCNFSRQRLDGGSAGSPSAAGPGFPAALHDFEMSGNMSDDMKEITNCVRQAMRSGSLERKVKSTSSQT
VGLASVGTQTIRTVSVGLQTDPPRSSLHGKAWSPRSSSLVSVRSKQISSSLDKVHSRIERPCCSPKYGSPKLQRRSVSKLDSSKDRSLWN
LHQGKQNGSAWARSTTTRDSPVLRNINDGLSSLFSVVEHSGSTESVWKLGMSETRAKPEPPKYGIVQEFFRNVCGRAPSPTSSAGEEGTK
KPEPLSPASYHQPEGVARILNKKAAKLGSSEEVRLTMLPQVGKDGVLRDGDGAVVLPNEDAVCDCSTQSLTSCFARSSRSAIRHSPSKCR

--------------------------------------------------------------
>8490_8490_6_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000395913_SOGA1_chr20_35445872_ENST00000279034_length(transcript)=3913nt_BP=894nt
CTTAAACGCGACTCAAGGCGTCGGGTTTGTTGTCAACCAATCACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAG
GGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAACGGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGTGGGAGCCG
ACGGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTG
ACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATT
CCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCT
GTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAA
CAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTT
TATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAG
CTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTAC
CTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCC
TTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCAC
TTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTAC
CGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCAC
CTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGAC
GACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTG
GCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTG
CTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCA
CAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCC
AACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTG
CCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAG
GCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGT
GCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTG
CTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTG
CCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTG
CCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGA
GCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAG
CAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCG
CTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTG
CAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTG
CCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCT
GATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAG
GAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAG
CGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCC
CTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAG
CGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAAC
CGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGC
AGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCT
GAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTT
GTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCC
TGCCAGGACCCCCCAGGGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCA
CTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCA
TCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTG
CTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAA

>8490_8490_6_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000395913_SOGA1_chr20_35445872_ENST00000279034_length(amino acids)=1142AA_BP=245
MGRRWEPTGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQ
ATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLE
KAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSA
DLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENR
GLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSC
SVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEV
LPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQ
QESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHD
SLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKR
WRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKA
KATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQ
QVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLP

--------------------------------------------------------------
>8490_8490_7_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000312783_SOGA1_chr20_35445872_ENST00000237536_length(transcript)=13906nt_BP=948nt
ACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAGGGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAAC
GGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGGTCTCACTCCATTGCCCAGGCCAGAGTGCGGGGATATTTGATAAGAAACTT
CAGTGAAGGCCGGGCGCGGTGGCTCATGCCCGTAATCCCAGCATTTTCGGAGGCCGAGGCATCATGGACCGATCTAAAGAAAACTGCATT
TCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTA
AATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAG
CCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAG
CAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCT
TTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTG
GCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGG
CATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTAT
AGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCC
GATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAG
CTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCA
GCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGC
CGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCT
GAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAG
GAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCAC
GAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGC
GAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGT
ACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCG
GTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTG
GCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGC
GAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCA
GATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATT
CTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGC
ACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAG
GAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAG
GCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGC
CTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTG
CTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGC
ATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAG
AACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGT
CTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAG
CTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTA
GCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAG
CGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGG
GTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAA
GTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTT
GCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACC
AAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGGCAGATGCAGCGCAGC
TACACGGCTCCTGACAAGACGGGCATCCGAGTCTACTATAGTCCCCCGGTGGCCCGGCGCCTCGGAGTCCCTGTGGTTCATGACAAAGAG
GGCAAGATCATTATCGAGCCCGGCTTCCTCTTCACCACAGCCAAGCCCAAAGAGTCGGCCGAGGCTGATGGGCTGGCTGAGAGCTCCTAT
GGTCGGTGGCTCTGCAACTTCTCACGGCAGCGCCTGGACGGAGGCTCAGCGGGCAGCCCCTCGGCGGCCGGGCCTGGCTTCCCAGCGGCC
CTGCATGACTTTGAGATGTCAGGCAACATGAGTGATGACATGAAGGAGATCACCAACTGTGTGCGCCAGGCCATGCGCTCCGGCTCACTG
GAGAGGAAAGTGAAGAGCACATCCAGCCAGACGGTGGGCCTGGCCAGTGTGGGCACACAGACCATCCGCACGGTCAGCGTGGGCCTGCAG
ACCGACCCACCCCGCAGCAGCCTCCATGGCAAGGCCTGGTCACCCCGCAGCTCTTCGCTCGTGTCTGTGCGCAGCAAGCAGATCTCCTCC
TCCCTGGACAAGGTCCATTCGCGCATCGAGCGGCCCTGCTGCTCCCCCAAGTATGGCTCACCAAAGCTCCAGAGGCGGTCTGTGTCCAAG
CTGGACAGCAGCAAGGACCGCAGCCTGTGGAACCTGCACCAGGGCAAGCAGAACGGCTCGGCCTGGGCCCGCTCCACCACCACGCGGGAC
AGCCCTGTATTGAGAAACATCAACGATGGACTCTCCAGCCTCTTCAGTGTGGTGGAGCACTCAGGGAGCACGGAGTCTGTCTGGAAACTA
GGCATGTCTGAGACGCGGGCCAAGCCCGAGCCTCCCAAGTACGGCATTGTGCAGGAATTCTTCCGTAATGTGTGTGGCCGGGCACCGAGC
CCCACCTCATCAGCAGGAGAGGAGGGCACCAAGAAGCCAGAGCCCCTCTCCCCAGCCAGCTACCATCAGCCAGAGGGTGTGGCCAGGATC
CTGAACAAGAAGGCAGCCAAGTTGGGCAGCAGTGAGGAGGTCAGACTCACCATGCTCCCCCAGGTGGGGAAGGATGGTGTCCTCCGGGAC
GGAGATGGAGCCGTGGTCCTTCCCAATGAGGACGCTGTTTGTGACTGTAGTACCCAGTCTCTCACCTCCTGCTTCGCCCGATCGTCCCGC
TCTGCCATCCGCCACTCTCCTTCCAAGTGCAGGCTGCACCCTTCAGAGTCCAGCTGGGGTGGGGAGGAGAGGGCACTCCCCCCCAGCGAG
TGACAGAGCAGCCAAGCTCCCCGCCTCAACCAGCCCAGCCCCTGGATAGCAGAAGGGAACCAGCAGAGACGAGACGAGGTGAGGCGAGGG
GCTGTGTCCTCAGCATTGCCTGGCCCTGGAGGGACAGCAGTGATGCCACTGCCAGAATGCAGCTTTCACATCAAGGTAAAGCCGGGTCTC
CTGCTGGCCCCTGGGTGGTGAGCTTCGACTTCCCAGGGGAAGGCAGTGAGTGGGAGAGAGACCAAACCTGGGCTTCCCAAGCATCCACTG
AGAGATCTGTCAAGAGCCGATCCCTGGGTCCTAAGAGAGAGCCTTGCCTGGTTCTGCCCATGCCACCCTCTTGGAAGAGCCCAAGAAGGA
TACATGTCTGGCCATGCCTTTGGGGAAAAGGAGTCGGAGAGATGTTTCCTGCTGACCATCCACCCCTTCATTTGGGAGGAGACACTGCTG
AGAAGAACAGGCTTTGCTCTAGGGCTCCATGTTTGGTTCCTGGTGGAGCCCTGTTGGGCATCATCACCATCACCTCCTTCTCTCCACCAC
CTCCTCCTCCCAGCCCCACTGCTCTAGATTGCTGGGACACTAGGGAGTATGATAGGGCAGTAGCCAGGGCCATTGCTTAGTGTCCTGGAG
CCCTGGATCTCCCTGCCCATAGCCTGGATGCAGCAAGAGCTGGGAGGCGAAGTGGAAACATGCAGGGCTCAGGGTTGGGGAGTGATTGCA
ATTGCCTTCCTTGCCAAAGTGACTTGGGGCCCCAACGTTCCCAGCAGACCCCTTGAGGACAGAAATAGGTAGAGTCAGTCTCAAGACCTG
GTGCATAGATAAATGCCTAAATACACTGCCTTGATCTCAAGTGATCTCAGAGGCCTCTTTCCCTGGCACCCTGAGAGGCAGCAGGCACTA
CATCTCCACTGTGTTTACATCCTGCAGCTGGTCGGGGGGCAAAGATATTCCCAGTAAGAGATTCTTGGTTGGCCAGGTCAGGCCCAGGAG
AACACCAAGAGGCCAGAGCCCAGGACACAGCAGTAGACTGGGGCCTGGAAACACGTATCTTGCCTAGATTGTTTATTTGAATTTTTCCTA
CTATAAATATTTAAGGTGGTTTACTTTATTTTAATAATTTAATTTACCCCAAAGTCCCTAAGGTAATTTATTGGAGGTTGAAACATGCAT
TCTTGCCACTGGGACAACATGAGGCCTCTAACAGCACGGGCAGGCATGGGGTCCCCTGGGTGGACGAGGCCGCTTGGCAGCCAGGTTTGG
AGACCTGGCCTCCTGGTCAGCTTTGGAGGGCCCCTCAACAGAGCTGGAGCCCTGCACCCCAACACGGCTGGCCATGTGGCCTCAGAACAC
TACTTATTACTCAATGCCTGGTACTTGGCCAGCCCCAGCGGTCAGTCTATAAATACTCACTGACAAGGTGGAGGGCTGGACGGCCATCAC
CACTCCCCAGACGTTCTCCATTGCCTGTCTCATTTCCCTCCCTCCCTCTGACACCTTTCTTCATGAGTCGAACGTGGATTACTAAAGCTC
TATTAAGAGTGTGGAGATCCCTCCAAGTTTCCCAAATGAGAACTCACAGGAAAACAGGACTGAACTTTGAGAATGTTGTTTATCGCAGCT
TTGCACATAAACCTGAGTGTCTCCCAGCCTGCCTCGGTTCTCACCAGCCTGCCAGCCTTTTCACCAGCCTCTCTCCTTAGCCTTATGGCC
TTTCACGGCTCTTCTCCCTGCCCCAGCTCTGCTGCCCGCCCTTCCTCACGTCCCCTGTGAGCTGCCTGAGCCATTGGTTGGATTTCGATG
TGGCTCATTGCAGCATGTGGGGCAGCGCCTCCCATGGCCTCGCCTTGGTGCCGGTGAACCCCTTTTGGTTGCACACATGCTCCCCACACA
CACATAGACATCAGCCTTCCTGTAACTGACTGGGGACCCAGAGTGGAAACACCAGGATGGATCAGCTTGTCTGCAGAATTGCCCATCAGG
AAGACCAAAAGCCAGTAGCTTTGCTGATCTGCCCCCAGGACTCTGGAGGCGCCCCTGCACTCCCACCTCCCACCTGCCAGTTCCCAGACC
CACCCATTCGGGATCACCTGGACCAGTTACCCCAAGTCCTGCATCTCCCTTCCCTGCAGGCTGAACACCAGGGTCATGCCAGTCCCGCCA
GCCGCCTCCTCCATGCCCCAGTGACTGGTGTGGGCAGAGCAGGCAGCCAGTGGAGCTGTGGGCCAGTTCCGCTCTTGGATGCTGCTGCTC
TCACCCATGAGGTCAGGGGGGCCCTCCAAGGTTATCTCCAGGTGAGGGGATTCACATCAGGCCACAAGCCACCAGAGGCCTTCTGCCACC
TCCCAGAGCGACAGCCAGGGAGGCTGCGTACTCAGCCTCGGGGAGAAATCCCCGTGGGACCTGAGCCCCAAGACCTACGGACCACTCAGC
CTTACCATCGTACCGTCCAGGATTGTCCTTGCCATCTTTGTTGTCTCAGCCAGACCTTGGTTTTCAGTAAAGCCCCAGTTTCTACTTCCT
GCATGCCACTGTGCAAGGCCACTCATCACTGTTCCTGCAGAAGCCTCTGGACGTGGGGCTGGATGGGGTTGAAAATGTTACATGTAAATA
TTGGTTTGGTTCGGTTTTTAGCATTTTACTTGGTAACTGGTTGTTTTCTTTTTTGGGGTGGGGGGATTGGTTTGTAAAAATTCTCTACTC
TTTTGGAATGTGATTTCTAAGTTTGTTGGTTTCTTCAAATGCCTTTTAAGTCTTGGTAACATTCCCAAAGCAGAAAACTGCCTGACCCAC
AGTGGGGATTCCCTGGAGAATTGGGGTCCCAAGAAGGAATGCTGCCCTTCTCGAACCCGTTCTCCCCCTTCCTCCTGCCTCTCTGCCTTT
TACTGCTATTCCCTTCTTCTCCTCCTTTATCCTTCTTTCTGTTTTCCCCATCTCCACTCTCTCTTCAACCAAAGTCCCAAGGAACCCTCG
GGGCTCAATCCCCCATAGACCACTTGGCTTGGGTCCATGGGGTTGGCATCAGTTGGTTGGCGGAAATGGGGGACCAGTTGGCATGATGGC
CCTAAACTGGGAAACCTCATGTTTCTTATGTCTCACCTCTTTCCAGAGCCAAATCAGCCCCTTTTGGAATGATGACTTCATTGGAATGCA
AATCAAGTCATTTTGGTGCATCAGTGGCTCTTAGGCCTGCACACACGAGACATCAGAATCCAATCCTCTGACCCTGTGCCAGCCCTTTCC
CCCAGTTTATTTCCCACCAAAGGCTGACCTCTAAGAGGTCTTGCTTTCTATGAACTCAAGATGGGTCCCACCTCTAGGTGTCCCCAGGTG
CACTCTTCTACCGGTTGGCTTCCGATGTGACAAGGCCAAGGGCCCAAAGACTTGACCCTCTTACACCCTTGCTGACATGGTTCCATCATG
TCCACCCGCATGCACTTTTATGGTTTCATCACCCAGCCTCTTCTCCTCTGGCCCACCCAGCGTCCAGGCTCTTTCTCCCTCTCCCCTCCT
ATCTAGAATGTCCCCTGCTTCTAGCCTCACCAGACCCCCCAAGCTCCCACTACTTCTTCCATAATAATAGTAATAACAATGGTTATCATC
ATCCCCTGCACATCCCGCCTAAAGCACTTTACTATATAGAAAACGTTTCCCCTGGCCGGGCATGGTGGCTCACGCCTGAAATCCCAGCTC
TTTGGGAGGCTGAGGCGAGCGGATCAGTTGAGGTCAGGAGTTCAACGCCAGCCTGGCCAACGTGGTGAATCCCTGTCTGTACTAAAAGTA
CAAAAAATTAGCTGAGCATGGTGGTGCGTGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGTGGGAGAATCGCTTGAGCCCAGGAGGCGG
AGGTTGCAGGAGCAGAGATTGCGCCACTGCACTCCAGCCTGGATGACAGAGTGAGACCCAATCTCAAAAAAGAAATCGTTTCCCACCCCA
CATCTCCTTCAGACCTCTCAGGGACAACTCTGGGAGGCAGCCTTGGCAGGACATGGGTTAGTGCGCCCATTTTGCTGTGAGGAAACTGAG
GTACAGGTCTCATCCCAGAGCATGAGAAGTCACTGAGTTTAGATGAGAACTTGGGTCCAACTCTGTCCTGTTTGCTGTGCAAATCCGCTG
CCCTGCTGGGGGCTTTTGGTGGGTCCAGAATACCCAGAATATGCTGCTGGCCAACCCAGGCATAAAACAAGTCCATTCTAGATCACTGAG
CCTTGTGTATTCCAGAGGGTGATCTGAGGTCCCCATTCAGCAGAATTCTCTGAGGGCATGTTCAGAATGTAGATTCCTGGGCCCCACCTT
GAATTTGCATGTTTAACAAACTCTCCTGGGGTTGAGGGGTGGGTGCAGTGGTCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGG
GTGGATCACTTGAGCTCAGGAGTTCGAGCCTGGCCAATATGGTGAAACCCTGTCTCTACTAAAAATGCAAAAATTAGCCAGGTGTGGTGG
CACATGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGTGGGGAGCAGCGGTTGCAGTGAGCCGAGATT
GTGCCATTGCGCTCTAGCCTGGGCAACGGAATGAGACTTGTCTCAAAAAAAATAAAAAATAAAACCAGCCCTCCCCGGGGGATCTTAGGC
ACTATTGGCCACACCATTGGTGTTCGTGGCCCTGATTATTAGGCTATCTTTCTTTTTTTAAGTTTTTTTAGATTTATTTTTTATTTTATT
TATTTATTTATTTATTATTTATTATTATTATTTTTAGAGACAGGGGGTCTCCCTATGTTGCCCAGGCTGGTTTCAAACTCCTGGGCTCAA
GTGATCTGCCCTCCTCAGCCTCCCAAAGTGCTGGGGTTACAGGCAGGCATCAGCCACCGTGCCAGGTCATCTTCCTTTTTCTTTTTTTTT
TGGAGACAGAGTCTTGCTCTGTCATTCAGGCTGGAGTACAGTGGCGCCATTTCAGCTCACCGCAGCCTCCACCTCCCAGGTTCAAGCAGT
TCTCCTGCCTCAACCTCCCGAGTAACTGGGATTACAGGTGTGTGCCACCACACCCGACTAATTTTTTTATTTTTAGTAGAGACGGGGTTT
CACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCACCCACCTCGGCCTCCCAAAGTACTGGGATTATAGGCGTGAGC
CACGGCATCCAGCCTCATCTTTCTTTTAACCAATAAACATGATGCTGTATCTTAAAAAGAGCACTGAGCAGGGACTTAAGGGATCGAGTC
CTCAACCAAACTGATTTAATTACTCAGGATTTTCAAAAAGCATCAGAGGCTATTTACAATCTTAATCATAGGGGTTCAGTAAAATAAAAA
TAAGAAGTAAAAAAGCAAGAGAAATTATTCTGTAAATCTAACTGGTGTAATTCCCATAATCATGCAATTAAGTTTTACTCTTGAGTTTCC
TGACAGCCATTGGTAAAAAGAGAAACACATCAGGATTTATAATTTTTATCATCCAATTATGGGAAGCAAGCATGTTGGCCCCAGGAGACG
AACTCTTCTACTAATTTATAGCATGTATCCTTTCATCAAGGGCCACTGAGCCAGTTGGTGAGTCAACGGGTGAACCTAAGATGCAAGGAT
GTTTTCCAGGTGACTATTTAAAAAAATAAATAATCCACCAACGTGATTGACCTTGGCGAGATCATGTTTCTAGTCTATACCTCAGTTTCC
CCATCTGTAAAGTGAGGATAATGTCCCACCCCATGTAACTGTGGTGAGGACCAACTGCAACACTGTGCCTGCGAGTCTCCTTGGAAAAGT
GTAAGGTTCTACACAAATGGAAAGTGATCTGATCACACTCAGTGTCCCCAGCCCAGCCTTTCAGTGCCCTGGCCCTGGGGTGGGGGACAA
TACTCTCCTCACCCCCTTCACTAGTCTTCATGAATAGCAAGGAGGCCATAACATAATTTGGTCTAAACCCCTTCCTTTTTAAAAGAATGA
TGGCAAAATGTGCATAACATTAAATTTACCATTTTAAAGATTTCTAAGTGTCTCGAAGTACATTTGCAATGTGTAACTGCCACCTCCAGA
ACTTTTTCATCATCCTAAACTGAAACTCTGTACCCACTAAACAGTAACTCCCCACTCCCCCTGTCCCCAGTCCCTGGTAACCTCTATTCT
ACTTTTTTTTTTTCTCTGTGGATTGGCCTATTCTAGGTATCTCTTTTAAGTAGAATCATATAGTATTTGTCCTTTTGTGTCCAGCCCCCT
CATTTTTTGAGATGAGGAATCAGGCCCAGAGAAGGCAGTGGCTCACCCAAGGTCACATCGCAAACCAGAGGCAGAGCCAAGACCAGAACC
CAGGTTTCCTGACTCCTAACAGGTCTTGCATCCCCAGTAGTGAATGAGACACTGCTTTGACTTTCTGTAATCTTGGTTTAGCCCCTTCCT
TTCTCTGGGCTCAGTCTGCTTCCCACCACTCACCCATTCTTTCTTTATCATCTCCCTCCAAAGCCTCTTGTCCTCCTGCCTCCTCTTCTC
CTTGGCTGGTTCCTGCCAGCAAATGGAGTGGCCCTGGTCCATGCCAGGTTTTCCTCTTCTGGGTCCGGAGCTCACTATAGTATTCAGCCC
TCAGTCCTCCCAGGATGTTCTTCCCAGCACTGCTGCCTCGTGCGGATTTTCCCGTAACCTCAGTAACTGGCTTCTTGTCCCCCTGCTTCC
TACCAGGGAAGCCTTCCTGTCCGCGTCTGTGGTTTCCCAACTCTGATATTTGCTCTCAAATGTGGTGGTGTCCTGGTTCTGTGTTTATTT
ATTTTGTGTTTTCTCACACACACAAAAAAAAGCCTCTTCCTCCAATGCATCAGGAGGCACCAGCCCTGCCAGCCCTTCTCACTGGGCTCA
CCCTGCCCCAGCAACCCCCCGGTGCCACCCTTAAGCCACACTGTCTTCTCTGTAAGCAGCCTGCCAGCAGCAGCCCCAGCACTTTGCAAT
GGGCGTGTGTGTGGTGGTGGGTGGGGGGGGCTTGGATCCCTCCTTTTTCCTCCCCTGCCCTGCCCAGGCCCAGATGGCCTTGACTGTAAA
GCAGGTGCTGCCTGACAGGTTCTTCTCTCCCTGTCTCTGGTCATTGATCCATCTCTTTGTCCATTCAGTATCCAACCATCCTCTCCATTC
TCCTCTGGACCTCACCACTCTCAGAGCTGCTTGTCCTGGCAGAATCTACAGTTCACCCCAACTCTATGCCTTACCCCTCCCAACCCAACA
GCATTTGCAGTTTGCAAAATATACAGACCCAAGTCCTGAGGGGACTGAGGACATGATGCTGGGCCCAAGTCTCCTGCTCAGGGCTTCTCT
CCAATGCCAGCCCTGCCACTCCTTCCTCACCCTCCTTGGAGCCTCCTCTGCTGCTTGTCTATCCCAACGGCCCTGCTCCCCTCCCTTCCT
GCCCTTCACCAGCTTTCTGGACACCATGCCCTGAGGAAGGGACCTTTGGTTTTCTCTAAACATCTTTGAAGGGCTGAGGCAGTCAGGGCT
GGCTGCCTTGTCACTCTTTATTTGGAAGCCACTCAAACCATTCCCAAGAAGAGGGACCTCAGCTGGCAATCTGGAAACCTGGCCCAGGTC
TGGGCAGATGTCTTCACTTCTCCTACCTTCCCAGTCTTGTGATCCTGTGATGAGCACCAGGATGGCCCTGTGGTCCCTAGAGCACCCCTC
ATGCTGTAGGGTCCTGCAGCCCCATCCTTTCTCTACTGGGCCCTGGTATCCTGGCTCCTCTCTCAGCTCTGCCACTGATCTCTGTGCCTT
AGTTTACTTCTCTGCACGGGGGACTCACCCCAAGACCATTTCCAGCAGCTTCCCAGGTGATGTGGTGCCCCAAGGCTGGGCTTTGCCAGC
TGTGGCCCAGCTCCTTAGTGCTGCCCAGGAGACACCAGGCTGCTCAGAATGAGGTGACTGCGGGCACCATTCTCAGCCAGTGGTTCTTGT
ATTGCATTCCAGCAGCAGGAATATCACCTGGGAACTTGATAGAAGTGCAGATTAGCAGCCCCACCCAAGACCCACTGAATTAGAGCTTGT
GGAGTGGGGCCCTACAAGCTGGGGTTTTAAGGAGCCCTCCAAATGATTCTGACGCATAAGAATATGCCAACTGCTGATCTGGGCTAGCCA
TTAGTAGAGCCTGGGGAGGGACTGGGACTGGCTAGGCCAAGAACAGGTGGAAAACACCAGCCTTATCTGGACTCCTGAGATTGGGAACCA
CCACCAACAAAAACCAACCCTATAGTCGCTCCTCTTGGAAGAGGAAGAGAAGTTGAAGGGCCTGGAGAAAGCACACATTGTTTGTTTCCC
TGCTCCTGCTCACCTCTCTCACTTGTCTTGGTTTTACAAAAGGCTGTGTGGATGGTGCCAGCCAGGGAGGGGGTGGGAGTCCTGGGGAGG
CAGGAGGCAGAAGACCCTGACTGTTTCTCCCTTGGGAACCTCACCATAGGCCAGATAGCGCCTCTTCAAACTGAAAGAAATCTTAACTCC
ACAAAGAAAGCATCCTAAATCCCCAGTTCCTCCTCCTCCCAACCCCAGGGATACCTTGTAGACAGTGCCAAAAAACAGCTCCAACCCCCA
GCAGCTGGGAAGAGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCA
GCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCT
TGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAG
GACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAAGCGT

>8490_8490_7_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000312783_SOGA1_chr20_35445872_ENST00000237536_length(amino acids)=1539AA_BP=235
MDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSR
PLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRR
EVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAK
EESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMK
DHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEE
LAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAAL
VSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPL
GSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLS
QERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSE
GDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAE
LKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLS
QLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSS
EPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGRQMQRSYTAPDKTGIRVYYSPPVARRLGVPVVHDKEGKIIIEPGFLFTTAKPKESAE
ADGLAESSYGRWLCNFSRQRLDGGSAGSPSAAGPGFPAALHDFEMSGNMSDDMKEITNCVRQAMRSGSLERKVKSTSSQTVGLASVGTQT
IRTVSVGLQTDPPRSSLHGKAWSPRSSSLVSVRSKQISSSLDKVHSRIERPCCSPKYGSPKLQRRSVSKLDSSKDRSLWNLHQGKQNGSA
WARSTTTRDSPVLRNINDGLSSLFSVVEHSGSTESVWKLGMSETRAKPEPPKYGIVQEFFRNVCGRAPSPTSSAGEEGTKKPEPLSPASY
HQPEGVARILNKKAAKLGSSEEVRLTMLPQVGKDGVLRDGDGAVVLPNEDAVCDCSTQSLTSCFARSSRSAIRHSPSKCRLHPSESSWGG

--------------------------------------------------------------
>8490_8490_8_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000312783_SOGA1_chr20_35445872_ENST00000279034_length(transcript)=3967nt_BP=948nt
ACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAGGGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAAC
GGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGGTCTCACTCCATTGCCCAGGCCAGAGTGCGGGGATATTTGATAAGAAACTT
CAGTGAAGGCCGGGCGCGGTGGCTCATGCCCGTAATCCCAGCATTTTCGGAGGCCGAGGCATCATGGACCGATCTAAAGAAAACTGCATT
TCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTA
AATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAG
CCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAG
CAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCT
TTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTG
GCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGG
CATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTAT
AGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCC
GATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAG
CTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCA
GCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGC
CGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCT
GAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAG
GAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCAC
GAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGC
GAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGT
ACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCG
GTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTG
GCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGC
GAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCA
GATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATT
CTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGC
ACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAG
GAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAG
GCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGC
CTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTG
CTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGC
ATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAG
AACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGT
CTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAG
CTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTA
GCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAG
CGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGG
GTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAA
GTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTT
GCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACC
AAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGCCAGAAGCTGCCCTTC
CTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCAT
GGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTAT
TCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCT
GCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAAGCGTTTTTAATGGTTGTAGATTGGAAATAAAGTATG

>8490_8490_8_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000312783_SOGA1_chr20_35445872_ENST00000279034_length(amino acids)=1132AA_BP=235
MDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSR
PLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRR
EVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAK
EESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMK
DHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEE
LAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAAL
VSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPL
GSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLS
QERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSE
GDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAE
LKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLS
QLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSS

--------------------------------------------------------------
>8490_8490_9_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000371356_SOGA1_chr20_35445872_ENST00000237536_length(transcript)=13795nt_BP=837nt
CTTGGAAGACTTGGGTCCTTGGGTCGCAGGCTGGAGTGCAATGGTGTGATCTCAGCTCACTGCAACCTCTGCTTCCTGGGTTTAAGTGAT
TCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACA
GCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGG
GTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAG
AAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCT
GAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGT
CGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAA
GCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTG
TATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCA
AAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAG
GAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAG
AATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCC
CCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAG
GTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCC
GCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGC
TCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCG
GAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAG
AAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACG
GACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGG
GAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACG
GCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAG
CTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCT
GTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGC
TCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAG
CCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCT
GAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAA
ATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAG
AAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGG
TTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTG
ATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATC
AAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAG
TTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCC
GGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTC
ATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAG
CAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTG
GAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGT
TTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGC
AAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCG
TCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGGCAGATGCAGCGCAGCTACACGGCTCCTGACAAGACG
GGCATCCGAGTCTACTATAGTCCCCCGGTGGCCCGGCGCCTCGGAGTCCCTGTGGTTCATGACAAAGAGGGCAAGATCATTATCGAGCCC
GGCTTCCTCTTCACCACAGCCAAGCCCAAAGAGTCGGCCGAGGCTGATGGGCTGGCTGAGAGCTCCTATGGTCGGTGGCTCTGCAACTTC
TCACGGCAGCGCCTGGACGGAGGCTCAGCGGGCAGCCCCTCGGCGGCCGGGCCTGGCTTCCCAGCGGCCCTGCATGACTTTGAGATGTCA
GGCAACATGAGTGATGACATGAAGGAGATCACCAACTGTGTGCGCCAGGCCATGCGCTCCGGCTCACTGGAGAGGAAAGTGAAGAGCACA
TCCAGCCAGACGGTGGGCCTGGCCAGTGTGGGCACACAGACCATCCGCACGGTCAGCGTGGGCCTGCAGACCGACCCACCCCGCAGCAGC
CTCCATGGCAAGGCCTGGTCACCCCGCAGCTCTTCGCTCGTGTCTGTGCGCAGCAAGCAGATCTCCTCCTCCCTGGACAAGGTCCATTCG
CGCATCGAGCGGCCCTGCTGCTCCCCCAAGTATGGCTCACCAAAGCTCCAGAGGCGGTCTGTGTCCAAGCTGGACAGCAGCAAGGACCGC
AGCCTGTGGAACCTGCACCAGGGCAAGCAGAACGGCTCGGCCTGGGCCCGCTCCACCACCACGCGGGACAGCCCTGTATTGAGAAACATC
AACGATGGACTCTCCAGCCTCTTCAGTGTGGTGGAGCACTCAGGGAGCACGGAGTCTGTCTGGAAACTAGGCATGTCTGAGACGCGGGCC
AAGCCCGAGCCTCCCAAGTACGGCATTGTGCAGGAATTCTTCCGTAATGTGTGTGGCCGGGCACCGAGCCCCACCTCATCAGCAGGAGAG
GAGGGCACCAAGAAGCCAGAGCCCCTCTCCCCAGCCAGCTACCATCAGCCAGAGGGTGTGGCCAGGATCCTGAACAAGAAGGCAGCCAAG
TTGGGCAGCAGTGAGGAGGTCAGACTCACCATGCTCCCCCAGGTGGGGAAGGATGGTGTCCTCCGGGACGGAGATGGAGCCGTGGTCCTT
CCCAATGAGGACGCTGTTTGTGACTGTAGTACCCAGTCTCTCACCTCCTGCTTCGCCCGATCGTCCCGCTCTGCCATCCGCCACTCTCCT
TCCAAGTGCAGGCTGCACCCTTCAGAGTCCAGCTGGGGTGGGGAGGAGAGGGCACTCCCCCCCAGCGAGTGACAGAGCAGCCAAGCTCCC
CGCCTCAACCAGCCCAGCCCCTGGATAGCAGAAGGGAACCAGCAGAGACGAGACGAGGTGAGGCGAGGGGCTGTGTCCTCAGCATTGCCT
GGCCCTGGAGGGACAGCAGTGATGCCACTGCCAGAATGCAGCTTTCACATCAAGGTAAAGCCGGGTCTCCTGCTGGCCCCTGGGTGGTGA
GCTTCGACTTCCCAGGGGAAGGCAGTGAGTGGGAGAGAGACCAAACCTGGGCTTCCCAAGCATCCACTGAGAGATCTGTCAAGAGCCGAT
CCCTGGGTCCTAAGAGAGAGCCTTGCCTGGTTCTGCCCATGCCACCCTCTTGGAAGAGCCCAAGAAGGATACATGTCTGGCCATGCCTTT
GGGGAAAAGGAGTCGGAGAGATGTTTCCTGCTGACCATCCACCCCTTCATTTGGGAGGAGACACTGCTGAGAAGAACAGGCTTTGCTCTA
GGGCTCCATGTTTGGTTCCTGGTGGAGCCCTGTTGGGCATCATCACCATCACCTCCTTCTCTCCACCACCTCCTCCTCCCAGCCCCACTG
CTCTAGATTGCTGGGACACTAGGGAGTATGATAGGGCAGTAGCCAGGGCCATTGCTTAGTGTCCTGGAGCCCTGGATCTCCCTGCCCATA
GCCTGGATGCAGCAAGAGCTGGGAGGCGAAGTGGAAACATGCAGGGCTCAGGGTTGGGGAGTGATTGCAATTGCCTTCCTTGCCAAAGTG
ACTTGGGGCCCCAACGTTCCCAGCAGACCCCTTGAGGACAGAAATAGGTAGAGTCAGTCTCAAGACCTGGTGCATAGATAAATGCCTAAA
TACACTGCCTTGATCTCAAGTGATCTCAGAGGCCTCTTTCCCTGGCACCCTGAGAGGCAGCAGGCACTACATCTCCACTGTGTTTACATC
CTGCAGCTGGTCGGGGGGCAAAGATATTCCCAGTAAGAGATTCTTGGTTGGCCAGGTCAGGCCCAGGAGAACACCAAGAGGCCAGAGCCC
AGGACACAGCAGTAGACTGGGGCCTGGAAACACGTATCTTGCCTAGATTGTTTATTTGAATTTTTCCTACTATAAATATTTAAGGTGGTT
TACTTTATTTTAATAATTTAATTTACCCCAAAGTCCCTAAGGTAATTTATTGGAGGTTGAAACATGCATTCTTGCCACTGGGACAACATG
AGGCCTCTAACAGCACGGGCAGGCATGGGGTCCCCTGGGTGGACGAGGCCGCTTGGCAGCCAGGTTTGGAGACCTGGCCTCCTGGTCAGC
TTTGGAGGGCCCCTCAACAGAGCTGGAGCCCTGCACCCCAACACGGCTGGCCATGTGGCCTCAGAACACTACTTATTACTCAATGCCTGG
TACTTGGCCAGCCCCAGCGGTCAGTCTATAAATACTCACTGACAAGGTGGAGGGCTGGACGGCCATCACCACTCCCCAGACGTTCTCCAT
TGCCTGTCTCATTTCCCTCCCTCCCTCTGACACCTTTCTTCATGAGTCGAACGTGGATTACTAAAGCTCTATTAAGAGTGTGGAGATCCC
TCCAAGTTTCCCAAATGAGAACTCACAGGAAAACAGGACTGAACTTTGAGAATGTTGTTTATCGCAGCTTTGCACATAAACCTGAGTGTC
TCCCAGCCTGCCTCGGTTCTCACCAGCCTGCCAGCCTTTTCACCAGCCTCTCTCCTTAGCCTTATGGCCTTTCACGGCTCTTCTCCCTGC
CCCAGCTCTGCTGCCCGCCCTTCCTCACGTCCCCTGTGAGCTGCCTGAGCCATTGGTTGGATTTCGATGTGGCTCATTGCAGCATGTGGG
GCAGCGCCTCCCATGGCCTCGCCTTGGTGCCGGTGAACCCCTTTTGGTTGCACACATGCTCCCCACACACACATAGACATCAGCCTTCCT
GTAACTGACTGGGGACCCAGAGTGGAAACACCAGGATGGATCAGCTTGTCTGCAGAATTGCCCATCAGGAAGACCAAAAGCCAGTAGCTT
TGCTGATCTGCCCCCAGGACTCTGGAGGCGCCCCTGCACTCCCACCTCCCACCTGCCAGTTCCCAGACCCACCCATTCGGGATCACCTGG
ACCAGTTACCCCAAGTCCTGCATCTCCCTTCCCTGCAGGCTGAACACCAGGGTCATGCCAGTCCCGCCAGCCGCCTCCTCCATGCCCCAG
TGACTGGTGTGGGCAGAGCAGGCAGCCAGTGGAGCTGTGGGCCAGTTCCGCTCTTGGATGCTGCTGCTCTCACCCATGAGGTCAGGGGGG
CCCTCCAAGGTTATCTCCAGGTGAGGGGATTCACATCAGGCCACAAGCCACCAGAGGCCTTCTGCCACCTCCCAGAGCGACAGCCAGGGA
GGCTGCGTACTCAGCCTCGGGGAGAAATCCCCGTGGGACCTGAGCCCCAAGACCTACGGACCACTCAGCCTTACCATCGTACCGTCCAGG
ATTGTCCTTGCCATCTTTGTTGTCTCAGCCAGACCTTGGTTTTCAGTAAAGCCCCAGTTTCTACTTCCTGCATGCCACTGTGCAAGGCCA
CTCATCACTGTTCCTGCAGAAGCCTCTGGACGTGGGGCTGGATGGGGTTGAAAATGTTACATGTAAATATTGGTTTGGTTCGGTTTTTAG
CATTTTACTTGGTAACTGGTTGTTTTCTTTTTTGGGGTGGGGGGATTGGTTTGTAAAAATTCTCTACTCTTTTGGAATGTGATTTCTAAG
TTTGTTGGTTTCTTCAAATGCCTTTTAAGTCTTGGTAACATTCCCAAAGCAGAAAACTGCCTGACCCACAGTGGGGATTCCCTGGAGAAT
TGGGGTCCCAAGAAGGAATGCTGCCCTTCTCGAACCCGTTCTCCCCCTTCCTCCTGCCTCTCTGCCTTTTACTGCTATTCCCTTCTTCTC
CTCCTTTATCCTTCTTTCTGTTTTCCCCATCTCCACTCTCTCTTCAACCAAAGTCCCAAGGAACCCTCGGGGCTCAATCCCCCATAGACC
ACTTGGCTTGGGTCCATGGGGTTGGCATCAGTTGGTTGGCGGAAATGGGGGACCAGTTGGCATGATGGCCCTAAACTGGGAAACCTCATG
TTTCTTATGTCTCACCTCTTTCCAGAGCCAAATCAGCCCCTTTTGGAATGATGACTTCATTGGAATGCAAATCAAGTCATTTTGGTGCAT
CAGTGGCTCTTAGGCCTGCACACACGAGACATCAGAATCCAATCCTCTGACCCTGTGCCAGCCCTTTCCCCCAGTTTATTTCCCACCAAA
GGCTGACCTCTAAGAGGTCTTGCTTTCTATGAACTCAAGATGGGTCCCACCTCTAGGTGTCCCCAGGTGCACTCTTCTACCGGTTGGCTT
CCGATGTGACAAGGCCAAGGGCCCAAAGACTTGACCCTCTTACACCCTTGCTGACATGGTTCCATCATGTCCACCCGCATGCACTTTTAT
GGTTTCATCACCCAGCCTCTTCTCCTCTGGCCCACCCAGCGTCCAGGCTCTTTCTCCCTCTCCCCTCCTATCTAGAATGTCCCCTGCTTC
TAGCCTCACCAGACCCCCCAAGCTCCCACTACTTCTTCCATAATAATAGTAATAACAATGGTTATCATCATCCCCTGCACATCCCGCCTA
AAGCACTTTACTATATAGAAAACGTTTCCCCTGGCCGGGCATGGTGGCTCACGCCTGAAATCCCAGCTCTTTGGGAGGCTGAGGCGAGCG
GATCAGTTGAGGTCAGGAGTTCAACGCCAGCCTGGCCAACGTGGTGAATCCCTGTCTGTACTAAAAGTACAAAAAATTAGCTGAGCATGG
TGGTGCGTGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGTGGGAGAATCGCTTGAGCCCAGGAGGCGGAGGTTGCAGGAGCAGAGATTG
CGCCACTGCACTCCAGCCTGGATGACAGAGTGAGACCCAATCTCAAAAAAGAAATCGTTTCCCACCCCACATCTCCTTCAGACCTCTCAG
GGACAACTCTGGGAGGCAGCCTTGGCAGGACATGGGTTAGTGCGCCCATTTTGCTGTGAGGAAACTGAGGTACAGGTCTCATCCCAGAGC
ATGAGAAGTCACTGAGTTTAGATGAGAACTTGGGTCCAACTCTGTCCTGTTTGCTGTGCAAATCCGCTGCCCTGCTGGGGGCTTTTGGTG
GGTCCAGAATACCCAGAATATGCTGCTGGCCAACCCAGGCATAAAACAAGTCCATTCTAGATCACTGAGCCTTGTGTATTCCAGAGGGTG
ATCTGAGGTCCCCATTCAGCAGAATTCTCTGAGGGCATGTTCAGAATGTAGATTCCTGGGCCCCACCTTGAATTTGCATGTTTAACAAAC
TCTCCTGGGGTTGAGGGGTGGGTGCAGTGGTCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGTGGATCACTTGAGCTCAGGA
GTTCGAGCCTGGCCAATATGGTGAAACCCTGTCTCTACTAAAAATGCAAAAATTAGCCAGGTGTGGTGGCACATGCCTGTAATCCCAGCT
ACTTGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGTGGGGAGCAGCGGTTGCAGTGAGCCGAGATTGTGCCATTGCGCTCTAGCCTG
GGCAACGGAATGAGACTTGTCTCAAAAAAAATAAAAAATAAAACCAGCCCTCCCCGGGGGATCTTAGGCACTATTGGCCACACCATTGGT
GTTCGTGGCCCTGATTATTAGGCTATCTTTCTTTTTTTAAGTTTTTTTAGATTTATTTTTTATTTTATTTATTTATTTATTTATTATTTA
TTATTATTATTTTTAGAGACAGGGGGTCTCCCTATGTTGCCCAGGCTGGTTTCAAACTCCTGGGCTCAAGTGATCTGCCCTCCTCAGCCT
CCCAAAGTGCTGGGGTTACAGGCAGGCATCAGCCACCGTGCCAGGTCATCTTCCTTTTTCTTTTTTTTTTGGAGACAGAGTCTTGCTCTG
TCATTCAGGCTGGAGTACAGTGGCGCCATTTCAGCTCACCGCAGCCTCCACCTCCCAGGTTCAAGCAGTTCTCCTGCCTCAACCTCCCGA
GTAACTGGGATTACAGGTGTGTGCCACCACACCCGACTAATTTTTTTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGT
CTTGAACTCCTGACCTCAGGTGATCCACCCACCTCGGCCTCCCAAAGTACTGGGATTATAGGCGTGAGCCACGGCATCCAGCCTCATCTT
TCTTTTAACCAATAAACATGATGCTGTATCTTAAAAAGAGCACTGAGCAGGGACTTAAGGGATCGAGTCCTCAACCAAACTGATTTAATT
ACTCAGGATTTTCAAAAAGCATCAGAGGCTATTTACAATCTTAATCATAGGGGTTCAGTAAAATAAAAATAAGAAGTAAAAAAGCAAGAG
AAATTATTCTGTAAATCTAACTGGTGTAATTCCCATAATCATGCAATTAAGTTTTACTCTTGAGTTTCCTGACAGCCATTGGTAAAAAGA
GAAACACATCAGGATTTATAATTTTTATCATCCAATTATGGGAAGCAAGCATGTTGGCCCCAGGAGACGAACTCTTCTACTAATTTATAG
CATGTATCCTTTCATCAAGGGCCACTGAGCCAGTTGGTGAGTCAACGGGTGAACCTAAGATGCAAGGATGTTTTCCAGGTGACTATTTAA
AAAAATAAATAATCCACCAACGTGATTGACCTTGGCGAGATCATGTTTCTAGTCTATACCTCAGTTTCCCCATCTGTAAAGTGAGGATAA
TGTCCCACCCCATGTAACTGTGGTGAGGACCAACTGCAACACTGTGCCTGCGAGTCTCCTTGGAAAAGTGTAAGGTTCTACACAAATGGA
AAGTGATCTGATCACACTCAGTGTCCCCAGCCCAGCCTTTCAGTGCCCTGGCCCTGGGGTGGGGGACAATACTCTCCTCACCCCCTTCAC
TAGTCTTCATGAATAGCAAGGAGGCCATAACATAATTTGGTCTAAACCCCTTCCTTTTTAAAAGAATGATGGCAAAATGTGCATAACATT
AAATTTACCATTTTAAAGATTTCTAAGTGTCTCGAAGTACATTTGCAATGTGTAACTGCCACCTCCAGAACTTTTTCATCATCCTAAACT
GAAACTCTGTACCCACTAAACAGTAACTCCCCACTCCCCCTGTCCCCAGTCCCTGGTAACCTCTATTCTACTTTTTTTTTTTCTCTGTGG
ATTGGCCTATTCTAGGTATCTCTTTTAAGTAGAATCATATAGTATTTGTCCTTTTGTGTCCAGCCCCCTCATTTTTTGAGATGAGGAATC
AGGCCCAGAGAAGGCAGTGGCTCACCCAAGGTCACATCGCAAACCAGAGGCAGAGCCAAGACCAGAACCCAGGTTTCCTGACTCCTAACA
GGTCTTGCATCCCCAGTAGTGAATGAGACACTGCTTTGACTTTCTGTAATCTTGGTTTAGCCCCTTCCTTTCTCTGGGCTCAGTCTGCTT
CCCACCACTCACCCATTCTTTCTTTATCATCTCCCTCCAAAGCCTCTTGTCCTCCTGCCTCCTCTTCTCCTTGGCTGGTTCCTGCCAGCA
AATGGAGTGGCCCTGGTCCATGCCAGGTTTTCCTCTTCTGGGTCCGGAGCTCACTATAGTATTCAGCCCTCAGTCCTCCCAGGATGTTCT
TCCCAGCACTGCTGCCTCGTGCGGATTTTCCCGTAACCTCAGTAACTGGCTTCTTGTCCCCCTGCTTCCTACCAGGGAAGCCTTCCTGTC
CGCGTCTGTGGTTTCCCAACTCTGATATTTGCTCTCAAATGTGGTGGTGTCCTGGTTCTGTGTTTATTTATTTTGTGTTTTCTCACACAC
ACAAAAAAAAGCCTCTTCCTCCAATGCATCAGGAGGCACCAGCCCTGCCAGCCCTTCTCACTGGGCTCACCCTGCCCCAGCAACCCCCCG
GTGCCACCCTTAAGCCACACTGTCTTCTCTGTAAGCAGCCTGCCAGCAGCAGCCCCAGCACTTTGCAATGGGCGTGTGTGTGGTGGTGGG
TGGGGGGGGCTTGGATCCCTCCTTTTTCCTCCCCTGCCCTGCCCAGGCCCAGATGGCCTTGACTGTAAAGCAGGTGCTGCCTGACAGGTT
CTTCTCTCCCTGTCTCTGGTCATTGATCCATCTCTTTGTCCATTCAGTATCCAACCATCCTCTCCATTCTCCTCTGGACCTCACCACTCT
CAGAGCTGCTTGTCCTGGCAGAATCTACAGTTCACCCCAACTCTATGCCTTACCCCTCCCAACCCAACAGCATTTGCAGTTTGCAAAATA
TACAGACCCAAGTCCTGAGGGGACTGAGGACATGATGCTGGGCCCAAGTCTCCTGCTCAGGGCTTCTCTCCAATGCCAGCCCTGCCACTC
CTTCCTCACCCTCCTTGGAGCCTCCTCTGCTGCTTGTCTATCCCAACGGCCCTGCTCCCCTCCCTTCCTGCCCTTCACCAGCTTTCTGGA
CACCATGCCCTGAGGAAGGGACCTTTGGTTTTCTCTAAACATCTTTGAAGGGCTGAGGCAGTCAGGGCTGGCTGCCTTGTCACTCTTTAT
TTGGAAGCCACTCAAACCATTCCCAAGAAGAGGGACCTCAGCTGGCAATCTGGAAACCTGGCCCAGGTCTGGGCAGATGTCTTCACTTCT
CCTACCTTCCCAGTCTTGTGATCCTGTGATGAGCACCAGGATGGCCCTGTGGTCCCTAGAGCACCCCTCATGCTGTAGGGTCCTGCAGCC
CCATCCTTTCTCTACTGGGCCCTGGTATCCTGGCTCCTCTCTCAGCTCTGCCACTGATCTCTGTGCCTTAGTTTACTTCTCTGCACGGGG
GACTCACCCCAAGACCATTTCCAGCAGCTTCCCAGGTGATGTGGTGCCCCAAGGCTGGGCTTTGCCAGCTGTGGCCCAGCTCCTTAGTGC
TGCCCAGGAGACACCAGGCTGCTCAGAATGAGGTGACTGCGGGCACCATTCTCAGCCAGTGGTTCTTGTATTGCATTCCAGCAGCAGGAA
TATCACCTGGGAACTTGATAGAAGTGCAGATTAGCAGCCCCACCCAAGACCCACTGAATTAGAGCTTGTGGAGTGGGGCCCTACAAGCTG
GGGTTTTAAGGAGCCCTCCAAATGATTCTGACGCATAAGAATATGCCAACTGCTGATCTGGGCTAGCCATTAGTAGAGCCTGGGGAGGGA
CTGGGACTGGCTAGGCCAAGAACAGGTGGAAAACACCAGCCTTATCTGGACTCCTGAGATTGGGAACCACCACCAACAAAAACCAACCCT
ATAGTCGCTCCTCTTGGAAGAGGAAGAGAAGTTGAAGGGCCTGGAGAAAGCACACATTGTTTGTTTCCCTGCTCCTGCTCACCTCTCTCA
CTTGTCTTGGTTTTACAAAAGGCTGTGTGGATGGTGCCAGCCAGGGAGGGGGTGGGAGTCCTGGGGAGGCAGGAGGCAGAAGACCCTGAC
TGTTTCTCCCTTGGGAACCTCACCATAGGCCAGATAGCGCCTCTTCAAACTGAAAGAAATCTTAACTCCACAAAGAAAGCATCCTAAATC
CCCAGTTCCTCCTCCTCCCAACCCCAGGGATACCTTGTAGACAGTGCCAAAAAACAGCTCCAACCCCCAGCAGCTGGGAAGAGAGCCAGA
AGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGT
CCTGCAGTCATGGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTC
TTGTCCACTATTCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCAC
CAGCTCTCCCTGCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAAGCGTTTTTAATGGTTGTAGATTGGA

>8490_8490_9_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000371356_SOGA1_chr20_35445872_ENST00000237536_length(amino acids)=1577AA_BP=3
MGRRLECNGVISAHCNLCFLGLSDSPASASRVAGITGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSN
SSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKG
KFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQR
TATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRET
ELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELE
DQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGD
SAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAIL
VRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRS
FRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQE
KNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADM
KVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQ
LQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKS
VSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSSEPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGRQMQRSYTAPDKTGIRVYY
SPPVARRLGVPVVHDKEGKIIIEPGFLFTTAKPKESAEADGLAESSYGRWLCNFSRQRLDGGSAGSPSAAGPGFPAALHDFEMSGNMSDD
MKEITNCVRQAMRSGSLERKVKSTSSQTVGLASVGTQTIRTVSVGLQTDPPRSSLHGKAWSPRSSSLVSVRSKQISSSLDKVHSRIERPC
CSPKYGSPKLQRRSVSKLDSSKDRSLWNLHQGKQNGSAWARSTTTRDSPVLRNINDGLSSLFSVVEHSGSTESVWKLGMSETRAKPEPPK
YGIVQEFFRNVCGRAPSPTSSAGEEGTKKPEPLSPASYHQPEGVARILNKKAAKLGSSEEVRLTMLPQVGKDGVLRDGDGAVVLPNEDAV

--------------------------------------------------------------
>8490_8490_10_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000371356_SOGA1_chr20_35445872_ENST00000279034_length(transcript)=3856nt_BP=837nt
CTTGGAAGACTTGGGTCCTTGGGTCGCAGGCTGGAGTGCAATGGTGTGATCTCAGCTCACTGCAACCTCTGCTTCCTGGGTTTAAGTGAT
TCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACA
GCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGG
GTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAG
AAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCT
GAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGT
CGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAA
GCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTG
TATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCA
AAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAG
GAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAG
AATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCC
CCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAG
GTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCC
GCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGC
TCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCG
GAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAG
AAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACG
GACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGG
GAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACG
GCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAG
CTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCT
GTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGC
TCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAG
CCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCT
GAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAA
ATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAG
AAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGG
TTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTG
ATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATC
AAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAG
TTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCC
GGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTC
ATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAG
CAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTG
GAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGT
TTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGC
AAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCG
TCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCC
CAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGG
GCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTC
TTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGA

>8490_8490_10_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000371356_SOGA1_chr20_35445872_ENST00000279034_length(amino acids)=1170AA_BP=3
MGRRLECNGVISAHCNLCFLGLSDSPASASRVAGITGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSN
SSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKG
KFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQR
TATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRET
ELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELE
DQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGD
SAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAIL
VRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRS
FRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQE
KNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADM
KVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQ
LQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKS
VSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSSEPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGSQKLPFLLILAPPQPPPIL

--------------------------------------------------------------
>8490_8490_11_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000395913_SOGA1_chr20_35445872_ENST00000237536_length(transcript)=13852nt_BP=894nt
CTTAAACGCGACTCAAGGCGTCGGGTTTGTTGTCAACCAATCACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAG
GGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAACGGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGTGGGAGCCG
ACGGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTG
ACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATT
CCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCT
GTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAA
CAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTT
TATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAG
CTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTAC
CTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCC
TTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCAC
TTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTAC
CGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCAC
CTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGAC
GACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTG
GCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTG
CTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCA
CAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCC
AACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTG
CCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAG
GCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGT
GCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTG
CTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTG
CCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTG
CCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGA
GCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAG
CAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCG
CTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTG
CAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTG
CCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCT
GATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAG
GAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAG
CGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCC
CTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAG
CGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAAC
CGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGC
AGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCT
GAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTT
GTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCC
TGCCAGGACCCCCCAGGGAGGCAGATGCAGCGCAGCTACACGGCTCCTGACAAGACGGGCATCCGAGTCTACTATAGTCCCCCGGTGGCC
CGGCGCCTCGGAGTCCCTGTGGTTCATGACAAAGAGGGCAAGATCATTATCGAGCCCGGCTTCCTCTTCACCACAGCCAAGCCCAAAGAG
TCGGCCGAGGCTGATGGGCTGGCTGAGAGCTCCTATGGTCGGTGGCTCTGCAACTTCTCACGGCAGCGCCTGGACGGAGGCTCAGCGGGC
AGCCCCTCGGCGGCCGGGCCTGGCTTCCCAGCGGCCCTGCATGACTTTGAGATGTCAGGCAACATGAGTGATGACATGAAGGAGATCACC
AACTGTGTGCGCCAGGCCATGCGCTCCGGCTCACTGGAGAGGAAAGTGAAGAGCACATCCAGCCAGACGGTGGGCCTGGCCAGTGTGGGC
ACACAGACCATCCGCACGGTCAGCGTGGGCCTGCAGACCGACCCACCCCGCAGCAGCCTCCATGGCAAGGCCTGGTCACCCCGCAGCTCT
TCGCTCGTGTCTGTGCGCAGCAAGCAGATCTCCTCCTCCCTGGACAAGGTCCATTCGCGCATCGAGCGGCCCTGCTGCTCCCCCAAGTAT
GGCTCACCAAAGCTCCAGAGGCGGTCTGTGTCCAAGCTGGACAGCAGCAAGGACCGCAGCCTGTGGAACCTGCACCAGGGCAAGCAGAAC
GGCTCGGCCTGGGCCCGCTCCACCACCACGCGGGACAGCCCTGTATTGAGAAACATCAACGATGGACTCTCCAGCCTCTTCAGTGTGGTG
GAGCACTCAGGGAGCACGGAGTCTGTCTGGAAACTAGGCATGTCTGAGACGCGGGCCAAGCCCGAGCCTCCCAAGTACGGCATTGTGCAG
GAATTCTTCCGTAATGTGTGTGGCCGGGCACCGAGCCCCACCTCATCAGCAGGAGAGGAGGGCACCAAGAAGCCAGAGCCCCTCTCCCCA
GCCAGCTACCATCAGCCAGAGGGTGTGGCCAGGATCCTGAACAAGAAGGCAGCCAAGTTGGGCAGCAGTGAGGAGGTCAGACTCACCATG
CTCCCCCAGGTGGGGAAGGATGGTGTCCTCCGGGACGGAGATGGAGCCGTGGTCCTTCCCAATGAGGACGCTGTTTGTGACTGTAGTACC
CAGTCTCTCACCTCCTGCTTCGCCCGATCGTCCCGCTCTGCCATCCGCCACTCTCCTTCCAAGTGCAGGCTGCACCCTTCAGAGTCCAGC
TGGGGTGGGGAGGAGAGGGCACTCCCCCCCAGCGAGTGACAGAGCAGCCAAGCTCCCCGCCTCAACCAGCCCAGCCCCTGGATAGCAGAA
GGGAACCAGCAGAGACGAGACGAGGTGAGGCGAGGGGCTGTGTCCTCAGCATTGCCTGGCCCTGGAGGGACAGCAGTGATGCCACTGCCA
GAATGCAGCTTTCACATCAAGGTAAAGCCGGGTCTCCTGCTGGCCCCTGGGTGGTGAGCTTCGACTTCCCAGGGGAAGGCAGTGAGTGGG
AGAGAGACCAAACCTGGGCTTCCCAAGCATCCACTGAGAGATCTGTCAAGAGCCGATCCCTGGGTCCTAAGAGAGAGCCTTGCCTGGTTC
TGCCCATGCCACCCTCTTGGAAGAGCCCAAGAAGGATACATGTCTGGCCATGCCTTTGGGGAAAAGGAGTCGGAGAGATGTTTCCTGCTG
ACCATCCACCCCTTCATTTGGGAGGAGACACTGCTGAGAAGAACAGGCTTTGCTCTAGGGCTCCATGTTTGGTTCCTGGTGGAGCCCTGT
TGGGCATCATCACCATCACCTCCTTCTCTCCACCACCTCCTCCTCCCAGCCCCACTGCTCTAGATTGCTGGGACACTAGGGAGTATGATA
GGGCAGTAGCCAGGGCCATTGCTTAGTGTCCTGGAGCCCTGGATCTCCCTGCCCATAGCCTGGATGCAGCAAGAGCTGGGAGGCGAAGTG
GAAACATGCAGGGCTCAGGGTTGGGGAGTGATTGCAATTGCCTTCCTTGCCAAAGTGACTTGGGGCCCCAACGTTCCCAGCAGACCCCTT
GAGGACAGAAATAGGTAGAGTCAGTCTCAAGACCTGGTGCATAGATAAATGCCTAAATACACTGCCTTGATCTCAAGTGATCTCAGAGGC
CTCTTTCCCTGGCACCCTGAGAGGCAGCAGGCACTACATCTCCACTGTGTTTACATCCTGCAGCTGGTCGGGGGGCAAAGATATTCCCAG
TAAGAGATTCTTGGTTGGCCAGGTCAGGCCCAGGAGAACACCAAGAGGCCAGAGCCCAGGACACAGCAGTAGACTGGGGCCTGGAAACAC
GTATCTTGCCTAGATTGTTTATTTGAATTTTTCCTACTATAAATATTTAAGGTGGTTTACTTTATTTTAATAATTTAATTTACCCCAAAG
TCCCTAAGGTAATTTATTGGAGGTTGAAACATGCATTCTTGCCACTGGGACAACATGAGGCCTCTAACAGCACGGGCAGGCATGGGGTCC
CCTGGGTGGACGAGGCCGCTTGGCAGCCAGGTTTGGAGACCTGGCCTCCTGGTCAGCTTTGGAGGGCCCCTCAACAGAGCTGGAGCCCTG
CACCCCAACACGGCTGGCCATGTGGCCTCAGAACACTACTTATTACTCAATGCCTGGTACTTGGCCAGCCCCAGCGGTCAGTCTATAAAT
ACTCACTGACAAGGTGGAGGGCTGGACGGCCATCACCACTCCCCAGACGTTCTCCATTGCCTGTCTCATTTCCCTCCCTCCCTCTGACAC
CTTTCTTCATGAGTCGAACGTGGATTACTAAAGCTCTATTAAGAGTGTGGAGATCCCTCCAAGTTTCCCAAATGAGAACTCACAGGAAAA
CAGGACTGAACTTTGAGAATGTTGTTTATCGCAGCTTTGCACATAAACCTGAGTGTCTCCCAGCCTGCCTCGGTTCTCACCAGCCTGCCA
GCCTTTTCACCAGCCTCTCTCCTTAGCCTTATGGCCTTTCACGGCTCTTCTCCCTGCCCCAGCTCTGCTGCCCGCCCTTCCTCACGTCCC
CTGTGAGCTGCCTGAGCCATTGGTTGGATTTCGATGTGGCTCATTGCAGCATGTGGGGCAGCGCCTCCCATGGCCTCGCCTTGGTGCCGG
TGAACCCCTTTTGGTTGCACACATGCTCCCCACACACACATAGACATCAGCCTTCCTGTAACTGACTGGGGACCCAGAGTGGAAACACCA
GGATGGATCAGCTTGTCTGCAGAATTGCCCATCAGGAAGACCAAAAGCCAGTAGCTTTGCTGATCTGCCCCCAGGACTCTGGAGGCGCCC
CTGCACTCCCACCTCCCACCTGCCAGTTCCCAGACCCACCCATTCGGGATCACCTGGACCAGTTACCCCAAGTCCTGCATCTCCCTTCCC
TGCAGGCTGAACACCAGGGTCATGCCAGTCCCGCCAGCCGCCTCCTCCATGCCCCAGTGACTGGTGTGGGCAGAGCAGGCAGCCAGTGGA
GCTGTGGGCCAGTTCCGCTCTTGGATGCTGCTGCTCTCACCCATGAGGTCAGGGGGGCCCTCCAAGGTTATCTCCAGGTGAGGGGATTCA
CATCAGGCCACAAGCCACCAGAGGCCTTCTGCCACCTCCCAGAGCGACAGCCAGGGAGGCTGCGTACTCAGCCTCGGGGAGAAATCCCCG
TGGGACCTGAGCCCCAAGACCTACGGACCACTCAGCCTTACCATCGTACCGTCCAGGATTGTCCTTGCCATCTTTGTTGTCTCAGCCAGA
CCTTGGTTTTCAGTAAAGCCCCAGTTTCTACTTCCTGCATGCCACTGTGCAAGGCCACTCATCACTGTTCCTGCAGAAGCCTCTGGACGT
GGGGCTGGATGGGGTTGAAAATGTTACATGTAAATATTGGTTTGGTTCGGTTTTTAGCATTTTACTTGGTAACTGGTTGTTTTCTTTTTT
GGGGTGGGGGGATTGGTTTGTAAAAATTCTCTACTCTTTTGGAATGTGATTTCTAAGTTTGTTGGTTTCTTCAAATGCCTTTTAAGTCTT
GGTAACATTCCCAAAGCAGAAAACTGCCTGACCCACAGTGGGGATTCCCTGGAGAATTGGGGTCCCAAGAAGGAATGCTGCCCTTCTCGA
ACCCGTTCTCCCCCTTCCTCCTGCCTCTCTGCCTTTTACTGCTATTCCCTTCTTCTCCTCCTTTATCCTTCTTTCTGTTTTCCCCATCTC
CACTCTCTCTTCAACCAAAGTCCCAAGGAACCCTCGGGGCTCAATCCCCCATAGACCACTTGGCTTGGGTCCATGGGGTTGGCATCAGTT
GGTTGGCGGAAATGGGGGACCAGTTGGCATGATGGCCCTAAACTGGGAAACCTCATGTTTCTTATGTCTCACCTCTTTCCAGAGCCAAAT
CAGCCCCTTTTGGAATGATGACTTCATTGGAATGCAAATCAAGTCATTTTGGTGCATCAGTGGCTCTTAGGCCTGCACACACGAGACATC
AGAATCCAATCCTCTGACCCTGTGCCAGCCCTTTCCCCCAGTTTATTTCCCACCAAAGGCTGACCTCTAAGAGGTCTTGCTTTCTATGAA
CTCAAGATGGGTCCCACCTCTAGGTGTCCCCAGGTGCACTCTTCTACCGGTTGGCTTCCGATGTGACAAGGCCAAGGGCCCAAAGACTTG
ACCCTCTTACACCCTTGCTGACATGGTTCCATCATGTCCACCCGCATGCACTTTTATGGTTTCATCACCCAGCCTCTTCTCCTCTGGCCC
ACCCAGCGTCCAGGCTCTTTCTCCCTCTCCCCTCCTATCTAGAATGTCCCCTGCTTCTAGCCTCACCAGACCCCCCAAGCTCCCACTACT
TCTTCCATAATAATAGTAATAACAATGGTTATCATCATCCCCTGCACATCCCGCCTAAAGCACTTTACTATATAGAAAACGTTTCCCCTG
GCCGGGCATGGTGGCTCACGCCTGAAATCCCAGCTCTTTGGGAGGCTGAGGCGAGCGGATCAGTTGAGGTCAGGAGTTCAACGCCAGCCT
GGCCAACGTGGTGAATCCCTGTCTGTACTAAAAGTACAAAAAATTAGCTGAGCATGGTGGTGCGTGCCTGTAATCCCAGCTACTCGGGAG
GCTGAGGTGGGAGAATCGCTTGAGCCCAGGAGGCGGAGGTTGCAGGAGCAGAGATTGCGCCACTGCACTCCAGCCTGGATGACAGAGTGA
GACCCAATCTCAAAAAAGAAATCGTTTCCCACCCCACATCTCCTTCAGACCTCTCAGGGACAACTCTGGGAGGCAGCCTTGGCAGGACAT
GGGTTAGTGCGCCCATTTTGCTGTGAGGAAACTGAGGTACAGGTCTCATCCCAGAGCATGAGAAGTCACTGAGTTTAGATGAGAACTTGG
GTCCAACTCTGTCCTGTTTGCTGTGCAAATCCGCTGCCCTGCTGGGGGCTTTTGGTGGGTCCAGAATACCCAGAATATGCTGCTGGCCAA
CCCAGGCATAAAACAAGTCCATTCTAGATCACTGAGCCTTGTGTATTCCAGAGGGTGATCTGAGGTCCCCATTCAGCAGAATTCTCTGAG
GGCATGTTCAGAATGTAGATTCCTGGGCCCCACCTTGAATTTGCATGTTTAACAAACTCTCCTGGGGTTGAGGGGTGGGTGCAGTGGTCA
CACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGTGGATCACTTGAGCTCAGGAGTTCGAGCCTGGCCAATATGGTGAAACCCTGTC
TCTACTAAAAATGCAAAAATTAGCCAGGTGTGGTGGCACATGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCACTTGAA
CCTGGTGGGGAGCAGCGGTTGCAGTGAGCCGAGATTGTGCCATTGCGCTCTAGCCTGGGCAACGGAATGAGACTTGTCTCAAAAAAAATA
AAAAATAAAACCAGCCCTCCCCGGGGGATCTTAGGCACTATTGGCCACACCATTGGTGTTCGTGGCCCTGATTATTAGGCTATCTTTCTT
TTTTTAAGTTTTTTTAGATTTATTTTTTATTTTATTTATTTATTTATTTATTATTTATTATTATTATTTTTAGAGACAGGGGGTCTCCCT
ATGTTGCCCAGGCTGGTTTCAAACTCCTGGGCTCAAGTGATCTGCCCTCCTCAGCCTCCCAAAGTGCTGGGGTTACAGGCAGGCATCAGC
CACCGTGCCAGGTCATCTTCCTTTTTCTTTTTTTTTTGGAGACAGAGTCTTGCTCTGTCATTCAGGCTGGAGTACAGTGGCGCCATTTCA
GCTCACCGCAGCCTCCACCTCCCAGGTTCAAGCAGTTCTCCTGCCTCAACCTCCCGAGTAACTGGGATTACAGGTGTGTGCCACCACACC
CGACTAATTTTTTTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCACCCACC
TCGGCCTCCCAAAGTACTGGGATTATAGGCGTGAGCCACGGCATCCAGCCTCATCTTTCTTTTAACCAATAAACATGATGCTGTATCTTA
AAAAGAGCACTGAGCAGGGACTTAAGGGATCGAGTCCTCAACCAAACTGATTTAATTACTCAGGATTTTCAAAAAGCATCAGAGGCTATT
TACAATCTTAATCATAGGGGTTCAGTAAAATAAAAATAAGAAGTAAAAAAGCAAGAGAAATTATTCTGTAAATCTAACTGGTGTAATTCC
CATAATCATGCAATTAAGTTTTACTCTTGAGTTTCCTGACAGCCATTGGTAAAAAGAGAAACACATCAGGATTTATAATTTTTATCATCC
AATTATGGGAAGCAAGCATGTTGGCCCCAGGAGACGAACTCTTCTACTAATTTATAGCATGTATCCTTTCATCAAGGGCCACTGAGCCAG
TTGGTGAGTCAACGGGTGAACCTAAGATGCAAGGATGTTTTCCAGGTGACTATTTAAAAAAATAAATAATCCACCAACGTGATTGACCTT
GGCGAGATCATGTTTCTAGTCTATACCTCAGTTTCCCCATCTGTAAAGTGAGGATAATGTCCCACCCCATGTAACTGTGGTGAGGACCAA
CTGCAACACTGTGCCTGCGAGTCTCCTTGGAAAAGTGTAAGGTTCTACACAAATGGAAAGTGATCTGATCACACTCAGTGTCCCCAGCCC
AGCCTTTCAGTGCCCTGGCCCTGGGGTGGGGGACAATACTCTCCTCACCCCCTTCACTAGTCTTCATGAATAGCAAGGAGGCCATAACAT
AATTTGGTCTAAACCCCTTCCTTTTTAAAAGAATGATGGCAAAATGTGCATAACATTAAATTTACCATTTTAAAGATTTCTAAGTGTCTC
GAAGTACATTTGCAATGTGTAACTGCCACCTCCAGAACTTTTTCATCATCCTAAACTGAAACTCTGTACCCACTAAACAGTAACTCCCCA
CTCCCCCTGTCCCCAGTCCCTGGTAACCTCTATTCTACTTTTTTTTTTTCTCTGTGGATTGGCCTATTCTAGGTATCTCTTTTAAGTAGA
ATCATATAGTATTTGTCCTTTTGTGTCCAGCCCCCTCATTTTTTGAGATGAGGAATCAGGCCCAGAGAAGGCAGTGGCTCACCCAAGGTC
ACATCGCAAACCAGAGGCAGAGCCAAGACCAGAACCCAGGTTTCCTGACTCCTAACAGGTCTTGCATCCCCAGTAGTGAATGAGACACTG
CTTTGACTTTCTGTAATCTTGGTTTAGCCCCTTCCTTTCTCTGGGCTCAGTCTGCTTCCCACCACTCACCCATTCTTTCTTTATCATCTC
CCTCCAAAGCCTCTTGTCCTCCTGCCTCCTCTTCTCCTTGGCTGGTTCCTGCCAGCAAATGGAGTGGCCCTGGTCCATGCCAGGTTTTCC
TCTTCTGGGTCCGGAGCTCACTATAGTATTCAGCCCTCAGTCCTCCCAGGATGTTCTTCCCAGCACTGCTGCCTCGTGCGGATTTTCCCG
TAACCTCAGTAACTGGCTTCTTGTCCCCCTGCTTCCTACCAGGGAAGCCTTCCTGTCCGCGTCTGTGGTTTCCCAACTCTGATATTTGCT
CTCAAATGTGGTGGTGTCCTGGTTCTGTGTTTATTTATTTTGTGTTTTCTCACACACACAAAAAAAAGCCTCTTCCTCCAATGCATCAGG
AGGCACCAGCCCTGCCAGCCCTTCTCACTGGGCTCACCCTGCCCCAGCAACCCCCCGGTGCCACCCTTAAGCCACACTGTCTTCTCTGTA
AGCAGCCTGCCAGCAGCAGCCCCAGCACTTTGCAATGGGCGTGTGTGTGGTGGTGGGTGGGGGGGGCTTGGATCCCTCCTTTTTCCTCCC
CTGCCCTGCCCAGGCCCAGATGGCCTTGACTGTAAAGCAGGTGCTGCCTGACAGGTTCTTCTCTCCCTGTCTCTGGTCATTGATCCATCT
CTTTGTCCATTCAGTATCCAACCATCCTCTCCATTCTCCTCTGGACCTCACCACTCTCAGAGCTGCTTGTCCTGGCAGAATCTACAGTTC
ACCCCAACTCTATGCCTTACCCCTCCCAACCCAACAGCATTTGCAGTTTGCAAAATATACAGACCCAAGTCCTGAGGGGACTGAGGACAT
GATGCTGGGCCCAAGTCTCCTGCTCAGGGCTTCTCTCCAATGCCAGCCCTGCCACTCCTTCCTCACCCTCCTTGGAGCCTCCTCTGCTGC
TTGTCTATCCCAACGGCCCTGCTCCCCTCCCTTCCTGCCCTTCACCAGCTTTCTGGACACCATGCCCTGAGGAAGGGACCTTTGGTTTTC
TCTAAACATCTTTGAAGGGCTGAGGCAGTCAGGGCTGGCTGCCTTGTCACTCTTTATTTGGAAGCCACTCAAACCATTCCCAAGAAGAGG
GACCTCAGCTGGCAATCTGGAAACCTGGCCCAGGTCTGGGCAGATGTCTTCACTTCTCCTACCTTCCCAGTCTTGTGATCCTGTGATGAG
CACCAGGATGGCCCTGTGGTCCCTAGAGCACCCCTCATGCTGTAGGGTCCTGCAGCCCCATCCTTTCTCTACTGGGCCCTGGTATCCTGG
CTCCTCTCTCAGCTCTGCCACTGATCTCTGTGCCTTAGTTTACTTCTCTGCACGGGGGACTCACCCCAAGACCATTTCCAGCAGCTTCCC
AGGTGATGTGGTGCCCCAAGGCTGGGCTTTGCCAGCTGTGGCCCAGCTCCTTAGTGCTGCCCAGGAGACACCAGGCTGCTCAGAATGAGG
TGACTGCGGGCACCATTCTCAGCCAGTGGTTCTTGTATTGCATTCCAGCAGCAGGAATATCACCTGGGAACTTGATAGAAGTGCAGATTA
GCAGCCCCACCCAAGACCCACTGAATTAGAGCTTGTGGAGTGGGGCCCTACAAGCTGGGGTTTTAAGGAGCCCTCCAAATGATTCTGACG
CATAAGAATATGCCAACTGCTGATCTGGGCTAGCCATTAGTAGAGCCTGGGGAGGGACTGGGACTGGCTAGGCCAAGAACAGGTGGAAAA
CACCAGCCTTATCTGGACTCCTGAGATTGGGAACCACCACCAACAAAAACCAACCCTATAGTCGCTCCTCTTGGAAGAGGAAGAGAAGTT
GAAGGGCCTGGAGAAAGCACACATTGTTTGTTTCCCTGCTCCTGCTCACCTCTCTCACTTGTCTTGGTTTTACAAAAGGCTGTGTGGATG
GTGCCAGCCAGGGAGGGGGTGGGAGTCCTGGGGAGGCAGGAGGCAGAAGACCCTGACTGTTTCTCCCTTGGGAACCTCACCATAGGCCAG
ATAGCGCCTCTTCAAACTGAAAGAAATCTTAACTCCACAAAGAAAGCATCCTAAATCCCCAGTTCCTCCTCCTCCCAACCCCAGGGATAC
CTTGTAGACAGTGCCAAAAAACAGCTCCAACCCCCAGCAGCTGGGAAGAGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCC
AGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGGG
CAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTCT
TCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGAC

>8490_8490_11_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000395913_SOGA1_chr20_35445872_ENST00000237536_length(amino acids)=1549AA_BP=245
MGRRWEPTGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQ
ATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLE
KAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSA
DLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENR
GLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSC
SVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEV
LPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQ
QESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHD
SLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKR
WRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKA
KATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQ
QVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLP
NNPAFGFVSSEPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGRQMQRSYTAPDKTGIRVYYSPPVARRLGVPVVHDKEGKIIIEPGFLF
TTAKPKESAEADGLAESSYGRWLCNFSRQRLDGGSAGSPSAAGPGFPAALHDFEMSGNMSDDMKEITNCVRQAMRSGSLERKVKSTSSQT
VGLASVGTQTIRTVSVGLQTDPPRSSLHGKAWSPRSSSLVSVRSKQISSSLDKVHSRIERPCCSPKYGSPKLQRRSVSKLDSSKDRSLWN
LHQGKQNGSAWARSTTTRDSPVLRNINDGLSSLFSVVEHSGSTESVWKLGMSETRAKPEPPKYGIVQEFFRNVCGRAPSPTSSAGEEGTK
KPEPLSPASYHQPEGVARILNKKAAKLGSSEEVRLTMLPQVGKDGVLRDGDGAVVLPNEDAVCDCSTQSLTSCFARSSRSAIRHSPSKCR

--------------------------------------------------------------
>8490_8490_12_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000395913_SOGA1_chr20_35445872_ENST00000279034_length(transcript)=3913nt_BP=894nt
CTTAAACGCGACTCAAGGCGTCGGGTTTGTTGTCAACCAATCACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAG
GGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAACGGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGTGGGAGCCG
ACGGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTG
ACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATT
CCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCT
GTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAA
CAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTT
TATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAG
CTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTAC
CTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCC
TTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCAC
TTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTAC
CGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCAC
CTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGAC
GACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTG
GCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTG
CTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCA
CAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCC
AACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTG
CCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAG
GCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGT
GCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTG
CTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTG
CCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTG
CCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGA
GCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAG
CAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCG
CTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTG
CAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTG
CCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCT
GATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAG
GAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAG
CGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCC
CTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAG
CGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAAC
CGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGC
AGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCT
GAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTT
GTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCC
TGCCAGGACCCCCCAGGGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCA
CTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCA
TCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTG
CTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAA

>8490_8490_12_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000395913_SOGA1_chr20_35445872_ENST00000279034_length(amino acids)=1142AA_BP=245
MGRRWEPTGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQ
ATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLE
KAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSA
DLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENR
GLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSC
SVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEV
LPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQ
QESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHD
SLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKR
WRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKA
KATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQ
QVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLP

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for AURKA-SOGA1


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for AURKA-SOGA1


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for AURKA-SOGA1


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource