|
Fusion Gene Summary | |
Fusion Gene ORF analysis | |
Fusion Genomic Features | |
Fusion Protein Features | |
Fusion Gene Sequence | |
Fusion Gene PPI analysis | |
Related Drugs | |
Related Diseases |
Fusion gene:AURKA-SOGA1 (FusionGDB2 ID:HG6790TG140710) |
Fusion Gene Summary for AURKA-SOGA1 |
Fusion gene summary |
Fusion gene information | Fusion gene name: AURKA-SOGA1 | Fusion gene ID: hg6790tg140710 | Hgene | Tgene | Gene symbol | AURKA | SOGA1 | Gene ID | 6790 | 140710 |
Gene name | aurora kinase A | suppressor of glucose, autophagy associated 1 | |
Synonyms | AIK|ARK1|AURA|BTAK|PPP1R47|STK15|STK6|STK7 | C20orf117|KIAA0889|SOGA | |
Cytomap | ('AURKA')('SOGA1') 20q13.2 | 20q11.23 | |
Type of gene | protein-coding | protein-coding | |
Description | aurora kinase Aaurora 2aurora/IPL1-like kinaseaurora/IPL1-related kinase 1breast tumor-amplified kinaseprotein phosphatase 1, regulatory subunit 47serine/threonine protein kinase 15serine/threonine-protein kinase 6serine/threonine-protein kinase a | protein SOGA1SOGA family member 1suppressor of glucose by autophagysuppressor of glucose from autophagysuppressor of glucose, autophagy-associated protein 1 | |
Modification date | 20200329 | 20200313 | |
UniProtAcc | . | . | |
Ensembl transtripts involved in fusion gene | ENST00000312783, ENST00000347343, ENST00000371356, ENST00000395907, ENST00000395909, ENST00000395911, ENST00000395913, ENST00000395914, ENST00000395915, | ||
Fusion gene scores | * DoF score | 6 X 8 X 3=144 | 10 X 10 X 4=400 |
# samples | 8 | 12 | |
** MAII score | log2(8/144*10)=-0.84799690655495 possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs). DoF>8 and MAII<0 | log2(12/400*10)=-1.73696559416621 possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs). DoF>8 and MAII<0 | |
Context | PubMed: AURKA [Title/Abstract] AND SOGA1 [Title/Abstract] AND fusion [Title/Abstract] | ||
Most frequent breakpoint | AURKA(54956489)-SOGA1(35445872), # samples:2 | ||
Anticipated loss of major functional domain due to fusion event. | AURKA-SOGA1 seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF. AURKA-SOGA1 seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF. AURKA-SOGA1 seems lost the major protein functional domain in Hgene partner, which is a epigenetic factor due to the frame-shifted ORF. AURKA-SOGA1 seems lost the major protein functional domain in Hgene partner, which is a essential gene due to the frame-shifted ORF. AURKA-SOGA1 seems lost the major protein functional domain in Hgene partner, which is a IUPHAR drug target due to the frame-shifted ORF. AURKA-SOGA1 seems lost the major protein functional domain in Hgene partner, which is a kinase due to the frame-shifted ORF. AURKA-SOGA1 seems lost the major protein functional domain in Tgene partner, which is a essential gene due to the frame-shifted ORF. |
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types ** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10) |
Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez |
Partner | Gene | GO ID | GO term | PubMed ID |
Hgene | AURKA | GO:0006468 | protein phosphorylation | 21600873|21820309 |
Hgene | AURKA | GO:0009611 | response to wounding | 19435814 |
Hgene | AURKA | GO:0032091 | negative regulation of protein binding | 21820309 |
Hgene | AURKA | GO:0097421 | liver regeneration | 19435814 |
Fusion gene breakpoints across AURKA (5'-gene) * Click on the image to open the UCSC genome browser with custom track showing this image in a new window. |
Fusion gene breakpoints across SOGA1 (3'-gene) * Click on the image to open the UCSC genome browser with custom track showing this image in a new window. |
Fusion gene information * All genome coordinats were lifted-over on hg19. * Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser. |
Source | Disease | Sample | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand |
ChimerDB4 | OV | TCGA-61-1728-01A | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
ChimerDB4 | OV | TCGA-61-1728 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
Top |
Fusion Gene ORF analysis for AURKA-SOGA1 |
Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure. * Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser. |
ORF | Henst | Tenst | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand |
5CDS-intron | ENST00000312783 | ENST00000357779 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000312783 | ENST00000357779 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000312783 | ENST00000456801 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000312783 | ENST00000456801 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000347343 | ENST00000357779 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000347343 | ENST00000357779 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000347343 | ENST00000456801 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000347343 | ENST00000456801 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000371356 | ENST00000357779 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000371356 | ENST00000357779 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000371356 | ENST00000456801 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000371356 | ENST00000456801 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395907 | ENST00000357779 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395907 | ENST00000357779 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395907 | ENST00000456801 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395907 | ENST00000456801 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395909 | ENST00000357779 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395909 | ENST00000357779 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395909 | ENST00000456801 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395909 | ENST00000456801 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395911 | ENST00000357779 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395911 | ENST00000357779 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395911 | ENST00000456801 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395911 | ENST00000456801 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395913 | ENST00000357779 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395913 | ENST00000357779 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395913 | ENST00000456801 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395913 | ENST00000456801 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395914 | ENST00000357779 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395914 | ENST00000357779 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395914 | ENST00000456801 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395914 | ENST00000456801 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395915 | ENST00000357779 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395915 | ENST00000357779 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395915 | ENST00000456801 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
5CDS-intron | ENST00000395915 | ENST00000456801 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000347343 | ENST00000237536 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000347343 | ENST00000237536 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000347343 | ENST00000279034 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000347343 | ENST00000279034 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000395907 | ENST00000237536 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000395907 | ENST00000237536 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000395907 | ENST00000279034 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000395907 | ENST00000279034 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000395909 | ENST00000237536 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000395909 | ENST00000237536 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000395909 | ENST00000279034 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000395909 | ENST00000279034 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000395911 | ENST00000237536 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000395911 | ENST00000237536 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000395911 | ENST00000279034 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000395911 | ENST00000279034 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000395914 | ENST00000237536 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000395914 | ENST00000237536 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000395914 | ENST00000279034 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000395914 | ENST00000279034 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000395915 | ENST00000237536 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000395915 | ENST00000237536 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000395915 | ENST00000279034 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
Frame-shift | ENST00000395915 | ENST00000279034 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
In-frame | ENST00000312783 | ENST00000237536 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
In-frame | ENST00000312783 | ENST00000237536 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
In-frame | ENST00000312783 | ENST00000279034 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
In-frame | ENST00000312783 | ENST00000279034 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
In-frame | ENST00000371356 | ENST00000237536 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
In-frame | ENST00000371356 | ENST00000237536 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
In-frame | ENST00000371356 | ENST00000279034 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
In-frame | ENST00000371356 | ENST00000279034 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
In-frame | ENST00000395913 | ENST00000237536 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
In-frame | ENST00000395913 | ENST00000237536 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
In-frame | ENST00000395913 | ENST00000279034 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - |
In-frame | ENST00000395913 | ENST00000279034 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - |
ORFfinder result based on the fusion transcript sequence of in-frame fusion genes. |
Henst | Tenst | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand | Seq length (transcript) | BP loci (transcript) | Predicted start (transcript) | Predicted stop (transcript) | Seq length (amino acids) |
ENST00000371356 | AURKA | chr20 | 54956489 | - | ENST00000237536 | SOGA1 | chr20 | 35445872 | - | 13795 | 837 | 18 | 4751 | 1577 |
ENST00000371356 | AURKA | chr20 | 54956489 | - | ENST00000279034 | SOGA1 | chr20 | 35445872 | - | 3856 | 837 | 18 | 3530 | 1170 |
ENST00000312783 | AURKA | chr20 | 54956489 | - | ENST00000237536 | SOGA1 | chr20 | 35445872 | - | 13906 | 948 | 243 | 4862 | 1539 |
ENST00000312783 | AURKA | chr20 | 54956489 | - | ENST00000279034 | SOGA1 | chr20 | 35445872 | - | 3967 | 948 | 243 | 3641 | 1132 |
ENST00000395913 | AURKA | chr20 | 54956489 | - | ENST00000237536 | SOGA1 | chr20 | 35445872 | - | 13852 | 894 | 159 | 4808 | 1549 |
ENST00000395913 | AURKA | chr20 | 54956489 | - | ENST00000279034 | SOGA1 | chr20 | 35445872 | - | 3913 | 894 | 159 | 3587 | 1142 |
ENST00000371356 | AURKA | chr20 | 54956488 | - | ENST00000237536 | SOGA1 | chr20 | 35445872 | - | 13795 | 837 | 18 | 4751 | 1577 |
ENST00000371356 | AURKA | chr20 | 54956488 | - | ENST00000279034 | SOGA1 | chr20 | 35445872 | - | 3856 | 837 | 18 | 3530 | 1170 |
ENST00000312783 | AURKA | chr20 | 54956488 | - | ENST00000237536 | SOGA1 | chr20 | 35445872 | - | 13906 | 948 | 243 | 4862 | 1539 |
ENST00000312783 | AURKA | chr20 | 54956488 | - | ENST00000279034 | SOGA1 | chr20 | 35445872 | - | 3967 | 948 | 243 | 3641 | 1132 |
ENST00000395913 | AURKA | chr20 | 54956488 | - | ENST00000237536 | SOGA1 | chr20 | 35445872 | - | 13852 | 894 | 159 | 4808 | 1549 |
ENST00000395913 | AURKA | chr20 | 54956488 | - | ENST00000279034 | SOGA1 | chr20 | 35445872 | - | 3913 | 894 | 159 | 3587 | 1142 |
DeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated. |
Henst | Tenst | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand | No-coding score | Coding score |
ENST00000371356 | ENST00000237536 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - | 0.000988644 | 0.9990113 |
ENST00000371356 | ENST00000279034 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - | 0.008388676 | 0.9916113 |
ENST00000312783 | ENST00000237536 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - | 0.00114484 | 0.9988551 |
ENST00000312783 | ENST00000279034 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - | 0.009892589 | 0.9901074 |
ENST00000395913 | ENST00000237536 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - | 0.001056832 | 0.99894315 |
ENST00000395913 | ENST00000279034 | AURKA | chr20 | 54956489 | - | SOGA1 | chr20 | 35445872 | - | 0.009010821 | 0.99098915 |
ENST00000371356 | ENST00000237536 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - | 0.000988644 | 0.9990113 |
ENST00000371356 | ENST00000279034 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - | 0.008388676 | 0.9916113 |
ENST00000312783 | ENST00000237536 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - | 0.00114484 | 0.9988551 |
ENST00000312783 | ENST00000279034 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - | 0.009892589 | 0.9901074 |
ENST00000395913 | ENST00000237536 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - | 0.001056832 | 0.99894315 |
ENST00000395913 | ENST00000279034 | AURKA | chr20 | 54956488 | - | SOGA1 | chr20 | 35445872 | - | 0.009010821 | 0.99098915 |
Top |
Fusion Genomic Features for AURKA-SOGA1 |
FusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints. |
Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand | 1-p | p (fusion gene breakpoint) |
Distribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page. |
Top |
Fusion Protein Features for AURKA-SOGA1 |
Four levels of functional features of fusion genes Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr20:54956489/chr20:35445872) - FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels. - How to search 1. Put your fusion gene symbol. 2. Press the tab key until there will be shown the breakpoint information filled. 4. Go down and press 'Search' tab twice. 4. Go down to have the hyperlink of the search result. 5. Click the hyperlink. 6. See the FGviewer result for your fusion gene. |
Main function of each fusion partner protein. (from UniProt) |
Hgene | Tgene |
. | . |
FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}. | FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}. |
Retention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at * Minus value of BPloci means that the break pointn is located before the CDS. |
- In-frame and retained protein feature among the 13 regional features. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Protein feature | Protein feature note |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000312783 | - | 7 | 10 | 211_213 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000347343 | - | 6 | 9 | 211_213 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000371356 | - | 7 | 10 | 211_213 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000395909 | - | 8 | 11 | 211_213 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000395911 | - | 7 | 10 | 211_213 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000395913 | - | 6 | 9 | 211_213 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000395914 | - | 7 | 10 | 211_213 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000395915 | - | 6 | 9 | 211_213 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000312783 | - | 7 | 10 | 211_213 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000347343 | - | 6 | 9 | 211_213 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000371356 | - | 7 | 10 | 211_213 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000395909 | - | 8 | 11 | 211_213 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000395911 | - | 7 | 10 | 211_213 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000395913 | - | 6 | 9 | 211_213 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000395914 | - | 7 | 10 | 211_213 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000395915 | - | 6 | 9 | 211_213 | 235 | 404.0 | Nucleotide binding | ATP |
- In-frame and not-retained protein feature among the 13 regional features. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Protein feature | Protein feature note |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000312783 | - | 7 | 10 | 133_383 | 235 | 404.0 | Domain | Protein kinase |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000347343 | - | 6 | 9 | 133_383 | 235 | 404.0 | Domain | Protein kinase |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000371356 | - | 7 | 10 | 133_383 | 235 | 404.0 | Domain | Protein kinase |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000395909 | - | 8 | 11 | 133_383 | 235 | 404.0 | Domain | Protein kinase |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000395911 | - | 7 | 10 | 133_383 | 235 | 404.0 | Domain | Protein kinase |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000395913 | - | 6 | 9 | 133_383 | 235 | 404.0 | Domain | Protein kinase |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000395914 | - | 7 | 10 | 133_383 | 235 | 404.0 | Domain | Protein kinase |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000395915 | - | 6 | 9 | 133_383 | 235 | 404.0 | Domain | Protein kinase |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000312783 | - | 7 | 10 | 133_383 | 235 | 404.0 | Domain | Protein kinase |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000347343 | - | 6 | 9 | 133_383 | 235 | 404.0 | Domain | Protein kinase |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000371356 | - | 7 | 10 | 133_383 | 235 | 404.0 | Domain | Protein kinase |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000395909 | - | 8 | 11 | 133_383 | 235 | 404.0 | Domain | Protein kinase |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000395911 | - | 7 | 10 | 133_383 | 235 | 404.0 | Domain | Protein kinase |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000395913 | - | 6 | 9 | 133_383 | 235 | 404.0 | Domain | Protein kinase |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000395914 | - | 7 | 10 | 133_383 | 235 | 404.0 | Domain | Protein kinase |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000395915 | - | 6 | 9 | 133_383 | 235 | 404.0 | Domain | Protein kinase |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000312783 | - | 7 | 10 | 260_261 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000347343 | - | 6 | 9 | 260_261 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000371356 | - | 7 | 10 | 260_261 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000395909 | - | 8 | 11 | 260_261 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000395911 | - | 7 | 10 | 260_261 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000395913 | - | 6 | 9 | 260_261 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000395914 | - | 7 | 10 | 260_261 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000395915 | - | 6 | 9 | 260_261 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000312783 | - | 7 | 10 | 260_261 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000347343 | - | 6 | 9 | 260_261 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000371356 | - | 7 | 10 | 260_261 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000395909 | - | 8 | 11 | 260_261 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000395911 | - | 7 | 10 | 260_261 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000395913 | - | 6 | 9 | 260_261 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000395914 | - | 7 | 10 | 260_261 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000395915 | - | 6 | 9 | 260_261 | 235 | 404.0 | Nucleotide binding | ATP |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000312783 | - | 7 | 10 | 280_293 | 235 | 404.0 | Region | Activation segment |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000347343 | - | 6 | 9 | 280_293 | 235 | 404.0 | Region | Activation segment |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000371356 | - | 7 | 10 | 280_293 | 235 | 404.0 | Region | Activation segment |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000395909 | - | 8 | 11 | 280_293 | 235 | 404.0 | Region | Activation segment |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000395911 | - | 7 | 10 | 280_293 | 235 | 404.0 | Region | Activation segment |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000395913 | - | 6 | 9 | 280_293 | 235 | 404.0 | Region | Activation segment |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000395914 | - | 7 | 10 | 280_293 | 235 | 404.0 | Region | Activation segment |
Hgene | AURKA | chr20:54956488 | chr20:35445872 | ENST00000395915 | - | 6 | 9 | 280_293 | 235 | 404.0 | Region | Activation segment |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000312783 | - | 7 | 10 | 280_293 | 235 | 404.0 | Region | Activation segment |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000347343 | - | 6 | 9 | 280_293 | 235 | 404.0 | Region | Activation segment |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000371356 | - | 7 | 10 | 280_293 | 235 | 404.0 | Region | Activation segment |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000395909 | - | 8 | 11 | 280_293 | 235 | 404.0 | Region | Activation segment |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000395911 | - | 7 | 10 | 280_293 | 235 | 404.0 | Region | Activation segment |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000395913 | - | 6 | 9 | 280_293 | 235 | 404.0 | Region | Activation segment |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000395914 | - | 7 | 10 | 280_293 | 235 | 404.0 | Region | Activation segment |
Hgene | AURKA | chr20:54956489 | chr20:35445872 | ENST00000395915 | - | 6 | 9 | 280_293 | 235 | 404.0 | Region | Activation segment |
Top |
Fusion Gene Sequence for AURKA-SOGA1 |
For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones. |
>8490_8490_1_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000312783_SOGA1_chr20_35445872_ENST00000237536_length(transcript)=13906nt_BP=948nt ACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAGGGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAAC GGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGGTCTCACTCCATTGCCCAGGCCAGAGTGCGGGGATATTTGATAAGAAACTT CAGTGAAGGCCGGGCGCGGTGGCTCATGCCCGTAATCCCAGCATTTTCGGAGGCCGAGGCATCATGGACCGATCTAAAGAAAACTGCATT TCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTA AATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAG CCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAG CAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCT TTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTG GCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGG CATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTAT AGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCC GATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAG CTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCA GCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGC CGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCT GAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAG GAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCAC GAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGC GAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGT ACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCG GTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTG GCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGC GAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCA GATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATT CTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGC ACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAG GAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAG GCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGC CTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTG CTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGC ATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAG AACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGT CTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAG CTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTA GCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAG CGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGG GTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAA GTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTT GCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACC AAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGGCAGATGCAGCGCAGC TACACGGCTCCTGACAAGACGGGCATCCGAGTCTACTATAGTCCCCCGGTGGCCCGGCGCCTCGGAGTCCCTGTGGTTCATGACAAAGAG GGCAAGATCATTATCGAGCCCGGCTTCCTCTTCACCACAGCCAAGCCCAAAGAGTCGGCCGAGGCTGATGGGCTGGCTGAGAGCTCCTAT GGTCGGTGGCTCTGCAACTTCTCACGGCAGCGCCTGGACGGAGGCTCAGCGGGCAGCCCCTCGGCGGCCGGGCCTGGCTTCCCAGCGGCC CTGCATGACTTTGAGATGTCAGGCAACATGAGTGATGACATGAAGGAGATCACCAACTGTGTGCGCCAGGCCATGCGCTCCGGCTCACTG GAGAGGAAAGTGAAGAGCACATCCAGCCAGACGGTGGGCCTGGCCAGTGTGGGCACACAGACCATCCGCACGGTCAGCGTGGGCCTGCAG ACCGACCCACCCCGCAGCAGCCTCCATGGCAAGGCCTGGTCACCCCGCAGCTCTTCGCTCGTGTCTGTGCGCAGCAAGCAGATCTCCTCC TCCCTGGACAAGGTCCATTCGCGCATCGAGCGGCCCTGCTGCTCCCCCAAGTATGGCTCACCAAAGCTCCAGAGGCGGTCTGTGTCCAAG CTGGACAGCAGCAAGGACCGCAGCCTGTGGAACCTGCACCAGGGCAAGCAGAACGGCTCGGCCTGGGCCCGCTCCACCACCACGCGGGAC AGCCCTGTATTGAGAAACATCAACGATGGACTCTCCAGCCTCTTCAGTGTGGTGGAGCACTCAGGGAGCACGGAGTCTGTCTGGAAACTA GGCATGTCTGAGACGCGGGCCAAGCCCGAGCCTCCCAAGTACGGCATTGTGCAGGAATTCTTCCGTAATGTGTGTGGCCGGGCACCGAGC CCCACCTCATCAGCAGGAGAGGAGGGCACCAAGAAGCCAGAGCCCCTCTCCCCAGCCAGCTACCATCAGCCAGAGGGTGTGGCCAGGATC CTGAACAAGAAGGCAGCCAAGTTGGGCAGCAGTGAGGAGGTCAGACTCACCATGCTCCCCCAGGTGGGGAAGGATGGTGTCCTCCGGGAC GGAGATGGAGCCGTGGTCCTTCCCAATGAGGACGCTGTTTGTGACTGTAGTACCCAGTCTCTCACCTCCTGCTTCGCCCGATCGTCCCGC TCTGCCATCCGCCACTCTCCTTCCAAGTGCAGGCTGCACCCTTCAGAGTCCAGCTGGGGTGGGGAGGAGAGGGCACTCCCCCCCAGCGAG TGACAGAGCAGCCAAGCTCCCCGCCTCAACCAGCCCAGCCCCTGGATAGCAGAAGGGAACCAGCAGAGACGAGACGAGGTGAGGCGAGGG GCTGTGTCCTCAGCATTGCCTGGCCCTGGAGGGACAGCAGTGATGCCACTGCCAGAATGCAGCTTTCACATCAAGGTAAAGCCGGGTCTC CTGCTGGCCCCTGGGTGGTGAGCTTCGACTTCCCAGGGGAAGGCAGTGAGTGGGAGAGAGACCAAACCTGGGCTTCCCAAGCATCCACTG AGAGATCTGTCAAGAGCCGATCCCTGGGTCCTAAGAGAGAGCCTTGCCTGGTTCTGCCCATGCCACCCTCTTGGAAGAGCCCAAGAAGGA TACATGTCTGGCCATGCCTTTGGGGAAAAGGAGTCGGAGAGATGTTTCCTGCTGACCATCCACCCCTTCATTTGGGAGGAGACACTGCTG AGAAGAACAGGCTTTGCTCTAGGGCTCCATGTTTGGTTCCTGGTGGAGCCCTGTTGGGCATCATCACCATCACCTCCTTCTCTCCACCAC CTCCTCCTCCCAGCCCCACTGCTCTAGATTGCTGGGACACTAGGGAGTATGATAGGGCAGTAGCCAGGGCCATTGCTTAGTGTCCTGGAG CCCTGGATCTCCCTGCCCATAGCCTGGATGCAGCAAGAGCTGGGAGGCGAAGTGGAAACATGCAGGGCTCAGGGTTGGGGAGTGATTGCA ATTGCCTTCCTTGCCAAAGTGACTTGGGGCCCCAACGTTCCCAGCAGACCCCTTGAGGACAGAAATAGGTAGAGTCAGTCTCAAGACCTG GTGCATAGATAAATGCCTAAATACACTGCCTTGATCTCAAGTGATCTCAGAGGCCTCTTTCCCTGGCACCCTGAGAGGCAGCAGGCACTA CATCTCCACTGTGTTTACATCCTGCAGCTGGTCGGGGGGCAAAGATATTCCCAGTAAGAGATTCTTGGTTGGCCAGGTCAGGCCCAGGAG AACACCAAGAGGCCAGAGCCCAGGACACAGCAGTAGACTGGGGCCTGGAAACACGTATCTTGCCTAGATTGTTTATTTGAATTTTTCCTA CTATAAATATTTAAGGTGGTTTACTTTATTTTAATAATTTAATTTACCCCAAAGTCCCTAAGGTAATTTATTGGAGGTTGAAACATGCAT TCTTGCCACTGGGACAACATGAGGCCTCTAACAGCACGGGCAGGCATGGGGTCCCCTGGGTGGACGAGGCCGCTTGGCAGCCAGGTTTGG AGACCTGGCCTCCTGGTCAGCTTTGGAGGGCCCCTCAACAGAGCTGGAGCCCTGCACCCCAACACGGCTGGCCATGTGGCCTCAGAACAC TACTTATTACTCAATGCCTGGTACTTGGCCAGCCCCAGCGGTCAGTCTATAAATACTCACTGACAAGGTGGAGGGCTGGACGGCCATCAC CACTCCCCAGACGTTCTCCATTGCCTGTCTCATTTCCCTCCCTCCCTCTGACACCTTTCTTCATGAGTCGAACGTGGATTACTAAAGCTC TATTAAGAGTGTGGAGATCCCTCCAAGTTTCCCAAATGAGAACTCACAGGAAAACAGGACTGAACTTTGAGAATGTTGTTTATCGCAGCT TTGCACATAAACCTGAGTGTCTCCCAGCCTGCCTCGGTTCTCACCAGCCTGCCAGCCTTTTCACCAGCCTCTCTCCTTAGCCTTATGGCC TTTCACGGCTCTTCTCCCTGCCCCAGCTCTGCTGCCCGCCCTTCCTCACGTCCCCTGTGAGCTGCCTGAGCCATTGGTTGGATTTCGATG TGGCTCATTGCAGCATGTGGGGCAGCGCCTCCCATGGCCTCGCCTTGGTGCCGGTGAACCCCTTTTGGTTGCACACATGCTCCCCACACA CACATAGACATCAGCCTTCCTGTAACTGACTGGGGACCCAGAGTGGAAACACCAGGATGGATCAGCTTGTCTGCAGAATTGCCCATCAGG AAGACCAAAAGCCAGTAGCTTTGCTGATCTGCCCCCAGGACTCTGGAGGCGCCCCTGCACTCCCACCTCCCACCTGCCAGTTCCCAGACC CACCCATTCGGGATCACCTGGACCAGTTACCCCAAGTCCTGCATCTCCCTTCCCTGCAGGCTGAACACCAGGGTCATGCCAGTCCCGCCA GCCGCCTCCTCCATGCCCCAGTGACTGGTGTGGGCAGAGCAGGCAGCCAGTGGAGCTGTGGGCCAGTTCCGCTCTTGGATGCTGCTGCTC TCACCCATGAGGTCAGGGGGGCCCTCCAAGGTTATCTCCAGGTGAGGGGATTCACATCAGGCCACAAGCCACCAGAGGCCTTCTGCCACC TCCCAGAGCGACAGCCAGGGAGGCTGCGTACTCAGCCTCGGGGAGAAATCCCCGTGGGACCTGAGCCCCAAGACCTACGGACCACTCAGC CTTACCATCGTACCGTCCAGGATTGTCCTTGCCATCTTTGTTGTCTCAGCCAGACCTTGGTTTTCAGTAAAGCCCCAGTTTCTACTTCCT GCATGCCACTGTGCAAGGCCACTCATCACTGTTCCTGCAGAAGCCTCTGGACGTGGGGCTGGATGGGGTTGAAAATGTTACATGTAAATA TTGGTTTGGTTCGGTTTTTAGCATTTTACTTGGTAACTGGTTGTTTTCTTTTTTGGGGTGGGGGGATTGGTTTGTAAAAATTCTCTACTC TTTTGGAATGTGATTTCTAAGTTTGTTGGTTTCTTCAAATGCCTTTTAAGTCTTGGTAACATTCCCAAAGCAGAAAACTGCCTGACCCAC AGTGGGGATTCCCTGGAGAATTGGGGTCCCAAGAAGGAATGCTGCCCTTCTCGAACCCGTTCTCCCCCTTCCTCCTGCCTCTCTGCCTTT TACTGCTATTCCCTTCTTCTCCTCCTTTATCCTTCTTTCTGTTTTCCCCATCTCCACTCTCTCTTCAACCAAAGTCCCAAGGAACCCTCG GGGCTCAATCCCCCATAGACCACTTGGCTTGGGTCCATGGGGTTGGCATCAGTTGGTTGGCGGAAATGGGGGACCAGTTGGCATGATGGC CCTAAACTGGGAAACCTCATGTTTCTTATGTCTCACCTCTTTCCAGAGCCAAATCAGCCCCTTTTGGAATGATGACTTCATTGGAATGCA AATCAAGTCATTTTGGTGCATCAGTGGCTCTTAGGCCTGCACACACGAGACATCAGAATCCAATCCTCTGACCCTGTGCCAGCCCTTTCC CCCAGTTTATTTCCCACCAAAGGCTGACCTCTAAGAGGTCTTGCTTTCTATGAACTCAAGATGGGTCCCACCTCTAGGTGTCCCCAGGTG CACTCTTCTACCGGTTGGCTTCCGATGTGACAAGGCCAAGGGCCCAAAGACTTGACCCTCTTACACCCTTGCTGACATGGTTCCATCATG TCCACCCGCATGCACTTTTATGGTTTCATCACCCAGCCTCTTCTCCTCTGGCCCACCCAGCGTCCAGGCTCTTTCTCCCTCTCCCCTCCT ATCTAGAATGTCCCCTGCTTCTAGCCTCACCAGACCCCCCAAGCTCCCACTACTTCTTCCATAATAATAGTAATAACAATGGTTATCATC ATCCCCTGCACATCCCGCCTAAAGCACTTTACTATATAGAAAACGTTTCCCCTGGCCGGGCATGGTGGCTCACGCCTGAAATCCCAGCTC TTTGGGAGGCTGAGGCGAGCGGATCAGTTGAGGTCAGGAGTTCAACGCCAGCCTGGCCAACGTGGTGAATCCCTGTCTGTACTAAAAGTA CAAAAAATTAGCTGAGCATGGTGGTGCGTGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGTGGGAGAATCGCTTGAGCCCAGGAGGCGG AGGTTGCAGGAGCAGAGATTGCGCCACTGCACTCCAGCCTGGATGACAGAGTGAGACCCAATCTCAAAAAAGAAATCGTTTCCCACCCCA CATCTCCTTCAGACCTCTCAGGGACAACTCTGGGAGGCAGCCTTGGCAGGACATGGGTTAGTGCGCCCATTTTGCTGTGAGGAAACTGAG GTACAGGTCTCATCCCAGAGCATGAGAAGTCACTGAGTTTAGATGAGAACTTGGGTCCAACTCTGTCCTGTTTGCTGTGCAAATCCGCTG CCCTGCTGGGGGCTTTTGGTGGGTCCAGAATACCCAGAATATGCTGCTGGCCAACCCAGGCATAAAACAAGTCCATTCTAGATCACTGAG CCTTGTGTATTCCAGAGGGTGATCTGAGGTCCCCATTCAGCAGAATTCTCTGAGGGCATGTTCAGAATGTAGATTCCTGGGCCCCACCTT GAATTTGCATGTTTAACAAACTCTCCTGGGGTTGAGGGGTGGGTGCAGTGGTCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGG GTGGATCACTTGAGCTCAGGAGTTCGAGCCTGGCCAATATGGTGAAACCCTGTCTCTACTAAAAATGCAAAAATTAGCCAGGTGTGGTGG CACATGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGTGGGGAGCAGCGGTTGCAGTGAGCCGAGATT GTGCCATTGCGCTCTAGCCTGGGCAACGGAATGAGACTTGTCTCAAAAAAAATAAAAAATAAAACCAGCCCTCCCCGGGGGATCTTAGGC ACTATTGGCCACACCATTGGTGTTCGTGGCCCTGATTATTAGGCTATCTTTCTTTTTTTAAGTTTTTTTAGATTTATTTTTTATTTTATT TATTTATTTATTTATTATTTATTATTATTATTTTTAGAGACAGGGGGTCTCCCTATGTTGCCCAGGCTGGTTTCAAACTCCTGGGCTCAA GTGATCTGCCCTCCTCAGCCTCCCAAAGTGCTGGGGTTACAGGCAGGCATCAGCCACCGTGCCAGGTCATCTTCCTTTTTCTTTTTTTTT TGGAGACAGAGTCTTGCTCTGTCATTCAGGCTGGAGTACAGTGGCGCCATTTCAGCTCACCGCAGCCTCCACCTCCCAGGTTCAAGCAGT TCTCCTGCCTCAACCTCCCGAGTAACTGGGATTACAGGTGTGTGCCACCACACCCGACTAATTTTTTTATTTTTAGTAGAGACGGGGTTT CACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCACCCACCTCGGCCTCCCAAAGTACTGGGATTATAGGCGTGAGC CACGGCATCCAGCCTCATCTTTCTTTTAACCAATAAACATGATGCTGTATCTTAAAAAGAGCACTGAGCAGGGACTTAAGGGATCGAGTC CTCAACCAAACTGATTTAATTACTCAGGATTTTCAAAAAGCATCAGAGGCTATTTACAATCTTAATCATAGGGGTTCAGTAAAATAAAAA TAAGAAGTAAAAAAGCAAGAGAAATTATTCTGTAAATCTAACTGGTGTAATTCCCATAATCATGCAATTAAGTTTTACTCTTGAGTTTCC TGACAGCCATTGGTAAAAAGAGAAACACATCAGGATTTATAATTTTTATCATCCAATTATGGGAAGCAAGCATGTTGGCCCCAGGAGACG AACTCTTCTACTAATTTATAGCATGTATCCTTTCATCAAGGGCCACTGAGCCAGTTGGTGAGTCAACGGGTGAACCTAAGATGCAAGGAT GTTTTCCAGGTGACTATTTAAAAAAATAAATAATCCACCAACGTGATTGACCTTGGCGAGATCATGTTTCTAGTCTATACCTCAGTTTCC CCATCTGTAAAGTGAGGATAATGTCCCACCCCATGTAACTGTGGTGAGGACCAACTGCAACACTGTGCCTGCGAGTCTCCTTGGAAAAGT GTAAGGTTCTACACAAATGGAAAGTGATCTGATCACACTCAGTGTCCCCAGCCCAGCCTTTCAGTGCCCTGGCCCTGGGGTGGGGGACAA TACTCTCCTCACCCCCTTCACTAGTCTTCATGAATAGCAAGGAGGCCATAACATAATTTGGTCTAAACCCCTTCCTTTTTAAAAGAATGA TGGCAAAATGTGCATAACATTAAATTTACCATTTTAAAGATTTCTAAGTGTCTCGAAGTACATTTGCAATGTGTAACTGCCACCTCCAGA ACTTTTTCATCATCCTAAACTGAAACTCTGTACCCACTAAACAGTAACTCCCCACTCCCCCTGTCCCCAGTCCCTGGTAACCTCTATTCT ACTTTTTTTTTTTCTCTGTGGATTGGCCTATTCTAGGTATCTCTTTTAAGTAGAATCATATAGTATTTGTCCTTTTGTGTCCAGCCCCCT CATTTTTTGAGATGAGGAATCAGGCCCAGAGAAGGCAGTGGCTCACCCAAGGTCACATCGCAAACCAGAGGCAGAGCCAAGACCAGAACC CAGGTTTCCTGACTCCTAACAGGTCTTGCATCCCCAGTAGTGAATGAGACACTGCTTTGACTTTCTGTAATCTTGGTTTAGCCCCTTCCT TTCTCTGGGCTCAGTCTGCTTCCCACCACTCACCCATTCTTTCTTTATCATCTCCCTCCAAAGCCTCTTGTCCTCCTGCCTCCTCTTCTC CTTGGCTGGTTCCTGCCAGCAAATGGAGTGGCCCTGGTCCATGCCAGGTTTTCCTCTTCTGGGTCCGGAGCTCACTATAGTATTCAGCCC TCAGTCCTCCCAGGATGTTCTTCCCAGCACTGCTGCCTCGTGCGGATTTTCCCGTAACCTCAGTAACTGGCTTCTTGTCCCCCTGCTTCC TACCAGGGAAGCCTTCCTGTCCGCGTCTGTGGTTTCCCAACTCTGATATTTGCTCTCAAATGTGGTGGTGTCCTGGTTCTGTGTTTATTT ATTTTGTGTTTTCTCACACACACAAAAAAAAGCCTCTTCCTCCAATGCATCAGGAGGCACCAGCCCTGCCAGCCCTTCTCACTGGGCTCA CCCTGCCCCAGCAACCCCCCGGTGCCACCCTTAAGCCACACTGTCTTCTCTGTAAGCAGCCTGCCAGCAGCAGCCCCAGCACTTTGCAAT GGGCGTGTGTGTGGTGGTGGGTGGGGGGGGCTTGGATCCCTCCTTTTTCCTCCCCTGCCCTGCCCAGGCCCAGATGGCCTTGACTGTAAA GCAGGTGCTGCCTGACAGGTTCTTCTCTCCCTGTCTCTGGTCATTGATCCATCTCTTTGTCCATTCAGTATCCAACCATCCTCTCCATTC TCCTCTGGACCTCACCACTCTCAGAGCTGCTTGTCCTGGCAGAATCTACAGTTCACCCCAACTCTATGCCTTACCCCTCCCAACCCAACA GCATTTGCAGTTTGCAAAATATACAGACCCAAGTCCTGAGGGGACTGAGGACATGATGCTGGGCCCAAGTCTCCTGCTCAGGGCTTCTCT CCAATGCCAGCCCTGCCACTCCTTCCTCACCCTCCTTGGAGCCTCCTCTGCTGCTTGTCTATCCCAACGGCCCTGCTCCCCTCCCTTCCT GCCCTTCACCAGCTTTCTGGACACCATGCCCTGAGGAAGGGACCTTTGGTTTTCTCTAAACATCTTTGAAGGGCTGAGGCAGTCAGGGCT GGCTGCCTTGTCACTCTTTATTTGGAAGCCACTCAAACCATTCCCAAGAAGAGGGACCTCAGCTGGCAATCTGGAAACCTGGCCCAGGTC TGGGCAGATGTCTTCACTTCTCCTACCTTCCCAGTCTTGTGATCCTGTGATGAGCACCAGGATGGCCCTGTGGTCCCTAGAGCACCCCTC ATGCTGTAGGGTCCTGCAGCCCCATCCTTTCTCTACTGGGCCCTGGTATCCTGGCTCCTCTCTCAGCTCTGCCACTGATCTCTGTGCCTT AGTTTACTTCTCTGCACGGGGGACTCACCCCAAGACCATTTCCAGCAGCTTCCCAGGTGATGTGGTGCCCCAAGGCTGGGCTTTGCCAGC TGTGGCCCAGCTCCTTAGTGCTGCCCAGGAGACACCAGGCTGCTCAGAATGAGGTGACTGCGGGCACCATTCTCAGCCAGTGGTTCTTGT ATTGCATTCCAGCAGCAGGAATATCACCTGGGAACTTGATAGAAGTGCAGATTAGCAGCCCCACCCAAGACCCACTGAATTAGAGCTTGT GGAGTGGGGCCCTACAAGCTGGGGTTTTAAGGAGCCCTCCAAATGATTCTGACGCATAAGAATATGCCAACTGCTGATCTGGGCTAGCCA TTAGTAGAGCCTGGGGAGGGACTGGGACTGGCTAGGCCAAGAACAGGTGGAAAACACCAGCCTTATCTGGACTCCTGAGATTGGGAACCA CCACCAACAAAAACCAACCCTATAGTCGCTCCTCTTGGAAGAGGAAGAGAAGTTGAAGGGCCTGGAGAAAGCACACATTGTTTGTTTCCC TGCTCCTGCTCACCTCTCTCACTTGTCTTGGTTTTACAAAAGGCTGTGTGGATGGTGCCAGCCAGGGAGGGGGTGGGAGTCCTGGGGAGG CAGGAGGCAGAAGACCCTGACTGTTTCTCCCTTGGGAACCTCACCATAGGCCAGATAGCGCCTCTTCAAACTGAAAGAAATCTTAACTCC ACAAAGAAAGCATCCTAAATCCCCAGTTCCTCCTCCTCCCAACCCCAGGGATACCTTGTAGACAGTGCCAAAAAACAGCTCCAACCCCCA GCAGCTGGGAAGAGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCA GCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCT TGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAG GACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAAGCGT >8490_8490_1_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000312783_SOGA1_chr20_35445872_ENST00000237536_length(amino acids)=1539AA_BP=235 MDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSR PLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRR EVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAK EESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMK DHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEE LAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAAL VSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPL GSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLS QERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSE GDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAE LKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLS QLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSS EPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGRQMQRSYTAPDKTGIRVYYSPPVARRLGVPVVHDKEGKIIIEPGFLFTTAKPKESAE ADGLAESSYGRWLCNFSRQRLDGGSAGSPSAAGPGFPAALHDFEMSGNMSDDMKEITNCVRQAMRSGSLERKVKSTSSQTVGLASVGTQT IRTVSVGLQTDPPRSSLHGKAWSPRSSSLVSVRSKQISSSLDKVHSRIERPCCSPKYGSPKLQRRSVSKLDSSKDRSLWNLHQGKQNGSA WARSTTTRDSPVLRNINDGLSSLFSVVEHSGSTESVWKLGMSETRAKPEPPKYGIVQEFFRNVCGRAPSPTSSAGEEGTKKPEPLSPASY HQPEGVARILNKKAAKLGSSEEVRLTMLPQVGKDGVLRDGDGAVVLPNEDAVCDCSTQSLTSCFARSSRSAIRHSPSKCRLHPSESSWGG -------------------------------------------------------------- >8490_8490_2_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000312783_SOGA1_chr20_35445872_ENST00000279034_length(transcript)=3967nt_BP=948nt ACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAGGGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAAC GGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGGTCTCACTCCATTGCCCAGGCCAGAGTGCGGGGATATTTGATAAGAAACTT CAGTGAAGGCCGGGCGCGGTGGCTCATGCCCGTAATCCCAGCATTTTCGGAGGCCGAGGCATCATGGACCGATCTAAAGAAAACTGCATT TCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTA AATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAG CCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAG CAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCT TTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTG GCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGG CATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTAT AGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCC GATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAG CTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCA GCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGC CGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCT GAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAG GAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCAC GAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGC GAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGT ACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCG GTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTG GCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGC GAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCA GATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATT CTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGC ACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAG GAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAG GCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGC CTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTG CTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGC ATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAG AACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGT CTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAG CTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTA GCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAG CGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGG GTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAA GTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTT GCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACC AAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGCCAGAAGCTGCCCTTC CTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCAT GGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTAT TCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCT GCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAAGCGTTTTTAATGGTTGTAGATTGGAAATAAAGTATG >8490_8490_2_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000312783_SOGA1_chr20_35445872_ENST00000279034_length(amino acids)=1132AA_BP=235 MDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSR PLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRR EVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAK EESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMK DHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEE LAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAAL VSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPL GSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLS QERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSE GDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAE LKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLS QLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSS -------------------------------------------------------------- >8490_8490_3_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000371356_SOGA1_chr20_35445872_ENST00000237536_length(transcript)=13795nt_BP=837nt CTTGGAAGACTTGGGTCCTTGGGTCGCAGGCTGGAGTGCAATGGTGTGATCTCAGCTCACTGCAACCTCTGCTTCCTGGGTTTAAGTGAT TCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACA GCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGG GTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAG AAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCT GAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGT CGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAA GCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTG TATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCA AAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAG GAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAG AATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCC CCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAG GTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCC GCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGC TCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCG GAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAG AAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACG GACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGG GAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACG GCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAG CTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCT GTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGC TCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAG CCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCT GAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAA ATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAG AAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGG TTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTG ATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATC AAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAG TTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCC GGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTC ATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAG CAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTG GAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGT TTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGC AAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCG TCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGGCAGATGCAGCGCAGCTACACGGCTCCTGACAAGACG GGCATCCGAGTCTACTATAGTCCCCCGGTGGCCCGGCGCCTCGGAGTCCCTGTGGTTCATGACAAAGAGGGCAAGATCATTATCGAGCCC GGCTTCCTCTTCACCACAGCCAAGCCCAAAGAGTCGGCCGAGGCTGATGGGCTGGCTGAGAGCTCCTATGGTCGGTGGCTCTGCAACTTC TCACGGCAGCGCCTGGACGGAGGCTCAGCGGGCAGCCCCTCGGCGGCCGGGCCTGGCTTCCCAGCGGCCCTGCATGACTTTGAGATGTCA GGCAACATGAGTGATGACATGAAGGAGATCACCAACTGTGTGCGCCAGGCCATGCGCTCCGGCTCACTGGAGAGGAAAGTGAAGAGCACA TCCAGCCAGACGGTGGGCCTGGCCAGTGTGGGCACACAGACCATCCGCACGGTCAGCGTGGGCCTGCAGACCGACCCACCCCGCAGCAGC CTCCATGGCAAGGCCTGGTCACCCCGCAGCTCTTCGCTCGTGTCTGTGCGCAGCAAGCAGATCTCCTCCTCCCTGGACAAGGTCCATTCG CGCATCGAGCGGCCCTGCTGCTCCCCCAAGTATGGCTCACCAAAGCTCCAGAGGCGGTCTGTGTCCAAGCTGGACAGCAGCAAGGACCGC AGCCTGTGGAACCTGCACCAGGGCAAGCAGAACGGCTCGGCCTGGGCCCGCTCCACCACCACGCGGGACAGCCCTGTATTGAGAAACATC AACGATGGACTCTCCAGCCTCTTCAGTGTGGTGGAGCACTCAGGGAGCACGGAGTCTGTCTGGAAACTAGGCATGTCTGAGACGCGGGCC AAGCCCGAGCCTCCCAAGTACGGCATTGTGCAGGAATTCTTCCGTAATGTGTGTGGCCGGGCACCGAGCCCCACCTCATCAGCAGGAGAG GAGGGCACCAAGAAGCCAGAGCCCCTCTCCCCAGCCAGCTACCATCAGCCAGAGGGTGTGGCCAGGATCCTGAACAAGAAGGCAGCCAAG TTGGGCAGCAGTGAGGAGGTCAGACTCACCATGCTCCCCCAGGTGGGGAAGGATGGTGTCCTCCGGGACGGAGATGGAGCCGTGGTCCTT CCCAATGAGGACGCTGTTTGTGACTGTAGTACCCAGTCTCTCACCTCCTGCTTCGCCCGATCGTCCCGCTCTGCCATCCGCCACTCTCCT TCCAAGTGCAGGCTGCACCCTTCAGAGTCCAGCTGGGGTGGGGAGGAGAGGGCACTCCCCCCCAGCGAGTGACAGAGCAGCCAAGCTCCC CGCCTCAACCAGCCCAGCCCCTGGATAGCAGAAGGGAACCAGCAGAGACGAGACGAGGTGAGGCGAGGGGCTGTGTCCTCAGCATTGCCT GGCCCTGGAGGGACAGCAGTGATGCCACTGCCAGAATGCAGCTTTCACATCAAGGTAAAGCCGGGTCTCCTGCTGGCCCCTGGGTGGTGA GCTTCGACTTCCCAGGGGAAGGCAGTGAGTGGGAGAGAGACCAAACCTGGGCTTCCCAAGCATCCACTGAGAGATCTGTCAAGAGCCGAT CCCTGGGTCCTAAGAGAGAGCCTTGCCTGGTTCTGCCCATGCCACCCTCTTGGAAGAGCCCAAGAAGGATACATGTCTGGCCATGCCTTT GGGGAAAAGGAGTCGGAGAGATGTTTCCTGCTGACCATCCACCCCTTCATTTGGGAGGAGACACTGCTGAGAAGAACAGGCTTTGCTCTA GGGCTCCATGTTTGGTTCCTGGTGGAGCCCTGTTGGGCATCATCACCATCACCTCCTTCTCTCCACCACCTCCTCCTCCCAGCCCCACTG CTCTAGATTGCTGGGACACTAGGGAGTATGATAGGGCAGTAGCCAGGGCCATTGCTTAGTGTCCTGGAGCCCTGGATCTCCCTGCCCATA GCCTGGATGCAGCAAGAGCTGGGAGGCGAAGTGGAAACATGCAGGGCTCAGGGTTGGGGAGTGATTGCAATTGCCTTCCTTGCCAAAGTG ACTTGGGGCCCCAACGTTCCCAGCAGACCCCTTGAGGACAGAAATAGGTAGAGTCAGTCTCAAGACCTGGTGCATAGATAAATGCCTAAA TACACTGCCTTGATCTCAAGTGATCTCAGAGGCCTCTTTCCCTGGCACCCTGAGAGGCAGCAGGCACTACATCTCCACTGTGTTTACATC CTGCAGCTGGTCGGGGGGCAAAGATATTCCCAGTAAGAGATTCTTGGTTGGCCAGGTCAGGCCCAGGAGAACACCAAGAGGCCAGAGCCC AGGACACAGCAGTAGACTGGGGCCTGGAAACACGTATCTTGCCTAGATTGTTTATTTGAATTTTTCCTACTATAAATATTTAAGGTGGTT TACTTTATTTTAATAATTTAATTTACCCCAAAGTCCCTAAGGTAATTTATTGGAGGTTGAAACATGCATTCTTGCCACTGGGACAACATG AGGCCTCTAACAGCACGGGCAGGCATGGGGTCCCCTGGGTGGACGAGGCCGCTTGGCAGCCAGGTTTGGAGACCTGGCCTCCTGGTCAGC TTTGGAGGGCCCCTCAACAGAGCTGGAGCCCTGCACCCCAACACGGCTGGCCATGTGGCCTCAGAACACTACTTATTACTCAATGCCTGG TACTTGGCCAGCCCCAGCGGTCAGTCTATAAATACTCACTGACAAGGTGGAGGGCTGGACGGCCATCACCACTCCCCAGACGTTCTCCAT TGCCTGTCTCATTTCCCTCCCTCCCTCTGACACCTTTCTTCATGAGTCGAACGTGGATTACTAAAGCTCTATTAAGAGTGTGGAGATCCC TCCAAGTTTCCCAAATGAGAACTCACAGGAAAACAGGACTGAACTTTGAGAATGTTGTTTATCGCAGCTTTGCACATAAACCTGAGTGTC TCCCAGCCTGCCTCGGTTCTCACCAGCCTGCCAGCCTTTTCACCAGCCTCTCTCCTTAGCCTTATGGCCTTTCACGGCTCTTCTCCCTGC CCCAGCTCTGCTGCCCGCCCTTCCTCACGTCCCCTGTGAGCTGCCTGAGCCATTGGTTGGATTTCGATGTGGCTCATTGCAGCATGTGGG GCAGCGCCTCCCATGGCCTCGCCTTGGTGCCGGTGAACCCCTTTTGGTTGCACACATGCTCCCCACACACACATAGACATCAGCCTTCCT GTAACTGACTGGGGACCCAGAGTGGAAACACCAGGATGGATCAGCTTGTCTGCAGAATTGCCCATCAGGAAGACCAAAAGCCAGTAGCTT TGCTGATCTGCCCCCAGGACTCTGGAGGCGCCCCTGCACTCCCACCTCCCACCTGCCAGTTCCCAGACCCACCCATTCGGGATCACCTGG ACCAGTTACCCCAAGTCCTGCATCTCCCTTCCCTGCAGGCTGAACACCAGGGTCATGCCAGTCCCGCCAGCCGCCTCCTCCATGCCCCAG TGACTGGTGTGGGCAGAGCAGGCAGCCAGTGGAGCTGTGGGCCAGTTCCGCTCTTGGATGCTGCTGCTCTCACCCATGAGGTCAGGGGGG CCCTCCAAGGTTATCTCCAGGTGAGGGGATTCACATCAGGCCACAAGCCACCAGAGGCCTTCTGCCACCTCCCAGAGCGACAGCCAGGGA GGCTGCGTACTCAGCCTCGGGGAGAAATCCCCGTGGGACCTGAGCCCCAAGACCTACGGACCACTCAGCCTTACCATCGTACCGTCCAGG ATTGTCCTTGCCATCTTTGTTGTCTCAGCCAGACCTTGGTTTTCAGTAAAGCCCCAGTTTCTACTTCCTGCATGCCACTGTGCAAGGCCA CTCATCACTGTTCCTGCAGAAGCCTCTGGACGTGGGGCTGGATGGGGTTGAAAATGTTACATGTAAATATTGGTTTGGTTCGGTTTTTAG CATTTTACTTGGTAACTGGTTGTTTTCTTTTTTGGGGTGGGGGGATTGGTTTGTAAAAATTCTCTACTCTTTTGGAATGTGATTTCTAAG TTTGTTGGTTTCTTCAAATGCCTTTTAAGTCTTGGTAACATTCCCAAAGCAGAAAACTGCCTGACCCACAGTGGGGATTCCCTGGAGAAT TGGGGTCCCAAGAAGGAATGCTGCCCTTCTCGAACCCGTTCTCCCCCTTCCTCCTGCCTCTCTGCCTTTTACTGCTATTCCCTTCTTCTC CTCCTTTATCCTTCTTTCTGTTTTCCCCATCTCCACTCTCTCTTCAACCAAAGTCCCAAGGAACCCTCGGGGCTCAATCCCCCATAGACC ACTTGGCTTGGGTCCATGGGGTTGGCATCAGTTGGTTGGCGGAAATGGGGGACCAGTTGGCATGATGGCCCTAAACTGGGAAACCTCATG TTTCTTATGTCTCACCTCTTTCCAGAGCCAAATCAGCCCCTTTTGGAATGATGACTTCATTGGAATGCAAATCAAGTCATTTTGGTGCAT CAGTGGCTCTTAGGCCTGCACACACGAGACATCAGAATCCAATCCTCTGACCCTGTGCCAGCCCTTTCCCCCAGTTTATTTCCCACCAAA GGCTGACCTCTAAGAGGTCTTGCTTTCTATGAACTCAAGATGGGTCCCACCTCTAGGTGTCCCCAGGTGCACTCTTCTACCGGTTGGCTT CCGATGTGACAAGGCCAAGGGCCCAAAGACTTGACCCTCTTACACCCTTGCTGACATGGTTCCATCATGTCCACCCGCATGCACTTTTAT GGTTTCATCACCCAGCCTCTTCTCCTCTGGCCCACCCAGCGTCCAGGCTCTTTCTCCCTCTCCCCTCCTATCTAGAATGTCCCCTGCTTC TAGCCTCACCAGACCCCCCAAGCTCCCACTACTTCTTCCATAATAATAGTAATAACAATGGTTATCATCATCCCCTGCACATCCCGCCTA AAGCACTTTACTATATAGAAAACGTTTCCCCTGGCCGGGCATGGTGGCTCACGCCTGAAATCCCAGCTCTTTGGGAGGCTGAGGCGAGCG GATCAGTTGAGGTCAGGAGTTCAACGCCAGCCTGGCCAACGTGGTGAATCCCTGTCTGTACTAAAAGTACAAAAAATTAGCTGAGCATGG TGGTGCGTGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGTGGGAGAATCGCTTGAGCCCAGGAGGCGGAGGTTGCAGGAGCAGAGATTG CGCCACTGCACTCCAGCCTGGATGACAGAGTGAGACCCAATCTCAAAAAAGAAATCGTTTCCCACCCCACATCTCCTTCAGACCTCTCAG GGACAACTCTGGGAGGCAGCCTTGGCAGGACATGGGTTAGTGCGCCCATTTTGCTGTGAGGAAACTGAGGTACAGGTCTCATCCCAGAGC ATGAGAAGTCACTGAGTTTAGATGAGAACTTGGGTCCAACTCTGTCCTGTTTGCTGTGCAAATCCGCTGCCCTGCTGGGGGCTTTTGGTG GGTCCAGAATACCCAGAATATGCTGCTGGCCAACCCAGGCATAAAACAAGTCCATTCTAGATCACTGAGCCTTGTGTATTCCAGAGGGTG ATCTGAGGTCCCCATTCAGCAGAATTCTCTGAGGGCATGTTCAGAATGTAGATTCCTGGGCCCCACCTTGAATTTGCATGTTTAACAAAC TCTCCTGGGGTTGAGGGGTGGGTGCAGTGGTCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGTGGATCACTTGAGCTCAGGA GTTCGAGCCTGGCCAATATGGTGAAACCCTGTCTCTACTAAAAATGCAAAAATTAGCCAGGTGTGGTGGCACATGCCTGTAATCCCAGCT ACTTGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGTGGGGAGCAGCGGTTGCAGTGAGCCGAGATTGTGCCATTGCGCTCTAGCCTG GGCAACGGAATGAGACTTGTCTCAAAAAAAATAAAAAATAAAACCAGCCCTCCCCGGGGGATCTTAGGCACTATTGGCCACACCATTGGT GTTCGTGGCCCTGATTATTAGGCTATCTTTCTTTTTTTAAGTTTTTTTAGATTTATTTTTTATTTTATTTATTTATTTATTTATTATTTA TTATTATTATTTTTAGAGACAGGGGGTCTCCCTATGTTGCCCAGGCTGGTTTCAAACTCCTGGGCTCAAGTGATCTGCCCTCCTCAGCCT CCCAAAGTGCTGGGGTTACAGGCAGGCATCAGCCACCGTGCCAGGTCATCTTCCTTTTTCTTTTTTTTTTGGAGACAGAGTCTTGCTCTG TCATTCAGGCTGGAGTACAGTGGCGCCATTTCAGCTCACCGCAGCCTCCACCTCCCAGGTTCAAGCAGTTCTCCTGCCTCAACCTCCCGA GTAACTGGGATTACAGGTGTGTGCCACCACACCCGACTAATTTTTTTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGT CTTGAACTCCTGACCTCAGGTGATCCACCCACCTCGGCCTCCCAAAGTACTGGGATTATAGGCGTGAGCCACGGCATCCAGCCTCATCTT TCTTTTAACCAATAAACATGATGCTGTATCTTAAAAAGAGCACTGAGCAGGGACTTAAGGGATCGAGTCCTCAACCAAACTGATTTAATT ACTCAGGATTTTCAAAAAGCATCAGAGGCTATTTACAATCTTAATCATAGGGGTTCAGTAAAATAAAAATAAGAAGTAAAAAAGCAAGAG AAATTATTCTGTAAATCTAACTGGTGTAATTCCCATAATCATGCAATTAAGTTTTACTCTTGAGTTTCCTGACAGCCATTGGTAAAAAGA GAAACACATCAGGATTTATAATTTTTATCATCCAATTATGGGAAGCAAGCATGTTGGCCCCAGGAGACGAACTCTTCTACTAATTTATAG CATGTATCCTTTCATCAAGGGCCACTGAGCCAGTTGGTGAGTCAACGGGTGAACCTAAGATGCAAGGATGTTTTCCAGGTGACTATTTAA AAAAATAAATAATCCACCAACGTGATTGACCTTGGCGAGATCATGTTTCTAGTCTATACCTCAGTTTCCCCATCTGTAAAGTGAGGATAA TGTCCCACCCCATGTAACTGTGGTGAGGACCAACTGCAACACTGTGCCTGCGAGTCTCCTTGGAAAAGTGTAAGGTTCTACACAAATGGA AAGTGATCTGATCACACTCAGTGTCCCCAGCCCAGCCTTTCAGTGCCCTGGCCCTGGGGTGGGGGACAATACTCTCCTCACCCCCTTCAC TAGTCTTCATGAATAGCAAGGAGGCCATAACATAATTTGGTCTAAACCCCTTCCTTTTTAAAAGAATGATGGCAAAATGTGCATAACATT AAATTTACCATTTTAAAGATTTCTAAGTGTCTCGAAGTACATTTGCAATGTGTAACTGCCACCTCCAGAACTTTTTCATCATCCTAAACT GAAACTCTGTACCCACTAAACAGTAACTCCCCACTCCCCCTGTCCCCAGTCCCTGGTAACCTCTATTCTACTTTTTTTTTTTCTCTGTGG ATTGGCCTATTCTAGGTATCTCTTTTAAGTAGAATCATATAGTATTTGTCCTTTTGTGTCCAGCCCCCTCATTTTTTGAGATGAGGAATC AGGCCCAGAGAAGGCAGTGGCTCACCCAAGGTCACATCGCAAACCAGAGGCAGAGCCAAGACCAGAACCCAGGTTTCCTGACTCCTAACA GGTCTTGCATCCCCAGTAGTGAATGAGACACTGCTTTGACTTTCTGTAATCTTGGTTTAGCCCCTTCCTTTCTCTGGGCTCAGTCTGCTT CCCACCACTCACCCATTCTTTCTTTATCATCTCCCTCCAAAGCCTCTTGTCCTCCTGCCTCCTCTTCTCCTTGGCTGGTTCCTGCCAGCA AATGGAGTGGCCCTGGTCCATGCCAGGTTTTCCTCTTCTGGGTCCGGAGCTCACTATAGTATTCAGCCCTCAGTCCTCCCAGGATGTTCT TCCCAGCACTGCTGCCTCGTGCGGATTTTCCCGTAACCTCAGTAACTGGCTTCTTGTCCCCCTGCTTCCTACCAGGGAAGCCTTCCTGTC CGCGTCTGTGGTTTCCCAACTCTGATATTTGCTCTCAAATGTGGTGGTGTCCTGGTTCTGTGTTTATTTATTTTGTGTTTTCTCACACAC ACAAAAAAAAGCCTCTTCCTCCAATGCATCAGGAGGCACCAGCCCTGCCAGCCCTTCTCACTGGGCTCACCCTGCCCCAGCAACCCCCCG GTGCCACCCTTAAGCCACACTGTCTTCTCTGTAAGCAGCCTGCCAGCAGCAGCCCCAGCACTTTGCAATGGGCGTGTGTGTGGTGGTGGG TGGGGGGGGCTTGGATCCCTCCTTTTTCCTCCCCTGCCCTGCCCAGGCCCAGATGGCCTTGACTGTAAAGCAGGTGCTGCCTGACAGGTT CTTCTCTCCCTGTCTCTGGTCATTGATCCATCTCTTTGTCCATTCAGTATCCAACCATCCTCTCCATTCTCCTCTGGACCTCACCACTCT CAGAGCTGCTTGTCCTGGCAGAATCTACAGTTCACCCCAACTCTATGCCTTACCCCTCCCAACCCAACAGCATTTGCAGTTTGCAAAATA TACAGACCCAAGTCCTGAGGGGACTGAGGACATGATGCTGGGCCCAAGTCTCCTGCTCAGGGCTTCTCTCCAATGCCAGCCCTGCCACTC CTTCCTCACCCTCCTTGGAGCCTCCTCTGCTGCTTGTCTATCCCAACGGCCCTGCTCCCCTCCCTTCCTGCCCTTCACCAGCTTTCTGGA CACCATGCCCTGAGGAAGGGACCTTTGGTTTTCTCTAAACATCTTTGAAGGGCTGAGGCAGTCAGGGCTGGCTGCCTTGTCACTCTTTAT TTGGAAGCCACTCAAACCATTCCCAAGAAGAGGGACCTCAGCTGGCAATCTGGAAACCTGGCCCAGGTCTGGGCAGATGTCTTCACTTCT CCTACCTTCCCAGTCTTGTGATCCTGTGATGAGCACCAGGATGGCCCTGTGGTCCCTAGAGCACCCCTCATGCTGTAGGGTCCTGCAGCC CCATCCTTTCTCTACTGGGCCCTGGTATCCTGGCTCCTCTCTCAGCTCTGCCACTGATCTCTGTGCCTTAGTTTACTTCTCTGCACGGGG GACTCACCCCAAGACCATTTCCAGCAGCTTCCCAGGTGATGTGGTGCCCCAAGGCTGGGCTTTGCCAGCTGTGGCCCAGCTCCTTAGTGC TGCCCAGGAGACACCAGGCTGCTCAGAATGAGGTGACTGCGGGCACCATTCTCAGCCAGTGGTTCTTGTATTGCATTCCAGCAGCAGGAA TATCACCTGGGAACTTGATAGAAGTGCAGATTAGCAGCCCCACCCAAGACCCACTGAATTAGAGCTTGTGGAGTGGGGCCCTACAAGCTG GGGTTTTAAGGAGCCCTCCAAATGATTCTGACGCATAAGAATATGCCAACTGCTGATCTGGGCTAGCCATTAGTAGAGCCTGGGGAGGGA CTGGGACTGGCTAGGCCAAGAACAGGTGGAAAACACCAGCCTTATCTGGACTCCTGAGATTGGGAACCACCACCAACAAAAACCAACCCT ATAGTCGCTCCTCTTGGAAGAGGAAGAGAAGTTGAAGGGCCTGGAGAAAGCACACATTGTTTGTTTCCCTGCTCCTGCTCACCTCTCTCA CTTGTCTTGGTTTTACAAAAGGCTGTGTGGATGGTGCCAGCCAGGGAGGGGGTGGGAGTCCTGGGGAGGCAGGAGGCAGAAGACCCTGAC TGTTTCTCCCTTGGGAACCTCACCATAGGCCAGATAGCGCCTCTTCAAACTGAAAGAAATCTTAACTCCACAAAGAAAGCATCCTAAATC CCCAGTTCCTCCTCCTCCCAACCCCAGGGATACCTTGTAGACAGTGCCAAAAAACAGCTCCAACCCCCAGCAGCTGGGAAGAGAGCCAGA AGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGT CCTGCAGTCATGGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTC TTGTCCACTATTCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCAC CAGCTCTCCCTGCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAAGCGTTTTTAATGGTTGTAGATTGGA >8490_8490_3_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000371356_SOGA1_chr20_35445872_ENST00000237536_length(amino acids)=1577AA_BP=3 MGRRLECNGVISAHCNLCFLGLSDSPASASRVAGITGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSN SSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKG KFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQR TATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRET ELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELE DQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGD SAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAIL VRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRS FRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQE KNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADM KVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQ LQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKS VSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSSEPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGRQMQRSYTAPDKTGIRVYY SPPVARRLGVPVVHDKEGKIIIEPGFLFTTAKPKESAEADGLAESSYGRWLCNFSRQRLDGGSAGSPSAAGPGFPAALHDFEMSGNMSDD MKEITNCVRQAMRSGSLERKVKSTSSQTVGLASVGTQTIRTVSVGLQTDPPRSSLHGKAWSPRSSSLVSVRSKQISSSLDKVHSRIERPC CSPKYGSPKLQRRSVSKLDSSKDRSLWNLHQGKQNGSAWARSTTTRDSPVLRNINDGLSSLFSVVEHSGSTESVWKLGMSETRAKPEPPK YGIVQEFFRNVCGRAPSPTSSAGEEGTKKPEPLSPASYHQPEGVARILNKKAAKLGSSEEVRLTMLPQVGKDGVLRDGDGAVVLPNEDAV -------------------------------------------------------------- >8490_8490_4_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000371356_SOGA1_chr20_35445872_ENST00000279034_length(transcript)=3856nt_BP=837nt CTTGGAAGACTTGGGTCCTTGGGTCGCAGGCTGGAGTGCAATGGTGTGATCTCAGCTCACTGCAACCTCTGCTTCCTGGGTTTAAGTGAT TCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACA GCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGG GTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAG AAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCT GAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGT CGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAA GCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTG TATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCA AAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAG GAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAG AATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCC CCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAG GTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCC GCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGC TCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCG GAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAG AAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACG GACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGG GAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACG GCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAG CTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCT GTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGC TCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAG CCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCT GAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAA ATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAG AAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGG TTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTG ATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATC AAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAG TTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCC GGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTC ATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAG CAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTG GAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGT TTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGC AAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCG TCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCC CAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGG GCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTC TTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGA >8490_8490_4_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000371356_SOGA1_chr20_35445872_ENST00000279034_length(amino acids)=1170AA_BP=3 MGRRLECNGVISAHCNLCFLGLSDSPASASRVAGITGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSN SSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKG KFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQR TATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRET ELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELE DQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGD SAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAIL VRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRS FRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQE KNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADM KVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQ LQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKS VSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSSEPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGSQKLPFLLILAPPQPPPIL -------------------------------------------------------------- >8490_8490_5_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000395913_SOGA1_chr20_35445872_ENST00000237536_length(transcript)=13852nt_BP=894nt CTTAAACGCGACTCAAGGCGTCGGGTTTGTTGTCAACCAATCACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAG GGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAACGGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGTGGGAGCCG ACGGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTG ACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATT CCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCT GTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAA CAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTT TATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAG CTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTAC CTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCC TTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCAC TTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTAC CGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCAC CTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGAC GACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTG GCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTG CTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCA CAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCC AACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTG CCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAG GCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGT GCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTG CTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTG CCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTG CCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGA GCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAG CAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCG CTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTG CAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTG CCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCT GATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAG GAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAG CGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCC CTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAG CGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAAC CGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGC AGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCT GAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTT GTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCC TGCCAGGACCCCCCAGGGAGGCAGATGCAGCGCAGCTACACGGCTCCTGACAAGACGGGCATCCGAGTCTACTATAGTCCCCCGGTGGCC CGGCGCCTCGGAGTCCCTGTGGTTCATGACAAAGAGGGCAAGATCATTATCGAGCCCGGCTTCCTCTTCACCACAGCCAAGCCCAAAGAG TCGGCCGAGGCTGATGGGCTGGCTGAGAGCTCCTATGGTCGGTGGCTCTGCAACTTCTCACGGCAGCGCCTGGACGGAGGCTCAGCGGGC AGCCCCTCGGCGGCCGGGCCTGGCTTCCCAGCGGCCCTGCATGACTTTGAGATGTCAGGCAACATGAGTGATGACATGAAGGAGATCACC AACTGTGTGCGCCAGGCCATGCGCTCCGGCTCACTGGAGAGGAAAGTGAAGAGCACATCCAGCCAGACGGTGGGCCTGGCCAGTGTGGGC ACACAGACCATCCGCACGGTCAGCGTGGGCCTGCAGACCGACCCACCCCGCAGCAGCCTCCATGGCAAGGCCTGGTCACCCCGCAGCTCT TCGCTCGTGTCTGTGCGCAGCAAGCAGATCTCCTCCTCCCTGGACAAGGTCCATTCGCGCATCGAGCGGCCCTGCTGCTCCCCCAAGTAT GGCTCACCAAAGCTCCAGAGGCGGTCTGTGTCCAAGCTGGACAGCAGCAAGGACCGCAGCCTGTGGAACCTGCACCAGGGCAAGCAGAAC GGCTCGGCCTGGGCCCGCTCCACCACCACGCGGGACAGCCCTGTATTGAGAAACATCAACGATGGACTCTCCAGCCTCTTCAGTGTGGTG GAGCACTCAGGGAGCACGGAGTCTGTCTGGAAACTAGGCATGTCTGAGACGCGGGCCAAGCCCGAGCCTCCCAAGTACGGCATTGTGCAG GAATTCTTCCGTAATGTGTGTGGCCGGGCACCGAGCCCCACCTCATCAGCAGGAGAGGAGGGCACCAAGAAGCCAGAGCCCCTCTCCCCA GCCAGCTACCATCAGCCAGAGGGTGTGGCCAGGATCCTGAACAAGAAGGCAGCCAAGTTGGGCAGCAGTGAGGAGGTCAGACTCACCATG CTCCCCCAGGTGGGGAAGGATGGTGTCCTCCGGGACGGAGATGGAGCCGTGGTCCTTCCCAATGAGGACGCTGTTTGTGACTGTAGTACC CAGTCTCTCACCTCCTGCTTCGCCCGATCGTCCCGCTCTGCCATCCGCCACTCTCCTTCCAAGTGCAGGCTGCACCCTTCAGAGTCCAGC TGGGGTGGGGAGGAGAGGGCACTCCCCCCCAGCGAGTGACAGAGCAGCCAAGCTCCCCGCCTCAACCAGCCCAGCCCCTGGATAGCAGAA GGGAACCAGCAGAGACGAGACGAGGTGAGGCGAGGGGCTGTGTCCTCAGCATTGCCTGGCCCTGGAGGGACAGCAGTGATGCCACTGCCA GAATGCAGCTTTCACATCAAGGTAAAGCCGGGTCTCCTGCTGGCCCCTGGGTGGTGAGCTTCGACTTCCCAGGGGAAGGCAGTGAGTGGG AGAGAGACCAAACCTGGGCTTCCCAAGCATCCACTGAGAGATCTGTCAAGAGCCGATCCCTGGGTCCTAAGAGAGAGCCTTGCCTGGTTC TGCCCATGCCACCCTCTTGGAAGAGCCCAAGAAGGATACATGTCTGGCCATGCCTTTGGGGAAAAGGAGTCGGAGAGATGTTTCCTGCTG ACCATCCACCCCTTCATTTGGGAGGAGACACTGCTGAGAAGAACAGGCTTTGCTCTAGGGCTCCATGTTTGGTTCCTGGTGGAGCCCTGT TGGGCATCATCACCATCACCTCCTTCTCTCCACCACCTCCTCCTCCCAGCCCCACTGCTCTAGATTGCTGGGACACTAGGGAGTATGATA GGGCAGTAGCCAGGGCCATTGCTTAGTGTCCTGGAGCCCTGGATCTCCCTGCCCATAGCCTGGATGCAGCAAGAGCTGGGAGGCGAAGTG GAAACATGCAGGGCTCAGGGTTGGGGAGTGATTGCAATTGCCTTCCTTGCCAAAGTGACTTGGGGCCCCAACGTTCCCAGCAGACCCCTT GAGGACAGAAATAGGTAGAGTCAGTCTCAAGACCTGGTGCATAGATAAATGCCTAAATACACTGCCTTGATCTCAAGTGATCTCAGAGGC CTCTTTCCCTGGCACCCTGAGAGGCAGCAGGCACTACATCTCCACTGTGTTTACATCCTGCAGCTGGTCGGGGGGCAAAGATATTCCCAG TAAGAGATTCTTGGTTGGCCAGGTCAGGCCCAGGAGAACACCAAGAGGCCAGAGCCCAGGACACAGCAGTAGACTGGGGCCTGGAAACAC GTATCTTGCCTAGATTGTTTATTTGAATTTTTCCTACTATAAATATTTAAGGTGGTTTACTTTATTTTAATAATTTAATTTACCCCAAAG TCCCTAAGGTAATTTATTGGAGGTTGAAACATGCATTCTTGCCACTGGGACAACATGAGGCCTCTAACAGCACGGGCAGGCATGGGGTCC CCTGGGTGGACGAGGCCGCTTGGCAGCCAGGTTTGGAGACCTGGCCTCCTGGTCAGCTTTGGAGGGCCCCTCAACAGAGCTGGAGCCCTG CACCCCAACACGGCTGGCCATGTGGCCTCAGAACACTACTTATTACTCAATGCCTGGTACTTGGCCAGCCCCAGCGGTCAGTCTATAAAT ACTCACTGACAAGGTGGAGGGCTGGACGGCCATCACCACTCCCCAGACGTTCTCCATTGCCTGTCTCATTTCCCTCCCTCCCTCTGACAC CTTTCTTCATGAGTCGAACGTGGATTACTAAAGCTCTATTAAGAGTGTGGAGATCCCTCCAAGTTTCCCAAATGAGAACTCACAGGAAAA CAGGACTGAACTTTGAGAATGTTGTTTATCGCAGCTTTGCACATAAACCTGAGTGTCTCCCAGCCTGCCTCGGTTCTCACCAGCCTGCCA GCCTTTTCACCAGCCTCTCTCCTTAGCCTTATGGCCTTTCACGGCTCTTCTCCCTGCCCCAGCTCTGCTGCCCGCCCTTCCTCACGTCCC CTGTGAGCTGCCTGAGCCATTGGTTGGATTTCGATGTGGCTCATTGCAGCATGTGGGGCAGCGCCTCCCATGGCCTCGCCTTGGTGCCGG TGAACCCCTTTTGGTTGCACACATGCTCCCCACACACACATAGACATCAGCCTTCCTGTAACTGACTGGGGACCCAGAGTGGAAACACCA GGATGGATCAGCTTGTCTGCAGAATTGCCCATCAGGAAGACCAAAAGCCAGTAGCTTTGCTGATCTGCCCCCAGGACTCTGGAGGCGCCC CTGCACTCCCACCTCCCACCTGCCAGTTCCCAGACCCACCCATTCGGGATCACCTGGACCAGTTACCCCAAGTCCTGCATCTCCCTTCCC TGCAGGCTGAACACCAGGGTCATGCCAGTCCCGCCAGCCGCCTCCTCCATGCCCCAGTGACTGGTGTGGGCAGAGCAGGCAGCCAGTGGA GCTGTGGGCCAGTTCCGCTCTTGGATGCTGCTGCTCTCACCCATGAGGTCAGGGGGGCCCTCCAAGGTTATCTCCAGGTGAGGGGATTCA CATCAGGCCACAAGCCACCAGAGGCCTTCTGCCACCTCCCAGAGCGACAGCCAGGGAGGCTGCGTACTCAGCCTCGGGGAGAAATCCCCG TGGGACCTGAGCCCCAAGACCTACGGACCACTCAGCCTTACCATCGTACCGTCCAGGATTGTCCTTGCCATCTTTGTTGTCTCAGCCAGA CCTTGGTTTTCAGTAAAGCCCCAGTTTCTACTTCCTGCATGCCACTGTGCAAGGCCACTCATCACTGTTCCTGCAGAAGCCTCTGGACGT GGGGCTGGATGGGGTTGAAAATGTTACATGTAAATATTGGTTTGGTTCGGTTTTTAGCATTTTACTTGGTAACTGGTTGTTTTCTTTTTT GGGGTGGGGGGATTGGTTTGTAAAAATTCTCTACTCTTTTGGAATGTGATTTCTAAGTTTGTTGGTTTCTTCAAATGCCTTTTAAGTCTT GGTAACATTCCCAAAGCAGAAAACTGCCTGACCCACAGTGGGGATTCCCTGGAGAATTGGGGTCCCAAGAAGGAATGCTGCCCTTCTCGA ACCCGTTCTCCCCCTTCCTCCTGCCTCTCTGCCTTTTACTGCTATTCCCTTCTTCTCCTCCTTTATCCTTCTTTCTGTTTTCCCCATCTC CACTCTCTCTTCAACCAAAGTCCCAAGGAACCCTCGGGGCTCAATCCCCCATAGACCACTTGGCTTGGGTCCATGGGGTTGGCATCAGTT GGTTGGCGGAAATGGGGGACCAGTTGGCATGATGGCCCTAAACTGGGAAACCTCATGTTTCTTATGTCTCACCTCTTTCCAGAGCCAAAT CAGCCCCTTTTGGAATGATGACTTCATTGGAATGCAAATCAAGTCATTTTGGTGCATCAGTGGCTCTTAGGCCTGCACACACGAGACATC AGAATCCAATCCTCTGACCCTGTGCCAGCCCTTTCCCCCAGTTTATTTCCCACCAAAGGCTGACCTCTAAGAGGTCTTGCTTTCTATGAA CTCAAGATGGGTCCCACCTCTAGGTGTCCCCAGGTGCACTCTTCTACCGGTTGGCTTCCGATGTGACAAGGCCAAGGGCCCAAAGACTTG ACCCTCTTACACCCTTGCTGACATGGTTCCATCATGTCCACCCGCATGCACTTTTATGGTTTCATCACCCAGCCTCTTCTCCTCTGGCCC ACCCAGCGTCCAGGCTCTTTCTCCCTCTCCCCTCCTATCTAGAATGTCCCCTGCTTCTAGCCTCACCAGACCCCCCAAGCTCCCACTACT TCTTCCATAATAATAGTAATAACAATGGTTATCATCATCCCCTGCACATCCCGCCTAAAGCACTTTACTATATAGAAAACGTTTCCCCTG GCCGGGCATGGTGGCTCACGCCTGAAATCCCAGCTCTTTGGGAGGCTGAGGCGAGCGGATCAGTTGAGGTCAGGAGTTCAACGCCAGCCT GGCCAACGTGGTGAATCCCTGTCTGTACTAAAAGTACAAAAAATTAGCTGAGCATGGTGGTGCGTGCCTGTAATCCCAGCTACTCGGGAG GCTGAGGTGGGAGAATCGCTTGAGCCCAGGAGGCGGAGGTTGCAGGAGCAGAGATTGCGCCACTGCACTCCAGCCTGGATGACAGAGTGA GACCCAATCTCAAAAAAGAAATCGTTTCCCACCCCACATCTCCTTCAGACCTCTCAGGGACAACTCTGGGAGGCAGCCTTGGCAGGACAT GGGTTAGTGCGCCCATTTTGCTGTGAGGAAACTGAGGTACAGGTCTCATCCCAGAGCATGAGAAGTCACTGAGTTTAGATGAGAACTTGG GTCCAACTCTGTCCTGTTTGCTGTGCAAATCCGCTGCCCTGCTGGGGGCTTTTGGTGGGTCCAGAATACCCAGAATATGCTGCTGGCCAA CCCAGGCATAAAACAAGTCCATTCTAGATCACTGAGCCTTGTGTATTCCAGAGGGTGATCTGAGGTCCCCATTCAGCAGAATTCTCTGAG GGCATGTTCAGAATGTAGATTCCTGGGCCCCACCTTGAATTTGCATGTTTAACAAACTCTCCTGGGGTTGAGGGGTGGGTGCAGTGGTCA CACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGTGGATCACTTGAGCTCAGGAGTTCGAGCCTGGCCAATATGGTGAAACCCTGTC TCTACTAAAAATGCAAAAATTAGCCAGGTGTGGTGGCACATGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCACTTGAA CCTGGTGGGGAGCAGCGGTTGCAGTGAGCCGAGATTGTGCCATTGCGCTCTAGCCTGGGCAACGGAATGAGACTTGTCTCAAAAAAAATA AAAAATAAAACCAGCCCTCCCCGGGGGATCTTAGGCACTATTGGCCACACCATTGGTGTTCGTGGCCCTGATTATTAGGCTATCTTTCTT TTTTTAAGTTTTTTTAGATTTATTTTTTATTTTATTTATTTATTTATTTATTATTTATTATTATTATTTTTAGAGACAGGGGGTCTCCCT ATGTTGCCCAGGCTGGTTTCAAACTCCTGGGCTCAAGTGATCTGCCCTCCTCAGCCTCCCAAAGTGCTGGGGTTACAGGCAGGCATCAGC CACCGTGCCAGGTCATCTTCCTTTTTCTTTTTTTTTTGGAGACAGAGTCTTGCTCTGTCATTCAGGCTGGAGTACAGTGGCGCCATTTCA GCTCACCGCAGCCTCCACCTCCCAGGTTCAAGCAGTTCTCCTGCCTCAACCTCCCGAGTAACTGGGATTACAGGTGTGTGCCACCACACC CGACTAATTTTTTTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCACCCACC TCGGCCTCCCAAAGTACTGGGATTATAGGCGTGAGCCACGGCATCCAGCCTCATCTTTCTTTTAACCAATAAACATGATGCTGTATCTTA AAAAGAGCACTGAGCAGGGACTTAAGGGATCGAGTCCTCAACCAAACTGATTTAATTACTCAGGATTTTCAAAAAGCATCAGAGGCTATT TACAATCTTAATCATAGGGGTTCAGTAAAATAAAAATAAGAAGTAAAAAAGCAAGAGAAATTATTCTGTAAATCTAACTGGTGTAATTCC CATAATCATGCAATTAAGTTTTACTCTTGAGTTTCCTGACAGCCATTGGTAAAAAGAGAAACACATCAGGATTTATAATTTTTATCATCC AATTATGGGAAGCAAGCATGTTGGCCCCAGGAGACGAACTCTTCTACTAATTTATAGCATGTATCCTTTCATCAAGGGCCACTGAGCCAG TTGGTGAGTCAACGGGTGAACCTAAGATGCAAGGATGTTTTCCAGGTGACTATTTAAAAAAATAAATAATCCACCAACGTGATTGACCTT GGCGAGATCATGTTTCTAGTCTATACCTCAGTTTCCCCATCTGTAAAGTGAGGATAATGTCCCACCCCATGTAACTGTGGTGAGGACCAA CTGCAACACTGTGCCTGCGAGTCTCCTTGGAAAAGTGTAAGGTTCTACACAAATGGAAAGTGATCTGATCACACTCAGTGTCCCCAGCCC AGCCTTTCAGTGCCCTGGCCCTGGGGTGGGGGACAATACTCTCCTCACCCCCTTCACTAGTCTTCATGAATAGCAAGGAGGCCATAACAT AATTTGGTCTAAACCCCTTCCTTTTTAAAAGAATGATGGCAAAATGTGCATAACATTAAATTTACCATTTTAAAGATTTCTAAGTGTCTC GAAGTACATTTGCAATGTGTAACTGCCACCTCCAGAACTTTTTCATCATCCTAAACTGAAACTCTGTACCCACTAAACAGTAACTCCCCA CTCCCCCTGTCCCCAGTCCCTGGTAACCTCTATTCTACTTTTTTTTTTTCTCTGTGGATTGGCCTATTCTAGGTATCTCTTTTAAGTAGA ATCATATAGTATTTGTCCTTTTGTGTCCAGCCCCCTCATTTTTTGAGATGAGGAATCAGGCCCAGAGAAGGCAGTGGCTCACCCAAGGTC ACATCGCAAACCAGAGGCAGAGCCAAGACCAGAACCCAGGTTTCCTGACTCCTAACAGGTCTTGCATCCCCAGTAGTGAATGAGACACTG CTTTGACTTTCTGTAATCTTGGTTTAGCCCCTTCCTTTCTCTGGGCTCAGTCTGCTTCCCACCACTCACCCATTCTTTCTTTATCATCTC CCTCCAAAGCCTCTTGTCCTCCTGCCTCCTCTTCTCCTTGGCTGGTTCCTGCCAGCAAATGGAGTGGCCCTGGTCCATGCCAGGTTTTCC TCTTCTGGGTCCGGAGCTCACTATAGTATTCAGCCCTCAGTCCTCCCAGGATGTTCTTCCCAGCACTGCTGCCTCGTGCGGATTTTCCCG TAACCTCAGTAACTGGCTTCTTGTCCCCCTGCTTCCTACCAGGGAAGCCTTCCTGTCCGCGTCTGTGGTTTCCCAACTCTGATATTTGCT CTCAAATGTGGTGGTGTCCTGGTTCTGTGTTTATTTATTTTGTGTTTTCTCACACACACAAAAAAAAGCCTCTTCCTCCAATGCATCAGG AGGCACCAGCCCTGCCAGCCCTTCTCACTGGGCTCACCCTGCCCCAGCAACCCCCCGGTGCCACCCTTAAGCCACACTGTCTTCTCTGTA AGCAGCCTGCCAGCAGCAGCCCCAGCACTTTGCAATGGGCGTGTGTGTGGTGGTGGGTGGGGGGGGCTTGGATCCCTCCTTTTTCCTCCC CTGCCCTGCCCAGGCCCAGATGGCCTTGACTGTAAAGCAGGTGCTGCCTGACAGGTTCTTCTCTCCCTGTCTCTGGTCATTGATCCATCT CTTTGTCCATTCAGTATCCAACCATCCTCTCCATTCTCCTCTGGACCTCACCACTCTCAGAGCTGCTTGTCCTGGCAGAATCTACAGTTC ACCCCAACTCTATGCCTTACCCCTCCCAACCCAACAGCATTTGCAGTTTGCAAAATATACAGACCCAAGTCCTGAGGGGACTGAGGACAT GATGCTGGGCCCAAGTCTCCTGCTCAGGGCTTCTCTCCAATGCCAGCCCTGCCACTCCTTCCTCACCCTCCTTGGAGCCTCCTCTGCTGC TTGTCTATCCCAACGGCCCTGCTCCCCTCCCTTCCTGCCCTTCACCAGCTTTCTGGACACCATGCCCTGAGGAAGGGACCTTTGGTTTTC TCTAAACATCTTTGAAGGGCTGAGGCAGTCAGGGCTGGCTGCCTTGTCACTCTTTATTTGGAAGCCACTCAAACCATTCCCAAGAAGAGG GACCTCAGCTGGCAATCTGGAAACCTGGCCCAGGTCTGGGCAGATGTCTTCACTTCTCCTACCTTCCCAGTCTTGTGATCCTGTGATGAG CACCAGGATGGCCCTGTGGTCCCTAGAGCACCCCTCATGCTGTAGGGTCCTGCAGCCCCATCCTTTCTCTACTGGGCCCTGGTATCCTGG CTCCTCTCTCAGCTCTGCCACTGATCTCTGTGCCTTAGTTTACTTCTCTGCACGGGGGACTCACCCCAAGACCATTTCCAGCAGCTTCCC AGGTGATGTGGTGCCCCAAGGCTGGGCTTTGCCAGCTGTGGCCCAGCTCCTTAGTGCTGCCCAGGAGACACCAGGCTGCTCAGAATGAGG TGACTGCGGGCACCATTCTCAGCCAGTGGTTCTTGTATTGCATTCCAGCAGCAGGAATATCACCTGGGAACTTGATAGAAGTGCAGATTA GCAGCCCCACCCAAGACCCACTGAATTAGAGCTTGTGGAGTGGGGCCCTACAAGCTGGGGTTTTAAGGAGCCCTCCAAATGATTCTGACG CATAAGAATATGCCAACTGCTGATCTGGGCTAGCCATTAGTAGAGCCTGGGGAGGGACTGGGACTGGCTAGGCCAAGAACAGGTGGAAAA CACCAGCCTTATCTGGACTCCTGAGATTGGGAACCACCACCAACAAAAACCAACCCTATAGTCGCTCCTCTTGGAAGAGGAAGAGAAGTT GAAGGGCCTGGAGAAAGCACACATTGTTTGTTTCCCTGCTCCTGCTCACCTCTCTCACTTGTCTTGGTTTTACAAAAGGCTGTGTGGATG GTGCCAGCCAGGGAGGGGGTGGGAGTCCTGGGGAGGCAGGAGGCAGAAGACCCTGACTGTTTCTCCCTTGGGAACCTCACCATAGGCCAG ATAGCGCCTCTTCAAACTGAAAGAAATCTTAACTCCACAAAGAAAGCATCCTAAATCCCCAGTTCCTCCTCCTCCCAACCCCAGGGATAC CTTGTAGACAGTGCCAAAAAACAGCTCCAACCCCCAGCAGCTGGGAAGAGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCC AGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGGG CAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTCT TCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGAC >8490_8490_5_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000395913_SOGA1_chr20_35445872_ENST00000237536_length(amino acids)=1549AA_BP=245 MGRRWEPTGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQ ATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLE KAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSA DLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENR GLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSC SVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEV LPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQ QESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHD SLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKR WRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKA KATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQ QVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLP NNPAFGFVSSEPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGRQMQRSYTAPDKTGIRVYYSPPVARRLGVPVVHDKEGKIIIEPGFLF TTAKPKESAEADGLAESSYGRWLCNFSRQRLDGGSAGSPSAAGPGFPAALHDFEMSGNMSDDMKEITNCVRQAMRSGSLERKVKSTSSQT VGLASVGTQTIRTVSVGLQTDPPRSSLHGKAWSPRSSSLVSVRSKQISSSLDKVHSRIERPCCSPKYGSPKLQRRSVSKLDSSKDRSLWN LHQGKQNGSAWARSTTTRDSPVLRNINDGLSSLFSVVEHSGSTESVWKLGMSETRAKPEPPKYGIVQEFFRNVCGRAPSPTSSAGEEGTK KPEPLSPASYHQPEGVARILNKKAAKLGSSEEVRLTMLPQVGKDGVLRDGDGAVVLPNEDAVCDCSTQSLTSCFARSSRSAIRHSPSKCR -------------------------------------------------------------- >8490_8490_6_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000395913_SOGA1_chr20_35445872_ENST00000279034_length(transcript)=3913nt_BP=894nt CTTAAACGCGACTCAAGGCGTCGGGTTTGTTGTCAACCAATCACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAG GGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAACGGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGTGGGAGCCG ACGGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTG ACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATT CCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCT GTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAA CAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTT TATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAG CTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTAC CTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCC TTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCAC TTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTAC CGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCAC CTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGAC GACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTG GCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTG CTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCA CAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCC AACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTG CCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAG GCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGT GCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTG CTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTG CCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTG CCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGA GCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAG CAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCG CTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTG CAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTG CCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCT GATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAG GAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAG CGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCC CTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAG CGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAAC CGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGC AGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCT GAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTT GTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCC TGCCAGGACCCCCCAGGGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCA CTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCA TCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTG CTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAA >8490_8490_6_AURKA-SOGA1_AURKA_chr20_54956488_ENST00000395913_SOGA1_chr20_35445872_ENST00000279034_length(amino acids)=1142AA_BP=245 MGRRWEPTGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQ ATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLE KAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSA DLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENR GLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSC SVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEV LPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQ QESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHD SLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKR WRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKA KATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQ QVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLP -------------------------------------------------------------- >8490_8490_7_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000312783_SOGA1_chr20_35445872_ENST00000237536_length(transcript)=13906nt_BP=948nt ACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAGGGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAAC GGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGGTCTCACTCCATTGCCCAGGCCAGAGTGCGGGGATATTTGATAAGAAACTT CAGTGAAGGCCGGGCGCGGTGGCTCATGCCCGTAATCCCAGCATTTTCGGAGGCCGAGGCATCATGGACCGATCTAAAGAAAACTGCATT TCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTA AATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAG CCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAG CAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCT TTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTG GCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGG CATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTAT AGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCC GATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAG CTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCA GCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGC CGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCT GAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAG GAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCAC GAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGC GAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGT ACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCG GTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTG GCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGC GAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCA GATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATT CTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGC ACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAG GAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAG GCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGC CTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTG CTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGC ATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAG AACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGT CTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAG CTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTA GCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAG CGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGG GTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAA GTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTT GCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACC AAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGGCAGATGCAGCGCAGC TACACGGCTCCTGACAAGACGGGCATCCGAGTCTACTATAGTCCCCCGGTGGCCCGGCGCCTCGGAGTCCCTGTGGTTCATGACAAAGAG GGCAAGATCATTATCGAGCCCGGCTTCCTCTTCACCACAGCCAAGCCCAAAGAGTCGGCCGAGGCTGATGGGCTGGCTGAGAGCTCCTAT GGTCGGTGGCTCTGCAACTTCTCACGGCAGCGCCTGGACGGAGGCTCAGCGGGCAGCCCCTCGGCGGCCGGGCCTGGCTTCCCAGCGGCC CTGCATGACTTTGAGATGTCAGGCAACATGAGTGATGACATGAAGGAGATCACCAACTGTGTGCGCCAGGCCATGCGCTCCGGCTCACTG GAGAGGAAAGTGAAGAGCACATCCAGCCAGACGGTGGGCCTGGCCAGTGTGGGCACACAGACCATCCGCACGGTCAGCGTGGGCCTGCAG ACCGACCCACCCCGCAGCAGCCTCCATGGCAAGGCCTGGTCACCCCGCAGCTCTTCGCTCGTGTCTGTGCGCAGCAAGCAGATCTCCTCC TCCCTGGACAAGGTCCATTCGCGCATCGAGCGGCCCTGCTGCTCCCCCAAGTATGGCTCACCAAAGCTCCAGAGGCGGTCTGTGTCCAAG CTGGACAGCAGCAAGGACCGCAGCCTGTGGAACCTGCACCAGGGCAAGCAGAACGGCTCGGCCTGGGCCCGCTCCACCACCACGCGGGAC AGCCCTGTATTGAGAAACATCAACGATGGACTCTCCAGCCTCTTCAGTGTGGTGGAGCACTCAGGGAGCACGGAGTCTGTCTGGAAACTA GGCATGTCTGAGACGCGGGCCAAGCCCGAGCCTCCCAAGTACGGCATTGTGCAGGAATTCTTCCGTAATGTGTGTGGCCGGGCACCGAGC CCCACCTCATCAGCAGGAGAGGAGGGCACCAAGAAGCCAGAGCCCCTCTCCCCAGCCAGCTACCATCAGCCAGAGGGTGTGGCCAGGATC CTGAACAAGAAGGCAGCCAAGTTGGGCAGCAGTGAGGAGGTCAGACTCACCATGCTCCCCCAGGTGGGGAAGGATGGTGTCCTCCGGGAC GGAGATGGAGCCGTGGTCCTTCCCAATGAGGACGCTGTTTGTGACTGTAGTACCCAGTCTCTCACCTCCTGCTTCGCCCGATCGTCCCGC TCTGCCATCCGCCACTCTCCTTCCAAGTGCAGGCTGCACCCTTCAGAGTCCAGCTGGGGTGGGGAGGAGAGGGCACTCCCCCCCAGCGAG TGACAGAGCAGCCAAGCTCCCCGCCTCAACCAGCCCAGCCCCTGGATAGCAGAAGGGAACCAGCAGAGACGAGACGAGGTGAGGCGAGGG GCTGTGTCCTCAGCATTGCCTGGCCCTGGAGGGACAGCAGTGATGCCACTGCCAGAATGCAGCTTTCACATCAAGGTAAAGCCGGGTCTC CTGCTGGCCCCTGGGTGGTGAGCTTCGACTTCCCAGGGGAAGGCAGTGAGTGGGAGAGAGACCAAACCTGGGCTTCCCAAGCATCCACTG AGAGATCTGTCAAGAGCCGATCCCTGGGTCCTAAGAGAGAGCCTTGCCTGGTTCTGCCCATGCCACCCTCTTGGAAGAGCCCAAGAAGGA TACATGTCTGGCCATGCCTTTGGGGAAAAGGAGTCGGAGAGATGTTTCCTGCTGACCATCCACCCCTTCATTTGGGAGGAGACACTGCTG AGAAGAACAGGCTTTGCTCTAGGGCTCCATGTTTGGTTCCTGGTGGAGCCCTGTTGGGCATCATCACCATCACCTCCTTCTCTCCACCAC CTCCTCCTCCCAGCCCCACTGCTCTAGATTGCTGGGACACTAGGGAGTATGATAGGGCAGTAGCCAGGGCCATTGCTTAGTGTCCTGGAG CCCTGGATCTCCCTGCCCATAGCCTGGATGCAGCAAGAGCTGGGAGGCGAAGTGGAAACATGCAGGGCTCAGGGTTGGGGAGTGATTGCA ATTGCCTTCCTTGCCAAAGTGACTTGGGGCCCCAACGTTCCCAGCAGACCCCTTGAGGACAGAAATAGGTAGAGTCAGTCTCAAGACCTG GTGCATAGATAAATGCCTAAATACACTGCCTTGATCTCAAGTGATCTCAGAGGCCTCTTTCCCTGGCACCCTGAGAGGCAGCAGGCACTA CATCTCCACTGTGTTTACATCCTGCAGCTGGTCGGGGGGCAAAGATATTCCCAGTAAGAGATTCTTGGTTGGCCAGGTCAGGCCCAGGAG AACACCAAGAGGCCAGAGCCCAGGACACAGCAGTAGACTGGGGCCTGGAAACACGTATCTTGCCTAGATTGTTTATTTGAATTTTTCCTA CTATAAATATTTAAGGTGGTTTACTTTATTTTAATAATTTAATTTACCCCAAAGTCCCTAAGGTAATTTATTGGAGGTTGAAACATGCAT TCTTGCCACTGGGACAACATGAGGCCTCTAACAGCACGGGCAGGCATGGGGTCCCCTGGGTGGACGAGGCCGCTTGGCAGCCAGGTTTGG AGACCTGGCCTCCTGGTCAGCTTTGGAGGGCCCCTCAACAGAGCTGGAGCCCTGCACCCCAACACGGCTGGCCATGTGGCCTCAGAACAC TACTTATTACTCAATGCCTGGTACTTGGCCAGCCCCAGCGGTCAGTCTATAAATACTCACTGACAAGGTGGAGGGCTGGACGGCCATCAC CACTCCCCAGACGTTCTCCATTGCCTGTCTCATTTCCCTCCCTCCCTCTGACACCTTTCTTCATGAGTCGAACGTGGATTACTAAAGCTC TATTAAGAGTGTGGAGATCCCTCCAAGTTTCCCAAATGAGAACTCACAGGAAAACAGGACTGAACTTTGAGAATGTTGTTTATCGCAGCT TTGCACATAAACCTGAGTGTCTCCCAGCCTGCCTCGGTTCTCACCAGCCTGCCAGCCTTTTCACCAGCCTCTCTCCTTAGCCTTATGGCC TTTCACGGCTCTTCTCCCTGCCCCAGCTCTGCTGCCCGCCCTTCCTCACGTCCCCTGTGAGCTGCCTGAGCCATTGGTTGGATTTCGATG TGGCTCATTGCAGCATGTGGGGCAGCGCCTCCCATGGCCTCGCCTTGGTGCCGGTGAACCCCTTTTGGTTGCACACATGCTCCCCACACA CACATAGACATCAGCCTTCCTGTAACTGACTGGGGACCCAGAGTGGAAACACCAGGATGGATCAGCTTGTCTGCAGAATTGCCCATCAGG AAGACCAAAAGCCAGTAGCTTTGCTGATCTGCCCCCAGGACTCTGGAGGCGCCCCTGCACTCCCACCTCCCACCTGCCAGTTCCCAGACC CACCCATTCGGGATCACCTGGACCAGTTACCCCAAGTCCTGCATCTCCCTTCCCTGCAGGCTGAACACCAGGGTCATGCCAGTCCCGCCA GCCGCCTCCTCCATGCCCCAGTGACTGGTGTGGGCAGAGCAGGCAGCCAGTGGAGCTGTGGGCCAGTTCCGCTCTTGGATGCTGCTGCTC TCACCCATGAGGTCAGGGGGGCCCTCCAAGGTTATCTCCAGGTGAGGGGATTCACATCAGGCCACAAGCCACCAGAGGCCTTCTGCCACC TCCCAGAGCGACAGCCAGGGAGGCTGCGTACTCAGCCTCGGGGAGAAATCCCCGTGGGACCTGAGCCCCAAGACCTACGGACCACTCAGC CTTACCATCGTACCGTCCAGGATTGTCCTTGCCATCTTTGTTGTCTCAGCCAGACCTTGGTTTTCAGTAAAGCCCCAGTTTCTACTTCCT GCATGCCACTGTGCAAGGCCACTCATCACTGTTCCTGCAGAAGCCTCTGGACGTGGGGCTGGATGGGGTTGAAAATGTTACATGTAAATA TTGGTTTGGTTCGGTTTTTAGCATTTTACTTGGTAACTGGTTGTTTTCTTTTTTGGGGTGGGGGGATTGGTTTGTAAAAATTCTCTACTC TTTTGGAATGTGATTTCTAAGTTTGTTGGTTTCTTCAAATGCCTTTTAAGTCTTGGTAACATTCCCAAAGCAGAAAACTGCCTGACCCAC AGTGGGGATTCCCTGGAGAATTGGGGTCCCAAGAAGGAATGCTGCCCTTCTCGAACCCGTTCTCCCCCTTCCTCCTGCCTCTCTGCCTTT TACTGCTATTCCCTTCTTCTCCTCCTTTATCCTTCTTTCTGTTTTCCCCATCTCCACTCTCTCTTCAACCAAAGTCCCAAGGAACCCTCG GGGCTCAATCCCCCATAGACCACTTGGCTTGGGTCCATGGGGTTGGCATCAGTTGGTTGGCGGAAATGGGGGACCAGTTGGCATGATGGC CCTAAACTGGGAAACCTCATGTTTCTTATGTCTCACCTCTTTCCAGAGCCAAATCAGCCCCTTTTGGAATGATGACTTCATTGGAATGCA AATCAAGTCATTTTGGTGCATCAGTGGCTCTTAGGCCTGCACACACGAGACATCAGAATCCAATCCTCTGACCCTGTGCCAGCCCTTTCC CCCAGTTTATTTCCCACCAAAGGCTGACCTCTAAGAGGTCTTGCTTTCTATGAACTCAAGATGGGTCCCACCTCTAGGTGTCCCCAGGTG CACTCTTCTACCGGTTGGCTTCCGATGTGACAAGGCCAAGGGCCCAAAGACTTGACCCTCTTACACCCTTGCTGACATGGTTCCATCATG TCCACCCGCATGCACTTTTATGGTTTCATCACCCAGCCTCTTCTCCTCTGGCCCACCCAGCGTCCAGGCTCTTTCTCCCTCTCCCCTCCT ATCTAGAATGTCCCCTGCTTCTAGCCTCACCAGACCCCCCAAGCTCCCACTACTTCTTCCATAATAATAGTAATAACAATGGTTATCATC ATCCCCTGCACATCCCGCCTAAAGCACTTTACTATATAGAAAACGTTTCCCCTGGCCGGGCATGGTGGCTCACGCCTGAAATCCCAGCTC TTTGGGAGGCTGAGGCGAGCGGATCAGTTGAGGTCAGGAGTTCAACGCCAGCCTGGCCAACGTGGTGAATCCCTGTCTGTACTAAAAGTA CAAAAAATTAGCTGAGCATGGTGGTGCGTGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGTGGGAGAATCGCTTGAGCCCAGGAGGCGG AGGTTGCAGGAGCAGAGATTGCGCCACTGCACTCCAGCCTGGATGACAGAGTGAGACCCAATCTCAAAAAAGAAATCGTTTCCCACCCCA CATCTCCTTCAGACCTCTCAGGGACAACTCTGGGAGGCAGCCTTGGCAGGACATGGGTTAGTGCGCCCATTTTGCTGTGAGGAAACTGAG GTACAGGTCTCATCCCAGAGCATGAGAAGTCACTGAGTTTAGATGAGAACTTGGGTCCAACTCTGTCCTGTTTGCTGTGCAAATCCGCTG CCCTGCTGGGGGCTTTTGGTGGGTCCAGAATACCCAGAATATGCTGCTGGCCAACCCAGGCATAAAACAAGTCCATTCTAGATCACTGAG CCTTGTGTATTCCAGAGGGTGATCTGAGGTCCCCATTCAGCAGAATTCTCTGAGGGCATGTTCAGAATGTAGATTCCTGGGCCCCACCTT GAATTTGCATGTTTAACAAACTCTCCTGGGGTTGAGGGGTGGGTGCAGTGGTCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGG GTGGATCACTTGAGCTCAGGAGTTCGAGCCTGGCCAATATGGTGAAACCCTGTCTCTACTAAAAATGCAAAAATTAGCCAGGTGTGGTGG CACATGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGTGGGGAGCAGCGGTTGCAGTGAGCCGAGATT GTGCCATTGCGCTCTAGCCTGGGCAACGGAATGAGACTTGTCTCAAAAAAAATAAAAAATAAAACCAGCCCTCCCCGGGGGATCTTAGGC ACTATTGGCCACACCATTGGTGTTCGTGGCCCTGATTATTAGGCTATCTTTCTTTTTTTAAGTTTTTTTAGATTTATTTTTTATTTTATT TATTTATTTATTTATTATTTATTATTATTATTTTTAGAGACAGGGGGTCTCCCTATGTTGCCCAGGCTGGTTTCAAACTCCTGGGCTCAA GTGATCTGCCCTCCTCAGCCTCCCAAAGTGCTGGGGTTACAGGCAGGCATCAGCCACCGTGCCAGGTCATCTTCCTTTTTCTTTTTTTTT TGGAGACAGAGTCTTGCTCTGTCATTCAGGCTGGAGTACAGTGGCGCCATTTCAGCTCACCGCAGCCTCCACCTCCCAGGTTCAAGCAGT TCTCCTGCCTCAACCTCCCGAGTAACTGGGATTACAGGTGTGTGCCACCACACCCGACTAATTTTTTTATTTTTAGTAGAGACGGGGTTT CACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCACCCACCTCGGCCTCCCAAAGTACTGGGATTATAGGCGTGAGC CACGGCATCCAGCCTCATCTTTCTTTTAACCAATAAACATGATGCTGTATCTTAAAAAGAGCACTGAGCAGGGACTTAAGGGATCGAGTC CTCAACCAAACTGATTTAATTACTCAGGATTTTCAAAAAGCATCAGAGGCTATTTACAATCTTAATCATAGGGGTTCAGTAAAATAAAAA TAAGAAGTAAAAAAGCAAGAGAAATTATTCTGTAAATCTAACTGGTGTAATTCCCATAATCATGCAATTAAGTTTTACTCTTGAGTTTCC TGACAGCCATTGGTAAAAAGAGAAACACATCAGGATTTATAATTTTTATCATCCAATTATGGGAAGCAAGCATGTTGGCCCCAGGAGACG AACTCTTCTACTAATTTATAGCATGTATCCTTTCATCAAGGGCCACTGAGCCAGTTGGTGAGTCAACGGGTGAACCTAAGATGCAAGGAT GTTTTCCAGGTGACTATTTAAAAAAATAAATAATCCACCAACGTGATTGACCTTGGCGAGATCATGTTTCTAGTCTATACCTCAGTTTCC CCATCTGTAAAGTGAGGATAATGTCCCACCCCATGTAACTGTGGTGAGGACCAACTGCAACACTGTGCCTGCGAGTCTCCTTGGAAAAGT GTAAGGTTCTACACAAATGGAAAGTGATCTGATCACACTCAGTGTCCCCAGCCCAGCCTTTCAGTGCCCTGGCCCTGGGGTGGGGGACAA TACTCTCCTCACCCCCTTCACTAGTCTTCATGAATAGCAAGGAGGCCATAACATAATTTGGTCTAAACCCCTTCCTTTTTAAAAGAATGA TGGCAAAATGTGCATAACATTAAATTTACCATTTTAAAGATTTCTAAGTGTCTCGAAGTACATTTGCAATGTGTAACTGCCACCTCCAGA ACTTTTTCATCATCCTAAACTGAAACTCTGTACCCACTAAACAGTAACTCCCCACTCCCCCTGTCCCCAGTCCCTGGTAACCTCTATTCT ACTTTTTTTTTTTCTCTGTGGATTGGCCTATTCTAGGTATCTCTTTTAAGTAGAATCATATAGTATTTGTCCTTTTGTGTCCAGCCCCCT CATTTTTTGAGATGAGGAATCAGGCCCAGAGAAGGCAGTGGCTCACCCAAGGTCACATCGCAAACCAGAGGCAGAGCCAAGACCAGAACC CAGGTTTCCTGACTCCTAACAGGTCTTGCATCCCCAGTAGTGAATGAGACACTGCTTTGACTTTCTGTAATCTTGGTTTAGCCCCTTCCT TTCTCTGGGCTCAGTCTGCTTCCCACCACTCACCCATTCTTTCTTTATCATCTCCCTCCAAAGCCTCTTGTCCTCCTGCCTCCTCTTCTC CTTGGCTGGTTCCTGCCAGCAAATGGAGTGGCCCTGGTCCATGCCAGGTTTTCCTCTTCTGGGTCCGGAGCTCACTATAGTATTCAGCCC TCAGTCCTCCCAGGATGTTCTTCCCAGCACTGCTGCCTCGTGCGGATTTTCCCGTAACCTCAGTAACTGGCTTCTTGTCCCCCTGCTTCC TACCAGGGAAGCCTTCCTGTCCGCGTCTGTGGTTTCCCAACTCTGATATTTGCTCTCAAATGTGGTGGTGTCCTGGTTCTGTGTTTATTT ATTTTGTGTTTTCTCACACACACAAAAAAAAGCCTCTTCCTCCAATGCATCAGGAGGCACCAGCCCTGCCAGCCCTTCTCACTGGGCTCA CCCTGCCCCAGCAACCCCCCGGTGCCACCCTTAAGCCACACTGTCTTCTCTGTAAGCAGCCTGCCAGCAGCAGCCCCAGCACTTTGCAAT GGGCGTGTGTGTGGTGGTGGGTGGGGGGGGCTTGGATCCCTCCTTTTTCCTCCCCTGCCCTGCCCAGGCCCAGATGGCCTTGACTGTAAA GCAGGTGCTGCCTGACAGGTTCTTCTCTCCCTGTCTCTGGTCATTGATCCATCTCTTTGTCCATTCAGTATCCAACCATCCTCTCCATTC TCCTCTGGACCTCACCACTCTCAGAGCTGCTTGTCCTGGCAGAATCTACAGTTCACCCCAACTCTATGCCTTACCCCTCCCAACCCAACA GCATTTGCAGTTTGCAAAATATACAGACCCAAGTCCTGAGGGGACTGAGGACATGATGCTGGGCCCAAGTCTCCTGCTCAGGGCTTCTCT CCAATGCCAGCCCTGCCACTCCTTCCTCACCCTCCTTGGAGCCTCCTCTGCTGCTTGTCTATCCCAACGGCCCTGCTCCCCTCCCTTCCT GCCCTTCACCAGCTTTCTGGACACCATGCCCTGAGGAAGGGACCTTTGGTTTTCTCTAAACATCTTTGAAGGGCTGAGGCAGTCAGGGCT GGCTGCCTTGTCACTCTTTATTTGGAAGCCACTCAAACCATTCCCAAGAAGAGGGACCTCAGCTGGCAATCTGGAAACCTGGCCCAGGTC TGGGCAGATGTCTTCACTTCTCCTACCTTCCCAGTCTTGTGATCCTGTGATGAGCACCAGGATGGCCCTGTGGTCCCTAGAGCACCCCTC ATGCTGTAGGGTCCTGCAGCCCCATCCTTTCTCTACTGGGCCCTGGTATCCTGGCTCCTCTCTCAGCTCTGCCACTGATCTCTGTGCCTT AGTTTACTTCTCTGCACGGGGGACTCACCCCAAGACCATTTCCAGCAGCTTCCCAGGTGATGTGGTGCCCCAAGGCTGGGCTTTGCCAGC TGTGGCCCAGCTCCTTAGTGCTGCCCAGGAGACACCAGGCTGCTCAGAATGAGGTGACTGCGGGCACCATTCTCAGCCAGTGGTTCTTGT ATTGCATTCCAGCAGCAGGAATATCACCTGGGAACTTGATAGAAGTGCAGATTAGCAGCCCCACCCAAGACCCACTGAATTAGAGCTTGT GGAGTGGGGCCCTACAAGCTGGGGTTTTAAGGAGCCCTCCAAATGATTCTGACGCATAAGAATATGCCAACTGCTGATCTGGGCTAGCCA TTAGTAGAGCCTGGGGAGGGACTGGGACTGGCTAGGCCAAGAACAGGTGGAAAACACCAGCCTTATCTGGACTCCTGAGATTGGGAACCA CCACCAACAAAAACCAACCCTATAGTCGCTCCTCTTGGAAGAGGAAGAGAAGTTGAAGGGCCTGGAGAAAGCACACATTGTTTGTTTCCC TGCTCCTGCTCACCTCTCTCACTTGTCTTGGTTTTACAAAAGGCTGTGTGGATGGTGCCAGCCAGGGAGGGGGTGGGAGTCCTGGGGAGG CAGGAGGCAGAAGACCCTGACTGTTTCTCCCTTGGGAACCTCACCATAGGCCAGATAGCGCCTCTTCAAACTGAAAGAAATCTTAACTCC ACAAAGAAAGCATCCTAAATCCCCAGTTCCTCCTCCTCCCAACCCCAGGGATACCTTGTAGACAGTGCCAAAAAACAGCTCCAACCCCCA GCAGCTGGGAAGAGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCA GCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCT TGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAG GACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAAGCGT >8490_8490_7_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000312783_SOGA1_chr20_35445872_ENST00000237536_length(amino acids)=1539AA_BP=235 MDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSR PLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRR EVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAK EESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMK DHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEE LAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAAL VSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPL GSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLS QERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSE GDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAE LKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLS QLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSS EPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGRQMQRSYTAPDKTGIRVYYSPPVARRLGVPVVHDKEGKIIIEPGFLFTTAKPKESAE ADGLAESSYGRWLCNFSRQRLDGGSAGSPSAAGPGFPAALHDFEMSGNMSDDMKEITNCVRQAMRSGSLERKVKSTSSQTVGLASVGTQT IRTVSVGLQTDPPRSSLHGKAWSPRSSSLVSVRSKQISSSLDKVHSRIERPCCSPKYGSPKLQRRSVSKLDSSKDRSLWNLHQGKQNGSA WARSTTTRDSPVLRNINDGLSSLFSVVEHSGSTESVWKLGMSETRAKPEPPKYGIVQEFFRNVCGRAPSPTSSAGEEGTKKPEPLSPASY HQPEGVARILNKKAAKLGSSEEVRLTMLPQVGKDGVLRDGDGAVVLPNEDAVCDCSTQSLTSCFARSSRSAIRHSPSKCRLHPSESSWGG -------------------------------------------------------------- >8490_8490_8_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000312783_SOGA1_chr20_35445872_ENST00000279034_length(transcript)=3967nt_BP=948nt ACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAGGGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAAC GGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGGTCTCACTCCATTGCCCAGGCCAGAGTGCGGGGATATTTGATAAGAAACTT CAGTGAAGGCCGGGCGCGGTGGCTCATGCCCGTAATCCCAGCATTTTCGGAGGCCGAGGCATCATGGACCGATCTAAAGAAAACTGCATT TCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTA AATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAG CCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAG CAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCT TTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTG GCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGG CATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTAT AGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCC GATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAG CTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCA GCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGC CGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCT GAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAG GAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCAC GAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGC GAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGT ACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCG GTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTG GCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGC GAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCA GATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATT CTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGC ACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAG GAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAG GCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGC CTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTG CTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGC ATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAG AACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGT CTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAG CTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTA GCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAG CGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGG GTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAA GTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTT GCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACC AAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGCCAGAAGCTGCCCTTC CTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCAT GGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTAT TCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCT GCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAAGCGTTTTTAATGGTTGTAGATTGGAAATAAAGTATG >8490_8490_8_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000312783_SOGA1_chr20_35445872_ENST00000279034_length(amino acids)=1132AA_BP=235 MDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSR PLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRR EVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAK EESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMK DHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEE LAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAAL VSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPL GSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLS QERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSE GDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAE LKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLS QLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSS -------------------------------------------------------------- >8490_8490_9_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000371356_SOGA1_chr20_35445872_ENST00000237536_length(transcript)=13795nt_BP=837nt CTTGGAAGACTTGGGTCCTTGGGTCGCAGGCTGGAGTGCAATGGTGTGATCTCAGCTCACTGCAACCTCTGCTTCCTGGGTTTAAGTGAT TCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACA GCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGG GTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAG AAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCT GAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGT CGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAA GCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTG TATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCA AAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAG GAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAG AATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCC CCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAG GTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCC GCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGC TCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCG GAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAG AAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACG GACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGG GAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACG GCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAG CTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCT GTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGC TCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAG CCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCT GAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAA ATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAG AAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGG TTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTG ATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATC AAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAG TTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCC GGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTC ATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAG CAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTG GAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGT TTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGC AAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCG TCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGGCAGATGCAGCGCAGCTACACGGCTCCTGACAAGACG GGCATCCGAGTCTACTATAGTCCCCCGGTGGCCCGGCGCCTCGGAGTCCCTGTGGTTCATGACAAAGAGGGCAAGATCATTATCGAGCCC GGCTTCCTCTTCACCACAGCCAAGCCCAAAGAGTCGGCCGAGGCTGATGGGCTGGCTGAGAGCTCCTATGGTCGGTGGCTCTGCAACTTC TCACGGCAGCGCCTGGACGGAGGCTCAGCGGGCAGCCCCTCGGCGGCCGGGCCTGGCTTCCCAGCGGCCCTGCATGACTTTGAGATGTCA GGCAACATGAGTGATGACATGAAGGAGATCACCAACTGTGTGCGCCAGGCCATGCGCTCCGGCTCACTGGAGAGGAAAGTGAAGAGCACA TCCAGCCAGACGGTGGGCCTGGCCAGTGTGGGCACACAGACCATCCGCACGGTCAGCGTGGGCCTGCAGACCGACCCACCCCGCAGCAGC CTCCATGGCAAGGCCTGGTCACCCCGCAGCTCTTCGCTCGTGTCTGTGCGCAGCAAGCAGATCTCCTCCTCCCTGGACAAGGTCCATTCG CGCATCGAGCGGCCCTGCTGCTCCCCCAAGTATGGCTCACCAAAGCTCCAGAGGCGGTCTGTGTCCAAGCTGGACAGCAGCAAGGACCGC AGCCTGTGGAACCTGCACCAGGGCAAGCAGAACGGCTCGGCCTGGGCCCGCTCCACCACCACGCGGGACAGCCCTGTATTGAGAAACATC AACGATGGACTCTCCAGCCTCTTCAGTGTGGTGGAGCACTCAGGGAGCACGGAGTCTGTCTGGAAACTAGGCATGTCTGAGACGCGGGCC AAGCCCGAGCCTCCCAAGTACGGCATTGTGCAGGAATTCTTCCGTAATGTGTGTGGCCGGGCACCGAGCCCCACCTCATCAGCAGGAGAG GAGGGCACCAAGAAGCCAGAGCCCCTCTCCCCAGCCAGCTACCATCAGCCAGAGGGTGTGGCCAGGATCCTGAACAAGAAGGCAGCCAAG TTGGGCAGCAGTGAGGAGGTCAGACTCACCATGCTCCCCCAGGTGGGGAAGGATGGTGTCCTCCGGGACGGAGATGGAGCCGTGGTCCTT CCCAATGAGGACGCTGTTTGTGACTGTAGTACCCAGTCTCTCACCTCCTGCTTCGCCCGATCGTCCCGCTCTGCCATCCGCCACTCTCCT TCCAAGTGCAGGCTGCACCCTTCAGAGTCCAGCTGGGGTGGGGAGGAGAGGGCACTCCCCCCCAGCGAGTGACAGAGCAGCCAAGCTCCC CGCCTCAACCAGCCCAGCCCCTGGATAGCAGAAGGGAACCAGCAGAGACGAGACGAGGTGAGGCGAGGGGCTGTGTCCTCAGCATTGCCT GGCCCTGGAGGGACAGCAGTGATGCCACTGCCAGAATGCAGCTTTCACATCAAGGTAAAGCCGGGTCTCCTGCTGGCCCCTGGGTGGTGA GCTTCGACTTCCCAGGGGAAGGCAGTGAGTGGGAGAGAGACCAAACCTGGGCTTCCCAAGCATCCACTGAGAGATCTGTCAAGAGCCGAT CCCTGGGTCCTAAGAGAGAGCCTTGCCTGGTTCTGCCCATGCCACCCTCTTGGAAGAGCCCAAGAAGGATACATGTCTGGCCATGCCTTT GGGGAAAAGGAGTCGGAGAGATGTTTCCTGCTGACCATCCACCCCTTCATTTGGGAGGAGACACTGCTGAGAAGAACAGGCTTTGCTCTA GGGCTCCATGTTTGGTTCCTGGTGGAGCCCTGTTGGGCATCATCACCATCACCTCCTTCTCTCCACCACCTCCTCCTCCCAGCCCCACTG CTCTAGATTGCTGGGACACTAGGGAGTATGATAGGGCAGTAGCCAGGGCCATTGCTTAGTGTCCTGGAGCCCTGGATCTCCCTGCCCATA GCCTGGATGCAGCAAGAGCTGGGAGGCGAAGTGGAAACATGCAGGGCTCAGGGTTGGGGAGTGATTGCAATTGCCTTCCTTGCCAAAGTG ACTTGGGGCCCCAACGTTCCCAGCAGACCCCTTGAGGACAGAAATAGGTAGAGTCAGTCTCAAGACCTGGTGCATAGATAAATGCCTAAA TACACTGCCTTGATCTCAAGTGATCTCAGAGGCCTCTTTCCCTGGCACCCTGAGAGGCAGCAGGCACTACATCTCCACTGTGTTTACATC CTGCAGCTGGTCGGGGGGCAAAGATATTCCCAGTAAGAGATTCTTGGTTGGCCAGGTCAGGCCCAGGAGAACACCAAGAGGCCAGAGCCC AGGACACAGCAGTAGACTGGGGCCTGGAAACACGTATCTTGCCTAGATTGTTTATTTGAATTTTTCCTACTATAAATATTTAAGGTGGTT TACTTTATTTTAATAATTTAATTTACCCCAAAGTCCCTAAGGTAATTTATTGGAGGTTGAAACATGCATTCTTGCCACTGGGACAACATG AGGCCTCTAACAGCACGGGCAGGCATGGGGTCCCCTGGGTGGACGAGGCCGCTTGGCAGCCAGGTTTGGAGACCTGGCCTCCTGGTCAGC TTTGGAGGGCCCCTCAACAGAGCTGGAGCCCTGCACCCCAACACGGCTGGCCATGTGGCCTCAGAACACTACTTATTACTCAATGCCTGG TACTTGGCCAGCCCCAGCGGTCAGTCTATAAATACTCACTGACAAGGTGGAGGGCTGGACGGCCATCACCACTCCCCAGACGTTCTCCAT TGCCTGTCTCATTTCCCTCCCTCCCTCTGACACCTTTCTTCATGAGTCGAACGTGGATTACTAAAGCTCTATTAAGAGTGTGGAGATCCC TCCAAGTTTCCCAAATGAGAACTCACAGGAAAACAGGACTGAACTTTGAGAATGTTGTTTATCGCAGCTTTGCACATAAACCTGAGTGTC TCCCAGCCTGCCTCGGTTCTCACCAGCCTGCCAGCCTTTTCACCAGCCTCTCTCCTTAGCCTTATGGCCTTTCACGGCTCTTCTCCCTGC CCCAGCTCTGCTGCCCGCCCTTCCTCACGTCCCCTGTGAGCTGCCTGAGCCATTGGTTGGATTTCGATGTGGCTCATTGCAGCATGTGGG GCAGCGCCTCCCATGGCCTCGCCTTGGTGCCGGTGAACCCCTTTTGGTTGCACACATGCTCCCCACACACACATAGACATCAGCCTTCCT GTAACTGACTGGGGACCCAGAGTGGAAACACCAGGATGGATCAGCTTGTCTGCAGAATTGCCCATCAGGAAGACCAAAAGCCAGTAGCTT TGCTGATCTGCCCCCAGGACTCTGGAGGCGCCCCTGCACTCCCACCTCCCACCTGCCAGTTCCCAGACCCACCCATTCGGGATCACCTGG ACCAGTTACCCCAAGTCCTGCATCTCCCTTCCCTGCAGGCTGAACACCAGGGTCATGCCAGTCCCGCCAGCCGCCTCCTCCATGCCCCAG TGACTGGTGTGGGCAGAGCAGGCAGCCAGTGGAGCTGTGGGCCAGTTCCGCTCTTGGATGCTGCTGCTCTCACCCATGAGGTCAGGGGGG CCCTCCAAGGTTATCTCCAGGTGAGGGGATTCACATCAGGCCACAAGCCACCAGAGGCCTTCTGCCACCTCCCAGAGCGACAGCCAGGGA GGCTGCGTACTCAGCCTCGGGGAGAAATCCCCGTGGGACCTGAGCCCCAAGACCTACGGACCACTCAGCCTTACCATCGTACCGTCCAGG ATTGTCCTTGCCATCTTTGTTGTCTCAGCCAGACCTTGGTTTTCAGTAAAGCCCCAGTTTCTACTTCCTGCATGCCACTGTGCAAGGCCA CTCATCACTGTTCCTGCAGAAGCCTCTGGACGTGGGGCTGGATGGGGTTGAAAATGTTACATGTAAATATTGGTTTGGTTCGGTTTTTAG CATTTTACTTGGTAACTGGTTGTTTTCTTTTTTGGGGTGGGGGGATTGGTTTGTAAAAATTCTCTACTCTTTTGGAATGTGATTTCTAAG TTTGTTGGTTTCTTCAAATGCCTTTTAAGTCTTGGTAACATTCCCAAAGCAGAAAACTGCCTGACCCACAGTGGGGATTCCCTGGAGAAT TGGGGTCCCAAGAAGGAATGCTGCCCTTCTCGAACCCGTTCTCCCCCTTCCTCCTGCCTCTCTGCCTTTTACTGCTATTCCCTTCTTCTC CTCCTTTATCCTTCTTTCTGTTTTCCCCATCTCCACTCTCTCTTCAACCAAAGTCCCAAGGAACCCTCGGGGCTCAATCCCCCATAGACC ACTTGGCTTGGGTCCATGGGGTTGGCATCAGTTGGTTGGCGGAAATGGGGGACCAGTTGGCATGATGGCCCTAAACTGGGAAACCTCATG TTTCTTATGTCTCACCTCTTTCCAGAGCCAAATCAGCCCCTTTTGGAATGATGACTTCATTGGAATGCAAATCAAGTCATTTTGGTGCAT CAGTGGCTCTTAGGCCTGCACACACGAGACATCAGAATCCAATCCTCTGACCCTGTGCCAGCCCTTTCCCCCAGTTTATTTCCCACCAAA GGCTGACCTCTAAGAGGTCTTGCTTTCTATGAACTCAAGATGGGTCCCACCTCTAGGTGTCCCCAGGTGCACTCTTCTACCGGTTGGCTT CCGATGTGACAAGGCCAAGGGCCCAAAGACTTGACCCTCTTACACCCTTGCTGACATGGTTCCATCATGTCCACCCGCATGCACTTTTAT GGTTTCATCACCCAGCCTCTTCTCCTCTGGCCCACCCAGCGTCCAGGCTCTTTCTCCCTCTCCCCTCCTATCTAGAATGTCCCCTGCTTC TAGCCTCACCAGACCCCCCAAGCTCCCACTACTTCTTCCATAATAATAGTAATAACAATGGTTATCATCATCCCCTGCACATCCCGCCTA AAGCACTTTACTATATAGAAAACGTTTCCCCTGGCCGGGCATGGTGGCTCACGCCTGAAATCCCAGCTCTTTGGGAGGCTGAGGCGAGCG GATCAGTTGAGGTCAGGAGTTCAACGCCAGCCTGGCCAACGTGGTGAATCCCTGTCTGTACTAAAAGTACAAAAAATTAGCTGAGCATGG TGGTGCGTGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGTGGGAGAATCGCTTGAGCCCAGGAGGCGGAGGTTGCAGGAGCAGAGATTG CGCCACTGCACTCCAGCCTGGATGACAGAGTGAGACCCAATCTCAAAAAAGAAATCGTTTCCCACCCCACATCTCCTTCAGACCTCTCAG GGACAACTCTGGGAGGCAGCCTTGGCAGGACATGGGTTAGTGCGCCCATTTTGCTGTGAGGAAACTGAGGTACAGGTCTCATCCCAGAGC ATGAGAAGTCACTGAGTTTAGATGAGAACTTGGGTCCAACTCTGTCCTGTTTGCTGTGCAAATCCGCTGCCCTGCTGGGGGCTTTTGGTG GGTCCAGAATACCCAGAATATGCTGCTGGCCAACCCAGGCATAAAACAAGTCCATTCTAGATCACTGAGCCTTGTGTATTCCAGAGGGTG ATCTGAGGTCCCCATTCAGCAGAATTCTCTGAGGGCATGTTCAGAATGTAGATTCCTGGGCCCCACCTTGAATTTGCATGTTTAACAAAC TCTCCTGGGGTTGAGGGGTGGGTGCAGTGGTCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGTGGATCACTTGAGCTCAGGA GTTCGAGCCTGGCCAATATGGTGAAACCCTGTCTCTACTAAAAATGCAAAAATTAGCCAGGTGTGGTGGCACATGCCTGTAATCCCAGCT ACTTGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGTGGGGAGCAGCGGTTGCAGTGAGCCGAGATTGTGCCATTGCGCTCTAGCCTG GGCAACGGAATGAGACTTGTCTCAAAAAAAATAAAAAATAAAACCAGCCCTCCCCGGGGGATCTTAGGCACTATTGGCCACACCATTGGT GTTCGTGGCCCTGATTATTAGGCTATCTTTCTTTTTTTAAGTTTTTTTAGATTTATTTTTTATTTTATTTATTTATTTATTTATTATTTA TTATTATTATTTTTAGAGACAGGGGGTCTCCCTATGTTGCCCAGGCTGGTTTCAAACTCCTGGGCTCAAGTGATCTGCCCTCCTCAGCCT CCCAAAGTGCTGGGGTTACAGGCAGGCATCAGCCACCGTGCCAGGTCATCTTCCTTTTTCTTTTTTTTTTGGAGACAGAGTCTTGCTCTG TCATTCAGGCTGGAGTACAGTGGCGCCATTTCAGCTCACCGCAGCCTCCACCTCCCAGGTTCAAGCAGTTCTCCTGCCTCAACCTCCCGA GTAACTGGGATTACAGGTGTGTGCCACCACACCCGACTAATTTTTTTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGT CTTGAACTCCTGACCTCAGGTGATCCACCCACCTCGGCCTCCCAAAGTACTGGGATTATAGGCGTGAGCCACGGCATCCAGCCTCATCTT TCTTTTAACCAATAAACATGATGCTGTATCTTAAAAAGAGCACTGAGCAGGGACTTAAGGGATCGAGTCCTCAACCAAACTGATTTAATT ACTCAGGATTTTCAAAAAGCATCAGAGGCTATTTACAATCTTAATCATAGGGGTTCAGTAAAATAAAAATAAGAAGTAAAAAAGCAAGAG AAATTATTCTGTAAATCTAACTGGTGTAATTCCCATAATCATGCAATTAAGTTTTACTCTTGAGTTTCCTGACAGCCATTGGTAAAAAGA GAAACACATCAGGATTTATAATTTTTATCATCCAATTATGGGAAGCAAGCATGTTGGCCCCAGGAGACGAACTCTTCTACTAATTTATAG CATGTATCCTTTCATCAAGGGCCACTGAGCCAGTTGGTGAGTCAACGGGTGAACCTAAGATGCAAGGATGTTTTCCAGGTGACTATTTAA AAAAATAAATAATCCACCAACGTGATTGACCTTGGCGAGATCATGTTTCTAGTCTATACCTCAGTTTCCCCATCTGTAAAGTGAGGATAA TGTCCCACCCCATGTAACTGTGGTGAGGACCAACTGCAACACTGTGCCTGCGAGTCTCCTTGGAAAAGTGTAAGGTTCTACACAAATGGA AAGTGATCTGATCACACTCAGTGTCCCCAGCCCAGCCTTTCAGTGCCCTGGCCCTGGGGTGGGGGACAATACTCTCCTCACCCCCTTCAC TAGTCTTCATGAATAGCAAGGAGGCCATAACATAATTTGGTCTAAACCCCTTCCTTTTTAAAAGAATGATGGCAAAATGTGCATAACATT AAATTTACCATTTTAAAGATTTCTAAGTGTCTCGAAGTACATTTGCAATGTGTAACTGCCACCTCCAGAACTTTTTCATCATCCTAAACT GAAACTCTGTACCCACTAAACAGTAACTCCCCACTCCCCCTGTCCCCAGTCCCTGGTAACCTCTATTCTACTTTTTTTTTTTCTCTGTGG ATTGGCCTATTCTAGGTATCTCTTTTAAGTAGAATCATATAGTATTTGTCCTTTTGTGTCCAGCCCCCTCATTTTTTGAGATGAGGAATC AGGCCCAGAGAAGGCAGTGGCTCACCCAAGGTCACATCGCAAACCAGAGGCAGAGCCAAGACCAGAACCCAGGTTTCCTGACTCCTAACA GGTCTTGCATCCCCAGTAGTGAATGAGACACTGCTTTGACTTTCTGTAATCTTGGTTTAGCCCCTTCCTTTCTCTGGGCTCAGTCTGCTT CCCACCACTCACCCATTCTTTCTTTATCATCTCCCTCCAAAGCCTCTTGTCCTCCTGCCTCCTCTTCTCCTTGGCTGGTTCCTGCCAGCA AATGGAGTGGCCCTGGTCCATGCCAGGTTTTCCTCTTCTGGGTCCGGAGCTCACTATAGTATTCAGCCCTCAGTCCTCCCAGGATGTTCT TCCCAGCACTGCTGCCTCGTGCGGATTTTCCCGTAACCTCAGTAACTGGCTTCTTGTCCCCCTGCTTCCTACCAGGGAAGCCTTCCTGTC CGCGTCTGTGGTTTCCCAACTCTGATATTTGCTCTCAAATGTGGTGGTGTCCTGGTTCTGTGTTTATTTATTTTGTGTTTTCTCACACAC ACAAAAAAAAGCCTCTTCCTCCAATGCATCAGGAGGCACCAGCCCTGCCAGCCCTTCTCACTGGGCTCACCCTGCCCCAGCAACCCCCCG GTGCCACCCTTAAGCCACACTGTCTTCTCTGTAAGCAGCCTGCCAGCAGCAGCCCCAGCACTTTGCAATGGGCGTGTGTGTGGTGGTGGG TGGGGGGGGCTTGGATCCCTCCTTTTTCCTCCCCTGCCCTGCCCAGGCCCAGATGGCCTTGACTGTAAAGCAGGTGCTGCCTGACAGGTT CTTCTCTCCCTGTCTCTGGTCATTGATCCATCTCTTTGTCCATTCAGTATCCAACCATCCTCTCCATTCTCCTCTGGACCTCACCACTCT CAGAGCTGCTTGTCCTGGCAGAATCTACAGTTCACCCCAACTCTATGCCTTACCCCTCCCAACCCAACAGCATTTGCAGTTTGCAAAATA TACAGACCCAAGTCCTGAGGGGACTGAGGACATGATGCTGGGCCCAAGTCTCCTGCTCAGGGCTTCTCTCCAATGCCAGCCCTGCCACTC CTTCCTCACCCTCCTTGGAGCCTCCTCTGCTGCTTGTCTATCCCAACGGCCCTGCTCCCCTCCCTTCCTGCCCTTCACCAGCTTTCTGGA CACCATGCCCTGAGGAAGGGACCTTTGGTTTTCTCTAAACATCTTTGAAGGGCTGAGGCAGTCAGGGCTGGCTGCCTTGTCACTCTTTAT TTGGAAGCCACTCAAACCATTCCCAAGAAGAGGGACCTCAGCTGGCAATCTGGAAACCTGGCCCAGGTCTGGGCAGATGTCTTCACTTCT CCTACCTTCCCAGTCTTGTGATCCTGTGATGAGCACCAGGATGGCCCTGTGGTCCCTAGAGCACCCCTCATGCTGTAGGGTCCTGCAGCC CCATCCTTTCTCTACTGGGCCCTGGTATCCTGGCTCCTCTCTCAGCTCTGCCACTGATCTCTGTGCCTTAGTTTACTTCTCTGCACGGGG GACTCACCCCAAGACCATTTCCAGCAGCTTCCCAGGTGATGTGGTGCCCCAAGGCTGGGCTTTGCCAGCTGTGGCCCAGCTCCTTAGTGC TGCCCAGGAGACACCAGGCTGCTCAGAATGAGGTGACTGCGGGCACCATTCTCAGCCAGTGGTTCTTGTATTGCATTCCAGCAGCAGGAA TATCACCTGGGAACTTGATAGAAGTGCAGATTAGCAGCCCCACCCAAGACCCACTGAATTAGAGCTTGTGGAGTGGGGCCCTACAAGCTG GGGTTTTAAGGAGCCCTCCAAATGATTCTGACGCATAAGAATATGCCAACTGCTGATCTGGGCTAGCCATTAGTAGAGCCTGGGGAGGGA CTGGGACTGGCTAGGCCAAGAACAGGTGGAAAACACCAGCCTTATCTGGACTCCTGAGATTGGGAACCACCACCAACAAAAACCAACCCT ATAGTCGCTCCTCTTGGAAGAGGAAGAGAAGTTGAAGGGCCTGGAGAAAGCACACATTGTTTGTTTCCCTGCTCCTGCTCACCTCTCTCA CTTGTCTTGGTTTTACAAAAGGCTGTGTGGATGGTGCCAGCCAGGGAGGGGGTGGGAGTCCTGGGGAGGCAGGAGGCAGAAGACCCTGAC TGTTTCTCCCTTGGGAACCTCACCATAGGCCAGATAGCGCCTCTTCAAACTGAAAGAAATCTTAACTCCACAAAGAAAGCATCCTAAATC CCCAGTTCCTCCTCCTCCCAACCCCAGGGATACCTTGTAGACAGTGCCAAAAAACAGCTCCAACCCCCAGCAGCTGGGAAGAGAGCCAGA AGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGT CCTGCAGTCATGGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTC TTGTCCACTATTCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCAC CAGCTCTCCCTGCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAAGCGTTTTTAATGGTTGTAGATTGGA >8490_8490_9_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000371356_SOGA1_chr20_35445872_ENST00000237536_length(amino acids)=1577AA_BP=3 MGRRLECNGVISAHCNLCFLGLSDSPASASRVAGITGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSN SSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKG KFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQR TATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRET ELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELE DQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGD SAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAIL VRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRS FRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQE KNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADM KVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQ LQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKS VSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSSEPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGRQMQRSYTAPDKTGIRVYY SPPVARRLGVPVVHDKEGKIIIEPGFLFTTAKPKESAEADGLAESSYGRWLCNFSRQRLDGGSAGSPSAAGPGFPAALHDFEMSGNMSDD MKEITNCVRQAMRSGSLERKVKSTSSQTVGLASVGTQTIRTVSVGLQTDPPRSSLHGKAWSPRSSSLVSVRSKQISSSLDKVHSRIERPC CSPKYGSPKLQRRSVSKLDSSKDRSLWNLHQGKQNGSAWARSTTTRDSPVLRNINDGLSSLFSVVEHSGSTESVWKLGMSETRAKPEPPK YGIVQEFFRNVCGRAPSPTSSAGEEGTKKPEPLSPASYHQPEGVARILNKKAAKLGSSEEVRLTMLPQVGKDGVLRDGDGAVVLPNEDAV -------------------------------------------------------------- >8490_8490_10_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000371356_SOGA1_chr20_35445872_ENST00000279034_length(transcript)=3856nt_BP=837nt CTTGGAAGACTTGGGTCCTTGGGTCGCAGGCTGGAGTGCAATGGTGTGATCTCAGCTCACTGCAACCTCTGCTTCCTGGGTTTAAGTGAT TCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACA GCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTGACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGG GTCTTGTGTCCTTCAAATTCTTCCCAGCGCATTCCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAG AAGCAATTGCAGGCAACCAGTGTACCTCATCCTGTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCT GAAAATAATCCTGAGGAGGAACTGGCATCAAAACAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGT CGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTTTATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAA GCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAGCTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTG TATGGTTATTTCCATGATGCTACCAGAGTCTACCTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCA AAGTTTGATGAGCAGAGAACTGCTACTCATTCCTTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAG GAGGACAGTGCAGACCTGAAGTGCCAGTTGCACTTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAG AATGACAGCATGAAGGAGGAGCTGCTGAAGTACCGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCC CCCCACTCGCGGGAGACCGAGCTGAAGGTGCACCTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAG GTGGAGAACCGAGGCCTGCGGGCTGAGATGGACGACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCC GCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTGGCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGC TCCTCTGCCGAGCTCGAGGACCAGAACAAGCTGCTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCG GAGGACAGTTGTTCTGTGCTCAGCGAACCTTCACAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAG AAGCTGCAGTACGAGAACCGCGTGCTCCTCTCCAACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACG GACGCCGAGGCCGGGGACTCTGCCCAGTGTGTGCCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGG GAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAGGCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACG GCTGGCCTCCGGCTGTGTCTGGACAACGAGTGTGCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAG CTCATCCATGCCATCCTGGTGCGCCTGAGCGTGCTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCT GTCAAGGAACAGCAGGAGTCCTTCTCATCACTGCCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGC TCAGACTTTCAGCCACCTGACTTCAGGGACCTGCCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAG CCCGACCCCAGCCGGAGCTTCAGGCCTTACCGAGCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCT GAGGCCCACGACAGCCTCCGGGGCTTGCAAGAGCAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAA ATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCGCTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAG AAATTCTGGAGCCAGGAGAAGAACATGCTGGTGCAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGG TTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTGCCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTG ATGGAGGAAGAGGAGATAAACGCTCAGCATTCTGATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATC AAGACACTGGCCGACATGAAGGTGACGCTGAAGGAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAG TTTGCCAAGGCCAAGGCTACCTGGGAGACAGAGCGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCC GGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCCCTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTC ATGGAGCTGACTCGGCAGCTGCAGATCAGTGAGCGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAG CAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAACCGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTG GAGAAGCAGGACAACAGCTGGAAGGAGACACGCAGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGT TTAAAGAGAACCAAATCTGTTTCTTCCATGTCTGAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGC AAGAAGCTGCCTAACAACCCTGCCTTTGGCTTTGTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCG TCGAGGGACTGCAACCACCTGGGTGCCCTGGCCTGCCAGGACCCCCCAGGGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCC CAGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGG GCAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTC TTCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGA >8490_8490_10_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000371356_SOGA1_chr20_35445872_ENST00000279034_length(amino acids)=1170AA_BP=3 MGRRLECNGVISAHCNLCFLGLSDSPASASRVAGITGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSN SSQRIPLQAQKLVSSHKPVQNQKQKQLQATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKG KFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQR TATHSLKKRGTRSLGKADKKTLVQEDSADLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRET ELKVHLKLVEEEANLLSRRIVELEVENRGLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELE DQNKLLLNELAKFRSEHELDVALSEDSCSVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGD SAQCVPAPLGETHESHAVRLCRAREAEVLPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAIL VRLSVLQQELNAFTRKADAVLGCSVKEQQESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRS FRPYRAEDNDSYASEIKELQLVLAEAHDSLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQE KNMLVQESQQFKHNFLLLFMKLRWFLKRWRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADM KVTLKELCWLLRDERRGLTELQQQFAKAKATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQ LQISERNWSQEKLQLVERLQGEKQQVEQQVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKS VSSMSEFESLLDCSPYLAGGDARGKKLPNNPAFGFVSSEPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGSQKLPFLLILAPPQPPPIL -------------------------------------------------------------- >8490_8490_11_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000395913_SOGA1_chr20_35445872_ENST00000237536_length(transcript)=13852nt_BP=894nt CTTAAACGCGACTCAAGGCGTCGGGTTTGTTGTCAACCAATCACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAG GGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAACGGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGTGGGAGCCG ACGGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTG ACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATT CCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCT GTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAA CAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTT TATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAG CTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTAC CTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCC TTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCAC TTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTAC CGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCAC CTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGAC GACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTG GCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTG CTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCA CAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCC AACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTG CCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAG GCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGT GCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTG CTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTG CCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTG CCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGA GCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAG CAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCG CTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTG CAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTG CCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCT GATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAG GAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAG CGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCC CTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAG CGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAAC CGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGC AGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCT GAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTT GTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCC TGCCAGGACCCCCCAGGGAGGCAGATGCAGCGCAGCTACACGGCTCCTGACAAGACGGGCATCCGAGTCTACTATAGTCCCCCGGTGGCC CGGCGCCTCGGAGTCCCTGTGGTTCATGACAAAGAGGGCAAGATCATTATCGAGCCCGGCTTCCTCTTCACCACAGCCAAGCCCAAAGAG TCGGCCGAGGCTGATGGGCTGGCTGAGAGCTCCTATGGTCGGTGGCTCTGCAACTTCTCACGGCAGCGCCTGGACGGAGGCTCAGCGGGC AGCCCCTCGGCGGCCGGGCCTGGCTTCCCAGCGGCCCTGCATGACTTTGAGATGTCAGGCAACATGAGTGATGACATGAAGGAGATCACC AACTGTGTGCGCCAGGCCATGCGCTCCGGCTCACTGGAGAGGAAAGTGAAGAGCACATCCAGCCAGACGGTGGGCCTGGCCAGTGTGGGC ACACAGACCATCCGCACGGTCAGCGTGGGCCTGCAGACCGACCCACCCCGCAGCAGCCTCCATGGCAAGGCCTGGTCACCCCGCAGCTCT TCGCTCGTGTCTGTGCGCAGCAAGCAGATCTCCTCCTCCCTGGACAAGGTCCATTCGCGCATCGAGCGGCCCTGCTGCTCCCCCAAGTAT GGCTCACCAAAGCTCCAGAGGCGGTCTGTGTCCAAGCTGGACAGCAGCAAGGACCGCAGCCTGTGGAACCTGCACCAGGGCAAGCAGAAC GGCTCGGCCTGGGCCCGCTCCACCACCACGCGGGACAGCCCTGTATTGAGAAACATCAACGATGGACTCTCCAGCCTCTTCAGTGTGGTG GAGCACTCAGGGAGCACGGAGTCTGTCTGGAAACTAGGCATGTCTGAGACGCGGGCCAAGCCCGAGCCTCCCAAGTACGGCATTGTGCAG GAATTCTTCCGTAATGTGTGTGGCCGGGCACCGAGCCCCACCTCATCAGCAGGAGAGGAGGGCACCAAGAAGCCAGAGCCCCTCTCCCCA GCCAGCTACCATCAGCCAGAGGGTGTGGCCAGGATCCTGAACAAGAAGGCAGCCAAGTTGGGCAGCAGTGAGGAGGTCAGACTCACCATG CTCCCCCAGGTGGGGAAGGATGGTGTCCTCCGGGACGGAGATGGAGCCGTGGTCCTTCCCAATGAGGACGCTGTTTGTGACTGTAGTACC CAGTCTCTCACCTCCTGCTTCGCCCGATCGTCCCGCTCTGCCATCCGCCACTCTCCTTCCAAGTGCAGGCTGCACCCTTCAGAGTCCAGC TGGGGTGGGGAGGAGAGGGCACTCCCCCCCAGCGAGTGACAGAGCAGCCAAGCTCCCCGCCTCAACCAGCCCAGCCCCTGGATAGCAGAA GGGAACCAGCAGAGACGAGACGAGGTGAGGCGAGGGGCTGTGTCCTCAGCATTGCCTGGCCCTGGAGGGACAGCAGTGATGCCACTGCCA GAATGCAGCTTTCACATCAAGGTAAAGCCGGGTCTCCTGCTGGCCCCTGGGTGGTGAGCTTCGACTTCCCAGGGGAAGGCAGTGAGTGGG AGAGAGACCAAACCTGGGCTTCCCAAGCATCCACTGAGAGATCTGTCAAGAGCCGATCCCTGGGTCCTAAGAGAGAGCCTTGCCTGGTTC TGCCCATGCCACCCTCTTGGAAGAGCCCAAGAAGGATACATGTCTGGCCATGCCTTTGGGGAAAAGGAGTCGGAGAGATGTTTCCTGCTG ACCATCCACCCCTTCATTTGGGAGGAGACACTGCTGAGAAGAACAGGCTTTGCTCTAGGGCTCCATGTTTGGTTCCTGGTGGAGCCCTGT TGGGCATCATCACCATCACCTCCTTCTCTCCACCACCTCCTCCTCCCAGCCCCACTGCTCTAGATTGCTGGGACACTAGGGAGTATGATA GGGCAGTAGCCAGGGCCATTGCTTAGTGTCCTGGAGCCCTGGATCTCCCTGCCCATAGCCTGGATGCAGCAAGAGCTGGGAGGCGAAGTG GAAACATGCAGGGCTCAGGGTTGGGGAGTGATTGCAATTGCCTTCCTTGCCAAAGTGACTTGGGGCCCCAACGTTCCCAGCAGACCCCTT GAGGACAGAAATAGGTAGAGTCAGTCTCAAGACCTGGTGCATAGATAAATGCCTAAATACACTGCCTTGATCTCAAGTGATCTCAGAGGC CTCTTTCCCTGGCACCCTGAGAGGCAGCAGGCACTACATCTCCACTGTGTTTACATCCTGCAGCTGGTCGGGGGGCAAAGATATTCCCAG TAAGAGATTCTTGGTTGGCCAGGTCAGGCCCAGGAGAACACCAAGAGGCCAGAGCCCAGGACACAGCAGTAGACTGGGGCCTGGAAACAC GTATCTTGCCTAGATTGTTTATTTGAATTTTTCCTACTATAAATATTTAAGGTGGTTTACTTTATTTTAATAATTTAATTTACCCCAAAG TCCCTAAGGTAATTTATTGGAGGTTGAAACATGCATTCTTGCCACTGGGACAACATGAGGCCTCTAACAGCACGGGCAGGCATGGGGTCC CCTGGGTGGACGAGGCCGCTTGGCAGCCAGGTTTGGAGACCTGGCCTCCTGGTCAGCTTTGGAGGGCCCCTCAACAGAGCTGGAGCCCTG CACCCCAACACGGCTGGCCATGTGGCCTCAGAACACTACTTATTACTCAATGCCTGGTACTTGGCCAGCCCCAGCGGTCAGTCTATAAAT ACTCACTGACAAGGTGGAGGGCTGGACGGCCATCACCACTCCCCAGACGTTCTCCATTGCCTGTCTCATTTCCCTCCCTCCCTCTGACAC CTTTCTTCATGAGTCGAACGTGGATTACTAAAGCTCTATTAAGAGTGTGGAGATCCCTCCAAGTTTCCCAAATGAGAACTCACAGGAAAA CAGGACTGAACTTTGAGAATGTTGTTTATCGCAGCTTTGCACATAAACCTGAGTGTCTCCCAGCCTGCCTCGGTTCTCACCAGCCTGCCA GCCTTTTCACCAGCCTCTCTCCTTAGCCTTATGGCCTTTCACGGCTCTTCTCCCTGCCCCAGCTCTGCTGCCCGCCCTTCCTCACGTCCC CTGTGAGCTGCCTGAGCCATTGGTTGGATTTCGATGTGGCTCATTGCAGCATGTGGGGCAGCGCCTCCCATGGCCTCGCCTTGGTGCCGG TGAACCCCTTTTGGTTGCACACATGCTCCCCACACACACATAGACATCAGCCTTCCTGTAACTGACTGGGGACCCAGAGTGGAAACACCA GGATGGATCAGCTTGTCTGCAGAATTGCCCATCAGGAAGACCAAAAGCCAGTAGCTTTGCTGATCTGCCCCCAGGACTCTGGAGGCGCCC CTGCACTCCCACCTCCCACCTGCCAGTTCCCAGACCCACCCATTCGGGATCACCTGGACCAGTTACCCCAAGTCCTGCATCTCCCTTCCC TGCAGGCTGAACACCAGGGTCATGCCAGTCCCGCCAGCCGCCTCCTCCATGCCCCAGTGACTGGTGTGGGCAGAGCAGGCAGCCAGTGGA GCTGTGGGCCAGTTCCGCTCTTGGATGCTGCTGCTCTCACCCATGAGGTCAGGGGGGCCCTCCAAGGTTATCTCCAGGTGAGGGGATTCA CATCAGGCCACAAGCCACCAGAGGCCTTCTGCCACCTCCCAGAGCGACAGCCAGGGAGGCTGCGTACTCAGCCTCGGGGAGAAATCCCCG TGGGACCTGAGCCCCAAGACCTACGGACCACTCAGCCTTACCATCGTACCGTCCAGGATTGTCCTTGCCATCTTTGTTGTCTCAGCCAGA CCTTGGTTTTCAGTAAAGCCCCAGTTTCTACTTCCTGCATGCCACTGTGCAAGGCCACTCATCACTGTTCCTGCAGAAGCCTCTGGACGT GGGGCTGGATGGGGTTGAAAATGTTACATGTAAATATTGGTTTGGTTCGGTTTTTAGCATTTTACTTGGTAACTGGTTGTTTTCTTTTTT GGGGTGGGGGGATTGGTTTGTAAAAATTCTCTACTCTTTTGGAATGTGATTTCTAAGTTTGTTGGTTTCTTCAAATGCCTTTTAAGTCTT GGTAACATTCCCAAAGCAGAAAACTGCCTGACCCACAGTGGGGATTCCCTGGAGAATTGGGGTCCCAAGAAGGAATGCTGCCCTTCTCGA ACCCGTTCTCCCCCTTCCTCCTGCCTCTCTGCCTTTTACTGCTATTCCCTTCTTCTCCTCCTTTATCCTTCTTTCTGTTTTCCCCATCTC CACTCTCTCTTCAACCAAAGTCCCAAGGAACCCTCGGGGCTCAATCCCCCATAGACCACTTGGCTTGGGTCCATGGGGTTGGCATCAGTT GGTTGGCGGAAATGGGGGACCAGTTGGCATGATGGCCCTAAACTGGGAAACCTCATGTTTCTTATGTCTCACCTCTTTCCAGAGCCAAAT CAGCCCCTTTTGGAATGATGACTTCATTGGAATGCAAATCAAGTCATTTTGGTGCATCAGTGGCTCTTAGGCCTGCACACACGAGACATC AGAATCCAATCCTCTGACCCTGTGCCAGCCCTTTCCCCCAGTTTATTTCCCACCAAAGGCTGACCTCTAAGAGGTCTTGCTTTCTATGAA CTCAAGATGGGTCCCACCTCTAGGTGTCCCCAGGTGCACTCTTCTACCGGTTGGCTTCCGATGTGACAAGGCCAAGGGCCCAAAGACTTG ACCCTCTTACACCCTTGCTGACATGGTTCCATCATGTCCACCCGCATGCACTTTTATGGTTTCATCACCCAGCCTCTTCTCCTCTGGCCC ACCCAGCGTCCAGGCTCTTTCTCCCTCTCCCCTCCTATCTAGAATGTCCCCTGCTTCTAGCCTCACCAGACCCCCCAAGCTCCCACTACT TCTTCCATAATAATAGTAATAACAATGGTTATCATCATCCCCTGCACATCCCGCCTAAAGCACTTTACTATATAGAAAACGTTTCCCCTG GCCGGGCATGGTGGCTCACGCCTGAAATCCCAGCTCTTTGGGAGGCTGAGGCGAGCGGATCAGTTGAGGTCAGGAGTTCAACGCCAGCCT GGCCAACGTGGTGAATCCCTGTCTGTACTAAAAGTACAAAAAATTAGCTGAGCATGGTGGTGCGTGCCTGTAATCCCAGCTACTCGGGAG GCTGAGGTGGGAGAATCGCTTGAGCCCAGGAGGCGGAGGTTGCAGGAGCAGAGATTGCGCCACTGCACTCCAGCCTGGATGACAGAGTGA GACCCAATCTCAAAAAAGAAATCGTTTCCCACCCCACATCTCCTTCAGACCTCTCAGGGACAACTCTGGGAGGCAGCCTTGGCAGGACAT GGGTTAGTGCGCCCATTTTGCTGTGAGGAAACTGAGGTACAGGTCTCATCCCAGAGCATGAGAAGTCACTGAGTTTAGATGAGAACTTGG GTCCAACTCTGTCCTGTTTGCTGTGCAAATCCGCTGCCCTGCTGGGGGCTTTTGGTGGGTCCAGAATACCCAGAATATGCTGCTGGCCAA CCCAGGCATAAAACAAGTCCATTCTAGATCACTGAGCCTTGTGTATTCCAGAGGGTGATCTGAGGTCCCCATTCAGCAGAATTCTCTGAG GGCATGTTCAGAATGTAGATTCCTGGGCCCCACCTTGAATTTGCATGTTTAACAAACTCTCCTGGGGTTGAGGGGTGGGTGCAGTGGTCA CACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGTGGATCACTTGAGCTCAGGAGTTCGAGCCTGGCCAATATGGTGAAACCCTGTC TCTACTAAAAATGCAAAAATTAGCCAGGTGTGGTGGCACATGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCACTTGAA CCTGGTGGGGAGCAGCGGTTGCAGTGAGCCGAGATTGTGCCATTGCGCTCTAGCCTGGGCAACGGAATGAGACTTGTCTCAAAAAAAATA AAAAATAAAACCAGCCCTCCCCGGGGGATCTTAGGCACTATTGGCCACACCATTGGTGTTCGTGGCCCTGATTATTAGGCTATCTTTCTT TTTTTAAGTTTTTTTAGATTTATTTTTTATTTTATTTATTTATTTATTTATTATTTATTATTATTATTTTTAGAGACAGGGGGTCTCCCT ATGTTGCCCAGGCTGGTTTCAAACTCCTGGGCTCAAGTGATCTGCCCTCCTCAGCCTCCCAAAGTGCTGGGGTTACAGGCAGGCATCAGC CACCGTGCCAGGTCATCTTCCTTTTTCTTTTTTTTTTGGAGACAGAGTCTTGCTCTGTCATTCAGGCTGGAGTACAGTGGCGCCATTTCA GCTCACCGCAGCCTCCACCTCCCAGGTTCAAGCAGTTCTCCTGCCTCAACCTCCCGAGTAACTGGGATTACAGGTGTGTGCCACCACACC CGACTAATTTTTTTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCACCCACC TCGGCCTCCCAAAGTACTGGGATTATAGGCGTGAGCCACGGCATCCAGCCTCATCTTTCTTTTAACCAATAAACATGATGCTGTATCTTA AAAAGAGCACTGAGCAGGGACTTAAGGGATCGAGTCCTCAACCAAACTGATTTAATTACTCAGGATTTTCAAAAAGCATCAGAGGCTATT TACAATCTTAATCATAGGGGTTCAGTAAAATAAAAATAAGAAGTAAAAAAGCAAGAGAAATTATTCTGTAAATCTAACTGGTGTAATTCC CATAATCATGCAATTAAGTTTTACTCTTGAGTTTCCTGACAGCCATTGGTAAAAAGAGAAACACATCAGGATTTATAATTTTTATCATCC AATTATGGGAAGCAAGCATGTTGGCCCCAGGAGACGAACTCTTCTACTAATTTATAGCATGTATCCTTTCATCAAGGGCCACTGAGCCAG TTGGTGAGTCAACGGGTGAACCTAAGATGCAAGGATGTTTTCCAGGTGACTATTTAAAAAAATAAATAATCCACCAACGTGATTGACCTT GGCGAGATCATGTTTCTAGTCTATACCTCAGTTTCCCCATCTGTAAAGTGAGGATAATGTCCCACCCCATGTAACTGTGGTGAGGACCAA CTGCAACACTGTGCCTGCGAGTCTCCTTGGAAAAGTGTAAGGTTCTACACAAATGGAAAGTGATCTGATCACACTCAGTGTCCCCAGCCC AGCCTTTCAGTGCCCTGGCCCTGGGGTGGGGGACAATACTCTCCTCACCCCCTTCACTAGTCTTCATGAATAGCAAGGAGGCCATAACAT AATTTGGTCTAAACCCCTTCCTTTTTAAAAGAATGATGGCAAAATGTGCATAACATTAAATTTACCATTTTAAAGATTTCTAAGTGTCTC GAAGTACATTTGCAATGTGTAACTGCCACCTCCAGAACTTTTTCATCATCCTAAACTGAAACTCTGTACCCACTAAACAGTAACTCCCCA CTCCCCCTGTCCCCAGTCCCTGGTAACCTCTATTCTACTTTTTTTTTTTCTCTGTGGATTGGCCTATTCTAGGTATCTCTTTTAAGTAGA ATCATATAGTATTTGTCCTTTTGTGTCCAGCCCCCTCATTTTTTGAGATGAGGAATCAGGCCCAGAGAAGGCAGTGGCTCACCCAAGGTC ACATCGCAAACCAGAGGCAGAGCCAAGACCAGAACCCAGGTTTCCTGACTCCTAACAGGTCTTGCATCCCCAGTAGTGAATGAGACACTG CTTTGACTTTCTGTAATCTTGGTTTAGCCCCTTCCTTTCTCTGGGCTCAGTCTGCTTCCCACCACTCACCCATTCTTTCTTTATCATCTC CCTCCAAAGCCTCTTGTCCTCCTGCCTCCTCTTCTCCTTGGCTGGTTCCTGCCAGCAAATGGAGTGGCCCTGGTCCATGCCAGGTTTTCC TCTTCTGGGTCCGGAGCTCACTATAGTATTCAGCCCTCAGTCCTCCCAGGATGTTCTTCCCAGCACTGCTGCCTCGTGCGGATTTTCCCG TAACCTCAGTAACTGGCTTCTTGTCCCCCTGCTTCCTACCAGGGAAGCCTTCCTGTCCGCGTCTGTGGTTTCCCAACTCTGATATTTGCT CTCAAATGTGGTGGTGTCCTGGTTCTGTGTTTATTTATTTTGTGTTTTCTCACACACACAAAAAAAAGCCTCTTCCTCCAATGCATCAGG AGGCACCAGCCCTGCCAGCCCTTCTCACTGGGCTCACCCTGCCCCAGCAACCCCCCGGTGCCACCCTTAAGCCACACTGTCTTCTCTGTA AGCAGCCTGCCAGCAGCAGCCCCAGCACTTTGCAATGGGCGTGTGTGTGGTGGTGGGTGGGGGGGGCTTGGATCCCTCCTTTTTCCTCCC CTGCCCTGCCCAGGCCCAGATGGCCTTGACTGTAAAGCAGGTGCTGCCTGACAGGTTCTTCTCTCCCTGTCTCTGGTCATTGATCCATCT CTTTGTCCATTCAGTATCCAACCATCCTCTCCATTCTCCTCTGGACCTCACCACTCTCAGAGCTGCTTGTCCTGGCAGAATCTACAGTTC ACCCCAACTCTATGCCTTACCCCTCCCAACCCAACAGCATTTGCAGTTTGCAAAATATACAGACCCAAGTCCTGAGGGGACTGAGGACAT GATGCTGGGCCCAAGTCTCCTGCTCAGGGCTTCTCTCCAATGCCAGCCCTGCCACTCCTTCCTCACCCTCCTTGGAGCCTCCTCTGCTGC TTGTCTATCCCAACGGCCCTGCTCCCCTCCCTTCCTGCCCTTCACCAGCTTTCTGGACACCATGCCCTGAGGAAGGGACCTTTGGTTTTC TCTAAACATCTTTGAAGGGCTGAGGCAGTCAGGGCTGGCTGCCTTGTCACTCTTTATTTGGAAGCCACTCAAACCATTCCCAAGAAGAGG GACCTCAGCTGGCAATCTGGAAACCTGGCCCAGGTCTGGGCAGATGTCTTCACTTCTCCTACCTTCCCAGTCTTGTGATCCTGTGATGAG CACCAGGATGGCCCTGTGGTCCCTAGAGCACCCCTCATGCTGTAGGGTCCTGCAGCCCCATCCTTTCTCTACTGGGCCCTGGTATCCTGG CTCCTCTCTCAGCTCTGCCACTGATCTCTGTGCCTTAGTTTACTTCTCTGCACGGGGGACTCACCCCAAGACCATTTCCAGCAGCTTCCC AGGTGATGTGGTGCCCCAAGGCTGGGCTTTGCCAGCTGTGGCCCAGCTCCTTAGTGCTGCCCAGGAGACACCAGGCTGCTCAGAATGAGG TGACTGCGGGCACCATTCTCAGCCAGTGGTTCTTGTATTGCATTCCAGCAGCAGGAATATCACCTGGGAACTTGATAGAAGTGCAGATTA GCAGCCCCACCCAAGACCCACTGAATTAGAGCTTGTGGAGTGGGGCCCTACAAGCTGGGGTTTTAAGGAGCCCTCCAAATGATTCTGACG CATAAGAATATGCCAACTGCTGATCTGGGCTAGCCATTAGTAGAGCCTGGGGAGGGACTGGGACTGGCTAGGCCAAGAACAGGTGGAAAA CACCAGCCTTATCTGGACTCCTGAGATTGGGAACCACCACCAACAAAAACCAACCCTATAGTCGCTCCTCTTGGAAGAGGAAGAGAAGTT GAAGGGCCTGGAGAAAGCACACATTGTTTGTTTCCCTGCTCCTGCTCACCTCTCTCACTTGTCTTGGTTTTACAAAAGGCTGTGTGGATG GTGCCAGCCAGGGAGGGGGTGGGAGTCCTGGGGAGGCAGGAGGCAGAAGACCCTGACTGTTTCTCCCTTGGGAACCTCACCATAGGCCAG ATAGCGCCTCTTCAAACTGAAAGAAATCTTAACTCCACAAAGAAAGCATCCTAAATCCCCAGTTCCTCCTCCTCCCAACCCCAGGGATAC CTTGTAGACAGTGCCAAAAAACAGCTCCAACCCCCAGCAGCTGGGAAGAGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCC AGCCCCCGCCAATACTGTGAACCCCCTTCCCACTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGGG CAAGGCCCATGGGAGGGAAGGGACCAAGGGCATCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTCT TCAAAAGGCTCCTTCCTAGGATGGATCGGGTGCTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGAC >8490_8490_11_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000395913_SOGA1_chr20_35445872_ENST00000237536_length(amino acids)=1549AA_BP=245 MGRRWEPTGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQ ATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLE KAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSA DLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENR GLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSC SVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEV LPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQ QESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHD SLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKR WRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKA KATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQ QVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLP NNPAFGFVSSEPGDPEKDTKEKPGLSSRDCNHLGALACQDPPGRQMQRSYTAPDKTGIRVYYSPPVARRLGVPVVHDKEGKIIIEPGFLF TTAKPKESAEADGLAESSYGRWLCNFSRQRLDGGSAGSPSAAGPGFPAALHDFEMSGNMSDDMKEITNCVRQAMRSGSLERKVKSTSSQT VGLASVGTQTIRTVSVGLQTDPPRSSLHGKAWSPRSSSLVSVRSKQISSSLDKVHSRIERPCCSPKYGSPKLQRRSVSKLDSSKDRSLWN LHQGKQNGSAWARSTTTRDSPVLRNINDGLSSLFSVVEHSGSTESVWKLGMSETRAKPEPPKYGIVQEFFRNVCGRAPSPTSSAGEEGTK KPEPLSPASYHQPEGVARILNKKAAKLGSSEEVRLTMLPQVGKDGVLRDGDGAVVLPNEDAVCDCSTQSLTSCFARSSRSAIRHSPSKCR -------------------------------------------------------------- >8490_8490_12_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000395913_SOGA1_chr20_35445872_ENST00000279034_length(transcript)=3913nt_BP=894nt CTTAAACGCGACTCAAGGCGTCGGGTTTGTTGTCAACCAATCACAAGGCAGCCTCGCTCGAGCGCAGGCCAATCGGCTTTCTAGCTAGAG GGTTTAACTCCTATTTAAAAAGAAGAACCTTTGAATTCTAACGGCTGAGCTCTTGGAAGACTTGGGTCCTTGGGTCGCAGGTGGGAGCCG ACGGGCATCATGGACCGATCTAAAGAAAACTGCATTTCAGGACCTGTTAAGGCTACAGCTCCAGTTGGAGGTCCAAAACGTGTTCTCGTG ACTCAGCAATTTCCTTGTCAGAATCCATTACCTGTAAATAGTGGCCAGGCTCAGCGGGTCTTGTGTCCTTCAAATTCTTCCCAGCGCATT CCTTTGCAAGCACAAAAGCTTGTCTCCAGTCACAAGCCGGTTCAGAATCAGAAGCAGAAGCAATTGCAGGCAACCAGTGTACCTCATCCT GTCTCCAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCTGAAAATAATCCTGAGGAGGAACTGGCATCAAAA CAGAAAAATGAAGAATCAAAAAAGAGGCAGTGGGCTTTGGAAGACTTTGAAATTGGTCGCCCTCTGGGTAAAGGAAAGTTTGGTAATGTT TATTTGGCAAGAGAAAAGCAAAGCAAGTTTATTCTGGCTCTTAAAGTGTTATTTAAAGCTCAGCTGGAGAAAGCCGGAGTGGAGCATCAG CTCAGAAGAGAAGTAGAAATACAGTCCCACCTTCGGCATCCTAATATTCTTAGACTGTATGGTTATTTCCATGATGCTACCAGAGTCTAC CTAATTCTGGAATATGCACCACTTGGAACAGTTTATAGAGAACTTCAGAAACTTTCAAAGTTTGATGAGCAGAGAACTGCTACTCATTCC TTGAAGAAAAGAGGAACCCGCTCCCTGGGGAAGGCCGATAAGAAGACTTTGGTGCAGGAGGACAGTGCAGACCTGAAGTGCCAGTTGCAC TTTGCAAAGGAGGAGTCAGCCCTCATGTGCAAGAAGCTCACTAAGCTTGCCAAGGAGAATGACAGCATGAAGGAGGAGCTGCTGAAGTAC CGCTCGCTCTATGGGGACCTGGACAGCGCGCTGTCAGCCGAGGAGCTGGCCGATGCCCCCCACTCGCGGGAGACCGAGCTGAAGGTGCAC CTGAAGCTGGTGGAGGAGGAAGCCAACCTGCTGAGCCGCCGCATCGTGGAGCTGGAGGTGGAGAACCGAGGCCTGCGGGCTGAGATGGAC GACATGAAGGATCATGGAGGTGGCTGTGGGGGTCCTGAGGCACGCCTGGCCTTCTCCGCGCTGGGTGGCGGAGAGTGCGGGGAGAGCTTG GCAGAGCTGCGGCGACACCTGCAGTTTGTCGAAGAGGAGGCCGAGCTGCTGCGGCGCTCCTCTGCCGAGCTCGAGGACCAGAACAAGCTG CTGCTGAACGAGCTGGCCAAGTTCCGCTCGGAGCACGAGCTGGACGTGGCGCTGTCGGAGGACAGTTGTTCTGTGCTCAGCGAACCTTCA CAGGAGGAGCTGGCGGCCGCCAAGCTGCAGATCGGCGAGCTCAGCGGCAAGGTCAAGAAGCTGCAGTACGAGAACCGCGTGCTCCTCTCC AACCTCCAGCGCTGTGACCTCGCCTCCTGCCAGAGTACGCGGCCCATGCTGGAGACGGACGCCGAGGCCGGGGACTCTGCCCAGTGTGTG CCTGCTCCCCTGGGCGAGACACACGAGTCCCATGCGGTCCGACTCTGCAGAGCCAGGGAGGCCGAGGTGCTGCCTGGGCTGAGAGAGCAG GCCGCCCTGGTCAGTAAGGCCATCGATGTCCTGGTGGCTGATGCCAATGGCTTCACGGCTGGCCTCCGGCTGTGTCTGGACAACGAGTGT GCTGACTTCCGGCTGCATGAGGCCCCCGACAACAGCGAGGGCCCCAGGGACACCAAGCTCATCCATGCCATCCTGGTGCGCCTGAGCGTG CTGCAGCAGGAGCTGAATGCCTTCACGCGGAAGGCAGATGCAGTCCTCGGGTGCTCTGTCAAGGAACAGCAGGAGTCCTTCTCATCACTG CCCCCCTTGGGCTCCCAGGGGCTCTCTAAGGAGATTCTTCTGGCAAAAGACCTTGGCTCAGACTTTCAGCCACCTGACTTCAGGGACCTG CCGGAATGGGAGCCCAGGATCCGAGAGGCTTTCCGCACTGGTGACTTGGACTCTAAGCCCGACCCCAGCCGGAGCTTCAGGCCTTACCGA GCTGAAGACAATGATTCCTATGCCTCTGAGATCAAGGAGCTGCAGCTGGTGCTGGCTGAGGCCCACGACAGCCTCCGGGGCTTGCAAGAG CAGCTCTCCCAGGAGCGGCAGCTACGAAAGGAGGAGGCCGACAATTTCAACCAGAAAATGGTCCAGCTGAAGGAGGACCAGCAGAGGGCG CTCCTGAGGCGGGAGTTTGAGCTGCAGAGTCTGAGCCTCCAGCGGAGGCTGGAGCAGAAATTCTGGAGCCAGGAGAAGAACATGCTGGTG CAGGAGTCCCAGCAATTCAAGCACAACTTCCTGCTGCTCTTCATGAAGCTCAGGTGGTTCCTCAAGCGCTGGCGGCAGGGCAAGGTTTTG CCCAGCGAAGGGGATGACTTCCTCGAGGTGAACAGCATGAAGGAGCTGTACTTGCTGATGGAGGAAGAGGAGATAAACGCTCAGCATTCT GATAACAAGGCCTGCACGGGGGACAGCTGGACCCAGAACACGCCCAATGAGTACATCAAGACACTGGCCGACATGAAGGTGACGCTGAAG GAGCTGTGCTGGCTGCTCCGGGATGAACGCCGTGGTCTGACGGAGCTTCAGCAACAGTTTGCCAAGGCCAAGGCTACCTGGGAGACAGAG CGGGCAGAGCTCAAGGGCCATACCTCCCAGATGGAGCTGAAGACAGGGAAGGGGGCCGGGGAGCGGGCAGGGCCCGACTGGAAGGCAGCC CTACAGCGGGAGCGTGAGGAGCAGCAGCACCTCCTAGCTGAGTCCTACAGCGCTGTCATGGAGCTGACTCGGCAGCTGCAGATCAGTGAG CGCAACTGGAGCCAGGAAAAGCTGCAGCTGGTGGAGCGGCTGCAGGGTGAGAAGCAGCAGGTGGAGCAGCAGGTGAAGGAGCTGCAGAAC CGCCTAAGCCAGCTGCAGAAGGCTGCCGACCCCTGGGTCCTGAAGCACTCGGAGCTGGAGAAGCAGGACAACAGCTGGAAGGAGACACGC AGTGAGAAGATCCACGACAAGGAGGCTGTTTCCGAAGTTGAGCTTGGAGGAAATGGTTTAAAGAGAACCAAATCTGTTTCTTCCATGTCT GAGTTTGAAAGTTTGCTCGACTGTTCCCCTTACCTTGCTGGCGGAGATGCCCGGGGCAAGAAGCTGCCTAACAACCCTGCCTTTGGCTTT GTGAGCTCCGAGCCAGGGGATCCAGAGAAAGACACCAAGGAGAAGCCTGGGCTCTCGTCGAGGGACTGCAACCACCTGGGTGCCCTGGCC TGCCAGGACCCCCCAGGGAGCCAGAAGCTGCCCTTCCTCCTCATCCTGGCCCCTCCCCAGCCCCCGCCAATACTGTGAACCCCCTTCCCA CTCAGCCTGGTTTCCTGGTGAGGGTCCTGCAGTCATGGGCCCTGGGGGACCCCCAGGGCAAGGCCCATGGGAGGGAAGGGACCAAGGGCA TCCTTGGGCCAACTGTCCACCTCTCTTGTCCACTATTCTCTCCTTTCCACTTCTGTCTTCAAAAGGCTCCTTCCTAGGATGGATCGGGTG CTAGGACAACTGCAGTCCAATCCACCAGCTCTCCCTGCCCCTGTGTCTTATTTCAGACATGAGAATAACTGTACAGTGTAAACTTATAAA >8490_8490_12_AURKA-SOGA1_AURKA_chr20_54956489_ENST00000395913_SOGA1_chr20_35445872_ENST00000279034_length(amino acids)=1142AA_BP=245 MGRRWEPTGIMDRSKENCISGPVKATAPVGGPKRVLVTQQFPCQNPLPVNSGQAQRVLCPSNSSQRIPLQAQKLVSSHKPVQNQKQKQLQ ATSVPHPVSRPLNNTQKSKQPLPSAPENNPEEELASKQKNEESKKRQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLE KAGVEHQLRREVEIQSHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATHSLKKRGTRSLGKADKKTLVQEDSA DLKCQLHFAKEESALMCKKLTKLAKENDSMKEELLKYRSLYGDLDSALSAEELADAPHSRETELKVHLKLVEEEANLLSRRIVELEVENR GLRAEMDDMKDHGGGCGGPEARLAFSALGGGECGESLAELRRHLQFVEEEAELLRRSSAELEDQNKLLLNELAKFRSEHELDVALSEDSC SVLSEPSQEELAAAKLQIGELSGKVKKLQYENRVLLSNLQRCDLASCQSTRPMLETDAEAGDSAQCVPAPLGETHESHAVRLCRAREAEV LPGLREQAALVSKAIDVLVADANGFTAGLRLCLDNECADFRLHEAPDNSEGPRDTKLIHAILVRLSVLQQELNAFTRKADAVLGCSVKEQ QESFSSLPPLGSQGLSKEILLAKDLGSDFQPPDFRDLPEWEPRIREAFRTGDLDSKPDPSRSFRPYRAEDNDSYASEIKELQLVLAEAHD SLRGLQEQLSQERQLRKEEADNFNQKMVQLKEDQQRALLRREFELQSLSLQRRLEQKFWSQEKNMLVQESQQFKHNFLLLFMKLRWFLKR WRQGKVLPSEGDDFLEVNSMKELYLLMEEEEINAQHSDNKACTGDSWTQNTPNEYIKTLADMKVTLKELCWLLRDERRGLTELQQQFAKA KATWETERAELKGHTSQMELKTGKGAGERAGPDWKAALQREREEQQHLLAESYSAVMELTRQLQISERNWSQEKLQLVERLQGEKQQVEQ QVKELQNRLSQLQKAADPWVLKHSELEKQDNSWKETRSEKIHDKEAVSEVELGGNGLKRTKSVSSMSEFESLLDCSPYLAGGDARGKKLP -------------------------------------------------------------- |
Top |
Fusion Gene PPI Analysis for AURKA-SOGA1 |
Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in |
Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160) |
Hgene | Hgene's interactors | Tgene | Tgene's interactors |
- Retained PPIs in in-frame fusion. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Still interaction with |
- Lost PPIs in in-frame fusion. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Interaction lost with |
- Retained PPIs, but lost function due to frame-shift fusion. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Interaction lost with |
Top |
Related Drugs for AURKA-SOGA1 |
Drugs targeting genes involved in this fusion gene. (DrugBank Version 5.1.8 2021-05-08) |
Partner | Gene | UniProtAcc | DrugBank ID | Drug name | Drug activity | Drug type | Drug status |
Top |
Related Diseases for AURKA-SOGA1 |
Diseases associated with fusion partners. (DisGeNet 4.0) |
Partner | Gene | Disease ID | Disease name | # pubmeds | Source |
Hgene | AURKA | C0002938 | Aneuploidy | 1 | CTD_human |
Hgene | AURKA | C0006142 | Malignant neoplasm of breast | 1 | CTD_human |
Hgene | AURKA | C0009402 | Colorectal Carcinoma | 1 | CTD_human |
Hgene | AURKA | C0009404 | Colorectal Neoplasms | 1 | CTD_human |
Hgene | AURKA | C0022665 | Kidney Neoplasm | 1 | CTD_human |
Hgene | AURKA | C0024668 | Mammary Neoplasms, Experimental | 1 | CTD_human |
Hgene | AURKA | C0025202 | melanoma | 1 | CTD_human |
Hgene | AURKA | C0027819 | Neuroblastoma | 1 | CTD_human |
Hgene | AURKA | C0033578 | Prostatic Neoplasms | 1 | CTD_human |
Hgene | AURKA | C0376358 | Malignant neoplasm of prostate | 1 | CTD_human |
Hgene | AURKA | C0678222 | Breast Carcinoma | 1 | CTD_human |
Hgene | AURKA | C0740457 | Malignant neoplasm of kidney | 1 | CTD_human |
Hgene | AURKA | C1257806 | Chromosomal Instability | 1 | CTD_human |
Hgene | AURKA | C1257931 | Mammary Neoplasms, Human | 1 | CTD_human |
Hgene | AURKA | C1458155 | Mammary Neoplasms | 1 | CTD_human |
Hgene | AURKA | C2239176 | Liver carcinoma | 1 | CTD_human |
Hgene | AURKA | C4704874 | Mammary Carcinoma, Human | 1 | CTD_human |
Hgene | AURKA | C4721453 | Peripheral Nervous System Diseases | 1 | CTD_human |