FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:SPATA7-NAA50 (FusionGDB2 ID:85589)

Fusion Gene Summary for SPATA7-NAA50

check button Fusion gene summary
Fusion gene informationFusion gene name: SPATA7-NAA50
Fusion gene ID: 85589
HgeneTgene
Gene symbol

SPATA7

NAA50

Gene ID

55812

80218

Gene namespermatogenesis associated 7N-alpha-acetyltransferase 50, NatE catalytic subunit
SynonymsHEL-S-296|HSD-3.1|HSD3|LCA3MAK3|NAT13|NAT13P|NAT5|NAT5P|SAN|hNaa50p
Cytomap

14q31.3

3q13.31

Type of geneprotein-codingprotein-coding
Descriptionspermatogenesis-associated protein 7epididymis secretory protein Li 296epididymis secretory sperm binding proteinspermatogenesis-associated protein HSD3N-alpha-acetyltransferase 50N-acetyltransferase 13 (GCN5-related)N-acetyltransferase 5N-acetyltransferase san homologN-epsilon-acetyltransferase 50natE catalytic subunit
Modification date2020032820200313
UniProtAcc.

Q9GZZ1

Ensembl transtripts involved in fusion geneENST00000045347, ENST00000356583, 
ENST00000393545, ENST00000556553, 
ENST00000554102, 
ENST00000467022, 
ENST00000493454, ENST00000493900, 
ENST00000497255, ENST00000497525, 
ENST00000240922, ENST00000477813, 
Fusion gene scores* DoF score3 X 3 X 3=279 X 5 X 5=225
# samples 39
** MAII scorelog2(3/27*10)=0.15200309344505
effective Gene in Pan-Cancer Fusion Genes (eGinPCFGs).
DoF>8 and MAII>0
log2(9/225*10)=-1.32192809488736
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: SPATA7 [Title/Abstract] AND NAA50 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointSPATA7(88897569)-NAA50(113442939), # samples:1
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
TgeneNAA50

GO:0006474

N-terminal protein amino acid acetylation

21900231|22311970|27484799

TgeneNAA50

GO:0034087

establishment of mitotic sister chromatid cohesion

27422821

TgeneNAA50

GO:0071962

mitotic sister chromatid cohesion, centromeric

17502424


check buttonFusion gene breakpoints across SPATA7 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across NAA50 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4STADTCGA-FP-8210-01ASPATA7chr14

88897569

+NAA50chr3

113442939

-


Top

Fusion Gene ORF analysis for SPATA7-NAA50

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-5UTRENST00000045347ENST00000467022SPATA7chr14

88897569

+NAA50chr3

113442939

-
5CDS-5UTRENST00000045347ENST00000493454SPATA7chr14

88897569

+NAA50chr3

113442939

-
5CDS-5UTRENST00000045347ENST00000493900SPATA7chr14

88897569

+NAA50chr3

113442939

-
5CDS-5UTRENST00000045347ENST00000497255SPATA7chr14

88897569

+NAA50chr3

113442939

-
5CDS-5UTRENST00000045347ENST00000497525SPATA7chr14

88897569

+NAA50chr3

113442939

-
5CDS-5UTRENST00000356583ENST00000467022SPATA7chr14

88897569

+NAA50chr3

113442939

-
5CDS-5UTRENST00000356583ENST00000493454SPATA7chr14

88897569

+NAA50chr3

113442939

-
5CDS-5UTRENST00000356583ENST00000493900SPATA7chr14

88897569

+NAA50chr3

113442939

-
5CDS-5UTRENST00000356583ENST00000497255SPATA7chr14

88897569

+NAA50chr3

113442939

-
5CDS-5UTRENST00000356583ENST00000497525SPATA7chr14

88897569

+NAA50chr3

113442939

-
5CDS-5UTRENST00000393545ENST00000467022SPATA7chr14

88897569

+NAA50chr3

113442939

-
5CDS-5UTRENST00000393545ENST00000493454SPATA7chr14

88897569

+NAA50chr3

113442939

-
5CDS-5UTRENST00000393545ENST00000493900SPATA7chr14

88897569

+NAA50chr3

113442939

-
5CDS-5UTRENST00000393545ENST00000497255SPATA7chr14

88897569

+NAA50chr3

113442939

-
5CDS-5UTRENST00000393545ENST00000497525SPATA7chr14

88897569

+NAA50chr3

113442939

-
5CDS-5UTRENST00000556553ENST00000467022SPATA7chr14

88897569

+NAA50chr3

113442939

-
5CDS-5UTRENST00000556553ENST00000493454SPATA7chr14

88897569

+NAA50chr3

113442939

-
5CDS-5UTRENST00000556553ENST00000493900SPATA7chr14

88897569

+NAA50chr3

113442939

-
5CDS-5UTRENST00000556553ENST00000497255SPATA7chr14

88897569

+NAA50chr3

113442939

-
5CDS-5UTRENST00000556553ENST00000497525SPATA7chr14

88897569

+NAA50chr3

113442939

-
In-frameENST00000045347ENST00000240922SPATA7chr14

88897569

+NAA50chr3

113442939

-
In-frameENST00000045347ENST00000477813SPATA7chr14

88897569

+NAA50chr3

113442939

-
In-frameENST00000356583ENST00000240922SPATA7chr14

88897569

+NAA50chr3

113442939

-
In-frameENST00000356583ENST00000477813SPATA7chr14

88897569

+NAA50chr3

113442939

-
In-frameENST00000393545ENST00000240922SPATA7chr14

88897569

+NAA50chr3

113442939

-
In-frameENST00000393545ENST00000477813SPATA7chr14

88897569

+NAA50chr3

113442939

-
In-frameENST00000556553ENST00000240922SPATA7chr14

88897569

+NAA50chr3

113442939

-
In-frameENST00000556553ENST00000477813SPATA7chr14

88897569

+NAA50chr3

113442939

-
intron-3CDSENST00000554102ENST00000240922SPATA7chr14

88897569

+NAA50chr3

113442939

-
intron-3CDSENST00000554102ENST00000477813SPATA7chr14

88897569

+NAA50chr3

113442939

-
intron-5UTRENST00000554102ENST00000467022SPATA7chr14

88897569

+NAA50chr3

113442939

-
intron-5UTRENST00000554102ENST00000493454SPATA7chr14

88897569

+NAA50chr3

113442939

-
intron-5UTRENST00000554102ENST00000493900SPATA7chr14

88897569

+NAA50chr3

113442939

-
intron-5UTRENST00000554102ENST00000497255SPATA7chr14

88897569

+NAA50chr3

113442939

-
intron-5UTRENST00000554102ENST00000497525SPATA7chr14

88897569

+NAA50chr3

113442939

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000556553SPATA7chr1488897569+ENST00000240922NAA50chr3113442939-734715455592046495
ENST00000556553SPATA7chr1488897569+ENST00000477813NAA50chr3113442939-337215455591926455
ENST00000393545SPATA7chr1488897569+ENST00000240922NAA50chr3113442939-717313712891872527
ENST00000393545SPATA7chr1488897569+ENST00000477813NAA50chr3113442939-319813712891752487
ENST00000356583SPATA7chr1488897569+ENST00000240922NAA50chr3113442939-696311611751662495
ENST00000356583SPATA7chr1488897569+ENST00000477813NAA50chr3113442939-298811611751542455
ENST00000045347SPATA7chr1488897569+ENST00000240922NAA50chr3113442939-6884108201583527
ENST00000045347SPATA7chr1488897569+ENST00000477813NAA50chr3113442939-2909108201463487

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000556553ENST00000240922SPATA7chr1488897569+NAA50chr3113442939-0.0002151560.9997849
ENST00000556553ENST00000477813SPATA7chr1488897569+NAA50chr3113442939-0.0002846260.9997154
ENST00000393545ENST00000240922SPATA7chr1488897569+NAA50chr3113442939-0.0002592320.9997408
ENST00000393545ENST00000477813SPATA7chr1488897569+NAA50chr3113442939-0.0003543890.9996456
ENST00000356583ENST00000240922SPATA7chr1488897569+NAA50chr3113442939-0.0001957980.9998042
ENST00000356583ENST00000477813SPATA7chr1488897569+NAA50chr3113442939-0.0002356740.9997644
ENST00000045347ENST00000240922SPATA7chr1488897569+NAA50chr3113442939-0.000234360.9997657
ENST00000045347ENST00000477813SPATA7chr1488897569+NAA50chr3113442939-0.0003027630.9996972

Top

Fusion Genomic Features for SPATA7-NAA50


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.

Top

Fusion Protein Features for SPATA7-NAA50


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr14:88897569/chr3:113442939)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
.NAA50

Q9GZZ1

FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.FUNCTION: N-alpha-acetyltransferase that acetylates the N-terminus of proteins that retain their initiating methionine (PubMed:19744929, PubMed:22311970, PubMed:21900231, PubMed:27484799). Has a broad substrate specificity: able to acetylate the initiator methionine of most peptides, except for those with a proline in second position (PubMed:27484799). Also displays N-epsilon-acetyltransferase activity by mediating acetylation of the side chain of specific lysines on proteins (PubMed:19744929). Autoacetylates in vivo (PubMed:19744929). The relevance of N-epsilon-acetyltransferase activity is however unclear: able to acetylate H4 in vitro, but this result has not been confirmed in vivo (PubMed:19744929). Component of a N-alpha-acetyltransferase complex containing NAA10 and NAA15, but NAA50 does not influence the acetyltransferase activity of NAA10: this multiprotein complex probably constitutes the major contributor for N-terminal acetylation at the ribosome exit tunnel, with NAA10 acetylating all amino termini that are devoid of methionine and NAA50 acetylating other peptides (PubMed:16507339, PubMed:27484799). Required for sister chromatid cohesion during mitosis by promoting binding of CDCA5/sororin to cohesin: may act by counteracting the function of NAA10 (PubMed:17502424, PubMed:27422821). {ECO:0000269|PubMed:16507339, ECO:0000269|PubMed:17502424, ECO:0000269|PubMed:19744929, ECO:0000269|PubMed:21900231, ECO:0000269|PubMed:22311970, ECO:0000269|PubMed:27422821, ECO:0000269|PubMed:27484799}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgeneNAA50chr14:88897569chr3:113442939ENST00000240922056_1552170.0DomainN-acetyltransferase
TgeneNAA50chr14:88897569chr3:113442939ENST0000024092205117_1262170.0RegionCoenzyme A binding
TgeneNAA50chr14:88897569chr3:113442939ENST0000024092205138_1412170.0RegionSubstrate binding
TgeneNAA50chr14:88897569chr3:113442939ENST000002409220577_902170.0RegionAcetyl-CoA binding

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note


Top

Fusion Gene Sequence for SPATA7-NAA50


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>85589_85589_1_SPATA7-NAA50_SPATA7_chr14_88897569_ENST00000045347_NAA50_chr3_113442939_ENST00000240922_length(transcript)=6884nt_BP=1082nt
ATGGATGGCAGCCGGAGAGTCAGAGCAACCTCTGTCCTTCCCAGATATGGTCCACCGTGCCTATTTAAAGGACACTTGAGCACCAAAAGT
AATGCTTTTTGCACTGACTCCTCTTCTCTCAGACTAAGCACTCTCCAGCTGGTCAAGAATCACATGGCTGTTCACTATAATAAAATCCTT
TCAGCCAAAGCTGCAGTAGACTGCTCGGTTCCAGTAAGCGTGAGTACCAGCATAAAGTATGCAGACCAACAACGAAGAGAGAAACTCAAA
AAGGAATTAGCACAATGTGAAAAAGAGTTCAAATTAACTAAAACTGCAATGCGAGCCAATTATAAAAATAATTCCAAGTCACTTTTTAAT
ACCTTACAAAAGCCCTCAGGCGAACCGCAAATTGAGGATGACATGTTAAAAGAAGAAATGAATGGATTTTCATCCTTTGCAAGGTCACTA
GTACCCTCTTCAGAGAGACTACACCTAAGTCTACATAAATCCAGTAAAGTCATCACAAATGGTCCTGAGAAGAACTCCAGTTCCTCCCCG
TCCAGTGTGGATTATGCAGCCTCCGGGCCCCGGAAACTGAGCTCTGGAGCCCTGTATGGCAGAAGGCCCAGAAGCACATTCCCAAATTCC
CACCGGTTTCAGTTAGTCATTTCGAAAGCACCCAGTGGGGATCTTTTGGATAAACATTCTGAACTCTTTTCTAACAAACAATTGCCATTC
ACTCCTCGCACTTTAAAAACAGAAGCAAAATCTTTCCTGTCACAGTATCGCTATTATACACCTGCCAAAAGAAAAAAGGATTTTACAGAT
CAACGGATAGAAGCTGAAACCCAGACTGAATTAAGCTTTAAATCTGAGTTGGGGACAGCTGAGACTAAAAACATGACAGATTCAGAAATG
AACATAAAGCAGGCATCTAATTGTGTGACATATGATGCCAAAGAAAAAATAGCTCCTTTACCTTTAGAAGGGCATGACTCAACATGGGAT
GAGATTAAGGATGATGCTCTTCAGCATTCCTCACCAAGGGCAATGTGTCAGTATTCCCTGAAGCCCCCTTCAACTCGTAAAATCTACTCT
GATAGCCGGATCGAGCTGGGAGATGTGACACCACACAATATTAAACAGTTGAAAAGATTGAATCAGGTCATCTTTCCAGTCAGCTACAAT
GACAAGTTCTACAAGGATGTGCTGGAGGTTGGCGAGCTAGCAAAACTTGCCTATTTCAATGATATTGCTGTAGGTGCAGTATGCTGTAGG
GTGGATCATTCACAGAATCAGAAGAGACTTTACATCATGACACTAGGATGTCTGGCACCTTACCGAAGGCTAGGAATAGGAACTAAAATG
TTAAATCATGTCTTAAACATCTGTGAAAAAGATGGTACTTTTGACAACATTTATCTGCATGTCCAGATCAGCAATGAGTCGGCAATTGAC
TTCTACAGGAAGTTTGGCTTTGAGATTATTGAGACAAAGAAGAACTACTATAAGAGGATAGAGCCCGCAGATGCTCATGTGCTGCAGAAA
AACCTCAAAGTTCCTTCTGGTCAGAATGCAGATGTGCAAAAGACAGACAACTGAACAAATTACAAATGAACTTTCTTGCACTTGCTTGTC
GCCAAATAAAAGAGAGGCCCATTGATTCCTCCCCCACCCCAACACTTTTCTTTTAAAGCTTTTCTCCCTCCTTGTTCTTGTTTTTCTTTC
TTCCTTTCCTTTTCTCTGAGAGTTTTAATACTTTCAAGGACTTTAAAAAAATAATCATGTTTGAATTGTTTTCTCTTATTTTTGTGAGGT
GGTTTGAAGGAAGGACAAGGTAGATCTGTTTAGTTTTGCAGTTGAAGTTAGATGGTCCTAAACATTTAATTGTCAAATAATTTCAAATTT
AATGTCCTGCTTTCACATTGAAGGGCAGAGCCTACAAAACATTGTATATTTCAAAAGACAAAAAGAAGCAGCAGCAGTATCTTGTTCTCT
AATTCATAGACAAGTTGAGTGTGTTTGTGGTACTTTGGGTTTTTAAACACTTTGGGATACTAATCCCTAGACATTGCCTTCACTCCACCT
TTAGTCCTTCTGAGCACTCTCTCGGGAGTTGGAACATTGTTATCCTTGTAAGAAATACTAAGCTTATGTTGATTTTTAAGTAATTATATC
TTCTCTTCTTGCTGGTGGGTGGGGCAGTTTGGTTTAGTGTTATACTTTGGTCTAAGTATTTGAGTTAAACTGCTTTTTTGCTAATGAGTG
GGCTGGTTGTTAGCAGGTTTGTTTTTCCTGCTGTTGATTGTTACTAGTGGCATTAACTTTTAGAATTTGGGCTGGTGAGATTAATTTTTT
TTAATATCCCAGCTAGAGATATGGCCTTTAACTGACCTAAAGAGGTGTGTTGTGATTTAATTTTTTCCCGTTCCTTTTTCTTCAGTAAAC
CCAACAATAGTCTAACCTTAAAAATTGAGTTGATGTCCTTATAGGTCACTACCCCTAAATAAACCTGAAGCAGGTGTTTTCTCTTGGACA
TACTAAAAAATACCTAAAAGGAAGCTTAGATGGGCTGTGACACAAAAAATTCAATTACTGTCATCTAATGCCAGCTGTTAAAAGTGTGGC
CACTGAGCATTTGATTTTATAGGAAAAAATAGTATTTTTGAGAATAACATAGCTGTGCTATTGCACATCTGTTGGAGGACATCCCAGATT
TGCTTATACTCAGTGCCTGTGATATTGAGTTTAAGGATTTGAGGCAGGGGTAATTATTAAACATATTGCTTCTATTCTTGGAAAAATAGA
AGTGTAAAATGTTAATAATACAAATGTCACTGTGACCTCCTCCACTGAGAGGACTGGTTTATGCCAGATCATTTTCCGGCACACACGGAG
TGGCTTTGACAGATTGATAACTTTGTAAGATGGGAGACATCTGAAATATTCATGTTTTCCTTTTGTAGTCCCATCTCCACTATTTAGAAA
TGTTCTCAGACTTTAAAATAATGCACAGGGCTTGAGCTTTCTGTCATTTGACTTTAAAAGGAAGTTTCATTCATATTTATCCTCTTATGT
AAAATTGCGGTATAAAGTCTCATTTCCAAATATGTTAAATGACAAAATTATTTTATAAAATGTTTATGCACACTTTATAACCTTAAGTTT
TTATTTGAGAATGTGAAAGTACAAAGTGCAGTAGACTTCAACAATCTTGAGTGCCAAGAATAATACAGAAAAAGAAGACAGTTGATGAAT
GAGTTTATAGGGTTCTAATCTTAAGATGGTAAAAATGTAGAAAGACCTTGCTGGTTTTTTGGGGGTATTCGTTTCTTAAACAATCCAAAT
CTAAGCTTAGAAGAAAAGTTTAGCGTTAAGCACCTTTATCTTCATGAATAAGCTTCAGCTTGCTCTTGGCAAGAGAAGAGTGCTTGAGTT
ACAGAAGGCATAAGTAGTTTGAAGAATGCAGCAGCCTTTTTGTAAACTTCCCAGATATCAAAATAGACTTTGATATATAAATGGTTTTCT
GAGATGACACTGCCTCTATTTCTATAACCATTTCACCTGGACTATCTAATCAGTCCTATGAATGTATCCCTAAATGTGGTTATTGAAAAC
CTAATAGCTGCCTCATGACAAGTACATGTTATTTAAGGAGGAAAAAATATTAAATTTTGAATTGAGTGTGTAGGCTCCCTATCATTATAT
ATAGAGTTTCTTTTTCCACGGTAGTCAGTGACTTAACCTGAATTGTAAATGTTTGTAAAGGGTTAATTGTCCTACATCAAACTTAGTTAA
ATAATTCCATCCACTTATGGAGGAGGAGGAGAATGTGGAAGAGGTAAAAAGCTGGGCACAAGTTCATATGCCTATGAGTCAGTAAAGACT
GAAGTAATGTCCTATGTTGAGCTGGTTATTTTGATATATGATAATAATTATCTTTGAAGTAGAACAATTCTGTTAACTGGAAAATCACAG
GATATATCCATCATATTTTTCAGGACAGATAGTTTTTACTGTGGGGCAAATAGGTTAAAATTACACTATGTTAGTTGCATTTAGGTTTTA
AAGCAAAGAATCTGTAGAGAAATCTATGCAATATATAGTTTGTCCAGATTAGCTTTCATTTGGGGAATGAAGTTCTGAAATATCTAAAGC
AGTTTACTCATCAATTGAAAAGTCCTCCAAAAAGAGAACTATTGGGAAACCATGGTGTGGTGGTGGAAAAGAAAAGCTCCCTCAGTTTTT
TGGAGGGAATAACTTAAAAAAATACTTAAATGGCTAAGTTTACTTGGTGCAGTTAAGAATTAAACTTGTCAATTTTAACATTGCTGTTAC
ATCTGAAATAAACTTATGTGATGTTCTGGTAGTGATCTGTGATATCTGTAAATGTCAAAACTGTATTGTTGAATTCTGCAGCCAGCAACC
GTACATACTCATTGTGTAGTGTTTCCCTTACTGCTTTTATTTACTTTCACATTTAAGATCTAATTTTAAAATCATTAAAATAGGCCAAGT
GTAATTAAGGCATCCTAATTCAGATCTTCTGTGACTTGCTAGGTACAGTGCCCAATATCTACAATTCAGTTCCAATGAGAGGGAAAAAGT
AAGATCCAAGGAACTGTCTTGCTGCTGTATTTTTAATGTAATTATTAGAAATACTTGACACTTTAGGCTGCAATCTAGTTAGTAATGTTT
ATTGGCTACAGACAACATATTCAGGTGTTTTGTTTTTCTTCCTTGTAAGGAAGGAGTACGGTAGTTCAATGAGTTGGGAATTGGCTTTTA
AGCTTTTTATCAAACATGAATTTAAGATTTTTTCTTTAACAGAATTCAGTATTTAATATTTCTAGTCAAAATTGTTATAATTAGATTTTT
GCTTTTTTCATTTAAGAAGCAATGAGTGATCCTTGTCTAGTGTGTCTCAGTTATTACTGTAGTTTAAAACAACAAACAAGTATTACCTTC
CACAGTTCTGGATCAGGAATCTAGGAGCTGCTTAACTGGGTGGTTCTGTCTTGAGATCTCAGAAGGTTGCAGTCAAGCTGTTAGCCAGGA
CTGTACCATCTTGAAGCTTGACAAGGCTAGAGGAGCCATTTCCAAGATGTCTTGCTTACGTGGCTCTTGGGCTTCTGTTCCTCACTGTGT
GGGTATCACACACAGTGCCTGATGCTCACGTGGCTCTTGGGCTTCAGTTCCTTGGGCACCAGTTGCCTGGTGGACAACTGGTATGTGGCT
TCCCCTAGAGCAGGTGATATAAGAAAGTGAGCAGAAACCATACTTTTTTATGACGTAGCCTCAGGTGACATTATTTCTGCCATATTCCAT
GAGTCACACAGACCAACCATTGAATCAGCTTGGTATAGTGTGGGAGGGGACTGCACAAGGATGTGAATACTGGGAGGTGGATATCATTGG
GCCCTCTAGGAGACTGGCTATCACTGCTCAGAGAAGATGCTGGGCTTCGTTGACTCCCAAGGTCAGGGTAGTTTTGGTTAGGTTGTGTTT
TATTAGTCTCTAAAAGGAGTAGTTTTCTAAATGAGGGTTATAAAGCACTGCACTTTACAGTTATTCGGGATATAAAAGAAATAGTGGGTC
TAAAGGCTGTGCTCTTGTGGTTGGGTTTTCAGTGGGGAGGGGAGACTAGTCTGTCTCAGACTGATGCTGTTCTCACTGATTTGAATATTT
CTGGGAAATTGGATACTACAGTCACAAATAGGAACAGTAAGCCTATAGAAGTTTTTCAGGGAGTAAATATTTTCTATGGCAGTGTCCTGA
ATTGGTTCTCCCTTGCAAGACTGAACTGTAGGAACTTAGTCCTAGTTTATGATACAGCCAAGTAACATAGTCACATGGGAAAAAGAGCTT
GAGTCAGATTTCTTAATGTGTGTTGTTAACTTGGTAGAACATTGAGAATTATTTAAGTCAGAGAACGATCTGTTACTGGGGCAGAAATTC
TCAACCTTTTCAGTTCTCCAAAATTTAAGATACTTGATTTCTTAGGTAAAATGTTTTTGTTTTTGTTTTGGAGACAGAGTCTCGCTCTGT
CGCCCAGGCTGGAGTGCAGTGGCGCGATCTTGGCTCACTGCAAACTCCGCCTCCCAGATTCAAGCAATTCTGCCTGAGCCTCCCAAGTAG
CTGCGACTAGAAAGCGCATGCCACCACGCCTGGCTAATTTTTTGTATTTTAGTAGAGATGGGGGTTTCACCGTGTTGCCCAGGCTGGTCT
CAAACTCCTGAGCTTAGGCAATCCTCCTGGGGCAGCCTCCCAAAGTGCTAGGATTACAGGCGAGCCATGGCGCCTGGCCAGTAAAATGTT
TTCTATCTAGAATGAATCAAGGTATTTTCCTTGCTCAGTAGCTTCTAGAATAAGAAAAAAATAGCAGCAAGATCTGATTCAGAAATAGTT
GGGAGCAGAAAGTTAATATGAAGGAGTTGCTACTTGTTAACAGCCTAGAGTTGAGATCTAGAAGAATTATTACCTTTTTAAATTTGTGAT
GAAAGCTTAAATCCAGCATTTGGGAAGTTACTCTATTGGCTGAACTATTTTGGAGTTTGTAAGCTTTGTATTAGATATTCCTGATTTAAC
TGAAACTAATTTGCCACATAGCTTTAATTTCATCCCAGTTTTACTTGTTTTACTGTCCTCAAAAACTCAAGACATCTGAACTCAAAGGAT
CTAAGCAGTATAAATTAAAGCACATGTTGAATCACTGTAGCTTTCGTAGGACATCTGATTATAATGCTTTTCTTTGTTTCTTTGTCCATA
CTGAACTTGTCTAGTTTATCTTTGAGAAACATTTGCTAGTATCAAAAATCTTCTGTAAAGATTTGAACAATCTTGAATTCTCCTTGTCAC

>85589_85589_1_SPATA7-NAA50_SPATA7_chr14_88897569_ENST00000045347_NAA50_chr3_113442939_ENST00000240922_length(amino acids)=527AA_BP=357
MDGSRRVRATSVLPRYGPPCLFKGHLSTKSNAFCTDSSSLRLSTLQLVKNHMAVHYNKILSAKAAVDCSVPVSVSTSIKYADQQRREKLK
KELAQCEKEFKLTKTAMRANYKNNSKSLFNTLQKPSGEPQIEDDMLKEEMNGFSSFARSLVPSSERLHLSLHKSSKVITNGPEKNSSSSP
SSVDYAASGPRKLSSGALYGRRPRSTFPNSHRFQLVISKAPSGDLLDKHSELFSNKQLPFTPRTLKTEAKSFLSQYRYYTPAKRKKDFTD
QRIEAETQTELSFKSELGTAETKNMTDSEMNIKQASNCVTYDAKEKIAPLPLEGHDSTWDEIKDDALQHSSPRAMCQYSLKPPSTRKIYS
DSRIELGDVTPHNIKQLKRLNQVIFPVSYNDKFYKDVLEVGELAKLAYFNDIAVGAVCCRVDHSQNQKRLYIMTLGCLAPYRRLGIGTKM

--------------------------------------------------------------
>85589_85589_2_SPATA7-NAA50_SPATA7_chr14_88897569_ENST00000045347_NAA50_chr3_113442939_ENST00000477813_length(transcript)=2909nt_BP=1082nt
ATGGATGGCAGCCGGAGAGTCAGAGCAACCTCTGTCCTTCCCAGATATGGTCCACCGTGCCTATTTAAAGGACACTTGAGCACCAAAAGT
AATGCTTTTTGCACTGACTCCTCTTCTCTCAGACTAAGCACTCTCCAGCTGGTCAAGAATCACATGGCTGTTCACTATAATAAAATCCTT
TCAGCCAAAGCTGCAGTAGACTGCTCGGTTCCAGTAAGCGTGAGTACCAGCATAAAGTATGCAGACCAACAACGAAGAGAGAAACTCAAA
AAGGAATTAGCACAATGTGAAAAAGAGTTCAAATTAACTAAAACTGCAATGCGAGCCAATTATAAAAATAATTCCAAGTCACTTTTTAAT
ACCTTACAAAAGCCCTCAGGCGAACCGCAAATTGAGGATGACATGTTAAAAGAAGAAATGAATGGATTTTCATCCTTTGCAAGGTCACTA
GTACCCTCTTCAGAGAGACTACACCTAAGTCTACATAAATCCAGTAAAGTCATCACAAATGGTCCTGAGAAGAACTCCAGTTCCTCCCCG
TCCAGTGTGGATTATGCAGCCTCCGGGCCCCGGAAACTGAGCTCTGGAGCCCTGTATGGCAGAAGGCCCAGAAGCACATTCCCAAATTCC
CACCGGTTTCAGTTAGTCATTTCGAAAGCACCCAGTGGGGATCTTTTGGATAAACATTCTGAACTCTTTTCTAACAAACAATTGCCATTC
ACTCCTCGCACTTTAAAAACAGAAGCAAAATCTTTCCTGTCACAGTATCGCTATTATACACCTGCCAAAAGAAAAAAGGATTTTACAGAT
CAACGGATAGAAGCTGAAACCCAGACTGAATTAAGCTTTAAATCTGAGTTGGGGACAGCTGAGACTAAAAACATGACAGATTCAGAAATG
AACATAAAGCAGGCATCTAATTGTGTGACATATGATGCCAAAGAAAAAATAGCTCCTTTACCTTTAGAAGGGCATGACTCAACATGGGAT
GAGATTAAGGATGATGCTCTTCAGCATTCCTCACCAAGGGCAATGTGTCAGTATTCCCTGAAGCCCCCTTCAACTCGTAAAATCTACTCT
GATAGCCGGATCGAGCTGGGAGATGTGACACCACACAATATTAAACAGTTGAAAAGATTGAATCAGGTCATCTTTCCAGTCAGCTACAAT
GACAAGTTCTACAAGGATGTGCTGGAGGTTGGCGAGCTAGCAAAACTTGGAACTAAAATGTTAAATCATGTCTTAAACATCTGTGAAAAA
GATGGTACTTTTGACAACATTTATCTGCATGTCCAGATCAGCAATGAGTCGGCAATTGACTTCTACAGGAAGTTTGGCTTTGAGATTATT
GAGACAAAGAAGAACTACTATAAGAGGATAGAGCCCGCAGATGCTCATGTGCTGCAGAAAAACCTCAAAGTTCCTTCTGGTCAGAATGCA
GATGTGCAAAAGACAGACAACTGAACAAATTACAAATGAACTTTCTTGCACTTGCTTGTCGCCAAATAAAAGAGAGGCCCATTGATTCCT
CCCCCACCCCAACACTTTTCTTTTAAAGCTTTTCTCCCTCCTTGTTCTTGTTTTTCTTTCTTCCTTTCCTTTTCTCTGAGAGTTTTAATA
CTTTCAAGGACTTTAAAAAAATAATCATGTTTGAATTGTTTTCTCTTATTTTTGTGAGGTGGTTTGAAGGAAGGACAAGGTAGATCTGTT
TAGTTTTGCAGTTGAAGTTAGATGGTCCTAAACATTTAATTGTCAAATAATTTCAAATTTAATGTCCTGCTTTCACATTGAAGGGCAGAG
CCTACAAAACATTGTATATTTCAAAAGACAAAAAGAAGCAGCAGCAGTATCTTGTTCTCTAATTCATAGACAAGTTGAGTGTGTTTGTGG
TACTTTGGGTTTTTAAACACTTTGGGATACTAATCCCTAGACATTGCCTTCACTCCACCTTTAGTCCTTCTGAGCACTCTCTCGGGAGTT
GGAACATTGTTATCCTTGTAAGAAATACTAAGCTTATGTTGATTTTTAAGTAATTATATCTTCTCTTCTTGCTGGTGGGTGGGGCAGTTT
GGTTTAGTGTTATACTTTGGTCTAAGTATTTGAGTTAAACTGCTTTTTTGCTAATGAGTGGGCTGGTTGTTAGCAGGTTTGTTTTTCCTG
CTGTTGATTGTTACTAGTGGCATTAACTTTTAGAATTTGGGCTGGTGAGATTAATTTTTTTTAATATCCCAGCTAGAGATATGGCCTTTA
ACTGACCTAAAGAGGTGTGTTGTGATTTAATTTTTTCCCGTTCCTTTTTCTTCAGTAAACCCAACAATAGTCTAACCTTAAAAATTGAGT
TGATGTCCTTATAGGTCACTACCCCTAAATAAACCTGAAGCAGGTGTTTTCTCTTGGACATACTAAAAAATACCTAAAAGGAAGCTTAGA
TGGGCTGTGACACAAAAAATTCAATTACTGTCATCTAATGCCAGCTGTTAAAAGTGTGGCCACTGAGCATTTGATTTTATAGGAAAAAAT
AGTATTTTTGAGAATAACATAGCTGTGCTATTGCACATCTGTTGGAGGACATCCCAGATTTGCTTATACTCAGTGCCTGTGATATTGAGT
TTAAGGATTTGAGGCAGGGGTAATTATTAAACATATTGCTTCTATTCTTGGAAAAATAGAAGTGTAAAATGTTAATAATACAAATGTCAC
TGTGACCTCCTCCACTGAGAGGACTGGTTTATGCCAGATCATTTTCCGGCACACACGGAGTGGCTTTGACAGATTGATAACTTTGTAAGA
TGGGAGACATCTGAAATATTCATGTTTTCCTTTTGTAGTCCCATCTCCACTATTTAGAAATGTTCTCAGACTTTAAAATAATGCACAGGG

>85589_85589_2_SPATA7-NAA50_SPATA7_chr14_88897569_ENST00000045347_NAA50_chr3_113442939_ENST00000477813_length(amino acids)=487AA_BP=357
MDGSRRVRATSVLPRYGPPCLFKGHLSTKSNAFCTDSSSLRLSTLQLVKNHMAVHYNKILSAKAAVDCSVPVSVSTSIKYADQQRREKLK
KELAQCEKEFKLTKTAMRANYKNNSKSLFNTLQKPSGEPQIEDDMLKEEMNGFSSFARSLVPSSERLHLSLHKSSKVITNGPEKNSSSSP
SSVDYAASGPRKLSSGALYGRRPRSTFPNSHRFQLVISKAPSGDLLDKHSELFSNKQLPFTPRTLKTEAKSFLSQYRYYTPAKRKKDFTD
QRIEAETQTELSFKSELGTAETKNMTDSEMNIKQASNCVTYDAKEKIAPLPLEGHDSTWDEIKDDALQHSSPRAMCQYSLKPPSTRKIYS
DSRIELGDVTPHNIKQLKRLNQVIFPVSYNDKFYKDVLEVGELAKLGTKMLNHVLNICEKDGTFDNIYLHVQISNESAIDFYRKFGFEII

--------------------------------------------------------------
>85589_85589_3_SPATA7-NAA50_SPATA7_chr14_88897569_ENST00000356583_NAA50_chr3_113442939_ENST00000240922_length(transcript)=6963nt_BP=1161nt
ACAATAGCGACTCACTGGACCCAGCCCTTAGCAACGGCCTGGCAACGGTTTCCCTGCTGCTGCAGCCCCCGTCGGCTCCTCTTTTCCAGT
CCTCCACTGCCGGGGCTGGGCCCGGCCGCGGGAAGGACCGAAGGGGATACAGCGTGTCCCTGCGGCGGCTGCAAGAGGACTAAGCATGGA
TGGCAGCCGGAGAGTCAGAGCAACCTCTGTCCTTCCCAGATATGGTCCACCGTGCCTATTTAAAGGACACTTGAGCACCAAAAGTAATGC
TGCAGTAGACTGCTCGGTTCCAGTAAGCGTGAGTACCAGCATAAAGTATGCAGACCAACAACGAAGAGAGAAACTCAAAAAGGAATTAGC
ACAATGTGAAAAAGAGTTCAAATTAACTAAAACTGCAATGCGAGCCAATTATAAAAATAATTCCAAGTCACTTTTTAATACCTTACAAAA
GCCCTCAGGCGAACCGCAAATTGAGGATGACATGTTAAAAGAAGAAATGAATGGATTTTCATCCTTTGCAAGGTCACTAGTACCCTCTTC
AGAGAGACTACACCTAAGTCTACATAAATCCAGTAAAGTCATCACAAATGGTCCTGAGAAGAACTCCAGTTCCTCCCCGTCCAGTGTGGA
TTATGCAGCCTCCGGGCCCCGGAAACTGAGCTCTGGAGCCCTGTATGGCAGAAGGCCCAGAAGCACATTCCCAAATTCCCACCGGTTTCA
GTTAGTCATTTCGAAAGCACCCAGTGGGGATCTTTTGGATAAACATTCTGAACTCTTTTCTAACAAACAATTGCCATTCACTCCTCGCAC
TTTAAAAACAGAAGCAAAATCTTTCCTGTCACAGTATCGCTATTATACACCTGCCAAAAGAAAAAAGGATTTTACAGATCAACGGATAGA
AGCTGAAACCCAGACTGAATTAAGCTTTAAATCTGAGTTGGGGACAGCTGAGACTAAAAACATGACAGATTCAGAAATGAACATAAAGCA
GGCATCTAATTGTGTGACATATGATGCCAAAGAAAAAATAGCTCCTTTACCTTTAGAAGGGCATGACTCAACATGGGATGAGATTAAGGA
TGATGCTCTTCAGCATTCCTCACCAAGGGCAATGTGTCAGTATTCCCTGAAGCCCCCTTCAACTCGTAAAATCTACTCTGATAGCCGGAT
CGAGCTGGGAGATGTGACACCACACAATATTAAACAGTTGAAAAGATTGAATCAGGTCATCTTTCCAGTCAGCTACAATGACAAGTTCTA
CAAGGATGTGCTGGAGGTTGGCGAGCTAGCAAAACTTGCCTATTTCAATGATATTGCTGTAGGTGCAGTATGCTGTAGGGTGGATCATTC
ACAGAATCAGAAGAGACTTTACATCATGACACTAGGATGTCTGGCACCTTACCGAAGGCTAGGAATAGGAACTAAAATGTTAAATCATGT
CTTAAACATCTGTGAAAAAGATGGTACTTTTGACAACATTTATCTGCATGTCCAGATCAGCAATGAGTCGGCAATTGACTTCTACAGGAA
GTTTGGCTTTGAGATTATTGAGACAAAGAAGAACTACTATAAGAGGATAGAGCCCGCAGATGCTCATGTGCTGCAGAAAAACCTCAAAGT
TCCTTCTGGTCAGAATGCAGATGTGCAAAAGACAGACAACTGAACAAATTACAAATGAACTTTCTTGCACTTGCTTGTCGCCAAATAAAA
GAGAGGCCCATTGATTCCTCCCCCACCCCAACACTTTTCTTTTAAAGCTTTTCTCCCTCCTTGTTCTTGTTTTTCTTTCTTCCTTTCCTT
TTCTCTGAGAGTTTTAATACTTTCAAGGACTTTAAAAAAATAATCATGTTTGAATTGTTTTCTCTTATTTTTGTGAGGTGGTTTGAAGGA
AGGACAAGGTAGATCTGTTTAGTTTTGCAGTTGAAGTTAGATGGTCCTAAACATTTAATTGTCAAATAATTTCAAATTTAATGTCCTGCT
TTCACATTGAAGGGCAGAGCCTACAAAACATTGTATATTTCAAAAGACAAAAAGAAGCAGCAGCAGTATCTTGTTCTCTAATTCATAGAC
AAGTTGAGTGTGTTTGTGGTACTTTGGGTTTTTAAACACTTTGGGATACTAATCCCTAGACATTGCCTTCACTCCACCTTTAGTCCTTCT
GAGCACTCTCTCGGGAGTTGGAACATTGTTATCCTTGTAAGAAATACTAAGCTTATGTTGATTTTTAAGTAATTATATCTTCTCTTCTTG
CTGGTGGGTGGGGCAGTTTGGTTTAGTGTTATACTTTGGTCTAAGTATTTGAGTTAAACTGCTTTTTTGCTAATGAGTGGGCTGGTTGTT
AGCAGGTTTGTTTTTCCTGCTGTTGATTGTTACTAGTGGCATTAACTTTTAGAATTTGGGCTGGTGAGATTAATTTTTTTTAATATCCCA
GCTAGAGATATGGCCTTTAACTGACCTAAAGAGGTGTGTTGTGATTTAATTTTTTCCCGTTCCTTTTTCTTCAGTAAACCCAACAATAGT
CTAACCTTAAAAATTGAGTTGATGTCCTTATAGGTCACTACCCCTAAATAAACCTGAAGCAGGTGTTTTCTCTTGGACATACTAAAAAAT
ACCTAAAAGGAAGCTTAGATGGGCTGTGACACAAAAAATTCAATTACTGTCATCTAATGCCAGCTGTTAAAAGTGTGGCCACTGAGCATT
TGATTTTATAGGAAAAAATAGTATTTTTGAGAATAACATAGCTGTGCTATTGCACATCTGTTGGAGGACATCCCAGATTTGCTTATACTC
AGTGCCTGTGATATTGAGTTTAAGGATTTGAGGCAGGGGTAATTATTAAACATATTGCTTCTATTCTTGGAAAAATAGAAGTGTAAAATG
TTAATAATACAAATGTCACTGTGACCTCCTCCACTGAGAGGACTGGTTTATGCCAGATCATTTTCCGGCACACACGGAGTGGCTTTGACA
GATTGATAACTTTGTAAGATGGGAGACATCTGAAATATTCATGTTTTCCTTTTGTAGTCCCATCTCCACTATTTAGAAATGTTCTCAGAC
TTTAAAATAATGCACAGGGCTTGAGCTTTCTGTCATTTGACTTTAAAAGGAAGTTTCATTCATATTTATCCTCTTATGTAAAATTGCGGT
ATAAAGTCTCATTTCCAAATATGTTAAATGACAAAATTATTTTATAAAATGTTTATGCACACTTTATAACCTTAAGTTTTTATTTGAGAA
TGTGAAAGTACAAAGTGCAGTAGACTTCAACAATCTTGAGTGCCAAGAATAATACAGAAAAAGAAGACAGTTGATGAATGAGTTTATAGG
GTTCTAATCTTAAGATGGTAAAAATGTAGAAAGACCTTGCTGGTTTTTTGGGGGTATTCGTTTCTTAAACAATCCAAATCTAAGCTTAGA
AGAAAAGTTTAGCGTTAAGCACCTTTATCTTCATGAATAAGCTTCAGCTTGCTCTTGGCAAGAGAAGAGTGCTTGAGTTACAGAAGGCAT
AAGTAGTTTGAAGAATGCAGCAGCCTTTTTGTAAACTTCCCAGATATCAAAATAGACTTTGATATATAAATGGTTTTCTGAGATGACACT
GCCTCTATTTCTATAACCATTTCACCTGGACTATCTAATCAGTCCTATGAATGTATCCCTAAATGTGGTTATTGAAAACCTAATAGCTGC
CTCATGACAAGTACATGTTATTTAAGGAGGAAAAAATATTAAATTTTGAATTGAGTGTGTAGGCTCCCTATCATTATATATAGAGTTTCT
TTTTCCACGGTAGTCAGTGACTTAACCTGAATTGTAAATGTTTGTAAAGGGTTAATTGTCCTACATCAAACTTAGTTAAATAATTCCATC
CACTTATGGAGGAGGAGGAGAATGTGGAAGAGGTAAAAAGCTGGGCACAAGTTCATATGCCTATGAGTCAGTAAAGACTGAAGTAATGTC
CTATGTTGAGCTGGTTATTTTGATATATGATAATAATTATCTTTGAAGTAGAACAATTCTGTTAACTGGAAAATCACAGGATATATCCAT
CATATTTTTCAGGACAGATAGTTTTTACTGTGGGGCAAATAGGTTAAAATTACACTATGTTAGTTGCATTTAGGTTTTAAAGCAAAGAAT
CTGTAGAGAAATCTATGCAATATATAGTTTGTCCAGATTAGCTTTCATTTGGGGAATGAAGTTCTGAAATATCTAAAGCAGTTTACTCAT
CAATTGAAAAGTCCTCCAAAAAGAGAACTATTGGGAAACCATGGTGTGGTGGTGGAAAAGAAAAGCTCCCTCAGTTTTTTGGAGGGAATA
ACTTAAAAAAATACTTAAATGGCTAAGTTTACTTGGTGCAGTTAAGAATTAAACTTGTCAATTTTAACATTGCTGTTACATCTGAAATAA
ACTTATGTGATGTTCTGGTAGTGATCTGTGATATCTGTAAATGTCAAAACTGTATTGTTGAATTCTGCAGCCAGCAACCGTACATACTCA
TTGTGTAGTGTTTCCCTTACTGCTTTTATTTACTTTCACATTTAAGATCTAATTTTAAAATCATTAAAATAGGCCAAGTGTAATTAAGGC
ATCCTAATTCAGATCTTCTGTGACTTGCTAGGTACAGTGCCCAATATCTACAATTCAGTTCCAATGAGAGGGAAAAAGTAAGATCCAAGG
AACTGTCTTGCTGCTGTATTTTTAATGTAATTATTAGAAATACTTGACACTTTAGGCTGCAATCTAGTTAGTAATGTTTATTGGCTACAG
ACAACATATTCAGGTGTTTTGTTTTTCTTCCTTGTAAGGAAGGAGTACGGTAGTTCAATGAGTTGGGAATTGGCTTTTAAGCTTTTTATC
AAACATGAATTTAAGATTTTTTCTTTAACAGAATTCAGTATTTAATATTTCTAGTCAAAATTGTTATAATTAGATTTTTGCTTTTTTCAT
TTAAGAAGCAATGAGTGATCCTTGTCTAGTGTGTCTCAGTTATTACTGTAGTTTAAAACAACAAACAAGTATTACCTTCCACAGTTCTGG
ATCAGGAATCTAGGAGCTGCTTAACTGGGTGGTTCTGTCTTGAGATCTCAGAAGGTTGCAGTCAAGCTGTTAGCCAGGACTGTACCATCT
TGAAGCTTGACAAGGCTAGAGGAGCCATTTCCAAGATGTCTTGCTTACGTGGCTCTTGGGCTTCTGTTCCTCACTGTGTGGGTATCACAC
ACAGTGCCTGATGCTCACGTGGCTCTTGGGCTTCAGTTCCTTGGGCACCAGTTGCCTGGTGGACAACTGGTATGTGGCTTCCCCTAGAGC
AGGTGATATAAGAAAGTGAGCAGAAACCATACTTTTTTATGACGTAGCCTCAGGTGACATTATTTCTGCCATATTCCATGAGTCACACAG
ACCAACCATTGAATCAGCTTGGTATAGTGTGGGAGGGGACTGCACAAGGATGTGAATACTGGGAGGTGGATATCATTGGGCCCTCTAGGA
GACTGGCTATCACTGCTCAGAGAAGATGCTGGGCTTCGTTGACTCCCAAGGTCAGGGTAGTTTTGGTTAGGTTGTGTTTTATTAGTCTCT
AAAAGGAGTAGTTTTCTAAATGAGGGTTATAAAGCACTGCACTTTACAGTTATTCGGGATATAAAAGAAATAGTGGGTCTAAAGGCTGTG
CTCTTGTGGTTGGGTTTTCAGTGGGGAGGGGAGACTAGTCTGTCTCAGACTGATGCTGTTCTCACTGATTTGAATATTTCTGGGAAATTG
GATACTACAGTCACAAATAGGAACAGTAAGCCTATAGAAGTTTTTCAGGGAGTAAATATTTTCTATGGCAGTGTCCTGAATTGGTTCTCC
CTTGCAAGACTGAACTGTAGGAACTTAGTCCTAGTTTATGATACAGCCAAGTAACATAGTCACATGGGAAAAAGAGCTTGAGTCAGATTT
CTTAATGTGTGTTGTTAACTTGGTAGAACATTGAGAATTATTTAAGTCAGAGAACGATCTGTTACTGGGGCAGAAATTCTCAACCTTTTC
AGTTCTCCAAAATTTAAGATACTTGATTTCTTAGGTAAAATGTTTTTGTTTTTGTTTTGGAGACAGAGTCTCGCTCTGTCGCCCAGGCTG
GAGTGCAGTGGCGCGATCTTGGCTCACTGCAAACTCCGCCTCCCAGATTCAAGCAATTCTGCCTGAGCCTCCCAAGTAGCTGCGACTAGA
AAGCGCATGCCACCACGCCTGGCTAATTTTTTGTATTTTAGTAGAGATGGGGGTTTCACCGTGTTGCCCAGGCTGGTCTCAAACTCCTGA
GCTTAGGCAATCCTCCTGGGGCAGCCTCCCAAAGTGCTAGGATTACAGGCGAGCCATGGCGCCTGGCCAGTAAAATGTTTTCTATCTAGA
ATGAATCAAGGTATTTTCCTTGCTCAGTAGCTTCTAGAATAAGAAAAAAATAGCAGCAAGATCTGATTCAGAAATAGTTGGGAGCAGAAA
GTTAATATGAAGGAGTTGCTACTTGTTAACAGCCTAGAGTTGAGATCTAGAAGAATTATTACCTTTTTAAATTTGTGATGAAAGCTTAAA
TCCAGCATTTGGGAAGTTACTCTATTGGCTGAACTATTTTGGAGTTTGTAAGCTTTGTATTAGATATTCCTGATTTAACTGAAACTAATT
TGCCACATAGCTTTAATTTCATCCCAGTTTTACTTGTTTTACTGTCCTCAAAAACTCAAGACATCTGAACTCAAAGGATCTAAGCAGTAT
AAATTAAAGCACATGTTGAATCACTGTAGCTTTCGTAGGACATCTGATTATAATGCTTTTCTTTGTTTCTTTGTCCATACTGAACTTGTC
TAGTTTATCTTTGAGAAACATTTGCTAGTATCAAAAATCTTCTGTAAAGATTTGAACAATCTTGAATTCTCCTTGTCACTAGCCATCTCT

>85589_85589_3_SPATA7-NAA50_SPATA7_chr14_88897569_ENST00000356583_NAA50_chr3_113442939_ENST00000240922_length(amino acids)=495AA_BP=325
MDGSRRVRATSVLPRYGPPCLFKGHLSTKSNAAVDCSVPVSVSTSIKYADQQRREKLKKELAQCEKEFKLTKTAMRANYKNNSKSLFNTL
QKPSGEPQIEDDMLKEEMNGFSSFARSLVPSSERLHLSLHKSSKVITNGPEKNSSSSPSSVDYAASGPRKLSSGALYGRRPRSTFPNSHR
FQLVISKAPSGDLLDKHSELFSNKQLPFTPRTLKTEAKSFLSQYRYYTPAKRKKDFTDQRIEAETQTELSFKSELGTAETKNMTDSEMNI
KQASNCVTYDAKEKIAPLPLEGHDSTWDEIKDDALQHSSPRAMCQYSLKPPSTRKIYSDSRIELGDVTPHNIKQLKRLNQVIFPVSYNDK
FYKDVLEVGELAKLAYFNDIAVGAVCCRVDHSQNQKRLYIMTLGCLAPYRRLGIGTKMLNHVLNICEKDGTFDNIYLHVQISNESAIDFY

--------------------------------------------------------------
>85589_85589_4_SPATA7-NAA50_SPATA7_chr14_88897569_ENST00000356583_NAA50_chr3_113442939_ENST00000477813_length(transcript)=2988nt_BP=1161nt
ACAATAGCGACTCACTGGACCCAGCCCTTAGCAACGGCCTGGCAACGGTTTCCCTGCTGCTGCAGCCCCCGTCGGCTCCTCTTTTCCAGT
CCTCCACTGCCGGGGCTGGGCCCGGCCGCGGGAAGGACCGAAGGGGATACAGCGTGTCCCTGCGGCGGCTGCAAGAGGACTAAGCATGGA
TGGCAGCCGGAGAGTCAGAGCAACCTCTGTCCTTCCCAGATATGGTCCACCGTGCCTATTTAAAGGACACTTGAGCACCAAAAGTAATGC
TGCAGTAGACTGCTCGGTTCCAGTAAGCGTGAGTACCAGCATAAAGTATGCAGACCAACAACGAAGAGAGAAACTCAAAAAGGAATTAGC
ACAATGTGAAAAAGAGTTCAAATTAACTAAAACTGCAATGCGAGCCAATTATAAAAATAATTCCAAGTCACTTTTTAATACCTTACAAAA
GCCCTCAGGCGAACCGCAAATTGAGGATGACATGTTAAAAGAAGAAATGAATGGATTTTCATCCTTTGCAAGGTCACTAGTACCCTCTTC
AGAGAGACTACACCTAAGTCTACATAAATCCAGTAAAGTCATCACAAATGGTCCTGAGAAGAACTCCAGTTCCTCCCCGTCCAGTGTGGA
TTATGCAGCCTCCGGGCCCCGGAAACTGAGCTCTGGAGCCCTGTATGGCAGAAGGCCCAGAAGCACATTCCCAAATTCCCACCGGTTTCA
GTTAGTCATTTCGAAAGCACCCAGTGGGGATCTTTTGGATAAACATTCTGAACTCTTTTCTAACAAACAATTGCCATTCACTCCTCGCAC
TTTAAAAACAGAAGCAAAATCTTTCCTGTCACAGTATCGCTATTATACACCTGCCAAAAGAAAAAAGGATTTTACAGATCAACGGATAGA
AGCTGAAACCCAGACTGAATTAAGCTTTAAATCTGAGTTGGGGACAGCTGAGACTAAAAACATGACAGATTCAGAAATGAACATAAAGCA
GGCATCTAATTGTGTGACATATGATGCCAAAGAAAAAATAGCTCCTTTACCTTTAGAAGGGCATGACTCAACATGGGATGAGATTAAGGA
TGATGCTCTTCAGCATTCCTCACCAAGGGCAATGTGTCAGTATTCCCTGAAGCCCCCTTCAACTCGTAAAATCTACTCTGATAGCCGGAT
CGAGCTGGGAGATGTGACACCACACAATATTAAACAGTTGAAAAGATTGAATCAGGTCATCTTTCCAGTCAGCTACAATGACAAGTTCTA
CAAGGATGTGCTGGAGGTTGGCGAGCTAGCAAAACTTGGAACTAAAATGTTAAATCATGTCTTAAACATCTGTGAAAAAGATGGTACTTT
TGACAACATTTATCTGCATGTCCAGATCAGCAATGAGTCGGCAATTGACTTCTACAGGAAGTTTGGCTTTGAGATTATTGAGACAAAGAA
GAACTACTATAAGAGGATAGAGCCCGCAGATGCTCATGTGCTGCAGAAAAACCTCAAAGTTCCTTCTGGTCAGAATGCAGATGTGCAAAA
GACAGACAACTGAACAAATTACAAATGAACTTTCTTGCACTTGCTTGTCGCCAAATAAAAGAGAGGCCCATTGATTCCTCCCCCACCCCA
ACACTTTTCTTTTAAAGCTTTTCTCCCTCCTTGTTCTTGTTTTTCTTTCTTCCTTTCCTTTTCTCTGAGAGTTTTAATACTTTCAAGGAC
TTTAAAAAAATAATCATGTTTGAATTGTTTTCTCTTATTTTTGTGAGGTGGTTTGAAGGAAGGACAAGGTAGATCTGTTTAGTTTTGCAG
TTGAAGTTAGATGGTCCTAAACATTTAATTGTCAAATAATTTCAAATTTAATGTCCTGCTTTCACATTGAAGGGCAGAGCCTACAAAACA
TTGTATATTTCAAAAGACAAAAAGAAGCAGCAGCAGTATCTTGTTCTCTAATTCATAGACAAGTTGAGTGTGTTTGTGGTACTTTGGGTT
TTTAAACACTTTGGGATACTAATCCCTAGACATTGCCTTCACTCCACCTTTAGTCCTTCTGAGCACTCTCTCGGGAGTTGGAACATTGTT
ATCCTTGTAAGAAATACTAAGCTTATGTTGATTTTTAAGTAATTATATCTTCTCTTCTTGCTGGTGGGTGGGGCAGTTTGGTTTAGTGTT
ATACTTTGGTCTAAGTATTTGAGTTAAACTGCTTTTTTGCTAATGAGTGGGCTGGTTGTTAGCAGGTTTGTTTTTCCTGCTGTTGATTGT
TACTAGTGGCATTAACTTTTAGAATTTGGGCTGGTGAGATTAATTTTTTTTAATATCCCAGCTAGAGATATGGCCTTTAACTGACCTAAA
GAGGTGTGTTGTGATTTAATTTTTTCCCGTTCCTTTTTCTTCAGTAAACCCAACAATAGTCTAACCTTAAAAATTGAGTTGATGTCCTTA
TAGGTCACTACCCCTAAATAAACCTGAAGCAGGTGTTTTCTCTTGGACATACTAAAAAATACCTAAAAGGAAGCTTAGATGGGCTGTGAC
ACAAAAAATTCAATTACTGTCATCTAATGCCAGCTGTTAAAAGTGTGGCCACTGAGCATTTGATTTTATAGGAAAAAATAGTATTTTTGA
GAATAACATAGCTGTGCTATTGCACATCTGTTGGAGGACATCCCAGATTTGCTTATACTCAGTGCCTGTGATATTGAGTTTAAGGATTTG
AGGCAGGGGTAATTATTAAACATATTGCTTCTATTCTTGGAAAAATAGAAGTGTAAAATGTTAATAATACAAATGTCACTGTGACCTCCT
CCACTGAGAGGACTGGTTTATGCCAGATCATTTTCCGGCACACACGGAGTGGCTTTGACAGATTGATAACTTTGTAAGATGGGAGACATC
TGAAATATTCATGTTTTCCTTTTGTAGTCCCATCTCCACTATTTAGAAATGTTCTCAGACTTTAAAATAATGCACAGGGCTTGAGCTTTC

>85589_85589_4_SPATA7-NAA50_SPATA7_chr14_88897569_ENST00000356583_NAA50_chr3_113442939_ENST00000477813_length(amino acids)=455AA_BP=325
MDGSRRVRATSVLPRYGPPCLFKGHLSTKSNAAVDCSVPVSVSTSIKYADQQRREKLKKELAQCEKEFKLTKTAMRANYKNNSKSLFNTL
QKPSGEPQIEDDMLKEEMNGFSSFARSLVPSSERLHLSLHKSSKVITNGPEKNSSSSPSSVDYAASGPRKLSSGALYGRRPRSTFPNSHR
FQLVISKAPSGDLLDKHSELFSNKQLPFTPRTLKTEAKSFLSQYRYYTPAKRKKDFTDQRIEAETQTELSFKSELGTAETKNMTDSEMNI
KQASNCVTYDAKEKIAPLPLEGHDSTWDEIKDDALQHSSPRAMCQYSLKPPSTRKIYSDSRIELGDVTPHNIKQLKRLNQVIFPVSYNDK
FYKDVLEVGELAKLGTKMLNHVLNICEKDGTFDNIYLHVQISNESAIDFYRKFGFEIIETKKNYYKRIEPADAHVLQKNLKVPSGQNADV

--------------------------------------------------------------
>85589_85589_5_SPATA7-NAA50_SPATA7_chr14_88897569_ENST00000393545_NAA50_chr3_113442939_ENST00000240922_length(transcript)=7173nt_BP=1371nt
GACTGGGGCTCGATGCCCCTTCCCCGCCAGGTCTCTCCGTCCGCAACTGTCCTCCTAGTACCGGGTATCCCGCAGGCGGGGCTGCGGAAC
GCACGTCCCCTGCGCCGTGACGTCACAATAGCGACTCACTGGACCCAGCCCTTAGCAACGGCCTGGCAACGGTTTCCCTGCTGCTGCAGC
CCCCGTCGGCTCCTCTTTTCCAGTCCTCCACTGCCGGGGCTGGGCCCGGCCGCGGGAAGGACCGAAGGGGATACAGCGTGTCCCTGCGGC
GGCTGCAAGAGGACTAAGCATGGATGGCAGCCGGAGAGTCAGAGCAACCTCTGTCCTTCCCAGATATGGTCCACCGTGCCTATTTAAAGG
ACACTTGAGCACCAAAAGTAATGCTTTTTGCACTGACTCCTCTTCTCTCAGACTAAGCACTCTCCAGCTGGTCAAGAATCACATGGCTGT
TCACTATAATAAAATCCTTTCAGCCAAAGCTGCAGTAGACTGCTCGGTTCCAGTAAGCGTGAGTACCAGCATAAAGTATGCAGACCAACA
ACGAAGAGAGAAACTCAAAAAGGAATTAGCACAATGTGAAAAAGAGTTCAAATTAACTAAAACTGCAATGCGAGCCAATTATAAAAATAA
TTCCAAGTCACTTTTTAATACCTTACAAAAGCCCTCAGGCGAACCGCAAATTGAGGATGACATGTTAAAAGAAGAAATGAATGGATTTTC
ATCCTTTGCAAGGTCACTAGTACCCTCTTCAGAGAGACTACACCTAAGTCTACATAAATCCAGTAAAGTCATCACAAATGGTCCTGAGAA
GAACTCCAGTTCCTCCCCGTCCAGTGTGGATTATGCAGCCTCCGGGCCCCGGAAACTGAGCTCTGGAGCCCTGTATGGCAGAAGGCCCAG
AAGCACATTCCCAAATTCCCACCGGTTTCAGTTAGTCATTTCGAAAGCACCCAGTGGGGATCTTTTGGATAAACATTCTGAACTCTTTTC
TAACAAACAATTGCCATTCACTCCTCGCACTTTAAAAACAGAAGCAAAATCTTTCCTGTCACAGTATCGCTATTATACACCTGCCAAAAG
AAAAAAGGATTTTACAGATCAACGGATAGAAGCTGAAACCCAGACTGAATTAAGCTTTAAATCTGAGTTGGGGACAGCTGAGACTAAAAA
CATGACAGATTCAGAAATGAACATAAAGCAGGCATCTAATTGTGTGACATATGATGCCAAAGAAAAAATAGCTCCTTTACCTTTAGAAGG
GCATGACTCAACATGGGATGAGATTAAGGATGATGCTCTTCAGCATTCCTCACCAAGGGCAATGTGTCAGTATTCCCTGAAGCCCCCTTC
AACTCGTAAAATCTACTCTGATAGCCGGATCGAGCTGGGAGATGTGACACCACACAATATTAAACAGTTGAAAAGATTGAATCAGGTCAT
CTTTCCAGTCAGCTACAATGACAAGTTCTACAAGGATGTGCTGGAGGTTGGCGAGCTAGCAAAACTTGCCTATTTCAATGATATTGCTGT
AGGTGCAGTATGCTGTAGGGTGGATCATTCACAGAATCAGAAGAGACTTTACATCATGACACTAGGATGTCTGGCACCTTACCGAAGGCT
AGGAATAGGAACTAAAATGTTAAATCATGTCTTAAACATCTGTGAAAAAGATGGTACTTTTGACAACATTTATCTGCATGTCCAGATCAG
CAATGAGTCGGCAATTGACTTCTACAGGAAGTTTGGCTTTGAGATTATTGAGACAAAGAAGAACTACTATAAGAGGATAGAGCCCGCAGA
TGCTCATGTGCTGCAGAAAAACCTCAAAGTTCCTTCTGGTCAGAATGCAGATGTGCAAAAGACAGACAACTGAACAAATTACAAATGAAC
TTTCTTGCACTTGCTTGTCGCCAAATAAAAGAGAGGCCCATTGATTCCTCCCCCACCCCAACACTTTTCTTTTAAAGCTTTTCTCCCTCC
TTGTTCTTGTTTTTCTTTCTTCCTTTCCTTTTCTCTGAGAGTTTTAATACTTTCAAGGACTTTAAAAAAATAATCATGTTTGAATTGTTT
TCTCTTATTTTTGTGAGGTGGTTTGAAGGAAGGACAAGGTAGATCTGTTTAGTTTTGCAGTTGAAGTTAGATGGTCCTAAACATTTAATT
GTCAAATAATTTCAAATTTAATGTCCTGCTTTCACATTGAAGGGCAGAGCCTACAAAACATTGTATATTTCAAAAGACAAAAAGAAGCAG
CAGCAGTATCTTGTTCTCTAATTCATAGACAAGTTGAGTGTGTTTGTGGTACTTTGGGTTTTTAAACACTTTGGGATACTAATCCCTAGA
CATTGCCTTCACTCCACCTTTAGTCCTTCTGAGCACTCTCTCGGGAGTTGGAACATTGTTATCCTTGTAAGAAATACTAAGCTTATGTTG
ATTTTTAAGTAATTATATCTTCTCTTCTTGCTGGTGGGTGGGGCAGTTTGGTTTAGTGTTATACTTTGGTCTAAGTATTTGAGTTAAACT
GCTTTTTTGCTAATGAGTGGGCTGGTTGTTAGCAGGTTTGTTTTTCCTGCTGTTGATTGTTACTAGTGGCATTAACTTTTAGAATTTGGG
CTGGTGAGATTAATTTTTTTTAATATCCCAGCTAGAGATATGGCCTTTAACTGACCTAAAGAGGTGTGTTGTGATTTAATTTTTTCCCGT
TCCTTTTTCTTCAGTAAACCCAACAATAGTCTAACCTTAAAAATTGAGTTGATGTCCTTATAGGTCACTACCCCTAAATAAACCTGAAGC
AGGTGTTTTCTCTTGGACATACTAAAAAATACCTAAAAGGAAGCTTAGATGGGCTGTGACACAAAAAATTCAATTACTGTCATCTAATGC
CAGCTGTTAAAAGTGTGGCCACTGAGCATTTGATTTTATAGGAAAAAATAGTATTTTTGAGAATAACATAGCTGTGCTATTGCACATCTG
TTGGAGGACATCCCAGATTTGCTTATACTCAGTGCCTGTGATATTGAGTTTAAGGATTTGAGGCAGGGGTAATTATTAAACATATTGCTT
CTATTCTTGGAAAAATAGAAGTGTAAAATGTTAATAATACAAATGTCACTGTGACCTCCTCCACTGAGAGGACTGGTTTATGCCAGATCA
TTTTCCGGCACACACGGAGTGGCTTTGACAGATTGATAACTTTGTAAGATGGGAGACATCTGAAATATTCATGTTTTCCTTTTGTAGTCC
CATCTCCACTATTTAGAAATGTTCTCAGACTTTAAAATAATGCACAGGGCTTGAGCTTTCTGTCATTTGACTTTAAAAGGAAGTTTCATT
CATATTTATCCTCTTATGTAAAATTGCGGTATAAAGTCTCATTTCCAAATATGTTAAATGACAAAATTATTTTATAAAATGTTTATGCAC
ACTTTATAACCTTAAGTTTTTATTTGAGAATGTGAAAGTACAAAGTGCAGTAGACTTCAACAATCTTGAGTGCCAAGAATAATACAGAAA
AAGAAGACAGTTGATGAATGAGTTTATAGGGTTCTAATCTTAAGATGGTAAAAATGTAGAAAGACCTTGCTGGTTTTTTGGGGGTATTCG
TTTCTTAAACAATCCAAATCTAAGCTTAGAAGAAAAGTTTAGCGTTAAGCACCTTTATCTTCATGAATAAGCTTCAGCTTGCTCTTGGCA
AGAGAAGAGTGCTTGAGTTACAGAAGGCATAAGTAGTTTGAAGAATGCAGCAGCCTTTTTGTAAACTTCCCAGATATCAAAATAGACTTT
GATATATAAATGGTTTTCTGAGATGACACTGCCTCTATTTCTATAACCATTTCACCTGGACTATCTAATCAGTCCTATGAATGTATCCCT
AAATGTGGTTATTGAAAACCTAATAGCTGCCTCATGACAAGTACATGTTATTTAAGGAGGAAAAAATATTAAATTTTGAATTGAGTGTGT
AGGCTCCCTATCATTATATATAGAGTTTCTTTTTCCACGGTAGTCAGTGACTTAACCTGAATTGTAAATGTTTGTAAAGGGTTAATTGTC
CTACATCAAACTTAGTTAAATAATTCCATCCACTTATGGAGGAGGAGGAGAATGTGGAAGAGGTAAAAAGCTGGGCACAAGTTCATATGC
CTATGAGTCAGTAAAGACTGAAGTAATGTCCTATGTTGAGCTGGTTATTTTGATATATGATAATAATTATCTTTGAAGTAGAACAATTCT
GTTAACTGGAAAATCACAGGATATATCCATCATATTTTTCAGGACAGATAGTTTTTACTGTGGGGCAAATAGGTTAAAATTACACTATGT
TAGTTGCATTTAGGTTTTAAAGCAAAGAATCTGTAGAGAAATCTATGCAATATATAGTTTGTCCAGATTAGCTTTCATTTGGGGAATGAA
GTTCTGAAATATCTAAAGCAGTTTACTCATCAATTGAAAAGTCCTCCAAAAAGAGAACTATTGGGAAACCATGGTGTGGTGGTGGAAAAG
AAAAGCTCCCTCAGTTTTTTGGAGGGAATAACTTAAAAAAATACTTAAATGGCTAAGTTTACTTGGTGCAGTTAAGAATTAAACTTGTCA
ATTTTAACATTGCTGTTACATCTGAAATAAACTTATGTGATGTTCTGGTAGTGATCTGTGATATCTGTAAATGTCAAAACTGTATTGTTG
AATTCTGCAGCCAGCAACCGTACATACTCATTGTGTAGTGTTTCCCTTACTGCTTTTATTTACTTTCACATTTAAGATCTAATTTTAAAA
TCATTAAAATAGGCCAAGTGTAATTAAGGCATCCTAATTCAGATCTTCTGTGACTTGCTAGGTACAGTGCCCAATATCTACAATTCAGTT
CCAATGAGAGGGAAAAAGTAAGATCCAAGGAACTGTCTTGCTGCTGTATTTTTAATGTAATTATTAGAAATACTTGACACTTTAGGCTGC
AATCTAGTTAGTAATGTTTATTGGCTACAGACAACATATTCAGGTGTTTTGTTTTTCTTCCTTGTAAGGAAGGAGTACGGTAGTTCAATG
AGTTGGGAATTGGCTTTTAAGCTTTTTATCAAACATGAATTTAAGATTTTTTCTTTAACAGAATTCAGTATTTAATATTTCTAGTCAAAA
TTGTTATAATTAGATTTTTGCTTTTTTCATTTAAGAAGCAATGAGTGATCCTTGTCTAGTGTGTCTCAGTTATTACTGTAGTTTAAAACA
ACAAACAAGTATTACCTTCCACAGTTCTGGATCAGGAATCTAGGAGCTGCTTAACTGGGTGGTTCTGTCTTGAGATCTCAGAAGGTTGCA
GTCAAGCTGTTAGCCAGGACTGTACCATCTTGAAGCTTGACAAGGCTAGAGGAGCCATTTCCAAGATGTCTTGCTTACGTGGCTCTTGGG
CTTCTGTTCCTCACTGTGTGGGTATCACACACAGTGCCTGATGCTCACGTGGCTCTTGGGCTTCAGTTCCTTGGGCACCAGTTGCCTGGT
GGACAACTGGTATGTGGCTTCCCCTAGAGCAGGTGATATAAGAAAGTGAGCAGAAACCATACTTTTTTATGACGTAGCCTCAGGTGACAT
TATTTCTGCCATATTCCATGAGTCACACAGACCAACCATTGAATCAGCTTGGTATAGTGTGGGAGGGGACTGCACAAGGATGTGAATACT
GGGAGGTGGATATCATTGGGCCCTCTAGGAGACTGGCTATCACTGCTCAGAGAAGATGCTGGGCTTCGTTGACTCCCAAGGTCAGGGTAG
TTTTGGTTAGGTTGTGTTTTATTAGTCTCTAAAAGGAGTAGTTTTCTAAATGAGGGTTATAAAGCACTGCACTTTACAGTTATTCGGGAT
ATAAAAGAAATAGTGGGTCTAAAGGCTGTGCTCTTGTGGTTGGGTTTTCAGTGGGGAGGGGAGACTAGTCTGTCTCAGACTGATGCTGTT
CTCACTGATTTGAATATTTCTGGGAAATTGGATACTACAGTCACAAATAGGAACAGTAAGCCTATAGAAGTTTTTCAGGGAGTAAATATT
TTCTATGGCAGTGTCCTGAATTGGTTCTCCCTTGCAAGACTGAACTGTAGGAACTTAGTCCTAGTTTATGATACAGCCAAGTAACATAGT
CACATGGGAAAAAGAGCTTGAGTCAGATTTCTTAATGTGTGTTGTTAACTTGGTAGAACATTGAGAATTATTTAAGTCAGAGAACGATCT
GTTACTGGGGCAGAAATTCTCAACCTTTTCAGTTCTCCAAAATTTAAGATACTTGATTTCTTAGGTAAAATGTTTTTGTTTTTGTTTTGG
AGACAGAGTCTCGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCGCGATCTTGGCTCACTGCAAACTCCGCCTCCCAGATTCAAGCAATTCT
GCCTGAGCCTCCCAAGTAGCTGCGACTAGAAAGCGCATGCCACCACGCCTGGCTAATTTTTTGTATTTTAGTAGAGATGGGGGTTTCACC
GTGTTGCCCAGGCTGGTCTCAAACTCCTGAGCTTAGGCAATCCTCCTGGGGCAGCCTCCCAAAGTGCTAGGATTACAGGCGAGCCATGGC
GCCTGGCCAGTAAAATGTTTTCTATCTAGAATGAATCAAGGTATTTTCCTTGCTCAGTAGCTTCTAGAATAAGAAAAAAATAGCAGCAAG
ATCTGATTCAGAAATAGTTGGGAGCAGAAAGTTAATATGAAGGAGTTGCTACTTGTTAACAGCCTAGAGTTGAGATCTAGAAGAATTATT
ACCTTTTTAAATTTGTGATGAAAGCTTAAATCCAGCATTTGGGAAGTTACTCTATTGGCTGAACTATTTTGGAGTTTGTAAGCTTTGTAT
TAGATATTCCTGATTTAACTGAAACTAATTTGCCACATAGCTTTAATTTCATCCCAGTTTTACTTGTTTTACTGTCCTCAAAAACTCAAG
ACATCTGAACTCAAAGGATCTAAGCAGTATAAATTAAAGCACATGTTGAATCACTGTAGCTTTCGTAGGACATCTGATTATAATGCTTTT
CTTTGTTTCTTTGTCCATACTGAACTTGTCTAGTTTATCTTTGAGAAACATTTGCTAGTATCAAAAATCTTCTGTAAAGATTTGAACAAT

>85589_85589_5_SPATA7-NAA50_SPATA7_chr14_88897569_ENST00000393545_NAA50_chr3_113442939_ENST00000240922_length(amino acids)=527AA_BP=357
MDGSRRVRATSVLPRYGPPCLFKGHLSTKSNAFCTDSSSLRLSTLQLVKNHMAVHYNKILSAKAAVDCSVPVSVSTSIKYADQQRREKLK
KELAQCEKEFKLTKTAMRANYKNNSKSLFNTLQKPSGEPQIEDDMLKEEMNGFSSFARSLVPSSERLHLSLHKSSKVITNGPEKNSSSSP
SSVDYAASGPRKLSSGALYGRRPRSTFPNSHRFQLVISKAPSGDLLDKHSELFSNKQLPFTPRTLKTEAKSFLSQYRYYTPAKRKKDFTD
QRIEAETQTELSFKSELGTAETKNMTDSEMNIKQASNCVTYDAKEKIAPLPLEGHDSTWDEIKDDALQHSSPRAMCQYSLKPPSTRKIYS
DSRIELGDVTPHNIKQLKRLNQVIFPVSYNDKFYKDVLEVGELAKLAYFNDIAVGAVCCRVDHSQNQKRLYIMTLGCLAPYRRLGIGTKM

--------------------------------------------------------------
>85589_85589_6_SPATA7-NAA50_SPATA7_chr14_88897569_ENST00000393545_NAA50_chr3_113442939_ENST00000477813_length(transcript)=3198nt_BP=1371nt
GACTGGGGCTCGATGCCCCTTCCCCGCCAGGTCTCTCCGTCCGCAACTGTCCTCCTAGTACCGGGTATCCCGCAGGCGGGGCTGCGGAAC
GCACGTCCCCTGCGCCGTGACGTCACAATAGCGACTCACTGGACCCAGCCCTTAGCAACGGCCTGGCAACGGTTTCCCTGCTGCTGCAGC
CCCCGTCGGCTCCTCTTTTCCAGTCCTCCACTGCCGGGGCTGGGCCCGGCCGCGGGAAGGACCGAAGGGGATACAGCGTGTCCCTGCGGC
GGCTGCAAGAGGACTAAGCATGGATGGCAGCCGGAGAGTCAGAGCAACCTCTGTCCTTCCCAGATATGGTCCACCGTGCCTATTTAAAGG
ACACTTGAGCACCAAAAGTAATGCTTTTTGCACTGACTCCTCTTCTCTCAGACTAAGCACTCTCCAGCTGGTCAAGAATCACATGGCTGT
TCACTATAATAAAATCCTTTCAGCCAAAGCTGCAGTAGACTGCTCGGTTCCAGTAAGCGTGAGTACCAGCATAAAGTATGCAGACCAACA
ACGAAGAGAGAAACTCAAAAAGGAATTAGCACAATGTGAAAAAGAGTTCAAATTAACTAAAACTGCAATGCGAGCCAATTATAAAAATAA
TTCCAAGTCACTTTTTAATACCTTACAAAAGCCCTCAGGCGAACCGCAAATTGAGGATGACATGTTAAAAGAAGAAATGAATGGATTTTC
ATCCTTTGCAAGGTCACTAGTACCCTCTTCAGAGAGACTACACCTAAGTCTACATAAATCCAGTAAAGTCATCACAAATGGTCCTGAGAA
GAACTCCAGTTCCTCCCCGTCCAGTGTGGATTATGCAGCCTCCGGGCCCCGGAAACTGAGCTCTGGAGCCCTGTATGGCAGAAGGCCCAG
AAGCACATTCCCAAATTCCCACCGGTTTCAGTTAGTCATTTCGAAAGCACCCAGTGGGGATCTTTTGGATAAACATTCTGAACTCTTTTC
TAACAAACAATTGCCATTCACTCCTCGCACTTTAAAAACAGAAGCAAAATCTTTCCTGTCACAGTATCGCTATTATACACCTGCCAAAAG
AAAAAAGGATTTTACAGATCAACGGATAGAAGCTGAAACCCAGACTGAATTAAGCTTTAAATCTGAGTTGGGGACAGCTGAGACTAAAAA
CATGACAGATTCAGAAATGAACATAAAGCAGGCATCTAATTGTGTGACATATGATGCCAAAGAAAAAATAGCTCCTTTACCTTTAGAAGG
GCATGACTCAACATGGGATGAGATTAAGGATGATGCTCTTCAGCATTCCTCACCAAGGGCAATGTGTCAGTATTCCCTGAAGCCCCCTTC
AACTCGTAAAATCTACTCTGATAGCCGGATCGAGCTGGGAGATGTGACACCACACAATATTAAACAGTTGAAAAGATTGAATCAGGTCAT
CTTTCCAGTCAGCTACAATGACAAGTTCTACAAGGATGTGCTGGAGGTTGGCGAGCTAGCAAAACTTGGAACTAAAATGTTAAATCATGT
CTTAAACATCTGTGAAAAAGATGGTACTTTTGACAACATTTATCTGCATGTCCAGATCAGCAATGAGTCGGCAATTGACTTCTACAGGAA
GTTTGGCTTTGAGATTATTGAGACAAAGAAGAACTACTATAAGAGGATAGAGCCCGCAGATGCTCATGTGCTGCAGAAAAACCTCAAAGT
TCCTTCTGGTCAGAATGCAGATGTGCAAAAGACAGACAACTGAACAAATTACAAATGAACTTTCTTGCACTTGCTTGTCGCCAAATAAAA
GAGAGGCCCATTGATTCCTCCCCCACCCCAACACTTTTCTTTTAAAGCTTTTCTCCCTCCTTGTTCTTGTTTTTCTTTCTTCCTTTCCTT
TTCTCTGAGAGTTTTAATACTTTCAAGGACTTTAAAAAAATAATCATGTTTGAATTGTTTTCTCTTATTTTTGTGAGGTGGTTTGAAGGA
AGGACAAGGTAGATCTGTTTAGTTTTGCAGTTGAAGTTAGATGGTCCTAAACATTTAATTGTCAAATAATTTCAAATTTAATGTCCTGCT
TTCACATTGAAGGGCAGAGCCTACAAAACATTGTATATTTCAAAAGACAAAAAGAAGCAGCAGCAGTATCTTGTTCTCTAATTCATAGAC
AAGTTGAGTGTGTTTGTGGTACTTTGGGTTTTTAAACACTTTGGGATACTAATCCCTAGACATTGCCTTCACTCCACCTTTAGTCCTTCT
GAGCACTCTCTCGGGAGTTGGAACATTGTTATCCTTGTAAGAAATACTAAGCTTATGTTGATTTTTAAGTAATTATATCTTCTCTTCTTG
CTGGTGGGTGGGGCAGTTTGGTTTAGTGTTATACTTTGGTCTAAGTATTTGAGTTAAACTGCTTTTTTGCTAATGAGTGGGCTGGTTGTT
AGCAGGTTTGTTTTTCCTGCTGTTGATTGTTACTAGTGGCATTAACTTTTAGAATTTGGGCTGGTGAGATTAATTTTTTTTAATATCCCA
GCTAGAGATATGGCCTTTAACTGACCTAAAGAGGTGTGTTGTGATTTAATTTTTTCCCGTTCCTTTTTCTTCAGTAAACCCAACAATAGT
CTAACCTTAAAAATTGAGTTGATGTCCTTATAGGTCACTACCCCTAAATAAACCTGAAGCAGGTGTTTTCTCTTGGACATACTAAAAAAT
ACCTAAAAGGAAGCTTAGATGGGCTGTGACACAAAAAATTCAATTACTGTCATCTAATGCCAGCTGTTAAAAGTGTGGCCACTGAGCATT
TGATTTTATAGGAAAAAATAGTATTTTTGAGAATAACATAGCTGTGCTATTGCACATCTGTTGGAGGACATCCCAGATTTGCTTATACTC
AGTGCCTGTGATATTGAGTTTAAGGATTTGAGGCAGGGGTAATTATTAAACATATTGCTTCTATTCTTGGAAAAATAGAAGTGTAAAATG
TTAATAATACAAATGTCACTGTGACCTCCTCCACTGAGAGGACTGGTTTATGCCAGATCATTTTCCGGCACACACGGAGTGGCTTTGACA
GATTGATAACTTTGTAAGATGGGAGACATCTGAAATATTCATGTTTTCCTTTTGTAGTCCCATCTCCACTATTTAGAAATGTTCTCAGAC

>85589_85589_6_SPATA7-NAA50_SPATA7_chr14_88897569_ENST00000393545_NAA50_chr3_113442939_ENST00000477813_length(amino acids)=487AA_BP=357
MDGSRRVRATSVLPRYGPPCLFKGHLSTKSNAFCTDSSSLRLSTLQLVKNHMAVHYNKILSAKAAVDCSVPVSVSTSIKYADQQRREKLK
KELAQCEKEFKLTKTAMRANYKNNSKSLFNTLQKPSGEPQIEDDMLKEEMNGFSSFARSLVPSSERLHLSLHKSSKVITNGPEKNSSSSP
SSVDYAASGPRKLSSGALYGRRPRSTFPNSHRFQLVISKAPSGDLLDKHSELFSNKQLPFTPRTLKTEAKSFLSQYRYYTPAKRKKDFTD
QRIEAETQTELSFKSELGTAETKNMTDSEMNIKQASNCVTYDAKEKIAPLPLEGHDSTWDEIKDDALQHSSPRAMCQYSLKPPSTRKIYS
DSRIELGDVTPHNIKQLKRLNQVIFPVSYNDKFYKDVLEVGELAKLGTKMLNHVLNICEKDGTFDNIYLHVQISNESAIDFYRKFGFEII

--------------------------------------------------------------
>85589_85589_7_SPATA7-NAA50_SPATA7_chr14_88897569_ENST00000556553_NAA50_chr3_113442939_ENST00000240922_length(transcript)=7347nt_BP=1545nt
AACAGTTCATAGGGACAGCACCCCCACAACAGAGAATTATACAACCAGCCCAAAATGTCAATGGACATTTGGGCTGAGAAGTCTTACATG
ACAGAATGCTTCCTACCCTGATCTGACGTTATTTCCAAGGATTCAAAGGGGACGTGGATGCCAGATGAGTCCACCTCCTCTCCCTGCTCT
TCCCGCAGGGACAGAGCCTGCCAGGTGCTGGTGACTCGGATCCTGACAGGAGGTCGCTCTGCGGAAGACTGGCAGGATCCCAGAGCAACA
GACTGGGGCTCGATGCCCCTTCCCCGCCAGGTCTCTCCGTCCGCAACTGTCCTCCTAGTACCGGGTATCCCGCAGGCGGGGCTGCGGAAC
GCACGTCCCCTGCGCCGTGACGTCACAATAGCGACTCACTGGACCCAGCCCTTAGCAACGGCCTGGCAACGGTTTCCCTGCTGCTGCAGC
CCCCGTCGGCTCCTCTTTTCCAGTCCTCCACTGCCGGGGCTGGGCCCGGCCGCGGGAAGGACCGAAGGGGATACAGCGTGTCCCTGCGGC
GGCTGCAAGAGGACTAAGCATGGATGGCAGCCGGAGAGTCAGAGCAACCTCTGTCCTTCCCAGATATGGTCCACCGTGCCTATTTAAAGG
ACACTTGAGCACCAAAAGTAATGCTGCAGTAGACTGCTCGGTTCCAGTAAGCGTGAGTACCAGCATAAAGTATGCAGACCAACAACGAAG
AGAGAAACTCAAAAAGGAATTAGCACAATGTGAAAAAGAGTTCAAATTAACTAAAACTGCAATGCGAGCCAATTATAAAAATAATTCCAA
GTCACTTTTTAATACCTTACAAAAGCCCTCAGGCGAACCGCAAATTGAGGATGACATGTTAAAAGAAGAAATGAATGGATTTTCATCCTT
TGCAAGGTCACTAGTACCCTCTTCAGAGAGACTACACCTAAGTCTACATAAATCCAGTAAAGTCATCACAAATGGTCCTGAGAAGAACTC
CAGTTCCTCCCCGTCCAGTGTGGATTATGCAGCCTCCGGGCCCCGGAAACTGAGCTCTGGAGCCCTGTATGGCAGAAGGCCCAGAAGCAC
ATTCCCAAATTCCCACCGGTTTCAGTTAGTCATTTCGAAAGCACCCAGTGGGGATCTTTTGGATAAACATTCTGAACTCTTTTCTAACAA
ACAATTGCCATTCACTCCTCGCACTTTAAAAACAGAAGCAAAATCTTTCCTGTCACAGTATCGCTATTATACACCTGCCAAAAGAAAAAA
GGATTTTACAGATCAACGGATAGAAGCTGAAACCCAGACTGAATTAAGCTTTAAATCTGAGTTGGGGACAGCTGAGACTAAAAACATGAC
AGATTCAGAAATGAACATAAAGCAGGCATCTAATTGTGTGACATATGATGCCAAAGAAAAAATAGCTCCTTTACCTTTAGAAGGGCATGA
CTCAACATGGGATGAGATTAAGGATGATGCTCTTCAGCATTCCTCACCAAGGGCAATGTGTCAGTATTCCCTGAAGCCCCCTTCAACTCG
TAAAATCTACTCTGATAGCCGGATCGAGCTGGGAGATGTGACACCACACAATATTAAACAGTTGAAAAGATTGAATCAGGTCATCTTTCC
AGTCAGCTACAATGACAAGTTCTACAAGGATGTGCTGGAGGTTGGCGAGCTAGCAAAACTTGCCTATTTCAATGATATTGCTGTAGGTGC
AGTATGCTGTAGGGTGGATCATTCACAGAATCAGAAGAGACTTTACATCATGACACTAGGATGTCTGGCACCTTACCGAAGGCTAGGAAT
AGGAACTAAAATGTTAAATCATGTCTTAAACATCTGTGAAAAAGATGGTACTTTTGACAACATTTATCTGCATGTCCAGATCAGCAATGA
GTCGGCAATTGACTTCTACAGGAAGTTTGGCTTTGAGATTATTGAGACAAAGAAGAACTACTATAAGAGGATAGAGCCCGCAGATGCTCA
TGTGCTGCAGAAAAACCTCAAAGTTCCTTCTGGTCAGAATGCAGATGTGCAAAAGACAGACAACTGAACAAATTACAAATGAACTTTCTT
GCACTTGCTTGTCGCCAAATAAAAGAGAGGCCCATTGATTCCTCCCCCACCCCAACACTTTTCTTTTAAAGCTTTTCTCCCTCCTTGTTC
TTGTTTTTCTTTCTTCCTTTCCTTTTCTCTGAGAGTTTTAATACTTTCAAGGACTTTAAAAAAATAATCATGTTTGAATTGTTTTCTCTT
ATTTTTGTGAGGTGGTTTGAAGGAAGGACAAGGTAGATCTGTTTAGTTTTGCAGTTGAAGTTAGATGGTCCTAAACATTTAATTGTCAAA
TAATTTCAAATTTAATGTCCTGCTTTCACATTGAAGGGCAGAGCCTACAAAACATTGTATATTTCAAAAGACAAAAAGAAGCAGCAGCAG
TATCTTGTTCTCTAATTCATAGACAAGTTGAGTGTGTTTGTGGTACTTTGGGTTTTTAAACACTTTGGGATACTAATCCCTAGACATTGC
CTTCACTCCACCTTTAGTCCTTCTGAGCACTCTCTCGGGAGTTGGAACATTGTTATCCTTGTAAGAAATACTAAGCTTATGTTGATTTTT
AAGTAATTATATCTTCTCTTCTTGCTGGTGGGTGGGGCAGTTTGGTTTAGTGTTATACTTTGGTCTAAGTATTTGAGTTAAACTGCTTTT
TTGCTAATGAGTGGGCTGGTTGTTAGCAGGTTTGTTTTTCCTGCTGTTGATTGTTACTAGTGGCATTAACTTTTAGAATTTGGGCTGGTG
AGATTAATTTTTTTTAATATCCCAGCTAGAGATATGGCCTTTAACTGACCTAAAGAGGTGTGTTGTGATTTAATTTTTTCCCGTTCCTTT
TTCTTCAGTAAACCCAACAATAGTCTAACCTTAAAAATTGAGTTGATGTCCTTATAGGTCACTACCCCTAAATAAACCTGAAGCAGGTGT
TTTCTCTTGGACATACTAAAAAATACCTAAAAGGAAGCTTAGATGGGCTGTGACACAAAAAATTCAATTACTGTCATCTAATGCCAGCTG
TTAAAAGTGTGGCCACTGAGCATTTGATTTTATAGGAAAAAATAGTATTTTTGAGAATAACATAGCTGTGCTATTGCACATCTGTTGGAG
GACATCCCAGATTTGCTTATACTCAGTGCCTGTGATATTGAGTTTAAGGATTTGAGGCAGGGGTAATTATTAAACATATTGCTTCTATTC
TTGGAAAAATAGAAGTGTAAAATGTTAATAATACAAATGTCACTGTGACCTCCTCCACTGAGAGGACTGGTTTATGCCAGATCATTTTCC
GGCACACACGGAGTGGCTTTGACAGATTGATAACTTTGTAAGATGGGAGACATCTGAAATATTCATGTTTTCCTTTTGTAGTCCCATCTC
CACTATTTAGAAATGTTCTCAGACTTTAAAATAATGCACAGGGCTTGAGCTTTCTGTCATTTGACTTTAAAAGGAAGTTTCATTCATATT
TATCCTCTTATGTAAAATTGCGGTATAAAGTCTCATTTCCAAATATGTTAAATGACAAAATTATTTTATAAAATGTTTATGCACACTTTA
TAACCTTAAGTTTTTATTTGAGAATGTGAAAGTACAAAGTGCAGTAGACTTCAACAATCTTGAGTGCCAAGAATAATACAGAAAAAGAAG
ACAGTTGATGAATGAGTTTATAGGGTTCTAATCTTAAGATGGTAAAAATGTAGAAAGACCTTGCTGGTTTTTTGGGGGTATTCGTTTCTT
AAACAATCCAAATCTAAGCTTAGAAGAAAAGTTTAGCGTTAAGCACCTTTATCTTCATGAATAAGCTTCAGCTTGCTCTTGGCAAGAGAA
GAGTGCTTGAGTTACAGAAGGCATAAGTAGTTTGAAGAATGCAGCAGCCTTTTTGTAAACTTCCCAGATATCAAAATAGACTTTGATATA
TAAATGGTTTTCTGAGATGACACTGCCTCTATTTCTATAACCATTTCACCTGGACTATCTAATCAGTCCTATGAATGTATCCCTAAATGT
GGTTATTGAAAACCTAATAGCTGCCTCATGACAAGTACATGTTATTTAAGGAGGAAAAAATATTAAATTTTGAATTGAGTGTGTAGGCTC
CCTATCATTATATATAGAGTTTCTTTTTCCACGGTAGTCAGTGACTTAACCTGAATTGTAAATGTTTGTAAAGGGTTAATTGTCCTACAT
CAAACTTAGTTAAATAATTCCATCCACTTATGGAGGAGGAGGAGAATGTGGAAGAGGTAAAAAGCTGGGCACAAGTTCATATGCCTATGA
GTCAGTAAAGACTGAAGTAATGTCCTATGTTGAGCTGGTTATTTTGATATATGATAATAATTATCTTTGAAGTAGAACAATTCTGTTAAC
TGGAAAATCACAGGATATATCCATCATATTTTTCAGGACAGATAGTTTTTACTGTGGGGCAAATAGGTTAAAATTACACTATGTTAGTTG
CATTTAGGTTTTAAAGCAAAGAATCTGTAGAGAAATCTATGCAATATATAGTTTGTCCAGATTAGCTTTCATTTGGGGAATGAAGTTCTG
AAATATCTAAAGCAGTTTACTCATCAATTGAAAAGTCCTCCAAAAAGAGAACTATTGGGAAACCATGGTGTGGTGGTGGAAAAGAAAAGC
TCCCTCAGTTTTTTGGAGGGAATAACTTAAAAAAATACTTAAATGGCTAAGTTTACTTGGTGCAGTTAAGAATTAAACTTGTCAATTTTA
ACATTGCTGTTACATCTGAAATAAACTTATGTGATGTTCTGGTAGTGATCTGTGATATCTGTAAATGTCAAAACTGTATTGTTGAATTCT
GCAGCCAGCAACCGTACATACTCATTGTGTAGTGTTTCCCTTACTGCTTTTATTTACTTTCACATTTAAGATCTAATTTTAAAATCATTA
AAATAGGCCAAGTGTAATTAAGGCATCCTAATTCAGATCTTCTGTGACTTGCTAGGTACAGTGCCCAATATCTACAATTCAGTTCCAATG
AGAGGGAAAAAGTAAGATCCAAGGAACTGTCTTGCTGCTGTATTTTTAATGTAATTATTAGAAATACTTGACACTTTAGGCTGCAATCTA
GTTAGTAATGTTTATTGGCTACAGACAACATATTCAGGTGTTTTGTTTTTCTTCCTTGTAAGGAAGGAGTACGGTAGTTCAATGAGTTGG
GAATTGGCTTTTAAGCTTTTTATCAAACATGAATTTAAGATTTTTTCTTTAACAGAATTCAGTATTTAATATTTCTAGTCAAAATTGTTA
TAATTAGATTTTTGCTTTTTTCATTTAAGAAGCAATGAGTGATCCTTGTCTAGTGTGTCTCAGTTATTACTGTAGTTTAAAACAACAAAC
AAGTATTACCTTCCACAGTTCTGGATCAGGAATCTAGGAGCTGCTTAACTGGGTGGTTCTGTCTTGAGATCTCAGAAGGTTGCAGTCAAG
CTGTTAGCCAGGACTGTACCATCTTGAAGCTTGACAAGGCTAGAGGAGCCATTTCCAAGATGTCTTGCTTACGTGGCTCTTGGGCTTCTG
TTCCTCACTGTGTGGGTATCACACACAGTGCCTGATGCTCACGTGGCTCTTGGGCTTCAGTTCCTTGGGCACCAGTTGCCTGGTGGACAA
CTGGTATGTGGCTTCCCCTAGAGCAGGTGATATAAGAAAGTGAGCAGAAACCATACTTTTTTATGACGTAGCCTCAGGTGACATTATTTC
TGCCATATTCCATGAGTCACACAGACCAACCATTGAATCAGCTTGGTATAGTGTGGGAGGGGACTGCACAAGGATGTGAATACTGGGAGG
TGGATATCATTGGGCCCTCTAGGAGACTGGCTATCACTGCTCAGAGAAGATGCTGGGCTTCGTTGACTCCCAAGGTCAGGGTAGTTTTGG
TTAGGTTGTGTTTTATTAGTCTCTAAAAGGAGTAGTTTTCTAAATGAGGGTTATAAAGCACTGCACTTTACAGTTATTCGGGATATAAAA
GAAATAGTGGGTCTAAAGGCTGTGCTCTTGTGGTTGGGTTTTCAGTGGGGAGGGGAGACTAGTCTGTCTCAGACTGATGCTGTTCTCACT
GATTTGAATATTTCTGGGAAATTGGATACTACAGTCACAAATAGGAACAGTAAGCCTATAGAAGTTTTTCAGGGAGTAAATATTTTCTAT
GGCAGTGTCCTGAATTGGTTCTCCCTTGCAAGACTGAACTGTAGGAACTTAGTCCTAGTTTATGATACAGCCAAGTAACATAGTCACATG
GGAAAAAGAGCTTGAGTCAGATTTCTTAATGTGTGTTGTTAACTTGGTAGAACATTGAGAATTATTTAAGTCAGAGAACGATCTGTTACT
GGGGCAGAAATTCTCAACCTTTTCAGTTCTCCAAAATTTAAGATACTTGATTTCTTAGGTAAAATGTTTTTGTTTTTGTTTTGGAGACAG
AGTCTCGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCGCGATCTTGGCTCACTGCAAACTCCGCCTCCCAGATTCAAGCAATTCTGCCTGA
GCCTCCCAAGTAGCTGCGACTAGAAAGCGCATGCCACCACGCCTGGCTAATTTTTTGTATTTTAGTAGAGATGGGGGTTTCACCGTGTTG
CCCAGGCTGGTCTCAAACTCCTGAGCTTAGGCAATCCTCCTGGGGCAGCCTCCCAAAGTGCTAGGATTACAGGCGAGCCATGGCGCCTGG
CCAGTAAAATGTTTTCTATCTAGAATGAATCAAGGTATTTTCCTTGCTCAGTAGCTTCTAGAATAAGAAAAAAATAGCAGCAAGATCTGA
TTCAGAAATAGTTGGGAGCAGAAAGTTAATATGAAGGAGTTGCTACTTGTTAACAGCCTAGAGTTGAGATCTAGAAGAATTATTACCTTT
TTAAATTTGTGATGAAAGCTTAAATCCAGCATTTGGGAAGTTACTCTATTGGCTGAACTATTTTGGAGTTTGTAAGCTTTGTATTAGATA
TTCCTGATTTAACTGAAACTAATTTGCCACATAGCTTTAATTTCATCCCAGTTTTACTTGTTTTACTGTCCTCAAAAACTCAAGACATCT
GAACTCAAAGGATCTAAGCAGTATAAATTAAAGCACATGTTGAATCACTGTAGCTTTCGTAGGACATCTGATTATAATGCTTTTCTTTGT
TTCTTTGTCCATACTGAACTTGTCTAGTTTATCTTTGAGAAACATTTGCTAGTATCAAAAATCTTCTGTAAAGATTTGAACAATCTTGAA

>85589_85589_7_SPATA7-NAA50_SPATA7_chr14_88897569_ENST00000556553_NAA50_chr3_113442939_ENST00000240922_length(amino acids)=495AA_BP=325
MDGSRRVRATSVLPRYGPPCLFKGHLSTKSNAAVDCSVPVSVSTSIKYADQQRREKLKKELAQCEKEFKLTKTAMRANYKNNSKSLFNTL
QKPSGEPQIEDDMLKEEMNGFSSFARSLVPSSERLHLSLHKSSKVITNGPEKNSSSSPSSVDYAASGPRKLSSGALYGRRPRSTFPNSHR
FQLVISKAPSGDLLDKHSELFSNKQLPFTPRTLKTEAKSFLSQYRYYTPAKRKKDFTDQRIEAETQTELSFKSELGTAETKNMTDSEMNI
KQASNCVTYDAKEKIAPLPLEGHDSTWDEIKDDALQHSSPRAMCQYSLKPPSTRKIYSDSRIELGDVTPHNIKQLKRLNQVIFPVSYNDK
FYKDVLEVGELAKLAYFNDIAVGAVCCRVDHSQNQKRLYIMTLGCLAPYRRLGIGTKMLNHVLNICEKDGTFDNIYLHVQISNESAIDFY

--------------------------------------------------------------
>85589_85589_8_SPATA7-NAA50_SPATA7_chr14_88897569_ENST00000556553_NAA50_chr3_113442939_ENST00000477813_length(transcript)=3372nt_BP=1545nt
AACAGTTCATAGGGACAGCACCCCCACAACAGAGAATTATACAACCAGCCCAAAATGTCAATGGACATTTGGGCTGAGAAGTCTTACATG
ACAGAATGCTTCCTACCCTGATCTGACGTTATTTCCAAGGATTCAAAGGGGACGTGGATGCCAGATGAGTCCACCTCCTCTCCCTGCTCT
TCCCGCAGGGACAGAGCCTGCCAGGTGCTGGTGACTCGGATCCTGACAGGAGGTCGCTCTGCGGAAGACTGGCAGGATCCCAGAGCAACA
GACTGGGGCTCGATGCCCCTTCCCCGCCAGGTCTCTCCGTCCGCAACTGTCCTCCTAGTACCGGGTATCCCGCAGGCGGGGCTGCGGAAC
GCACGTCCCCTGCGCCGTGACGTCACAATAGCGACTCACTGGACCCAGCCCTTAGCAACGGCCTGGCAACGGTTTCCCTGCTGCTGCAGC
CCCCGTCGGCTCCTCTTTTCCAGTCCTCCACTGCCGGGGCTGGGCCCGGCCGCGGGAAGGACCGAAGGGGATACAGCGTGTCCCTGCGGC
GGCTGCAAGAGGACTAAGCATGGATGGCAGCCGGAGAGTCAGAGCAACCTCTGTCCTTCCCAGATATGGTCCACCGTGCCTATTTAAAGG
ACACTTGAGCACCAAAAGTAATGCTGCAGTAGACTGCTCGGTTCCAGTAAGCGTGAGTACCAGCATAAAGTATGCAGACCAACAACGAAG
AGAGAAACTCAAAAAGGAATTAGCACAATGTGAAAAAGAGTTCAAATTAACTAAAACTGCAATGCGAGCCAATTATAAAAATAATTCCAA
GTCACTTTTTAATACCTTACAAAAGCCCTCAGGCGAACCGCAAATTGAGGATGACATGTTAAAAGAAGAAATGAATGGATTTTCATCCTT
TGCAAGGTCACTAGTACCCTCTTCAGAGAGACTACACCTAAGTCTACATAAATCCAGTAAAGTCATCACAAATGGTCCTGAGAAGAACTC
CAGTTCCTCCCCGTCCAGTGTGGATTATGCAGCCTCCGGGCCCCGGAAACTGAGCTCTGGAGCCCTGTATGGCAGAAGGCCCAGAAGCAC
ATTCCCAAATTCCCACCGGTTTCAGTTAGTCATTTCGAAAGCACCCAGTGGGGATCTTTTGGATAAACATTCTGAACTCTTTTCTAACAA
ACAATTGCCATTCACTCCTCGCACTTTAAAAACAGAAGCAAAATCTTTCCTGTCACAGTATCGCTATTATACACCTGCCAAAAGAAAAAA
GGATTTTACAGATCAACGGATAGAAGCTGAAACCCAGACTGAATTAAGCTTTAAATCTGAGTTGGGGACAGCTGAGACTAAAAACATGAC
AGATTCAGAAATGAACATAAAGCAGGCATCTAATTGTGTGACATATGATGCCAAAGAAAAAATAGCTCCTTTACCTTTAGAAGGGCATGA
CTCAACATGGGATGAGATTAAGGATGATGCTCTTCAGCATTCCTCACCAAGGGCAATGTGTCAGTATTCCCTGAAGCCCCCTTCAACTCG
TAAAATCTACTCTGATAGCCGGATCGAGCTGGGAGATGTGACACCACACAATATTAAACAGTTGAAAAGATTGAATCAGGTCATCTTTCC
AGTCAGCTACAATGACAAGTTCTACAAGGATGTGCTGGAGGTTGGCGAGCTAGCAAAACTTGGAACTAAAATGTTAAATCATGTCTTAAA
CATCTGTGAAAAAGATGGTACTTTTGACAACATTTATCTGCATGTCCAGATCAGCAATGAGTCGGCAATTGACTTCTACAGGAAGTTTGG
CTTTGAGATTATTGAGACAAAGAAGAACTACTATAAGAGGATAGAGCCCGCAGATGCTCATGTGCTGCAGAAAAACCTCAAAGTTCCTTC
TGGTCAGAATGCAGATGTGCAAAAGACAGACAACTGAACAAATTACAAATGAACTTTCTTGCACTTGCTTGTCGCCAAATAAAAGAGAGG
CCCATTGATTCCTCCCCCACCCCAACACTTTTCTTTTAAAGCTTTTCTCCCTCCTTGTTCTTGTTTTTCTTTCTTCCTTTCCTTTTCTCT
GAGAGTTTTAATACTTTCAAGGACTTTAAAAAAATAATCATGTTTGAATTGTTTTCTCTTATTTTTGTGAGGTGGTTTGAAGGAAGGACA
AGGTAGATCTGTTTAGTTTTGCAGTTGAAGTTAGATGGTCCTAAACATTTAATTGTCAAATAATTTCAAATTTAATGTCCTGCTTTCACA
TTGAAGGGCAGAGCCTACAAAACATTGTATATTTCAAAAGACAAAAAGAAGCAGCAGCAGTATCTTGTTCTCTAATTCATAGACAAGTTG
AGTGTGTTTGTGGTACTTTGGGTTTTTAAACACTTTGGGATACTAATCCCTAGACATTGCCTTCACTCCACCTTTAGTCCTTCTGAGCAC
TCTCTCGGGAGTTGGAACATTGTTATCCTTGTAAGAAATACTAAGCTTATGTTGATTTTTAAGTAATTATATCTTCTCTTCTTGCTGGTG
GGTGGGGCAGTTTGGTTTAGTGTTATACTTTGGTCTAAGTATTTGAGTTAAACTGCTTTTTTGCTAATGAGTGGGCTGGTTGTTAGCAGG
TTTGTTTTTCCTGCTGTTGATTGTTACTAGTGGCATTAACTTTTAGAATTTGGGCTGGTGAGATTAATTTTTTTTAATATCCCAGCTAGA
GATATGGCCTTTAACTGACCTAAAGAGGTGTGTTGTGATTTAATTTTTTCCCGTTCCTTTTTCTTCAGTAAACCCAACAATAGTCTAACC
TTAAAAATTGAGTTGATGTCCTTATAGGTCACTACCCCTAAATAAACCTGAAGCAGGTGTTTTCTCTTGGACATACTAAAAAATACCTAA
AAGGAAGCTTAGATGGGCTGTGACACAAAAAATTCAATTACTGTCATCTAATGCCAGCTGTTAAAAGTGTGGCCACTGAGCATTTGATTT
TATAGGAAAAAATAGTATTTTTGAGAATAACATAGCTGTGCTATTGCACATCTGTTGGAGGACATCCCAGATTTGCTTATACTCAGTGCC
TGTGATATTGAGTTTAAGGATTTGAGGCAGGGGTAATTATTAAACATATTGCTTCTATTCTTGGAAAAATAGAAGTGTAAAATGTTAATA
ATACAAATGTCACTGTGACCTCCTCCACTGAGAGGACTGGTTTATGCCAGATCATTTTCCGGCACACACGGAGTGGCTTTGACAGATTGA
TAACTTTGTAAGATGGGAGACATCTGAAATATTCATGTTTTCCTTTTGTAGTCCCATCTCCACTATTTAGAAATGTTCTCAGACTTTAAA

>85589_85589_8_SPATA7-NAA50_SPATA7_chr14_88897569_ENST00000556553_NAA50_chr3_113442939_ENST00000477813_length(amino acids)=455AA_BP=325
MDGSRRVRATSVLPRYGPPCLFKGHLSTKSNAAVDCSVPVSVSTSIKYADQQRREKLKKELAQCEKEFKLTKTAMRANYKNNSKSLFNTL
QKPSGEPQIEDDMLKEEMNGFSSFARSLVPSSERLHLSLHKSSKVITNGPEKNSSSSPSSVDYAASGPRKLSSGALYGRRPRSTFPNSHR
FQLVISKAPSGDLLDKHSELFSNKQLPFTPRTLKTEAKSFLSQYRYYTPAKRKKDFTDQRIEAETQTELSFKSELGTAETKNMTDSEMNI
KQASNCVTYDAKEKIAPLPLEGHDSTWDEIKDDALQHSSPRAMCQYSLKPPSTRKIYSDSRIELGDVTPHNIKQLKRLNQVIFPVSYNDK
FYKDVLEVGELAKLGTKMLNHVLNICEKDGTFDNIYLHVQISNESAIDFYRKFGFEIIETKKNYYKRIEPADAHVLQKNLKVPSGQNADV

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for SPATA7-NAA50


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for SPATA7-NAA50


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for SPATA7-NAA50


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource