FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:HSPD1-NRK (FusionGDB2 ID:37989)

Fusion Gene Summary for HSPD1-NRK

check button Fusion gene summary
Fusion gene informationFusion gene name: HSPD1-NRK
Fusion gene ID: 37989
HgeneTgene
Gene symbol

HSPD1

NRK

Gene ID

3329

203447

Gene nameheat shock protein family D (Hsp60) member 1Nik related kinase
SynonymsCPN60|GROEL|HLD4|HSP-60|HSP60|HSP65|HuCHA60|SPG13NESK
Cytomap

2q33.1

Xq22.3

Type of geneprotein-codingprotein-coding
Description60 kDa heat shock protein, mitochondrial60 kDa chaperoninP60 lymphocyte proteinchaperonin 60epididymis secretory sperm binding proteinheat shock 60kDa protein 1 (chaperonin)heat shock protein 65mitochondrial matrix protein P1short heat shock protenik-related protein kinase
Modification date2020031520200313
UniProtAcc

P10809

Q7Z2Y5

Ensembl transtripts involved in fusion geneENST00000345042, ENST00000388968, 
ENST00000544407, 
ENST00000536164, 
ENST00000540278, ENST00000243300, 
ENST00000428173, 
Fusion gene scores* DoF score7 X 7 X 3=1472 X 2 X 1=4
# samples 72
** MAII scorelog2(7/147*10)=-1.0703893278914
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(2/4*10)=2.32192809488736
Context

PubMed: HSPD1 [Title/Abstract] AND NRK [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointHSPD1(198358883)-NRK(105139466), # samples:1
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneHSPD1

GO:0002368

B cell cytokine production

16148103

HgeneHSPD1

GO:0002755

MyD88-dependent toll-like receptor signaling pathway

16148103

HgeneHSPD1

GO:0002842

positive regulation of T cell mediated immune response to tumor cell

10663613

HgeneHSPD1

GO:0006919

activation of cysteine-type endopeptidase activity involved in apoptotic process

17823127

HgeneHSPD1

GO:0006986

response to unfolded protein

11050098

HgeneHSPD1

GO:0032727

positive regulation of interferon-alpha production

17164250

HgeneHSPD1

GO:0032729

positive regulation of interferon-gamma production

17164250

HgeneHSPD1

GO:0032733

positive regulation of interleukin-10 production

16148103

HgeneHSPD1

GO:0032735

positive regulation of interleukin-12 production

17164250

HgeneHSPD1

GO:0032755

positive regulation of interleukin-6 production

16148103

HgeneHSPD1

GO:0042026

protein refolding

11050098

HgeneHSPD1

GO:0042100

B cell proliferation

16148103

HgeneHSPD1

GO:0042110

T cell activation

15371451|17164250|18256040

HgeneHSPD1

GO:0042113

B cell activation

16148103

HgeneHSPD1

GO:0043032

positive regulation of macrophage activation

17164250

HgeneHSPD1

GO:0044406

adhesion of symbiont to host

20633027

HgeneHSPD1

GO:0048291

isotype switching to IgG isotypes

16148103

HgeneHSPD1

GO:0050870

positive regulation of T cell activation

16148103|17164250


check buttonFusion gene breakpoints across HSPD1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across NRK (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChiTaRS5.0N/ACD657301HSPD1chr2

198358883

-NRKchrX

105139466

+


Top

Fusion Gene ORF analysis for HSPD1-NRK

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-intronENST00000345042ENST00000536164HSPD1chr2

198358883

-NRKchrX

105139466

+
5CDS-intronENST00000345042ENST00000540278HSPD1chr2

198358883

-NRKchrX

105139466

+
5CDS-intronENST00000388968ENST00000536164HSPD1chr2

198358883

-NRKchrX

105139466

+
5CDS-intronENST00000388968ENST00000540278HSPD1chr2

198358883

-NRKchrX

105139466

+
In-frameENST00000345042ENST00000243300HSPD1chr2

198358883

-NRKchrX

105139466

+
In-frameENST00000345042ENST00000428173HSPD1chr2

198358883

-NRKchrX

105139466

+
In-frameENST00000388968ENST00000243300HSPD1chr2

198358883

-NRKchrX

105139466

+
In-frameENST00000388968ENST00000428173HSPD1chr2

198358883

-NRKchrX

105139466

+
intron-3CDSENST00000544407ENST00000243300HSPD1chr2

198358883

-NRKchrX

105139466

+
intron-3CDSENST00000544407ENST00000428173HSPD1chr2

198358883

-NRKchrX

105139466

+
intron-intronENST00000544407ENST00000536164HSPD1chr2

198358883

-NRKchrX

105139466

+
intron-intronENST00000544407ENST00000540278HSPD1chr2

198358883

-NRKchrX

105139466

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000388968HSPD1chr2198358883-ENST00000428173NRKchrX105139466+819996898751891400
ENST00000388968HSPD1chr2198358883-ENST00000243300NRKchrX105139466+819796898751861399
ENST00000345042HSPD1chr2198358883-ENST00000428173NRKchrX105139466+804881783650381400
ENST00000345042HSPD1chr2198358883-ENST00000243300NRKchrX105139466+804681783650351399

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000388968ENST00000428173HSPD1chr2198358883-NRKchrX105139466+0.0020754970.99792445
ENST00000388968ENST00000243300HSPD1chr2198358883-NRKchrX105139466+0.0020859130.99791414
ENST00000345042ENST00000428173HSPD1chr2198358883-NRKchrX105139466+0.0017641820.9982358
ENST00000345042ENST00000243300HSPD1chr2198358883-NRKchrX105139466+0.0017733590.9982267

Top

Fusion Genomic Features for HSPD1-NRK


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.

Top

Fusion Protein Features for HSPD1-NRK


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr2:198358883/chrX:105139466)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
HSPD1

P10809

NRK

Q7Z2Y5

FUNCTION: Chaperonin implicated in mitochondrial protein import and macromolecular assembly. Together with Hsp10, facilitates the correct folding of imported proteins. May also prevent misfolding and promote the refolding and proper assembly of unfolded polypeptides generated under stress conditions in the mitochondrial matrix (PubMed:1346131, PubMed:11422376). The functional units of these chaperonins consist of heptameric rings of the large subunit Hsp60, which function as a back-to-back double ring. In a cyclic reaction, Hsp60 ring complexes bind one unfolded substrate protein per ring, followed by the binding of ATP and association with 2 heptameric rings of the co-chaperonin Hsp10. This leads to sequestration of the substrate protein in the inner cavity of Hsp60 where, for a certain period of time, it can fold undisturbed by other cell components. Synchronous hydrolysis of ATP in all Hsp60 subunits results in the dissociation of the chaperonin rings and the release of ADP and the folded substrate protein (Probable). {ECO:0000269|PubMed:11422376, ECO:0000269|PubMed:1346131, ECO:0000305|PubMed:25918392}.FUNCTION: May phosphorylate cofilin-1 and induce actin polymerization through this process, during the late stages of embryogenesis. Involved in the TNF-alpha-induced signaling pathway (By similarity). {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneHSPD1chr2:198358883chrX:105139466ENST00000345042-612111_115233574.0Nucleotide bindingATP
HgeneHSPD1chr2:198358883chrX:105139466ENST00000388968-612111_115233574.0Nucleotide bindingATP
TgeneNRKchr2:198358883chrX:105139466ENST00000243300029725_75901583.0Coiled coilOntology_term=ECO:0000255
TgeneNRKchr2:198358883chrX:105139466ENST0000053616407725_7590189.0Coiled coilOntology_term=ECO:0000255
TgeneNRKchr2:198358883chrX:105139466ENST00000243300029402_57001583.0Compositional biasNote=Gln-rich
TgeneNRKchr2:198358883chrX:105139466ENST00000243300029917_97901583.0Compositional biasNote=Asp-rich
TgeneNRKchr2:198358883chrX:105139466ENST0000053616407402_5700189.0Compositional biasNote=Gln-rich
TgeneNRKchr2:198358883chrX:105139466ENST0000053616407917_9790189.0Compositional biasNote=Asp-rich
TgeneNRKchr2:198358883chrX:105139466ENST000002433000291209_155201583.0DomainCNH
TgeneNRKchr2:198358883chrX:105139466ENST0000024330002925_31301583.0DomainProtein kinase
TgeneNRKchr2:198358883chrX:105139466ENST00000536164071209_15520189.0DomainCNH
TgeneNRKchr2:198358883chrX:105139466ENST000005361640725_3130189.0DomainProtein kinase
TgeneNRKchr2:198358883chrX:105139466ENST0000024330002931_3901583.0Nucleotide bindingATP
TgeneNRKchr2:198358883chrX:105139466ENST000005361640731_390189.0Nucleotide bindingATP

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note


Top

Fusion Gene Sequence for HSPD1-NRK


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>37989_37989_1_HSPD1-NRK_HSPD1_chr2_198358883_ENST00000345042_NRK_chrX_105139466_ENST00000243300_length(transcript)=8046nt_BP=817nt
GCCCCGACGCGCACCGCGATTCGCCCCAAGGGCCCTGCGCAGGACGCTGACGCGAAGACTCGGAGGCGGAAGAAAAAAGGAGCTGTTTCT
AGGCTTTTCTAGGCGCCCAGCCGAGAAATGCTTCGGTTACCCACAGTCTTTCGCCAGATGAGACCGGTGTCCAGGGTACTGGCTCCTCAT
CTCACTCGGGCTTATGCCAAAGATGTAAAATTTGGTGCAGATGCCCGAGCCTTAATGCTTCAAGGTGTAGACCTTTTAGCCGATGCTGTG
GCCGTTACAATGGGGCCAAAGGGAAGAACAGTGATTATTGAGCAGAGTTGGGGAAGTCCCAAAGTAACAAAAGATGGTGTGACTGTTGCA
AAGTCAATTGACTTAAAAGATAAATACAAAAACATTGGAGCTAAACTTGTTCAAGATGTTGCCAATAACACAAATGAAGAAGCTGGGGAT
GGCACTACCACTGCTACTGTACTGGCACGCTCTATAGCCAAGGAAGGCTTCGAGAAGATTAGCAAAGGTGCTAATCCAGTGGAAATCAGG
AGAGGTGTGATGTTAGCTGTTGATGCTGTAATTGCTGAACTTAAAAAGCAGTCTAAACCTGTGACCACCCCTGAAGAAATTGCACAGGTT
GCTACGATTTCTGCAAACGGAGACAAAGAAATTGGCAATATCATCTCTGATGCAATGAAAAAAGTTGGAAGAAAGGGTGTCATCACAGTA
AAGGATGGAAAAACACTGAATGATGAATTAGAAATTATTGAAGGCATGAAGTTTGATCGAGGCTATATTTCTCCATACTTTATTAATACA
TCAAAAGCATCAAAGGTCAGAATGTGCTGCTGACTCATAATGCTGAAGTAAAACTGGTTGATTTTGGAGTGAGTGCCCAGGTGAGCAGAA
CTAATGGAAGAAGGAATAGTTTCATTGGGACACCATACTGGATGGCACCTGAGGTGATTGACTGTGATGAGGACCCAAGACGCTCCTATG
ATTACAGAAGTGATGTGTGGTCTGTGGGAATTACTGCCATTGAAATGGCTGAAGGAGCCCCTCCTCTGTGTAACCTTCAACCCTTGGAAG
CTCTCTTCGTTATTTTGCGGGAATCTGCTCCCACAGTCAAATCCAGCGGATGGTCCCGTAAGTTCCACAATTTCATGGAAAAGTGTACGA
TAAAAAATTTCCTGTTTCGTCCTACTTCTGCAAACATGCTTCAACACCCATTTGTTCGGGATATAAAAAATGAACGACATGTTGTTGAGT
CATTAACAAGGCATCTTACTGGAATCATTAAAAAAAGACAGAAAAAAGGAATACCTTTGATCTTTGAAAGAGAAGAAGCTATTAAGGAAC
AGTACACCGTGAGAAGATTCAGAGGACCCTCTTGCACTCACGAGCTTCTGAGATTGCCAACCAGCAGCAGATGCAGACCACTTAGAGTCC
TGCATGGGGAACCCTCTCAGCCAAGGTGGCTACCTGATCGAGAAGAGCCACAGGTCCAGGCACTTCAGCAGCTACAGGGAGCAGCCAGGG
TATTCATGCCACTGCAGGCTCTGGACAGTGCACCTAAGCCTCTAAAGGGGCAGGCTCAGGCACCTCAACGACTACAAGGGGCAGCTCGGG
TGTTCATGCCACTACAGGCTCAGGTGAAGGCTAAAGCCTCTAAACCTCTACAAATGCAGATTAAGGCACCTCCACGACTACGGAGGGCAG
CCAGGGTGCTCATGCCACTACAGGCACAGGTTAGGGCACCTAGGCTTCTGCAGGTACAGTCCCAGGTATCCAAAAAGCAGCAGGCCCAGA
CCCAGACATCAGAACCACAAGATTTGGACCAGGTACCAGAGGAATTTCAGGGTCAAGATCAGGTACCCGAACAACAAAGGCAGGGCCAGG
CCCCTGAACAACAGCAGAGGCACAACCAGGTGCCTGAACAAGAGCTGGAGCAGAACCAGGCACCTGAACAGCCAGAGGTACAGGAACAGG
CTGCCGAGCCTGCACAGGCAGAGACTGAGGCAGAGGAACCTGAGTCATTACGAGTAAATGCCCAGGTATTTCTGCCCCTGCTATCACAAG
ATCACCATGTGCTGTTGCCACTACATTTGGATACTCAGGTGCTCATTCCAGTAGAGGGGCAAACTGAAGGATCACCTCAGGCACAGGCTT
GGACACTAGAACCCCCACAGGCAATTGGCTCAGTTCAAGCACTGATAGAGGGACTATCAAGAGACTTGCTTCGGGCACCAAACTCAAATA
ACTCAAAGCCACTTGGTCCGTTGCAAACCCTGATGGAAAATCTGTCATCAAATAGGTTTTACTCACAACCAGAACAGGCACGGGAGAAAA
AATCAAAAGTTTCTACTCTGAGGCAAGCACTGGCAAAAAGACTATCACCAAAGAGGTTCAGGGCAAAGTCATCATGGAGACCTGAAAAGC
TTGAACTCTCGGATTTAGAAGCCCGCAGGCAAAGGCGCCAACGCAGATGGGAAGATATCTTTAATCAGCATGAGGAAGAATTGAGACAAG
TTGATAAAGACAAAGAAGATGAATCATCAGACAATGATGAAGTATTTCATTCGATTCAGGCTGAAGTCCAGATAGAGCCATTGAAGCCAT
ACATTTCAAATCCTAAAAAAATTGAGGTTCAAGAGAGATCTCCTTCTGTGCCTAACAACCAGGATCATGCACATCATGTCAAGTTCTCTT
CAAGCGTTCCTCAGCGGTCTCTTTTGGAACAAGCTCAGAAGCCCATTGACATCAGACAAAGGAGTTCGCAAAATCGTCAAAATTGGCTGG
CAGCATCAGAATCTTCTTCTGAGGAAGAAAGTCCTGTGACTGGAAGGAGGTCTCAGTCATCACCACCTTATTCTACTATTGATCAGAAGT
TGCTGGTTGACATCCATGTTCCAGATGGATTTAAAGTAGGAAAAATATCACCCCCTGTATACTTGACAAACGAATGGGTAGGCTATAATG
CACTCTCTGAAATCTTCCGGAATGATTGGTTAACTCCGGCACCTGTCATTCAGCCACCTGAAGAGGATGGTGATTATGTTGAACTCTATG
ATGCCAGTGCTGATACTGATGGTGATGATGATGATGAGTCTAATGATACTTTTGAAGATACCTATGATCATGCCAATGGCAATGATGACT
TGGATAACCAGGTTGATCAGGCTAATGATGTTTGTAAAGACCATGATGATGACAACAATAAGTTTGTTGATGATGTAAATAATAATTATT
ATGAGGCGCCTAGTTGTCCAAGGGCAAGCTATGGCAGAGATGGAAGCTGCAAGCAAGATGGTTATGATGGAAGTCGTGGAAAAGAGGAAG
CCTACAGAGGCTATGGAAGCCATACAGCCAATAGAAGCCATGGAGGAAGTGCAGCCAGTGAGGACAATGCAGCCATTGGAGATCAGGAAG
AACATGCAGCCAATATAGGCAGTGAAAGAAGAGGCAGTGAGGGTGATGGAGGTAAGGGAGTCGTTCGAACCAGTGAAGAGAGTGGAGCCC
TTGGACTCAATGGAGAAGAAAATTGCTCAGAGACAGATGGTCCAGGATTGAAGAGACCTGCGTCTCAGGACTTTGAATATCTACAGGAGG
AGCCAGGTGGTGGAAATGAGGCCTCAAATGCCATTGACTCAGGTGCTGCACCGTCAGCACCTGATCATGAGAGTGACAATAAGGACATAT
CAGAATCATCAACACAATCAGATTTTTCTGCCAATCACTCATCTCCTTCCAAAGGTTCTGGGATGTCTGCTGATGCTAACTTTGCCAGTG
CCATCTTATACGCTGGATTCGTAGAAGTACCTGAGGAATCACCTAAGCAACCCTCTGAAGTCAATGTTAACCCACTCTATGTCTCTCCTG
CATGTAAAAAACCACTAATCCACATGTATGAAAAGGAGTTCACTTCTGAGATCTGCTGTGGTTCTTTGTGGGGAGTCAATTTGCTGTTGG
GAACCCGATCTAATCTATATCTGATGGACAGAAGTGGAAAGGCTGACATTACTAAACTTATAAGGCGAAGACCATTCCGCCAGATTCAAG
TCTTAGAGCCACTCAATTTGCTGATTACCATCTCAGGTCATAAGAACAGACTTCGGGTGTATCATCTGACCTGGTTGAGGAACAAGATTT
TGAATAATGATCCAGAAAGTAAAAGAAGGCAAGAAGAAATGCTGAAGACAGAGGAAGCCTGCAAAGCTATTGATAAGTTAACAGGCTGTG
AACACTTCAGTGTCCTCCAACATGAAGAAACAACATATATTGCAATTGCTTTGAAATCATCAATTCACCTTTATGCATGGGCACCAAAGT
CCTTTGATGAAAGCACTGCTATTAAAGTATGCATTGATCAATCAGCAGACTCTGAAGGAGACTACATGTCCTATCAAGCCTATATACGAA
TACTGGCAAAAATACAGGCAGCTGATCCAGTGAACCGGTTTAAGAGACCAGATGAGCTCCTTCATTTGCTGAAGCTCAAGGTATTTCCAA
CACTTGATCATAAGCCAGTGACAGTTGACCTGGCTATTGGTTCTGAAAAAAGACTAAAGATTTTCTTCAGCTCAGCAGATGGATATCACC
TCATCGATGCAGAATCTGAGGTTATGTCTGATGTGACCCTGCCAAAGAATCCCCTGGAAATCATTATACCACAGAATATCATCATTTTAC
CTGATTGCTTGGGAATTGGCATGATGCTCACCTTCAATGCTGAAGCCCTCTCTGTGGAAGCAAATGAACAACTCTTCAAGAAGATCCTTG
AAATGTGGAAAGACATACCATCTTCTATAGCTTTTGAATGTACACAGCGAACCACAGGATGGGGCCAAAAGGCCATTGAAGTGCGCTCTT
TGCAATCCAGGGTTCTGGAAAGTGAGCTGAAGCGCAGGTCAATTAAGAAGCTGAGATTCCTGTGCACCCGGGGTGACAAGCTGTTCTTTA
CCTCTACCCTGCGCAATCACCACAGCCGGGTTTACTTCATGACACTTGGAAAACTTGAAGAGCTCCAAAGCAATTATGATGTCTAAAAGT
TTCCAGTGATTTATTACCACATTATAAACATCATGTATAGGCAGTCTGCATCTTCAGATTTCAGAGATTAAATGAGTATTCAGTTTTATT
TTTAGTAAAGATTAAATCCAAAACTTTACTTTTAATGTAGCACAGAATAGTTTTAATGAGAAATGCAGCTTTATGTATAAAATTAACTAT
AGCAAGCTCTAGGTACTCCAATGGTGTACAATGTCTTTTGCACAAACTTTGTAACTTTTGTTACTGTGAATTCAAACATTACTCTTTGGA
CAGTTTGGACAGTATCTGTATTCAGATTTTACAACATGGAGTAAAGAAACCTGTTATGAATTAGATTACAAGCAGCCTTCAAAAGAATTG
GCACTGGGATAAGATTTTTCAGAAAAAGAAAAACATCGGCAAACTGTGTGTGATTTTTCCAAAGCTATATAAAGAACCAAAGGTTTAGTC
AAGAAACAAAAATCTTAAAGATTATTATAACCCAGACTAAGGTTGAACAACCTGCATGCCCAGAGAAAACTATGGCGACAAAGGGGAAAA
GGCCACCACTCGTTTTCTCACTGATTCATGCCAATTAAGCCTACAGTTAAAGACCAGTTTTGTTCTTTTCACCCATTTTTAAGCTGGTTT
TCTCCTGATAAGAAGAAAGGAAGAAAGCCCCAGACGCTTGGTTTTTCTCAGAACCCCCAAAAGATGTGCAATAGCTGTTGTTACAAACCA
CCAAATAATACAGTTGTGAGCCTGAATACAGGACTGAACTCCTATACACGTGTACTGTAGAATGAGTATTTTTTAATACCTTAAGGTAGG
CGTCAAATTCTACTCCCCAAAGCAGAGATAGATTGATTTATCAAAATTATTATCTGGCCAACAGTGTGACTATCAGACAGCATCAAATAT
TTGCCCAATCCAAGATTAGACTACACAAAAGCTTCCTTCCAGTATTAAACAAAAAGAATTAAACATAACTATGAAAAAACTTTGCTAATA
TCTGTGTTTTTCAGATTTCATTTTTTGTAAAATCAGAAATTAATCTAAACATATTCAGTGATAAGTTCATGTGTAACGACTTAATGTTAA
AGGTTAAAAAAAAGATTTCACAAAATATACAACTTTCACCATATATATAAGCCTGCAAAATTAGAGTAGTGAAAGTCATGCTAGTCCATC
ACCCAAATATGTTATAGACGCCATAGACAGGTGATGTTTGGTCACCTATGGTAACTGCTACCTGATGAAGAGCATAATTTCTGCATATCC
ATCCTCAATACCATGGTAAATTCTGGGGCAATAGAGAAGCAACAGAACTGCCACAAAGTATACCTCAATATAATTCCTCTAGTTCTGCTT
CTAAAATCTGAGGACAGTGCTAGTGGGAAAATAATTTTCAAACTACCTGGTTAACCAAAATACAAAAGCAGCTGACTATGTGTGATTTCA
TAATAGCACATTTCTTGACACTTAGTGCTAGAAATGAAGATTTGGATTTTCCTAACAACTTACATCAAGAATGTAGTGTAGCTCATTATT
GAGAATTTAGGAAAGCCTGAATCCATTAATTAAGGAAATAAATGTGACTCACATTTCTTTTACTGTGACACAATAATGTGATCCTAAAAC
TGGCTTATCCTTGAGTGTTTACAACTCAAACAACTTTTTGAATGCAGTAGTTTTTTTTTTTTAAAAACAAACTTTTATGTCAAATTTTTT
TTCTTAGAAGTAGTCTTCATTATTATAAATTTGTACACCAAAAGGCCATGGGGAACTTTGTGCAAGTACCTCATCGCTGAGCAAATGGAG
CTTGCTATGTTTTAATTTCAGAAAATTTCCTCATATACGTAGTGTGTAGAATCAAGTCTTTTAATAATTCATTTTTTCTTCATAATATTT
ACTCAAAGTTAAGCTTAAAAATAAGTTTTATCTTAAAATCATATTTGAAGACAGTAAGACAGTAAACTATTTTAGGAAGTCAACCCCCAT
TGCACTCTGTGGCAGTTATTCTGGTAAAAATAGGCAAAAGTGACCTGAATCTACAATGATGTCCCAAAGTAACCAAGTAAGAGAGATTGT
AAATGATAAACCGAGCTTTAAAGGATAAAGTGTTAATAAAGAAAGGAAGCTGGGCACATGTCAAAAAGGGAGATCGAAATGTTAGGTAAT
CATTTAGAAAGGACAGAAAATATTTAAAGTGGCTCATAGGTAATGAATATTTCTGACTTAGATGTAAATCCATCTGGAATCTTTACATCC
TTTGCCAGCTGAAACAAGAAAGTGAAGGGACAATGATATTTCATGGTCAGTTTATTTTGTAAGAGACAGAAGAAATTATATCTATACATT
ACCTTGTAGCAGCAGTACCTGGAAGCCCCAGCCCGTCACAGAAGTGTGGAGGGGGGCTCCTGACTAGACAATTTCCCTAGCCCTTGTGAT
TTGAAGCATGAAAGTTCTGGCAGGTTATGAGCAGCACTAGGGATAAAGTATGGTTTTATTTTGGTGTAATTTAGGTTTTTCAACAAAGCC
CTTGTCTAAAATAAAAGGCATTATTGGAAATATTTGAAAACTAGAAAATGATGGATAAAAGGGCTGATAAGAAAATTTCTGACTGTCAGT
AGAAGTGAGATAAGATCCTCAGAGGAAACAGTAAGAAGGGATAATCATTAAGATAGTAAAACAGGCAAAGCAGAATCACATGTGCACACA
CACATACACATGTAAACATTGGAATGCATAAGTTTTAATATTTTAGCGCTATCAGTTTCTAAATGCATTAATTACTAACTGCCCTCTCCC
AAGATTCATTTAGTTCAAACAGTATCCGTAAACTAGGAATAATGCCACATGCATTCAATGGGATCTTTTAAGTACTCTTCAGTTTGTTCC
AAGAAATGTGCCTACTGAAATCAAATTAATTTGTATTCAATGTGTACTTCAAGACTGCTAATTGTTTCATCTGAAAGCCTACAATGAATC

>37989_37989_1_HSPD1-NRK_HSPD1_chr2_198358883_ENST00000345042_NRK_chrX_105139466_ENST00000243300_length(amino acids)=1399AA_BP=
MLTHNAEVKLVDFGVSAQVSRTNGRRNSFIGTPYWMAPEVIDCDEDPRRSYDYRSDVWSVGITAIEMAEGAPPLCNLQPLEALFVILRES
APTVKSSGWSRKFHNFMEKCTIKNFLFRPTSANMLQHPFVRDIKNERHVVESLTRHLTGIIKKRQKKGIPLIFEREEAIKEQYTVRRFRG
PSCTHELLRLPTSSRCRPLRVLHGEPSQPRWLPDREEPQVQALQQLQGAARVFMPLQALDSAPKPLKGQAQAPQRLQGAARVFMPLQAQV
KAKASKPLQMQIKAPPRLRRAARVLMPLQAQVRAPRLLQVQSQVSKKQQAQTQTSEPQDLDQVPEEFQGQDQVPEQQRQGQAPEQQQRHN
QVPEQELEQNQAPEQPEVQEQAAEPAQAETEAEEPESLRVNAQVFLPLLSQDHHVLLPLHLDTQVLIPVEGQTEGSPQAQAWTLEPPQAI
GSVQALIEGLSRDLLRAPNSNNSKPLGPLQTLMENLSSNRFYSQPEQAREKKSKVSTLRQALAKRLSPKRFRAKSSWRPEKLELSDLEAR
RQRRQRRWEDIFNQHEEELRQVDKDKEDESSDNDEVFHSIQAEVQIEPLKPYISNPKKIEVQERSPSVPNNQDHAHHVKFSSSVPQRSLL
EQAQKPIDIRQRSSQNRQNWLAASESSSEEESPVTGRRSQSSPPYSTIDQKLLVDIHVPDGFKVGKISPPVYLTNEWVGYNALSEIFRND
WLTPAPVIQPPEEDGDYVELYDASADTDGDDDDESNDTFEDTYDHANGNDDLDNQVDQANDVCKDHDDDNNKFVDDVNNNYYEAPSCPRA
SYGRDGSCKQDGYDGSRGKEEAYRGYGSHTANRSHGGSAASEDNAAIGDQEEHAANIGSERRGSEGDGGKGVVRTSEESGALGLNGEENC
SETDGPGLKRPASQDFEYLQEEPGGGNEASNAIDSGAAPSAPDHESDNKDISESSTQSDFSANHSSPSKGSGMSADANFASAILYAGFVE
VPEESPKQPSEVNVNPLYVSPACKKPLIHMYEKEFTSEICCGSLWGVNLLLGTRSNLYLMDRSGKADITKLIRRRPFRQIQVLEPLNLLI
TISGHKNRLRVYHLTWLRNKILNNDPESKRRQEEMLKTEEACKAIDKLTGCEHFSVLQHEETTYIAIALKSSIHLYAWAPKSFDESTAIK
VCIDQSADSEGDYMSYQAYIRILAKIQAADPVNRFKRPDELLHLLKLKVFPTLDHKPVTVDLAIGSEKRLKIFFSSADGYHLIDAESEVM
SDVTLPKNPLEIIIPQNIIILPDCLGIGMMLTFNAEALSVEANEQLFKKILEMWKDIPSSIAFECTQRTTGWGQKAIEVRSLQSRVLESE

--------------------------------------------------------------
>37989_37989_2_HSPD1-NRK_HSPD1_chr2_198358883_ENST00000345042_NRK_chrX_105139466_ENST00000428173_length(transcript)=8048nt_BP=817nt
GCCCCGACGCGCACCGCGATTCGCCCCAAGGGCCCTGCGCAGGACGCTGACGCGAAGACTCGGAGGCGGAAGAAAAAAGGAGCTGTTTCT
AGGCTTTTCTAGGCGCCCAGCCGAGAAATGCTTCGGTTACCCACAGTCTTTCGCCAGATGAGACCGGTGTCCAGGGTACTGGCTCCTCAT
CTCACTCGGGCTTATGCCAAAGATGTAAAATTTGGTGCAGATGCCCGAGCCTTAATGCTTCAAGGTGTAGACCTTTTAGCCGATGCTGTG
GCCGTTACAATGGGGCCAAAGGGAAGAACAGTGATTATTGAGCAGAGTTGGGGAAGTCCCAAAGTAACAAAAGATGGTGTGACTGTTGCA
AAGTCAATTGACTTAAAAGATAAATACAAAAACATTGGAGCTAAACTTGTTCAAGATGTTGCCAATAACACAAATGAAGAAGCTGGGGAT
GGCACTACCACTGCTACTGTACTGGCACGCTCTATAGCCAAGGAAGGCTTCGAGAAGATTAGCAAAGGTGCTAATCCAGTGGAAATCAGG
AGAGGTGTGATGTTAGCTGTTGATGCTGTAATTGCTGAACTTAAAAAGCAGTCTAAACCTGTGACCACCCCTGAAGAAATTGCACAGGTT
GCTACGATTTCTGCAAACGGAGACAAAGAAATTGGCAATATCATCTCTGATGCAATGAAAAAAGTTGGAAGAAAGGGTGTCATCACAGTA
AAGGATGGAAAAACACTGAATGATGAATTAGAAATTATTGAAGGCATGAAGTTTGATCGAGGCTATATTTCTCCATACTTTATTAATACA
TCAAAAGCATCAAAGGTCAGAATGTGCTGCTGACTCATAATGCTGAAGTAAAACTGGTTGATTTTGGAGTGAGTGCCCAGGTGAGCAGAA
CTAATGGAAGAAGGAATAGTTTCATTGGGACACCATACTGGATGGCACCTGAGGTGATTGACTGTGATGAGGACCCAAGACGCTCCTATG
ATTACAGAAGTGATGTGTGGTCTGTGGGAATTACTGCCATTGAAATGGCTGAAGGAGCCCCTCCTCTGTGTAACCTTCAACCCTTGGAAG
CTCTCTTCGTTATTTTGCGGGAATCTGCTCCCACAGTCAAATCCAGCGGATGGTCCCGTAAGTTCCACAATTTCATGGAAAAGTGTACGA
TAAAAAATTTCCTGTTTCGTCCTACTTCTGCAAACATGCTTCAACACCCATTTGTTCGGGATATAAAAAATGAACGACATGTTGTTGAGT
CATTAACAAGGCATCTTACTGGAATCATTAAAAAAAGACAGAAAAAAAAAGGAATACCTTTGATCTTTGAAAGAGAAGAAGCTATTAAGG
AACAGTACACCGTGAGAAGATTCAGAGGACCCTCTTGCACTCACGAGCTTCTGAGATTGCCAACCAGCAGCAGATGCAGACCACTTAGAG
TCCTGCATGGGGAACCCTCTCAGCCAAGGTGGCTACCTGATCGAGAAGAGCCACAGGTCCAGGCACTTCAGCAGCTACAGGGAGCAGCCA
GGGTATTCATGCCACTGCAGGCTCTGGACAGTGCACCTAAGCCTCTAAAGGGGCAGGCTCAGGCACCTCAACGACTACAAGGGGCAGCTC
GGGTGTTCATGCCACTACAGGCTCAGGTGAAGGCTAAAGCCTCTAAACCTCTACAAATGCAGATTAAGGCACCTCCACGACTACGGAGGG
CAGCCAGGGTGCTCATGCCACTACAGGCACAGGTTAGGGCACCTAGGCTTCTGCAGGTACAGTCCCAGGTATCCAAAAAGCAGCAGGCCC
AGACCCAGACATCAGAACCACAAGATTTGGACCAGGTACCAGAGGAATTTCAGGGTCAAGATCAGGTACCCGAACAACAAAGGCAGGGCC
AGGCCCCTGAACAACAGCAGAGGCACAACCAGGTGCCTGAACAAGAGCTGGAGCAGAACCAGGCACCTGAACAGCCAGAGGTACAGGAAC
AGGCTGCCGAGCCTGCACAGGCAGAGACTGAGGCAGAGGAACCTGAGTCATTACGAGTAAATGCCCAGGTATTTCTGCCCCTGCTATCAC
AAGATCACCATGTGCTGTTGCCACTACATTTGGATACTCAGGTGCTCATTCCAGTAGAGGGGCAAACTGAAGGATCACCTCAGGCACAGG
CTTGGACACTAGAACCCCCACAGGCAATTGGCTCAGTTCAAGCACTGATAGAGGGACTATCAAGAGACTTGCTTCGGGCACCAAACTCAA
ATAACTCAAAGCCACTTGGTCCGTTGCAAACCCTGATGGAAAATCTGTCATCAAATAGGTTTTACTCACAACCAGAACAGGCACGGGAGA
AAAAATCAAAAGTTTCTACTCTGAGGCAAGCACTGGCAAAAAGACTATCACCAAAGAGGTTCAGGGCAAAGTCATCATGGAGACCTGAAA
AGCTTGAACTCTCGGATTTAGAAGCCCGCAGGCAAAGGCGCCAACGCAGATGGGAAGATATCTTTAATCAGCATGAGGAAGAATTGAGAC
AAGTTGATAAAGACAAAGAAGATGAATCATCAGACAATGATGAAGTATTTCATTCGATTCAGGCTGAAGTCCAGATAGAGCCATTGAAGC
CATACATTTCAAATCCTAAAAAAATTGAGGTTCAAGAGAGATCTCCTTCTGTGCCTAACAACCAGGATCATGCACATCATGTCAAGTTCT
CTTCAAGCGTTCCTCAGCGGTCTCTTTTGGAACAAGCTCAGAAGCCCATTGACATCAGACAAAGGAGTTCGCAAAATCGTCAAAATTGGC
TGGCAGCATCAGAATCTTCTTCTGAGGAAGAAAGTCCTGTGACTGGAAGGAGGTCTCAGTCATCACCACCTTATTCTACTATTGATCAGA
AGTTGCTGGTTGACATCCATGTTCCAGATGGATTTAAAGTAGGAAAAATATCACCCCCTGTATACTTGACAAACGAATGGGTAGGCTATA
ATGCACTCTCTGAAATCTTCCGGAATGATTGGTTAACTCCGGCACCTGTCATTCAGCCACCTGAAGAGGATGGTGATTATGTTGAACTCT
ATGATGCCAGTGCTGATACTGATGGTGATGATGATGATGAGTCTAATGATACTTTTGAAGATACCTATGATCATGCCAATGGCAATGATG
ACTTGGATAACCAGGTTGATCAGGCTAATGATGTTTGTAAAGACCATGATGATGACAACAATAAGTTTGTTGATGATGTAAATAATAATT
ATTATGAGGCGCCTAGTTGTCCAAGGGCAAGCTATGGCAGAGATGGAAGCTGCAAGCAAGATGGTTATGATGGAAGTCGTGGAAAAGAGG
AAGCCTACAGAGGCTATGGAAGCCATACAGCCAATAGAAGCCATGGAGGAAGTGCAGCCAGTGAGGACAATGCAGCCATTGGAGATCAGG
AAGAACATGCAGCCAATATAGGCAGTGAAAGAAGAGGCAGTGAGGGTGATGGAGGTAAGGGAGTCGTTCGAACCAGTGAAGAGAGTGGAG
CCCTTGGACTCAATGGAGAAGAAAATTGCTCAGAGACAGATGGTCCAGGATTGAAGAGACCTGCGTCTCAGGACTTTGAATATCTACAGG
AGGAGCCAGGTGGTGGAAATGAGGCCTCAAATGCCATTGACTCAGGTGCTGCACCGTCAGCACCTGATCATGAGAGTGACAATAAGGACA
TATCAGAATCATCAACACAATCAGATTTTTCTGCCAATCACTCATCTCCTTCCAAAGGTTCTGGGATGTCTGCTGATGCTAACTTTGCCA
GTGCCATCTTATACGCTGGATTCGTAGAAGTACCTGAGGAATCACCTAAGCAACCCTCTGAAGTCAATGTTAACCCACTCTATGTCTCTC
CTGCATGTAAAAAACCACTAATCCACATGTATGAAAAGGAGTTCACTTCTGAGATCTGCTGTGGTTCTTTGTGGGGAGTCAATTTGCTGT
TGGGAACCCGATCTAATCTATATCTGATGGACAGAAGTGGAAAGGCTGACATTACTAAACTTATAAGGCGAAGACCATTCCGCCAGATTC
AAGTCTTAGAGCCACTCAATTTGCTGATTACCATCTCAGGTCATAAGAACAGACTTCGGGTGTATCATCTGACCTGGTTGAGGAACAAGA
TTTTGAATAATGATCCAGAAAGTAAAAGAAGGCAAGAAGAAATGCTGAAGACAGAGGAAGCCTGCAAAGCTATTGATAAGTTAACAGGCT
GTGAACACTTCAGTGTCCTCCAACATGAAGAAACAACATATATTGCAATTGCTTTGAAATCATCAATTCACCTTTATGCATGGGCACCAA
AGTCCTTTGATGAAAGCACTGCTATTAAAGTATGCATTGATCAATCAGCAGACTCTGAAGGAGACTACATGTCCTATCAAGCCTATATAC
GAATACTGGCAAAAATACAGGCAGCTGATCCAGTGAACCGGTTTAAGAGACCAGATGAGCTCCTTCATTTGCTGAAGCTCAAGGTATTTC
CAACACTTGATCATAAGCCAGTGACAGTTGACCTGGCTATTGGTTCTGAAAAAAGACTAAAGATTTTCTTCAGCTCAGCAGATGGATATC
ACCTCATCGATGCAGAATCTGAGGTTATGTCTGATGTGACCCTGCCAAAGAATCCCCTGGAAATCATTATACCACAGAATATCATCATTT
TACCTGATTGCTTGGGAATTGGCATGATGCTCACCTTCAATGCTGAAGCCCTCTCTGTGGAAGCAAATGAACAACTCTTCAAGAAGATCC
TTGAAATGTGGAAAGACATACCATCTTCTATAGCTTTTGAATGTACACAGCGAACCACAGGATGGGGCCAAAAGGCCATTGAAGTGCGCT
CTTTGCAATCCAGGGTTCTGGAAAGTGAGCTGAAGCGCAGGTCAATTAAGAAGCTGAGATTCCTGTGCACCCGGGGTGACAAGCTGTTCT
TTACCTCTACCCTGCGCAATCACCACAGCCGGGTTTACTTCATGACACTTGGAAAACTTGAAGAGCTCCAAAGCAATTATGATGTCTAAA
AGTTTCCAGTGATTTATTACCACATTATAAACATCATGTATAGGCAGTCTGCATCTTCAGATTTCAGAGATTAAATGAGTATTCAGTTTT
ATTTTTAGTAAAGATTAAATCCAAAACTTTACTTTTAATGTAGCACAGAATAGTTTTAATGAGAAATGCAGCTTTATGTATAAAATTAAC
TATAGCAAGCTCTAGGTACTCCAATGGTGTACAATGTCTTTTGCACAAACTTTGTAACTTTTGTTACTGTGAATTCAAACATTACTCTTT
GGACAGTTTGGACAGTATCTGTATTCAGATTTTACAACATGGAGTAAAGAAACCTGTTATGAATTAGATTACAAGCAGCCTTCAAAAGAA
TTGGCACTGGGATAAGATTTTTCAGAAAAAGAAAAACATCGGCAAACTGTGTGTGATTTTTCCAAAGCTATATAAAGAACCAAAGGTTTA
GTCAAGAAACAAAAATCTTAAAGATTATTATAACCCAGACTAAGGTTGAACAACCTGCATGCCCAGAGAAAACTATGGCGACAAAGGGGA
AAAGGCCACCACTCGTTTTCTCACTGATTCATGCCAATTAAGCCTACAGTTAAAGACCAGTTTTGTTCTTTTCACCCATTTTTAAGCTGG
TTTTCTCCTGATAAGAAGAAAGGAAGAAAGCCCCAGACGCTTGGTTTTTCTCAGAACCCCCAAAAGATGTGCAATAGCTGTTGTTACAAA
CCACCAAATAATACAGTTGTGAGCCTGAATACAGGACTGAACTCCTATACACGTGTACTGTAGAATGAGTATTTTTTAATACCTTAAGGT
AGGCGTCAAATTCTACTCCCCAAAGCAGAGATAGATTGATTTATCAAAATTATTATCTGGCCAACAGTGTGACTATCAGACAGCATCAAA
TATTTGCCCAATCCAAGATTAGACTACACAAAAGCTTCCTTCCAGTATTAAACAAAAAGAATTAAACATAACTATGAAAAAACTTTGCTA
ATATCTGTGTTTTTCAGATTTCATTTTTTGTAAAATCAGAAATTAATCTAAACATATTCAGTGATAAGTTCATGTGTAACGACTTAATGT
TAAAGGTTAAAAAAAAGATTTCACAAAATATACAACTTTCACCATATATATAAGCCTGCAAAATTAGAGTAGTGAAAGTCATGCTAGTCC
ATCACCCAAATATGTTATAGACGCCATAGACAGGTGATGTTTGGTCACCTATGGTAACTGCTACCTGATGAAGAGCATAATTTCTGCATA
TCCATCCTCAATACCATGGTAAATTCTGGGGCAATAGAGAAGCAACAGAACTGCCACAAAGTATACCTCAATATAATTCCTCTAGTTCTG
CTTCTAAAATCTGAGGACAGTGCTAGTGGGAAAATAATTTTCAAACTACCTGGTTAACCAAAATACAAAAGCAGCTGACTATGTGTGATT
TCATAATAGCACATTTCTTGACACTTAGTGCTAGAAATGAAGATTTGGATTTTCCTAACAACTTACATCAAGAATGTAGTGTAGCTCATT
ATTGAGAATTTAGGAAAGCCTGAATCCATTAATTAAGGAAATAAATGTGACTCACATTTCTTTTACTGTGACACAATAATGTGATCCTAA
AACTGGCTTATCCTTGAGTGTTTACAACTCAAACAACTTTTTGAATGCAGTAGTTTTTTTTTTTTAAAAACAAACTTTTATGTCAAATTT
TTTTTCTTAGAAGTAGTCTTCATTATTATAAATTTGTACACCAAAAGGCCATGGGGAACTTTGTGCAAGTACCTCATCGCTGAGCAAATG
GAGCTTGCTATGTTTTAATTTCAGAAAATTTCCTCATATACGTAGTGTGTAGAATCAAGTCTTTTAATAATTCATTTTTTCTTCATAATA
TTTACTCAAAGTTAAGCTTAAAAATAAGTTTTATCTTAAAATCATATTTGAAGACAGTAAGACAGTAAACTATTTTAGGAAGTCAACCCC
CATTGCACTCTGTGGCAGTTATTCTGGTAAAAATAGGCAAAAGTGACCTGAATCTACAATGATGTCCCAAAGTAACCAAGTAAGAGAGAT
TGTAAATGATAAACCGAGCTTTAAAGGATAAAGTGTTAATAAAGAAAGGAAGCTGGGCACATGTCAAAAAGGGAGATCGAAATGTTAGGT
AATCATTTAGAAAGGACAGAAAATATTTAAAGTGGCTCATAGGTAATGAATATTTCTGACTTAGATGTAAATCCATCTGGAATCTTTACA
TCCTTTGCCAGCTGAAACAAGAAAGTGAAGGGACAATGATATTTCATGGTCAGTTTATTTTGTAAGAGACAGAAGAAATTATATCTATAC
ATTACCTTGTAGCAGCAGTACCTGGAAGCCCCAGCCCGTCACAGAAGTGTGGAGGGGGGCTCCTGACTAGACAATTTCCCTAGCCCTTGT
GATTTGAAGCATGAAAGTTCTGGCAGGTTATGAGCAGCACTAGGGATAAAGTATGGTTTTATTTTGGTGTAATTTAGGTTTTTCAACAAA
GCCCTTGTCTAAAATAAAAGGCATTATTGGAAATATTTGAAAACTAGAAAATGATGGATAAAAGGGCTGATAAGAAAATTTCTGACTGTC
AGTAGAAGTGAGATAAGATCCTCAGAGGAAACAGTAAGAAGGGATAATCATTAAGATAGTAAAACAGGCAAAGCAGAATCACATGTGCAC
ACACACATACACATGTAAACATTGGAATGCATAAGTTTTAATATTTTAGCGCTATCAGTTTCTAAATGCATTAATTACTAACTGCCCTCT
CCCAAGATTCATTTAGTTCAAACAGTATCCGTAAACTAGGAATAATGCCACATGCATTCAATGGGATCTTTTAAGTACTCTTCAGTTTGT
TCCAAGAAATGTGCCTACTGAAATCAAATTAATTTGTATTCAATGTGTACTTCAAGACTGCTAATTGTTTCATCTGAAAGCCTACAATGA

>37989_37989_2_HSPD1-NRK_HSPD1_chr2_198358883_ENST00000345042_NRK_chrX_105139466_ENST00000428173_length(amino acids)=1400AA_BP=
MLTHNAEVKLVDFGVSAQVSRTNGRRNSFIGTPYWMAPEVIDCDEDPRRSYDYRSDVWSVGITAIEMAEGAPPLCNLQPLEALFVILRES
APTVKSSGWSRKFHNFMEKCTIKNFLFRPTSANMLQHPFVRDIKNERHVVESLTRHLTGIIKKRQKKKGIPLIFEREEAIKEQYTVRRFR
GPSCTHELLRLPTSSRCRPLRVLHGEPSQPRWLPDREEPQVQALQQLQGAARVFMPLQALDSAPKPLKGQAQAPQRLQGAARVFMPLQAQ
VKAKASKPLQMQIKAPPRLRRAARVLMPLQAQVRAPRLLQVQSQVSKKQQAQTQTSEPQDLDQVPEEFQGQDQVPEQQRQGQAPEQQQRH
NQVPEQELEQNQAPEQPEVQEQAAEPAQAETEAEEPESLRVNAQVFLPLLSQDHHVLLPLHLDTQVLIPVEGQTEGSPQAQAWTLEPPQA
IGSVQALIEGLSRDLLRAPNSNNSKPLGPLQTLMENLSSNRFYSQPEQAREKKSKVSTLRQALAKRLSPKRFRAKSSWRPEKLELSDLEA
RRQRRQRRWEDIFNQHEEELRQVDKDKEDESSDNDEVFHSIQAEVQIEPLKPYISNPKKIEVQERSPSVPNNQDHAHHVKFSSSVPQRSL
LEQAQKPIDIRQRSSQNRQNWLAASESSSEEESPVTGRRSQSSPPYSTIDQKLLVDIHVPDGFKVGKISPPVYLTNEWVGYNALSEIFRN
DWLTPAPVIQPPEEDGDYVELYDASADTDGDDDDESNDTFEDTYDHANGNDDLDNQVDQANDVCKDHDDDNNKFVDDVNNNYYEAPSCPR
ASYGRDGSCKQDGYDGSRGKEEAYRGYGSHTANRSHGGSAASEDNAAIGDQEEHAANIGSERRGSEGDGGKGVVRTSEESGALGLNGEEN
CSETDGPGLKRPASQDFEYLQEEPGGGNEASNAIDSGAAPSAPDHESDNKDISESSTQSDFSANHSSPSKGSGMSADANFASAILYAGFV
EVPEESPKQPSEVNVNPLYVSPACKKPLIHMYEKEFTSEICCGSLWGVNLLLGTRSNLYLMDRSGKADITKLIRRRPFRQIQVLEPLNLL
ITISGHKNRLRVYHLTWLRNKILNNDPESKRRQEEMLKTEEACKAIDKLTGCEHFSVLQHEETTYIAIALKSSIHLYAWAPKSFDESTAI
KVCIDQSADSEGDYMSYQAYIRILAKIQAADPVNRFKRPDELLHLLKLKVFPTLDHKPVTVDLAIGSEKRLKIFFSSADGYHLIDAESEV
MSDVTLPKNPLEIIIPQNIIILPDCLGIGMMLTFNAEALSVEANEQLFKKILEMWKDIPSSIAFECTQRTTGWGQKAIEVRSLQSRVLES

--------------------------------------------------------------
>37989_37989_3_HSPD1-NRK_HSPD1_chr2_198358883_ENST00000388968_NRK_chrX_105139466_ENST00000243300_length(transcript)=8197nt_BP=968nt
AGACTAAGCCGGCCGGAGAGGGCTGAGCGCGCTAGCACACCCTGCGCGGGTAGGGAGGGGCGGGGCTCGCGCGCAGGGTGTGCAGATTGC
AGGGCCCGGGCTGACGGGAAGTGGGTGGGAGCTGCCTGCACACGCGGTGCCGCGGGGCGGGAGTAGAGGCGGAGGGAGGGGACACGGGCT
CATTGCGGTGTGCGCCCTGCACTCTGTCCCTCACTCGCCGCCGACGACCTGTCTCGCCGAGCGCACGCCTTGCCGCCGCCCCGCAGAAAT
GCTTCGGTTACCCACAGTCTTTCGCCAGATGAGACCGGTGTCCAGGGTACTGGCTCCTCATCTCACTCGGGCTTATGCCAAAGATGTAAA
ATTTGGTGCAGATGCCCGAGCCTTAATGCTTCAAGGTGTAGACCTTTTAGCCGATGCTGTGGCCGTTACAATGGGGCCAAAGGGAAGAAC
AGTGATTATTGAGCAGAGTTGGGGAAGTCCCAAAGTAACAAAAGATGGTGTGACTGTTGCAAAGTCAATTGACTTAAAAGATAAATACAA
AAACATTGGAGCTAAACTTGTTCAAGATGTTGCCAATAACACAAATGAAGAAGCTGGGGATGGCACTACCACTGCTACTGTACTGGCACG
CTCTATAGCCAAGGAAGGCTTCGAGAAGATTAGCAAAGGTGCTAATCCAGTGGAAATCAGGAGAGGTGTGATGTTAGCTGTTGATGCTGT
AATTGCTGAACTTAAAAAGCAGTCTAAACCTGTGACCACCCCTGAAGAAATTGCACAGGTTGCTACGATTTCTGCAAACGGAGACAAAGA
AATTGGCAATATCATCTCTGATGCAATGAAAAAAGTTGGAAGAAAGGGTGTCATCACAGTAAAGGATGGAAAAACACTGAATGATGAATT
AGAAATTATTGAAGGCATGAAGTTTGATCGAGGCTATATTTCTCCATACTTTATTAATACATCAAAAGCATCAAAGGTCAGAATGTGCTG
CTGACTCATAATGCTGAAGTAAAACTGGTTGATTTTGGAGTGAGTGCCCAGGTGAGCAGAACTAATGGAAGAAGGAATAGTTTCATTGGG
ACACCATACTGGATGGCACCTGAGGTGATTGACTGTGATGAGGACCCAAGACGCTCCTATGATTACAGAAGTGATGTGTGGTCTGTGGGA
ATTACTGCCATTGAAATGGCTGAAGGAGCCCCTCCTCTGTGTAACCTTCAACCCTTGGAAGCTCTCTTCGTTATTTTGCGGGAATCTGCT
CCCACAGTCAAATCCAGCGGATGGTCCCGTAAGTTCCACAATTTCATGGAAAAGTGTACGATAAAAAATTTCCTGTTTCGTCCTACTTCT
GCAAACATGCTTCAACACCCATTTGTTCGGGATATAAAAAATGAACGACATGTTGTTGAGTCATTAACAAGGCATCTTACTGGAATCATT
AAAAAAAGACAGAAAAAAGGAATACCTTTGATCTTTGAAAGAGAAGAAGCTATTAAGGAACAGTACACCGTGAGAAGATTCAGAGGACCC
TCTTGCACTCACGAGCTTCTGAGATTGCCAACCAGCAGCAGATGCAGACCACTTAGAGTCCTGCATGGGGAACCCTCTCAGCCAAGGTGG
CTACCTGATCGAGAAGAGCCACAGGTCCAGGCACTTCAGCAGCTACAGGGAGCAGCCAGGGTATTCATGCCACTGCAGGCTCTGGACAGT
GCACCTAAGCCTCTAAAGGGGCAGGCTCAGGCACCTCAACGACTACAAGGGGCAGCTCGGGTGTTCATGCCACTACAGGCTCAGGTGAAG
GCTAAAGCCTCTAAACCTCTACAAATGCAGATTAAGGCACCTCCACGACTACGGAGGGCAGCCAGGGTGCTCATGCCACTACAGGCACAG
GTTAGGGCACCTAGGCTTCTGCAGGTACAGTCCCAGGTATCCAAAAAGCAGCAGGCCCAGACCCAGACATCAGAACCACAAGATTTGGAC
CAGGTACCAGAGGAATTTCAGGGTCAAGATCAGGTACCCGAACAACAAAGGCAGGGCCAGGCCCCTGAACAACAGCAGAGGCACAACCAG
GTGCCTGAACAAGAGCTGGAGCAGAACCAGGCACCTGAACAGCCAGAGGTACAGGAACAGGCTGCCGAGCCTGCACAGGCAGAGACTGAG
GCAGAGGAACCTGAGTCATTACGAGTAAATGCCCAGGTATTTCTGCCCCTGCTATCACAAGATCACCATGTGCTGTTGCCACTACATTTG
GATACTCAGGTGCTCATTCCAGTAGAGGGGCAAACTGAAGGATCACCTCAGGCACAGGCTTGGACACTAGAACCCCCACAGGCAATTGGC
TCAGTTCAAGCACTGATAGAGGGACTATCAAGAGACTTGCTTCGGGCACCAAACTCAAATAACTCAAAGCCACTTGGTCCGTTGCAAACC
CTGATGGAAAATCTGTCATCAAATAGGTTTTACTCACAACCAGAACAGGCACGGGAGAAAAAATCAAAAGTTTCTACTCTGAGGCAAGCA
CTGGCAAAAAGACTATCACCAAAGAGGTTCAGGGCAAAGTCATCATGGAGACCTGAAAAGCTTGAACTCTCGGATTTAGAAGCCCGCAGG
CAAAGGCGCCAACGCAGATGGGAAGATATCTTTAATCAGCATGAGGAAGAATTGAGACAAGTTGATAAAGACAAAGAAGATGAATCATCA
GACAATGATGAAGTATTTCATTCGATTCAGGCTGAAGTCCAGATAGAGCCATTGAAGCCATACATTTCAAATCCTAAAAAAATTGAGGTT
CAAGAGAGATCTCCTTCTGTGCCTAACAACCAGGATCATGCACATCATGTCAAGTTCTCTTCAAGCGTTCCTCAGCGGTCTCTTTTGGAA
CAAGCTCAGAAGCCCATTGACATCAGACAAAGGAGTTCGCAAAATCGTCAAAATTGGCTGGCAGCATCAGAATCTTCTTCTGAGGAAGAA
AGTCCTGTGACTGGAAGGAGGTCTCAGTCATCACCACCTTATTCTACTATTGATCAGAAGTTGCTGGTTGACATCCATGTTCCAGATGGA
TTTAAAGTAGGAAAAATATCACCCCCTGTATACTTGACAAACGAATGGGTAGGCTATAATGCACTCTCTGAAATCTTCCGGAATGATTGG
TTAACTCCGGCACCTGTCATTCAGCCACCTGAAGAGGATGGTGATTATGTTGAACTCTATGATGCCAGTGCTGATACTGATGGTGATGAT
GATGATGAGTCTAATGATACTTTTGAAGATACCTATGATCATGCCAATGGCAATGATGACTTGGATAACCAGGTTGATCAGGCTAATGAT
GTTTGTAAAGACCATGATGATGACAACAATAAGTTTGTTGATGATGTAAATAATAATTATTATGAGGCGCCTAGTTGTCCAAGGGCAAGC
TATGGCAGAGATGGAAGCTGCAAGCAAGATGGTTATGATGGAAGTCGTGGAAAAGAGGAAGCCTACAGAGGCTATGGAAGCCATACAGCC
AATAGAAGCCATGGAGGAAGTGCAGCCAGTGAGGACAATGCAGCCATTGGAGATCAGGAAGAACATGCAGCCAATATAGGCAGTGAAAGA
AGAGGCAGTGAGGGTGATGGAGGTAAGGGAGTCGTTCGAACCAGTGAAGAGAGTGGAGCCCTTGGACTCAATGGAGAAGAAAATTGCTCA
GAGACAGATGGTCCAGGATTGAAGAGACCTGCGTCTCAGGACTTTGAATATCTACAGGAGGAGCCAGGTGGTGGAAATGAGGCCTCAAAT
GCCATTGACTCAGGTGCTGCACCGTCAGCACCTGATCATGAGAGTGACAATAAGGACATATCAGAATCATCAACACAATCAGATTTTTCT
GCCAATCACTCATCTCCTTCCAAAGGTTCTGGGATGTCTGCTGATGCTAACTTTGCCAGTGCCATCTTATACGCTGGATTCGTAGAAGTA
CCTGAGGAATCACCTAAGCAACCCTCTGAAGTCAATGTTAACCCACTCTATGTCTCTCCTGCATGTAAAAAACCACTAATCCACATGTAT
GAAAAGGAGTTCACTTCTGAGATCTGCTGTGGTTCTTTGTGGGGAGTCAATTTGCTGTTGGGAACCCGATCTAATCTATATCTGATGGAC
AGAAGTGGAAAGGCTGACATTACTAAACTTATAAGGCGAAGACCATTCCGCCAGATTCAAGTCTTAGAGCCACTCAATTTGCTGATTACC
ATCTCAGGTCATAAGAACAGACTTCGGGTGTATCATCTGACCTGGTTGAGGAACAAGATTTTGAATAATGATCCAGAAAGTAAAAGAAGG
CAAGAAGAAATGCTGAAGACAGAGGAAGCCTGCAAAGCTATTGATAAGTTAACAGGCTGTGAACACTTCAGTGTCCTCCAACATGAAGAA
ACAACATATATTGCAATTGCTTTGAAATCATCAATTCACCTTTATGCATGGGCACCAAAGTCCTTTGATGAAAGCACTGCTATTAAAGTA
TGCATTGATCAATCAGCAGACTCTGAAGGAGACTACATGTCCTATCAAGCCTATATACGAATACTGGCAAAAATACAGGCAGCTGATCCA
GTGAACCGGTTTAAGAGACCAGATGAGCTCCTTCATTTGCTGAAGCTCAAGGTATTTCCAACACTTGATCATAAGCCAGTGACAGTTGAC
CTGGCTATTGGTTCTGAAAAAAGACTAAAGATTTTCTTCAGCTCAGCAGATGGATATCACCTCATCGATGCAGAATCTGAGGTTATGTCT
GATGTGACCCTGCCAAAGAATCCCCTGGAAATCATTATACCACAGAATATCATCATTTTACCTGATTGCTTGGGAATTGGCATGATGCTC
ACCTTCAATGCTGAAGCCCTCTCTGTGGAAGCAAATGAACAACTCTTCAAGAAGATCCTTGAAATGTGGAAAGACATACCATCTTCTATA
GCTTTTGAATGTACACAGCGAACCACAGGATGGGGCCAAAAGGCCATTGAAGTGCGCTCTTTGCAATCCAGGGTTCTGGAAAGTGAGCTG
AAGCGCAGGTCAATTAAGAAGCTGAGATTCCTGTGCACCCGGGGTGACAAGCTGTTCTTTACCTCTACCCTGCGCAATCACCACAGCCGG
GTTTACTTCATGACACTTGGAAAACTTGAAGAGCTCCAAAGCAATTATGATGTCTAAAAGTTTCCAGTGATTTATTACCACATTATAAAC
ATCATGTATAGGCAGTCTGCATCTTCAGATTTCAGAGATTAAATGAGTATTCAGTTTTATTTTTAGTAAAGATTAAATCCAAAACTTTAC
TTTTAATGTAGCACAGAATAGTTTTAATGAGAAATGCAGCTTTATGTATAAAATTAACTATAGCAAGCTCTAGGTACTCCAATGGTGTAC
AATGTCTTTTGCACAAACTTTGTAACTTTTGTTACTGTGAATTCAAACATTACTCTTTGGACAGTTTGGACAGTATCTGTATTCAGATTT
TACAACATGGAGTAAAGAAACCTGTTATGAATTAGATTACAAGCAGCCTTCAAAAGAATTGGCACTGGGATAAGATTTTTCAGAAAAAGA
AAAACATCGGCAAACTGTGTGTGATTTTTCCAAAGCTATATAAAGAACCAAAGGTTTAGTCAAGAAACAAAAATCTTAAAGATTATTATA
ACCCAGACTAAGGTTGAACAACCTGCATGCCCAGAGAAAACTATGGCGACAAAGGGGAAAAGGCCACCACTCGTTTTCTCACTGATTCAT
GCCAATTAAGCCTACAGTTAAAGACCAGTTTTGTTCTTTTCACCCATTTTTAAGCTGGTTTTCTCCTGATAAGAAGAAAGGAAGAAAGCC
CCAGACGCTTGGTTTTTCTCAGAACCCCCAAAAGATGTGCAATAGCTGTTGTTACAAACCACCAAATAATACAGTTGTGAGCCTGAATAC
AGGACTGAACTCCTATACACGTGTACTGTAGAATGAGTATTTTTTAATACCTTAAGGTAGGCGTCAAATTCTACTCCCCAAAGCAGAGAT
AGATTGATTTATCAAAATTATTATCTGGCCAACAGTGTGACTATCAGACAGCATCAAATATTTGCCCAATCCAAGATTAGACTACACAAA
AGCTTCCTTCCAGTATTAAACAAAAAGAATTAAACATAACTATGAAAAAACTTTGCTAATATCTGTGTTTTTCAGATTTCATTTTTTGTA
AAATCAGAAATTAATCTAAACATATTCAGTGATAAGTTCATGTGTAACGACTTAATGTTAAAGGTTAAAAAAAAGATTTCACAAAATATA
CAACTTTCACCATATATATAAGCCTGCAAAATTAGAGTAGTGAAAGTCATGCTAGTCCATCACCCAAATATGTTATAGACGCCATAGACA
GGTGATGTTTGGTCACCTATGGTAACTGCTACCTGATGAAGAGCATAATTTCTGCATATCCATCCTCAATACCATGGTAAATTCTGGGGC
AATAGAGAAGCAACAGAACTGCCACAAAGTATACCTCAATATAATTCCTCTAGTTCTGCTTCTAAAATCTGAGGACAGTGCTAGTGGGAA
AATAATTTTCAAACTACCTGGTTAACCAAAATACAAAAGCAGCTGACTATGTGTGATTTCATAATAGCACATTTCTTGACACTTAGTGCT
AGAAATGAAGATTTGGATTTTCCTAACAACTTACATCAAGAATGTAGTGTAGCTCATTATTGAGAATTTAGGAAAGCCTGAATCCATTAA
TTAAGGAAATAAATGTGACTCACATTTCTTTTACTGTGACACAATAATGTGATCCTAAAACTGGCTTATCCTTGAGTGTTTACAACTCAA
ACAACTTTTTGAATGCAGTAGTTTTTTTTTTTTAAAAACAAACTTTTATGTCAAATTTTTTTTCTTAGAAGTAGTCTTCATTATTATAAA
TTTGTACACCAAAAGGCCATGGGGAACTTTGTGCAAGTACCTCATCGCTGAGCAAATGGAGCTTGCTATGTTTTAATTTCAGAAAATTTC
CTCATATACGTAGTGTGTAGAATCAAGTCTTTTAATAATTCATTTTTTCTTCATAATATTTACTCAAAGTTAAGCTTAAAAATAAGTTTT
ATCTTAAAATCATATTTGAAGACAGTAAGACAGTAAACTATTTTAGGAAGTCAACCCCCATTGCACTCTGTGGCAGTTATTCTGGTAAAA
ATAGGCAAAAGTGACCTGAATCTACAATGATGTCCCAAAGTAACCAAGTAAGAGAGATTGTAAATGATAAACCGAGCTTTAAAGGATAAA
GTGTTAATAAAGAAAGGAAGCTGGGCACATGTCAAAAAGGGAGATCGAAATGTTAGGTAATCATTTAGAAAGGACAGAAAATATTTAAAG
TGGCTCATAGGTAATGAATATTTCTGACTTAGATGTAAATCCATCTGGAATCTTTACATCCTTTGCCAGCTGAAACAAGAAAGTGAAGGG
ACAATGATATTTCATGGTCAGTTTATTTTGTAAGAGACAGAAGAAATTATATCTATACATTACCTTGTAGCAGCAGTACCTGGAAGCCCC
AGCCCGTCACAGAAGTGTGGAGGGGGGCTCCTGACTAGACAATTTCCCTAGCCCTTGTGATTTGAAGCATGAAAGTTCTGGCAGGTTATG
AGCAGCACTAGGGATAAAGTATGGTTTTATTTTGGTGTAATTTAGGTTTTTCAACAAAGCCCTTGTCTAAAATAAAAGGCATTATTGGAA
ATATTTGAAAACTAGAAAATGATGGATAAAAGGGCTGATAAGAAAATTTCTGACTGTCAGTAGAAGTGAGATAAGATCCTCAGAGGAAAC
AGTAAGAAGGGATAATCATTAAGATAGTAAAACAGGCAAAGCAGAATCACATGTGCACACACACATACACATGTAAACATTGGAATGCAT
AAGTTTTAATATTTTAGCGCTATCAGTTTCTAAATGCATTAATTACTAACTGCCCTCTCCCAAGATTCATTTAGTTCAAACAGTATCCGT
AAACTAGGAATAATGCCACATGCATTCAATGGGATCTTTTAAGTACTCTTCAGTTTGTTCCAAGAAATGTGCCTACTGAAATCAAATTAA
TTTGTATTCAATGTGTACTTCAAGACTGCTAATTGTTTCATCTGAAAGCCTACAATGAATCATTGTTCAACCTTGAAAAATAAAATTTTG

>37989_37989_3_HSPD1-NRK_HSPD1_chr2_198358883_ENST00000388968_NRK_chrX_105139466_ENST00000243300_length(amino acids)=1399AA_BP=
MLTHNAEVKLVDFGVSAQVSRTNGRRNSFIGTPYWMAPEVIDCDEDPRRSYDYRSDVWSVGITAIEMAEGAPPLCNLQPLEALFVILRES
APTVKSSGWSRKFHNFMEKCTIKNFLFRPTSANMLQHPFVRDIKNERHVVESLTRHLTGIIKKRQKKGIPLIFEREEAIKEQYTVRRFRG
PSCTHELLRLPTSSRCRPLRVLHGEPSQPRWLPDREEPQVQALQQLQGAARVFMPLQALDSAPKPLKGQAQAPQRLQGAARVFMPLQAQV
KAKASKPLQMQIKAPPRLRRAARVLMPLQAQVRAPRLLQVQSQVSKKQQAQTQTSEPQDLDQVPEEFQGQDQVPEQQRQGQAPEQQQRHN
QVPEQELEQNQAPEQPEVQEQAAEPAQAETEAEEPESLRVNAQVFLPLLSQDHHVLLPLHLDTQVLIPVEGQTEGSPQAQAWTLEPPQAI
GSVQALIEGLSRDLLRAPNSNNSKPLGPLQTLMENLSSNRFYSQPEQAREKKSKVSTLRQALAKRLSPKRFRAKSSWRPEKLELSDLEAR
RQRRQRRWEDIFNQHEEELRQVDKDKEDESSDNDEVFHSIQAEVQIEPLKPYISNPKKIEVQERSPSVPNNQDHAHHVKFSSSVPQRSLL
EQAQKPIDIRQRSSQNRQNWLAASESSSEEESPVTGRRSQSSPPYSTIDQKLLVDIHVPDGFKVGKISPPVYLTNEWVGYNALSEIFRND
WLTPAPVIQPPEEDGDYVELYDASADTDGDDDDESNDTFEDTYDHANGNDDLDNQVDQANDVCKDHDDDNNKFVDDVNNNYYEAPSCPRA
SYGRDGSCKQDGYDGSRGKEEAYRGYGSHTANRSHGGSAASEDNAAIGDQEEHAANIGSERRGSEGDGGKGVVRTSEESGALGLNGEENC
SETDGPGLKRPASQDFEYLQEEPGGGNEASNAIDSGAAPSAPDHESDNKDISESSTQSDFSANHSSPSKGSGMSADANFASAILYAGFVE
VPEESPKQPSEVNVNPLYVSPACKKPLIHMYEKEFTSEICCGSLWGVNLLLGTRSNLYLMDRSGKADITKLIRRRPFRQIQVLEPLNLLI
TISGHKNRLRVYHLTWLRNKILNNDPESKRRQEEMLKTEEACKAIDKLTGCEHFSVLQHEETTYIAIALKSSIHLYAWAPKSFDESTAIK
VCIDQSADSEGDYMSYQAYIRILAKIQAADPVNRFKRPDELLHLLKLKVFPTLDHKPVTVDLAIGSEKRLKIFFSSADGYHLIDAESEVM
SDVTLPKNPLEIIIPQNIIILPDCLGIGMMLTFNAEALSVEANEQLFKKILEMWKDIPSSIAFECTQRTTGWGQKAIEVRSLQSRVLESE

--------------------------------------------------------------
>37989_37989_4_HSPD1-NRK_HSPD1_chr2_198358883_ENST00000388968_NRK_chrX_105139466_ENST00000428173_length(transcript)=8199nt_BP=968nt
AGACTAAGCCGGCCGGAGAGGGCTGAGCGCGCTAGCACACCCTGCGCGGGTAGGGAGGGGCGGGGCTCGCGCGCAGGGTGTGCAGATTGC
AGGGCCCGGGCTGACGGGAAGTGGGTGGGAGCTGCCTGCACACGCGGTGCCGCGGGGCGGGAGTAGAGGCGGAGGGAGGGGACACGGGCT
CATTGCGGTGTGCGCCCTGCACTCTGTCCCTCACTCGCCGCCGACGACCTGTCTCGCCGAGCGCACGCCTTGCCGCCGCCCCGCAGAAAT
GCTTCGGTTACCCACAGTCTTTCGCCAGATGAGACCGGTGTCCAGGGTACTGGCTCCTCATCTCACTCGGGCTTATGCCAAAGATGTAAA
ATTTGGTGCAGATGCCCGAGCCTTAATGCTTCAAGGTGTAGACCTTTTAGCCGATGCTGTGGCCGTTACAATGGGGCCAAAGGGAAGAAC
AGTGATTATTGAGCAGAGTTGGGGAAGTCCCAAAGTAACAAAAGATGGTGTGACTGTTGCAAAGTCAATTGACTTAAAAGATAAATACAA
AAACATTGGAGCTAAACTTGTTCAAGATGTTGCCAATAACACAAATGAAGAAGCTGGGGATGGCACTACCACTGCTACTGTACTGGCACG
CTCTATAGCCAAGGAAGGCTTCGAGAAGATTAGCAAAGGTGCTAATCCAGTGGAAATCAGGAGAGGTGTGATGTTAGCTGTTGATGCTGT
AATTGCTGAACTTAAAAAGCAGTCTAAACCTGTGACCACCCCTGAAGAAATTGCACAGGTTGCTACGATTTCTGCAAACGGAGACAAAGA
AATTGGCAATATCATCTCTGATGCAATGAAAAAAGTTGGAAGAAAGGGTGTCATCACAGTAAAGGATGGAAAAACACTGAATGATGAATT
AGAAATTATTGAAGGCATGAAGTTTGATCGAGGCTATATTTCTCCATACTTTATTAATACATCAAAAGCATCAAAGGTCAGAATGTGCTG
CTGACTCATAATGCTGAAGTAAAACTGGTTGATTTTGGAGTGAGTGCCCAGGTGAGCAGAACTAATGGAAGAAGGAATAGTTTCATTGGG
ACACCATACTGGATGGCACCTGAGGTGATTGACTGTGATGAGGACCCAAGACGCTCCTATGATTACAGAAGTGATGTGTGGTCTGTGGGA
ATTACTGCCATTGAAATGGCTGAAGGAGCCCCTCCTCTGTGTAACCTTCAACCCTTGGAAGCTCTCTTCGTTATTTTGCGGGAATCTGCT
CCCACAGTCAAATCCAGCGGATGGTCCCGTAAGTTCCACAATTTCATGGAAAAGTGTACGATAAAAAATTTCCTGTTTCGTCCTACTTCT
GCAAACATGCTTCAACACCCATTTGTTCGGGATATAAAAAATGAACGACATGTTGTTGAGTCATTAACAAGGCATCTTACTGGAATCATT
AAAAAAAGACAGAAAAAAAAAGGAATACCTTTGATCTTTGAAAGAGAAGAAGCTATTAAGGAACAGTACACCGTGAGAAGATTCAGAGGA
CCCTCTTGCACTCACGAGCTTCTGAGATTGCCAACCAGCAGCAGATGCAGACCACTTAGAGTCCTGCATGGGGAACCCTCTCAGCCAAGG
TGGCTACCTGATCGAGAAGAGCCACAGGTCCAGGCACTTCAGCAGCTACAGGGAGCAGCCAGGGTATTCATGCCACTGCAGGCTCTGGAC
AGTGCACCTAAGCCTCTAAAGGGGCAGGCTCAGGCACCTCAACGACTACAAGGGGCAGCTCGGGTGTTCATGCCACTACAGGCTCAGGTG
AAGGCTAAAGCCTCTAAACCTCTACAAATGCAGATTAAGGCACCTCCACGACTACGGAGGGCAGCCAGGGTGCTCATGCCACTACAGGCA
CAGGTTAGGGCACCTAGGCTTCTGCAGGTACAGTCCCAGGTATCCAAAAAGCAGCAGGCCCAGACCCAGACATCAGAACCACAAGATTTG
GACCAGGTACCAGAGGAATTTCAGGGTCAAGATCAGGTACCCGAACAACAAAGGCAGGGCCAGGCCCCTGAACAACAGCAGAGGCACAAC
CAGGTGCCTGAACAAGAGCTGGAGCAGAACCAGGCACCTGAACAGCCAGAGGTACAGGAACAGGCTGCCGAGCCTGCACAGGCAGAGACT
GAGGCAGAGGAACCTGAGTCATTACGAGTAAATGCCCAGGTATTTCTGCCCCTGCTATCACAAGATCACCATGTGCTGTTGCCACTACAT
TTGGATACTCAGGTGCTCATTCCAGTAGAGGGGCAAACTGAAGGATCACCTCAGGCACAGGCTTGGACACTAGAACCCCCACAGGCAATT
GGCTCAGTTCAAGCACTGATAGAGGGACTATCAAGAGACTTGCTTCGGGCACCAAACTCAAATAACTCAAAGCCACTTGGTCCGTTGCAA
ACCCTGATGGAAAATCTGTCATCAAATAGGTTTTACTCACAACCAGAACAGGCACGGGAGAAAAAATCAAAAGTTTCTACTCTGAGGCAA
GCACTGGCAAAAAGACTATCACCAAAGAGGTTCAGGGCAAAGTCATCATGGAGACCTGAAAAGCTTGAACTCTCGGATTTAGAAGCCCGC
AGGCAAAGGCGCCAACGCAGATGGGAAGATATCTTTAATCAGCATGAGGAAGAATTGAGACAAGTTGATAAAGACAAAGAAGATGAATCA
TCAGACAATGATGAAGTATTTCATTCGATTCAGGCTGAAGTCCAGATAGAGCCATTGAAGCCATACATTTCAAATCCTAAAAAAATTGAG
GTTCAAGAGAGATCTCCTTCTGTGCCTAACAACCAGGATCATGCACATCATGTCAAGTTCTCTTCAAGCGTTCCTCAGCGGTCTCTTTTG
GAACAAGCTCAGAAGCCCATTGACATCAGACAAAGGAGTTCGCAAAATCGTCAAAATTGGCTGGCAGCATCAGAATCTTCTTCTGAGGAA
GAAAGTCCTGTGACTGGAAGGAGGTCTCAGTCATCACCACCTTATTCTACTATTGATCAGAAGTTGCTGGTTGACATCCATGTTCCAGAT
GGATTTAAAGTAGGAAAAATATCACCCCCTGTATACTTGACAAACGAATGGGTAGGCTATAATGCACTCTCTGAAATCTTCCGGAATGAT
TGGTTAACTCCGGCACCTGTCATTCAGCCACCTGAAGAGGATGGTGATTATGTTGAACTCTATGATGCCAGTGCTGATACTGATGGTGAT
GATGATGATGAGTCTAATGATACTTTTGAAGATACCTATGATCATGCCAATGGCAATGATGACTTGGATAACCAGGTTGATCAGGCTAAT
GATGTTTGTAAAGACCATGATGATGACAACAATAAGTTTGTTGATGATGTAAATAATAATTATTATGAGGCGCCTAGTTGTCCAAGGGCA
AGCTATGGCAGAGATGGAAGCTGCAAGCAAGATGGTTATGATGGAAGTCGTGGAAAAGAGGAAGCCTACAGAGGCTATGGAAGCCATACA
GCCAATAGAAGCCATGGAGGAAGTGCAGCCAGTGAGGACAATGCAGCCATTGGAGATCAGGAAGAACATGCAGCCAATATAGGCAGTGAA
AGAAGAGGCAGTGAGGGTGATGGAGGTAAGGGAGTCGTTCGAACCAGTGAAGAGAGTGGAGCCCTTGGACTCAATGGAGAAGAAAATTGC
TCAGAGACAGATGGTCCAGGATTGAAGAGACCTGCGTCTCAGGACTTTGAATATCTACAGGAGGAGCCAGGTGGTGGAAATGAGGCCTCA
AATGCCATTGACTCAGGTGCTGCACCGTCAGCACCTGATCATGAGAGTGACAATAAGGACATATCAGAATCATCAACACAATCAGATTTT
TCTGCCAATCACTCATCTCCTTCCAAAGGTTCTGGGATGTCTGCTGATGCTAACTTTGCCAGTGCCATCTTATACGCTGGATTCGTAGAA
GTACCTGAGGAATCACCTAAGCAACCCTCTGAAGTCAATGTTAACCCACTCTATGTCTCTCCTGCATGTAAAAAACCACTAATCCACATG
TATGAAAAGGAGTTCACTTCTGAGATCTGCTGTGGTTCTTTGTGGGGAGTCAATTTGCTGTTGGGAACCCGATCTAATCTATATCTGATG
GACAGAAGTGGAAAGGCTGACATTACTAAACTTATAAGGCGAAGACCATTCCGCCAGATTCAAGTCTTAGAGCCACTCAATTTGCTGATT
ACCATCTCAGGTCATAAGAACAGACTTCGGGTGTATCATCTGACCTGGTTGAGGAACAAGATTTTGAATAATGATCCAGAAAGTAAAAGA
AGGCAAGAAGAAATGCTGAAGACAGAGGAAGCCTGCAAAGCTATTGATAAGTTAACAGGCTGTGAACACTTCAGTGTCCTCCAACATGAA
GAAACAACATATATTGCAATTGCTTTGAAATCATCAATTCACCTTTATGCATGGGCACCAAAGTCCTTTGATGAAAGCACTGCTATTAAA
GTATGCATTGATCAATCAGCAGACTCTGAAGGAGACTACATGTCCTATCAAGCCTATATACGAATACTGGCAAAAATACAGGCAGCTGAT
CCAGTGAACCGGTTTAAGAGACCAGATGAGCTCCTTCATTTGCTGAAGCTCAAGGTATTTCCAACACTTGATCATAAGCCAGTGACAGTT
GACCTGGCTATTGGTTCTGAAAAAAGACTAAAGATTTTCTTCAGCTCAGCAGATGGATATCACCTCATCGATGCAGAATCTGAGGTTATG
TCTGATGTGACCCTGCCAAAGAATCCCCTGGAAATCATTATACCACAGAATATCATCATTTTACCTGATTGCTTGGGAATTGGCATGATG
CTCACCTTCAATGCTGAAGCCCTCTCTGTGGAAGCAAATGAACAACTCTTCAAGAAGATCCTTGAAATGTGGAAAGACATACCATCTTCT
ATAGCTTTTGAATGTACACAGCGAACCACAGGATGGGGCCAAAAGGCCATTGAAGTGCGCTCTTTGCAATCCAGGGTTCTGGAAAGTGAG
CTGAAGCGCAGGTCAATTAAGAAGCTGAGATTCCTGTGCACCCGGGGTGACAAGCTGTTCTTTACCTCTACCCTGCGCAATCACCACAGC
CGGGTTTACTTCATGACACTTGGAAAACTTGAAGAGCTCCAAAGCAATTATGATGTCTAAAAGTTTCCAGTGATTTATTACCACATTATA
AACATCATGTATAGGCAGTCTGCATCTTCAGATTTCAGAGATTAAATGAGTATTCAGTTTTATTTTTAGTAAAGATTAAATCCAAAACTT
TACTTTTAATGTAGCACAGAATAGTTTTAATGAGAAATGCAGCTTTATGTATAAAATTAACTATAGCAAGCTCTAGGTACTCCAATGGTG
TACAATGTCTTTTGCACAAACTTTGTAACTTTTGTTACTGTGAATTCAAACATTACTCTTTGGACAGTTTGGACAGTATCTGTATTCAGA
TTTTACAACATGGAGTAAAGAAACCTGTTATGAATTAGATTACAAGCAGCCTTCAAAAGAATTGGCACTGGGATAAGATTTTTCAGAAAA
AGAAAAACATCGGCAAACTGTGTGTGATTTTTCCAAAGCTATATAAAGAACCAAAGGTTTAGTCAAGAAACAAAAATCTTAAAGATTATT
ATAACCCAGACTAAGGTTGAACAACCTGCATGCCCAGAGAAAACTATGGCGACAAAGGGGAAAAGGCCACCACTCGTTTTCTCACTGATT
CATGCCAATTAAGCCTACAGTTAAAGACCAGTTTTGTTCTTTTCACCCATTTTTAAGCTGGTTTTCTCCTGATAAGAAGAAAGGAAGAAA
GCCCCAGACGCTTGGTTTTTCTCAGAACCCCCAAAAGATGTGCAATAGCTGTTGTTACAAACCACCAAATAATACAGTTGTGAGCCTGAA
TACAGGACTGAACTCCTATACACGTGTACTGTAGAATGAGTATTTTTTAATACCTTAAGGTAGGCGTCAAATTCTACTCCCCAAAGCAGA
GATAGATTGATTTATCAAAATTATTATCTGGCCAACAGTGTGACTATCAGACAGCATCAAATATTTGCCCAATCCAAGATTAGACTACAC
AAAAGCTTCCTTCCAGTATTAAACAAAAAGAATTAAACATAACTATGAAAAAACTTTGCTAATATCTGTGTTTTTCAGATTTCATTTTTT
GTAAAATCAGAAATTAATCTAAACATATTCAGTGATAAGTTCATGTGTAACGACTTAATGTTAAAGGTTAAAAAAAAGATTTCACAAAAT
ATACAACTTTCACCATATATATAAGCCTGCAAAATTAGAGTAGTGAAAGTCATGCTAGTCCATCACCCAAATATGTTATAGACGCCATAG
ACAGGTGATGTTTGGTCACCTATGGTAACTGCTACCTGATGAAGAGCATAATTTCTGCATATCCATCCTCAATACCATGGTAAATTCTGG
GGCAATAGAGAAGCAACAGAACTGCCACAAAGTATACCTCAATATAATTCCTCTAGTTCTGCTTCTAAAATCTGAGGACAGTGCTAGTGG
GAAAATAATTTTCAAACTACCTGGTTAACCAAAATACAAAAGCAGCTGACTATGTGTGATTTCATAATAGCACATTTCTTGACACTTAGT
GCTAGAAATGAAGATTTGGATTTTCCTAACAACTTACATCAAGAATGTAGTGTAGCTCATTATTGAGAATTTAGGAAAGCCTGAATCCAT
TAATTAAGGAAATAAATGTGACTCACATTTCTTTTACTGTGACACAATAATGTGATCCTAAAACTGGCTTATCCTTGAGTGTTTACAACT
CAAACAACTTTTTGAATGCAGTAGTTTTTTTTTTTTAAAAACAAACTTTTATGTCAAATTTTTTTTCTTAGAAGTAGTCTTCATTATTAT
AAATTTGTACACCAAAAGGCCATGGGGAACTTTGTGCAAGTACCTCATCGCTGAGCAAATGGAGCTTGCTATGTTTTAATTTCAGAAAAT
TTCCTCATATACGTAGTGTGTAGAATCAAGTCTTTTAATAATTCATTTTTTCTTCATAATATTTACTCAAAGTTAAGCTTAAAAATAAGT
TTTATCTTAAAATCATATTTGAAGACAGTAAGACAGTAAACTATTTTAGGAAGTCAACCCCCATTGCACTCTGTGGCAGTTATTCTGGTA
AAAATAGGCAAAAGTGACCTGAATCTACAATGATGTCCCAAAGTAACCAAGTAAGAGAGATTGTAAATGATAAACCGAGCTTTAAAGGAT
AAAGTGTTAATAAAGAAAGGAAGCTGGGCACATGTCAAAAAGGGAGATCGAAATGTTAGGTAATCATTTAGAAAGGACAGAAAATATTTA
AAGTGGCTCATAGGTAATGAATATTTCTGACTTAGATGTAAATCCATCTGGAATCTTTACATCCTTTGCCAGCTGAAACAAGAAAGTGAA
GGGACAATGATATTTCATGGTCAGTTTATTTTGTAAGAGACAGAAGAAATTATATCTATACATTACCTTGTAGCAGCAGTACCTGGAAGC
CCCAGCCCGTCACAGAAGTGTGGAGGGGGGCTCCTGACTAGACAATTTCCCTAGCCCTTGTGATTTGAAGCATGAAAGTTCTGGCAGGTT
ATGAGCAGCACTAGGGATAAAGTATGGTTTTATTTTGGTGTAATTTAGGTTTTTCAACAAAGCCCTTGTCTAAAATAAAAGGCATTATTG
GAAATATTTGAAAACTAGAAAATGATGGATAAAAGGGCTGATAAGAAAATTTCTGACTGTCAGTAGAAGTGAGATAAGATCCTCAGAGGA
AACAGTAAGAAGGGATAATCATTAAGATAGTAAAACAGGCAAAGCAGAATCACATGTGCACACACACATACACATGTAAACATTGGAATG
CATAAGTTTTAATATTTTAGCGCTATCAGTTTCTAAATGCATTAATTACTAACTGCCCTCTCCCAAGATTCATTTAGTTCAAACAGTATC
CGTAAACTAGGAATAATGCCACATGCATTCAATGGGATCTTTTAAGTACTCTTCAGTTTGTTCCAAGAAATGTGCCTACTGAAATCAAAT
TAATTTGTATTCAATGTGTACTTCAAGACTGCTAATTGTTTCATCTGAAAGCCTACAATGAATCATTGTTCAACCTTGAAAAATAAAATT

>37989_37989_4_HSPD1-NRK_HSPD1_chr2_198358883_ENST00000388968_NRK_chrX_105139466_ENST00000428173_length(amino acids)=1400AA_BP=
MLTHNAEVKLVDFGVSAQVSRTNGRRNSFIGTPYWMAPEVIDCDEDPRRSYDYRSDVWSVGITAIEMAEGAPPLCNLQPLEALFVILRES
APTVKSSGWSRKFHNFMEKCTIKNFLFRPTSANMLQHPFVRDIKNERHVVESLTRHLTGIIKKRQKKKGIPLIFEREEAIKEQYTVRRFR
GPSCTHELLRLPTSSRCRPLRVLHGEPSQPRWLPDREEPQVQALQQLQGAARVFMPLQALDSAPKPLKGQAQAPQRLQGAARVFMPLQAQ
VKAKASKPLQMQIKAPPRLRRAARVLMPLQAQVRAPRLLQVQSQVSKKQQAQTQTSEPQDLDQVPEEFQGQDQVPEQQRQGQAPEQQQRH
NQVPEQELEQNQAPEQPEVQEQAAEPAQAETEAEEPESLRVNAQVFLPLLSQDHHVLLPLHLDTQVLIPVEGQTEGSPQAQAWTLEPPQA
IGSVQALIEGLSRDLLRAPNSNNSKPLGPLQTLMENLSSNRFYSQPEQAREKKSKVSTLRQALAKRLSPKRFRAKSSWRPEKLELSDLEA
RRQRRQRRWEDIFNQHEEELRQVDKDKEDESSDNDEVFHSIQAEVQIEPLKPYISNPKKIEVQERSPSVPNNQDHAHHVKFSSSVPQRSL
LEQAQKPIDIRQRSSQNRQNWLAASESSSEEESPVTGRRSQSSPPYSTIDQKLLVDIHVPDGFKVGKISPPVYLTNEWVGYNALSEIFRN
DWLTPAPVIQPPEEDGDYVELYDASADTDGDDDDESNDTFEDTYDHANGNDDLDNQVDQANDVCKDHDDDNNKFVDDVNNNYYEAPSCPR
ASYGRDGSCKQDGYDGSRGKEEAYRGYGSHTANRSHGGSAASEDNAAIGDQEEHAANIGSERRGSEGDGGKGVVRTSEESGALGLNGEEN
CSETDGPGLKRPASQDFEYLQEEPGGGNEASNAIDSGAAPSAPDHESDNKDISESSTQSDFSANHSSPSKGSGMSADANFASAILYAGFV
EVPEESPKQPSEVNVNPLYVSPACKKPLIHMYEKEFTSEICCGSLWGVNLLLGTRSNLYLMDRSGKADITKLIRRRPFRQIQVLEPLNLL
ITISGHKNRLRVYHLTWLRNKILNNDPESKRRQEEMLKTEEACKAIDKLTGCEHFSVLQHEETTYIAIALKSSIHLYAWAPKSFDESTAI
KVCIDQSADSEGDYMSYQAYIRILAKIQAADPVNRFKRPDELLHLLKLKVFPTLDHKPVTVDLAIGSEKRLKIFFSSADGYHLIDAESEV
MSDVTLPKNPLEIIIPQNIIILPDCLGIGMMLTFNAEALSVEANEQLFKKILEMWKDIPSSIAFECTQRTTGWGQKAIEVRSLQSRVLES

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for HSPD1-NRK


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for HSPD1-NRK


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for HSPD1-NRK


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource