Fusion Gene Studies
in Kim Lab

FusionBase FusionGDB FusionGDB2 FusionPDB FusionNeoAntigen FusionAI FusionNW FGviewer Publication Contact
FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:ALPK1-NAA15 (FusionGDB2 ID:HG80216TG80155)

Fusion Gene Summary for ALPK1-NAA15

check button Fusion gene summary
Fusion gene informationFusion gene name: ALPK1-NAA15
Fusion gene ID: hg80216tg80155
HgeneTgene
Gene symbol

ALPK1

NAA15

Gene ID

80216

80155

Gene namealpha kinase 1N-alpha-acetyltransferase 15, NatA auxiliary subunit
Synonyms8430410J10Rik|LAKGa19|MRD50|NARG1|NAT1P|NATH|TBDN|TBDN100
Cytomap('ALPK1')('NAA15')

4q25

4q31.1

Type of geneprotein-codingprotein-coding
Descriptionalpha-protein kinase 1chromosome 4 kinaselymphocyte alpha-kinaselymphocyte alpha-protein kinaseN-alpha-acetyltransferase 15, NatA auxiliary subunitN-terminal acetyltransferaseNMDA receptor regulated 1NMDA receptor-regulated protein 1gastric cancer antigen Ga19protein tubedown-1transcriptional coactivator tubedown-100tubedown-1
Modification date2020032020200313
UniProtAcc..
Ensembl transtripts involved in fusion geneENST00000177648, ENST00000458497, 
ENST00000504176, ENST00000505912, 
Fusion gene scores* DoF score6 X 6 X 3=10814 X 11 X 8=1232
# samples 615
** MAII scorelog2(6/108*10)=-0.84799690655495
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(15/1232*10)=-3.03796785019902
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: ALPK1 [Title/Abstract] AND NAA15 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointALPK1(113362261)-NAA15(140291365), # samples:1
Anticipated loss of major functional domain due to fusion event.ALPK1-NAA15 seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
ALPK1-NAA15 seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
ALPK1-NAA15 seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
ALPK1-NAA15 seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneALPK1

GO:0002753

cytoplasmic pattern recognition receptor signaling pathway

28222186|28877472|30111836

HgeneALPK1

GO:0043123

positive regulation of I-kappaB kinase/NF-kappaB signaling

28222186|28877472|30111836

HgeneALPK1

GO:0045087

innate immune response

28222186|28877472|30111836

TgeneNAA15

GO:0006474

N-terminal protein amino acid acetylation

15496142

TgeneNAA15

GO:0045893

positive regulation of transcription, DNA-templated

12145306


check buttonFusion gene breakpoints across ALPK1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure
check buttonFusion gene breakpoints across NAA15 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4SKCMTCGA-W3-A825-06AALPK1chr4

113362261

-NAA15chr4

140291365

+


Top

Fusion Gene ORF analysis for ALPK1-NAA15

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-intronENST00000177648ENST00000480277ALPK1chr4

113362261

-NAA15chr4

140291365

+
5CDS-intronENST00000177648ENST00000515576ALPK1chr4

113362261

-NAA15chr4

140291365

+
5CDS-intronENST00000458497ENST00000480277ALPK1chr4

113362261

-NAA15chr4

140291365

+
5CDS-intronENST00000458497ENST00000515576ALPK1chr4

113362261

-NAA15chr4

140291365

+
5CDS-intronENST00000504176ENST00000480277ALPK1chr4

113362261

-NAA15chr4

140291365

+
5CDS-intronENST00000504176ENST00000515576ALPK1chr4

113362261

-NAA15chr4

140291365

+
In-frameENST00000177648ENST00000296543ALPK1chr4

113362261

-NAA15chr4

140291365

+
In-frameENST00000177648ENST00000398947ALPK1chr4

113362261

-NAA15chr4

140291365

+
In-frameENST00000458497ENST00000296543ALPK1chr4

113362261

-NAA15chr4

140291365

+
In-frameENST00000458497ENST00000398947ALPK1chr4

113362261

-NAA15chr4

140291365

+
In-frameENST00000504176ENST00000296543ALPK1chr4

113362261

-NAA15chr4

140291365

+
In-frameENST00000504176ENST00000398947ALPK1chr4

113362261

-NAA15chr4

140291365

+
intron-3CDSENST00000505912ENST00000296543ALPK1chr4

113362261

-NAA15chr4

140291365

+
intron-3CDSENST00000505912ENST00000398947ALPK1chr4

113362261

-NAA15chr4

140291365

+
intron-intronENST00000505912ENST00000480277ALPK1chr4

113362261

-NAA15chr4

140291365

+
intron-intronENST00000505912ENST00000515576ALPK1chr4

113362261

-NAA15chr4

140291365

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000458497ALPK1chr4113362261-ENST00000296543NAA15chr4140291365+8152400627948531524
ENST00000458497ALPK1chr4113362261-ENST00000398947NAA15chr4140291365+8135400627948501523
ENST00000177648ALPK1chr4113362261-ENST00000296543NAA15chr4140291365+8073392712547741549
ENST00000177648ALPK1chr4113362261-ENST00000398947NAA15chr4140291365+8056392712547711548
ENST00000504176ALPK1chr4113362261-ENST00000296543NAA15chr4140291365+7908376221246091465
ENST00000504176ALPK1chr4113362261-ENST00000398947NAA15chr4140291365+7891376221246061464

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000458497ENST00000296543ALPK1chr4113362261-NAA15chr4140291365+0.0002017040.99979836
ENST00000458497ENST00000398947ALPK1chr4113362261-NAA15chr4140291365+0.0001802670.9998198
ENST00000177648ENST00000296543ALPK1chr4113362261-NAA15chr4140291365+0.0001866450.9998134
ENST00000177648ENST00000398947ALPK1chr4113362261-NAA15chr4140291365+0.0001669890.999833
ENST00000504176ENST00000296543ALPK1chr4113362261-NAA15chr4140291365+0.0001970650.999803
ENST00000504176ENST00000398947ALPK1chr4113362261-NAA15chr4140291365+0.0001751440.99982494

Top

Fusion Genomic Features for ALPK1-NAA15


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

Top

Fusion Protein Features for ALPK1-NAA15


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr4:113362261/chr4:140291365)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
..
FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneALPK1chr4:113362261chr4:140291365ENST00000177648-1516105_10912421245.0Compositional biasNote=Poly-Ala
HgeneALPK1chr4:113362261chr4:140291365ENST00000177648-1516921_96012421245.0Compositional biasNote=Ser-rich
HgeneALPK1chr4:113362261chr4:140291365ENST00000458497-1516105_10912421245.0Compositional biasNote=Poly-Ala
HgeneALPK1chr4:113362261chr4:140291365ENST00000458497-1516921_96012421245.0Compositional biasNote=Ser-rich
HgeneALPK1chr4:113362261chr4:140291365ENST00000504176-1415105_10911641167.0Compositional biasNote=Poly-Ala
HgeneALPK1chr4:113362261chr4:140291365ENST00000504176-1415921_96011641167.0Compositional biasNote=Ser-rich
HgeneALPK1chr4:113362261chr4:140291365ENST00000177648-15161017_123712421245.0DomainAlpha-type protein kinase
HgeneALPK1chr4:113362261chr4:140291365ENST00000458497-15161017_123712421245.0DomainAlpha-type protein kinase
HgeneALPK1chr4:113362261chr4:140291365ENST00000177648-1516150_15312421245.0RegionADP-D-glycero-beta-D-manno-heptose binding
HgeneALPK1chr4:113362261chr4:140291365ENST00000177648-1516236_23712421245.0RegionADP-D-glycero-beta-D-manno-heptose binding
HgeneALPK1chr4:113362261chr4:140291365ENST00000458497-1516150_15312421245.0RegionADP-D-glycero-beta-D-manno-heptose binding
HgeneALPK1chr4:113362261chr4:140291365ENST00000458497-1516236_23712421245.0RegionADP-D-glycero-beta-D-manno-heptose binding
HgeneALPK1chr4:113362261chr4:140291365ENST00000504176-1415150_15311641167.0RegionADP-D-glycero-beta-D-manno-heptose binding
HgeneALPK1chr4:113362261chr4:140291365ENST00000504176-1415236_23711641167.0RegionADP-D-glycero-beta-D-manno-heptose binding
TgeneNAA15chr4:113362261chr4:140291365ENST000002965431320629_632584867.0Compositional biasNote=Poly-Asp
TgeneNAA15chr4:113362261chr4:140291365ENST000002965431320612_629584867.0MotifBipartite nuclear localization signal
TgeneNAA15chr4:113362261chr4:140291365ENST000002965431320672_705584867.0RepeatNote=TPR 8

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneALPK1chr4:113362261chr4:140291365ENST00000504176-14151017_123711641167.0DomainAlpha-type protein kinase
TgeneNAA15chr4:113362261chr4:140291365ENST000002965431320148_184584867.0RepeatNote=TPR 3
TgeneNAA15chr4:113362261chr4:140291365ENST000002965431320224_257584867.0RepeatNote=TPR 4
TgeneNAA15chr4:113362261chr4:140291365ENST000002965431320374_407584867.0RepeatNote=TPR 5
TgeneNAA15chr4:113362261chr4:140291365ENST000002965431320409_441584867.0RepeatNote=TPR 6
TgeneNAA15chr4:113362261chr4:140291365ENST00000296543132046_79584867.0RepeatNote=TPR 1
TgeneNAA15chr4:113362261chr4:140291365ENST000002965431320485_518584867.0RepeatNote=TPR 7
TgeneNAA15chr4:113362261chr4:140291365ENST00000296543132080_113584867.0RepeatNote=TPR 2


Top

Fusion Gene Sequence for ALPK1-NAA15


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>4109_4109_1_ALPK1-NAA15_ALPK1_chr4_113362261_ENST00000177648_NAA15_chr4_140291365_ENST00000296543_length(transcript)=8073nt_BP=3927nt
TTTATGAGAAACAGTGTGTTTCAGAGAGGCTGTACCAGAATTAACTCTGCTCAGAGTTAGATTTGCTGGTCTTAAAGTACTTTTCCTCTT
TAAGATAAAAGAAGTTCTTCTAAATCAGGAATGGATTGAAATCTAATGAACCGAAACTTTGGGTAATTGATCACCCTAGACCCAGGGACA
CCCAATTCATCGTAATCATCATGAATAATCAAAAAGTGGTAGCTGTGCTACTGCAAGAGTGCAAGCAAGTGCTGGATCAGCTCTTGTTGG
AAGCGCCAGATGTGTCGGAAGAGGACAAGAGCGAGGACCAGCGCTGCAGAGCTTTACTCCCCAGCGAGTTAAGGACCCTGATCCAGGAGG
CAAAGGAAATGAAGTGGCCCTTCGTGCCTGAAAAGTGGCAGTACAAACAAGCCGTGGGCCCAGAGGACAAAACAAACCTGAAGGATGTGA
TTGGCGCCGGGTTGCAGCAGTTACTGGCGTCCCTGAGGGCCTCCATCCTCGCTCGGGACTGTGCGGCTGCGGCGGCTATTGTGTTCTTGG
TGGACCGGTTCCTGTATGGGCTCGACGTCTCTGGAAAACTTCTGCAGGTCGCCAAAGGTCTCCACAAGTTGCAGCCAGCCACGCCAATTG
CCCCGCAGGTGGTTATTCGCCAAGCCCGAATCTCCGTGAACTCAGGAAAACTTTTAAAAGCAGAGTATATTCTGAGCAGTCTAATAAGCA
ACAATGGAGCAACGGGTACCTGGCTGTACAGAAATGAAAGTGACAAGGTCCTGGTGCAGTCGGTCTGTATACAGATCAGAGGGCAGATTC
TGCAAAAGCTGGGGATGTGGTACGAAGCAGCAGAGTTAATATGGGCCTCCATTGTAGGATATTTGGCACTTCCTCAGCCGGATAAAAAGG
GCCTCTCCACGTCGCTAGGTATACTGGCAGACATCTTTGTTTCCATGAGCAAGAACGATTATGAAAAGTTTAAAAACAATCCACAAATTA
ATTTGAGCCTGCTGAAGGAGTTTGACCACCATTTGCTGTCCGCTGCAGAAGCCTGCAAGCTGGCAGCTGCCTTCAGTGCCTATACGCCGC
TCTTCGTGCTCACAGCTGTGAATATCCGTGGCACGTGTTTATTGTCCTACAGTAGTTCAAATGACTGTCCTCCAGAATTGAAAAACTTAC
ATCTGTGTGAAGCCAAAGAGGCCTTTGAGATTGGCCTCCTCACCAAGAGAGATGATGAGCCTGTTACTGGAAAACAGGAGCTTCACAGCT
TTGTCAAAGCTGCTTTCGGTCTCACCACAGTGCACAGAAGGCTCCATGGGGAGACAGGGACGGTCCATGCAGCAAGTCAGCTCTGTAAGG
AAGCAATGGGGAAGCTGTACAATTTCAGCACTTCCTCCAGAAGTCAGGACAGAGAAGCTCTGTCTCAAGAAGTTATGTCTGTGATTGCCC
AGGTGAAGGAACATTTACAAGTTCAAAGCTTCTCAAATGTAGATGACAGATCTTATGTTCCCGAGAGTTTCGAGTGCAGGTTGGATAAAC
TTATCTTGCATGGGCAAGGGGATTTCCAAAAAATCCTTGACACCTATTCACAGCACCATACTTCGGTGTGTGAAGTATTTGAAAGTGATT
GTGGAAACAACAAAAATGAACAGAAAGATGCAAAAACAGGAGTCTGCATCACTGCTCTAAAAACAGAAATAAAAAACATAGATACTGTGA
GTACTACTCAAGAAAAGCCACATTGTCAAAGAGACACAGGAATATCTTCCTCCCTAATGGGTAAGAATGTTCAGAGGGAACTCAGAAGGG
GAGGAAGGAGAAACTGGACCCATTCTGATGCATTTCGAGTCTCCTTGGATCAAGATGTGGAGACTGAGACTGAGCCATCGGACTACAGCA
ATGGTGAGGGAGCTGTTTTCAACAAGTCTCTGAGTGGCAGCCAGACTTCCAGTGCTTGGAGCAACTTATCAGGGTTTAGTTCCTCTGCAA
GCTGGGAGGAAGTGAATTATCACGTTGACGACAGGTCAGCCAGAAAAGAGCCTGGCAAAGAACATCTGGTGGACACTCAGTGTTCCACTG
CCTTGTCTGAGGAGCTAGAGAATGACAGGGAAGGCAGAGCTATGCATTCATTGCATTCACAGCTTCATGATCTCTCTCTTCAGGAACCCA
ACAATGACAATTTGGAGCCTTCTCAAAATCAGCCACAGCAACAGATGCCCTTGACACCCTTCTCGCCTCATAATACCCCAGGCATTTTCT
TGGCCCCTGGTGCAGGGCTTCTAGAAGGAGCTCCAGAAGGTATCCAGGAAGTCAGAAATATGGGACCCAGAAATACTTCTGCTCACTCCA
GACCCTCATATCGTTCTGCTTCTTGGTCTTCTGATTCTGGTAGGCCCAAGAATATGGGCACACATCCTTCAGTCCAAAAAGAAGAAGCCT
TTGAAATAATTGTTGAGTTTCCAGAAACCAACTGCGATGTCAAAGACAGGCAGGGGAAAGAGCAGGGAGAAGAAATTAGTGAAAGAGGCG
CAGGCCCTACATTTAAAGCTAGTCCCTCCTGGGTTGACCCAGAAGGAGAAACAGCAGAAAGCACTGAAGATGCACCCTTAGACTTTCACA
GGGTCCTGCACAATTCTCTGGGAAACATTTCCATGCTGCCATGTAGCTCCTTCACCCCTAATTGGCCTGTTCAAAATCCTGACTCCAGAA
AAAGTGGTGGCCCAGTCGCAGAGCAGGGCATCGACCCTGATGCCTCCACAGTGGATGAGGAGGGGCAACTGCTCGACAGCATGGATGTTC
CCTGCACAAATGGGCACGGCTCTCATAGACTGTGCATTCTGAGACAGCCGCCTGGTCAGAGGGCGGAGACCCCCAATTCCTCTGTAAGCG
GTAACATCCTCTTCCCTGTCCTCAGCGAGGACTGCACTACCACAGAGGAAGGAAATCAGCCTGGAAACATGCTAAACTGCAGCCAGAACT
CCAGCTCATCCTCAGTGTGGTGGCTGAAATCACCTGCATTTTCCAGTGGTTCTTCTGAGGGGGACAGCCCTTGGTCCTATCTGAATTCCA
GTGGGAGTTCTTGGGTTTCATTGCCGGGAAAGATGAGGAAAGAGATCCTTGAGGCTCGCACCTTGCAACCTGATGACTTTGAAAAGCTGT
TGGCAGGAGTGAGGCATGATTGGCTGTTTCAGAGACTAGAGAATACGGGGGTTTTTAAGCCCAGTCAACTCCACCGAGCACATAGTGCTC
TTTTGTTAAAATATTCAAAAAAATCTGAACTGTGGACGGCCCAGGAAACTATTGTCTATTTGGGGGACTACTTGACTGTGAAGAAAAAAG
GCAGACAAAGAAATGCTTTTTGGGTTCATCATCTTCATCAAGAAGAAATTCTGGGGAGGTATGTTGGGAAAGACTATAAGGAGCAGAAGG
GGCTCTGGCACCACTTCACTGATGTGGAGCGACAGATGACCGCACAGCACTATGTGACAGAATTTAACAAGAGACTCTATGAACAAAACA
TTCCCACCCAGATATTCTACATCCCATCCACAATACTACTGATTTTAGAGGACAAGACAATAAAGGGATGTATCAGTGTGGAGCCTTACA
TACTGGGAGAATTTGTAAAATTGTCAAATAACACGAAAGTGGTGAAAACAGAATACAAAGCCACAGAATATGGCTTGGCCTATGGCCATT
TTTCTTATGAGTTTTCTAATCATAGAGATGTTGTGGTCGATTTACAAGGTTGGGTAACCGGTAATGGAAAAGGACTCATCTACCTCACAG
ATCCCCAGATTCACTCCGTTGATCAGAAAGTTTTCACTACCAATTTTGGAAAGAGAGGAATTTTTTACTTCTTTAATAACCAGCATGTGG
AATGTAATGAAATCTGCCATCGTCTTTCTTTGACTAGACCTTCAATGGAGAAACCATCAAACATGTCTGACAAAGAGCTAAAGAAGCTAC
GTAATAAACAAAGAAGAGCTCAAAAGAAAGCCCAGATAGAAGAAGAGAAAAAAAATGCAGAAAAAGAAAAGCAGCAGAGAAATCAGAAAA
AGAAGAAGGATGATGATGATGAGGAGATAGGAGGTCCAAAAGAAGAACTTATTCCAGAGAAACTGGCCAAGGTTGAAACTCCATTGGAAG
AAGCTATTAAATTTTTAACACCGTTGAAGAACTTGGTGAAGAACAAGATAGAGACTCATCTTTTTGCCTTTGAGATTTACTTTAGGAAAG
AAAAGTTTCTTTTGATGCTACAATCAGTAAAGAGGGCATTTGCTATTGATTCTAGTCATCCCTGGCTTCATGAGTGTATGATTCGTCTCT
TTAATACTGCAGTGTGTGAAAGTAAAGATTTATCTGATACAGTTAGAACAGTATTAAAACAAGAAATGAATCGTCTTTTTGGAGCAACGA
ATCCAAAGAATTTTAATGAAACTTTTCTGAAAAGGAATTCTGATTCATTGCCACACAGATTATCAGCTGCCAAAATGGTATATTACTTAG
ATCCTTCTAGTCAGAAGCGAGCTATAGAGTTGGCAACAACACTTGATGAATCTCTCACTAACAGAAACCTCCAGACATGTATGGAGGTAT
TGGAAGCCTTGTATGATGGTAGCCTAGGAGACTGTAAAGAAGCTGCTGAAATTTATAGAGCAAATTGTCATAAGCTTTTCCCTTATGCTT
TGGCTTTCATGCCTCCTGGATATGAAGAGGATATGAAGATCACAGTTAATGGAGATAGTTCTGCAGAAGCTGAAGAACTGGCCAATGAAA
TTTGAACATCACTAAACAAGCAAATGGAATGACTTTGGACCATATCTAGTATATAATATTTTTGTCACGCACCTGCTGCATTGCTCTAAC
TTACACAGAATGAGAGGAGTAAATGTTCTTGCCTTCAAATAGTGTTTTACGTTTTTTATCCTGCTGAAAAAGTATATATAAAATATCTAA
CATTACAGGATAGAGGTTCAGTTTCTTAAAAAATTAAAGCTGCTAAAATTGAGTGGTTAAAAAAGATACCTTATCCTATTCCTCCCCACC
CACCCATGTTTTTAAACTAATTTATATAAAATCTGGAGGCTGTTACAGCTAACAAAGCAGGTGTGTGGCAGAAATATTACTTTAAATTTG
TCTTGTGAGATTTTACTATATCTCAGACAGCATAAATGCTGTTTTAGCACTGGATTCTTTCACTGAGCACAAAGAGTTGTTGGGGCTTTA
GCATCTGACTGATTTTGTTACGGGGTTGATTCTGACCATAGGAAGTATGCAATGTGAATCACTATTTACAGAGAAACCTACAACAGATGC
TTGATGTTGTAGAAACTGGGACATATAGATACCAAGCAAAATTATAAGAAACCTATAAGGTGTTCAATACGCTTGTGTTTCCAAAATTCA
CTGTACATGATCAGTTTGGTGTTCTTGTACCACAGTTTTTAACTGAAGGAACCAGTTGTAACAGTCTCAATTTTAACTAAAACTTGAAGA
ACTAAAACAACAATGCAAACCTTTCAGCATTGTTTGGCCAAACTTGTTAAAACTGTAATGCAAGAACCAAATGCACTGTGATGTGGCACC
AACTAATTAGCAAGCATGAATTTTTCACCCAAGAGTGAAAAAAGGAAAATCTACCATGGCTTGAAGTTAAAGAGCAGAACTCCTGACTAC
CATTCTATGACTGATCAAAAGACTAATAGTTAAAAACCTCAGCAGGCCTTGTTCACGATATGCAGAAAAAAAAGTGCTGCAGTTTAGATA
CCTCTGGAATTTTTCCACAGTGTCACAGGTTTGTAATACTTGAAGCCCTACATTTCTAAGAATATATTTCTTGCTCAGTTGTTTCAGGCA
AGCCCAAGACTTTGTAATTTTTAAAGGGCCCAAGATTTTTTTTTTTTTTTTTTTTTTCAAATAACAGACCAGCTTCTTTTTCTTGCAGTT
ACAGATGTAATTTCCTTTTTGTTGTCAAACATAAGGTACCAAATATGATGCAATAAATTGTTTTGAAAAACAGTTGTGTGAATATTTCAA
CTAATCTGTGTTGGGCTTCTGTGAAATACACAGGTGGAAACAGAGGTGCAAGCCAGAGGCAATGTAATATGCTGTAAGGCTAGTGCAGAT
GGGAGCTTTTTAGAAGGGGCTAAGTGCTGGTGTCAGGGAAATTCCATAATGAAGTAGAATGCTGCTCCTGCATTAAGATTTCATTGAGGG
CAAGGCTGGTGGCAGGTACTATGAATGTAATTCATAATTTAAAAGGAAAACTAAAAACTATTTTGATTTGGGAAAATGAGCCTTAATTTG
TTAAACCTATACACTGAGAACTAGCCTCAGGCTTAATATTCTCATTGCATTTGCAAGATCTGAGCAAATAAGATTAAGTAAAACAAATCA
ATTGTATATATAATTGACCTTTTTGTGGAACATGTAGTTTATAGAAAGTATACTCTAAAGGGAATTTGCCGAAGACCTTTTACTGATTGA
ACAGTTGTGCTACAATCAACTTTTCATAGTACATGACCTGCATTCCACATCTCAGTCTAACAGTTTAGTAGTGATGTAAAGAGAAGTACA
AACCGAACTCCAGTGCTTTGTTATGTTTTATTAACTGGCCCTGTCTCAGGAACATCTTAACAGATGGCAAAAAAACAAAAACTTTTTTTC
AACTCCTATGAGTGGCAACTGAAGTTCTTATTGTTGGGAAAGAACACTAGTCCTACCTCTGCCACTAATGAGGTGTTTGGAGGAGGTACC
AGCCATATAATAGGGGGTGTATGTGTGAATTTTGTTTAAACTCTACTGTATATTGAAATGAAATTCATTTATTTGTCTTGACAATGTTCA
AATGATGTAGATTGTCTTAGAATGAATATTCATAAGTACTCAGAACTCTTAAGATGCAGATGCCACCCGTGAGGAGCTAAATTCCTAATG
TGTATTGTATTCCAACCCAATTTTACTGGAACTATTGAATAAATCTTTTATTTTCTTTCAGGTTTACTTGATAGTGGATACATTGGTGTC
AGAGGACCTTGACTGGGTTCATTTTATGTCCAGACATCACCCCTGAAACACTGTACTGTACTATCTTGCTCTGAGTAGTATCAACTGGAT
CGTCTCATATTTGCCTCATTCATCCTATTAATTTCAAATGATACTGTGGGGGAAAAACAGGGTTAGAAAAACAAGTGGAAAAAATGGAGT
GATGTTGTAATCTAAACAAGTGCCTTATGTTTATTGCTAAGAACTGGTGTTACCAACCCTTTTGAGAAGAAAGGGTCTCTTGACCTGTAT
TAACATAGGAAAGTAAAGTTCTTTTTGTTTTTCTTTATATTCAGCTACTCTTGTCATTTCTCGTTGAAAAACTAAAATCTGACTAGGTTA
GTTTACTCAGCTTTAATTAGATAGTTGAGTCATATATTTTCAACATTTTTCTGTATCGTATTTATTATGCACAAAAATAAAGTGTGATCT
CTAATAGCATGGCTAAAGGTAATGCCAATATTAGTGAATAGCTTTGCTGTGGGCTTTATCAGGCCCTTGTTTTTCACAGTGTTGTTTGTA
CTCCATGGTGTATTGCTATTAGAGCTGTGAAATGAAAGCTGTGACTTTATGAAGAGATCAAAAAAAGTGTGGTCCAGATTCAGGTAGGTT
GTGGTAAGTTCAGGGAAGTCTTGTATGATTTCTAAGAATATTTCAGTTACCATGTATAGTATTTGGGAAAAGCTGTAATGTAAAATATTG
GACTTTGTTGCAAGATAGGTATATACTTGTGTCAGATAATTAAAGCCTTAAATTTTGAAATGAATCTTTGAGATTTCAAAGAAAGACTGA
CCTTTCAAATAGAAGGCGGTCTTTCTATTCTAGCTAATGCCCCACTTCTTTAAGTTATAAACAAAGTTTCATGATACCATTCTGCTATCA
TCTAAACTTTGCTGAACTCTACTGCCAACCTACATTAAAAACAAAGTCCCACGAAATGGCTGTGTTACACCATAGTGAGCAGAAACTTAA

>4109_4109_1_ALPK1-NAA15_ALPK1_chr4_113362261_ENST00000177648_NAA15_chr4_140291365_ENST00000296543_length(amino acids)=1549AA_BP=1268
MKSNEPKLWVIDHPRPRDTQFIVIIMNNQKVVAVLLQECKQVLDQLLLEAPDVSEEDKSEDQRCRALLPSELRTLIQEAKEMKWPFVPEK
WQYKQAVGPEDKTNLKDVIGAGLQQLLASLRASILARDCAAAAAIVFLVDRFLYGLDVSGKLLQVAKGLHKLQPATPIAPQVVIRQARIS
VNSGKLLKAEYILSSLISNNGATGTWLYRNESDKVLVQSVCIQIRGQILQKLGMWYEAAELIWASIVGYLALPQPDKKGLSTSLGILADI
FVSMSKNDYEKFKNNPQINLSLLKEFDHHLLSAAEACKLAAAFSAYTPLFVLTAVNIRGTCLLSYSSSNDCPPELKNLHLCEAKEAFEIG
LLTKRDDEPVTGKQELHSFVKAAFGLTTVHRRLHGETGTVHAASQLCKEAMGKLYNFSTSSRSQDREALSQEVMSVIAQVKEHLQVQSFS
NVDDRSYVPESFECRLDKLILHGQGDFQKILDTYSQHHTSVCEVFESDCGNNKNEQKDAKTGVCITALKTEIKNIDTVSTTQEKPHCQRD
TGISSSLMGKNVQRELRRGGRRNWTHSDAFRVSLDQDVETETEPSDYSNGEGAVFNKSLSGSQTSSAWSNLSGFSSSASWEEVNYHVDDR
SARKEPGKEHLVDTQCSTALSEELENDREGRAMHSLHSQLHDLSLQEPNNDNLEPSQNQPQQQMPLTPFSPHNTPGIFLAPGAGLLEGAP
EGIQEVRNMGPRNTSAHSRPSYRSASWSSDSGRPKNMGTHPSVQKEEAFEIIVEFPETNCDVKDRQGKEQGEEISERGAGPTFKASPSWV
DPEGETAESTEDAPLDFHRVLHNSLGNISMLPCSSFTPNWPVQNPDSRKSGGPVAEQGIDPDASTVDEEGQLLDSMDVPCTNGHGSHRLC
ILRQPPGQRAETPNSSVSGNILFPVLSEDCTTTEEGNQPGNMLNCSQNSSSSSVWWLKSPAFSSGSSEGDSPWSYLNSSGSSWVSLPGKM
RKEILEARTLQPDDFEKLLAGVRHDWLFQRLENTGVFKPSQLHRAHSALLLKYSKKSELWTAQETIVYLGDYLTVKKKGRQRNAFWVHHL
HQEEILGRYVGKDYKEQKGLWHHFTDVERQMTAQHYVTEFNKRLYEQNIPTQIFYIPSTILLILEDKTIKGCISVEPYILGEFVKLSNNT
KVVKTEYKATEYGLAYGHFSYEFSNHRDVVVDLQGWVTGNGKGLIYLTDPQIHSVDQKVFTTNFGKRGIFYFFNNQHVECNEICHRLSLT
RPSMEKPSNMSDKELKKLRNKQRRAQKKAQIEEEKKNAEKEKQQRNQKKKKDDDDEEIGGPKEELIPEKLAKVETPLEEAIKFLTPLKNL
VKNKIETHLFAFEIYFRKEKFLLMLQSVKRAFAIDSSHPWLHECMIRLFNTAVCESKDLSDTVRTVLKQEMNRLFGATNPKNFNETFLKR
NSDSLPHRLSAAKMVYYLDPSSQKRAIELATTLDESLTNRNLQTCMEVLEALYDGSLGDCKEAAEIYRANCHKLFPYALAFMPPGYEEDM

--------------------------------------------------------------
>4109_4109_2_ALPK1-NAA15_ALPK1_chr4_113362261_ENST00000177648_NAA15_chr4_140291365_ENST00000398947_length(transcript)=8056nt_BP=3927nt
TTTATGAGAAACAGTGTGTTTCAGAGAGGCTGTACCAGAATTAACTCTGCTCAGAGTTAGATTTGCTGGTCTTAAAGTACTTTTCCTCTT
TAAGATAAAAGAAGTTCTTCTAAATCAGGAATGGATTGAAATCTAATGAACCGAAACTTTGGGTAATTGATCACCCTAGACCCAGGGACA
CCCAATTCATCGTAATCATCATGAATAATCAAAAAGTGGTAGCTGTGCTACTGCAAGAGTGCAAGCAAGTGCTGGATCAGCTCTTGTTGG
AAGCGCCAGATGTGTCGGAAGAGGACAAGAGCGAGGACCAGCGCTGCAGAGCTTTACTCCCCAGCGAGTTAAGGACCCTGATCCAGGAGG
CAAAGGAAATGAAGTGGCCCTTCGTGCCTGAAAAGTGGCAGTACAAACAAGCCGTGGGCCCAGAGGACAAAACAAACCTGAAGGATGTGA
TTGGCGCCGGGTTGCAGCAGTTACTGGCGTCCCTGAGGGCCTCCATCCTCGCTCGGGACTGTGCGGCTGCGGCGGCTATTGTGTTCTTGG
TGGACCGGTTCCTGTATGGGCTCGACGTCTCTGGAAAACTTCTGCAGGTCGCCAAAGGTCTCCACAAGTTGCAGCCAGCCACGCCAATTG
CCCCGCAGGTGGTTATTCGCCAAGCCCGAATCTCCGTGAACTCAGGAAAACTTTTAAAAGCAGAGTATATTCTGAGCAGTCTAATAAGCA
ACAATGGAGCAACGGGTACCTGGCTGTACAGAAATGAAAGTGACAAGGTCCTGGTGCAGTCGGTCTGTATACAGATCAGAGGGCAGATTC
TGCAAAAGCTGGGGATGTGGTACGAAGCAGCAGAGTTAATATGGGCCTCCATTGTAGGATATTTGGCACTTCCTCAGCCGGATAAAAAGG
GCCTCTCCACGTCGCTAGGTATACTGGCAGACATCTTTGTTTCCATGAGCAAGAACGATTATGAAAAGTTTAAAAACAATCCACAAATTA
ATTTGAGCCTGCTGAAGGAGTTTGACCACCATTTGCTGTCCGCTGCAGAAGCCTGCAAGCTGGCAGCTGCCTTCAGTGCCTATACGCCGC
TCTTCGTGCTCACAGCTGTGAATATCCGTGGCACGTGTTTATTGTCCTACAGTAGTTCAAATGACTGTCCTCCAGAATTGAAAAACTTAC
ATCTGTGTGAAGCCAAAGAGGCCTTTGAGATTGGCCTCCTCACCAAGAGAGATGATGAGCCTGTTACTGGAAAACAGGAGCTTCACAGCT
TTGTCAAAGCTGCTTTCGGTCTCACCACAGTGCACAGAAGGCTCCATGGGGAGACAGGGACGGTCCATGCAGCAAGTCAGCTCTGTAAGG
AAGCAATGGGGAAGCTGTACAATTTCAGCACTTCCTCCAGAAGTCAGGACAGAGAAGCTCTGTCTCAAGAAGTTATGTCTGTGATTGCCC
AGGTGAAGGAACATTTACAAGTTCAAAGCTTCTCAAATGTAGATGACAGATCTTATGTTCCCGAGAGTTTCGAGTGCAGGTTGGATAAAC
TTATCTTGCATGGGCAAGGGGATTTCCAAAAAATCCTTGACACCTATTCACAGCACCATACTTCGGTGTGTGAAGTATTTGAAAGTGATT
GTGGAAACAACAAAAATGAACAGAAAGATGCAAAAACAGGAGTCTGCATCACTGCTCTAAAAACAGAAATAAAAAACATAGATACTGTGA
GTACTACTCAAGAAAAGCCACATTGTCAAAGAGACACAGGAATATCTTCCTCCCTAATGGGTAAGAATGTTCAGAGGGAACTCAGAAGGG
GAGGAAGGAGAAACTGGACCCATTCTGATGCATTTCGAGTCTCCTTGGATCAAGATGTGGAGACTGAGACTGAGCCATCGGACTACAGCA
ATGGTGAGGGAGCTGTTTTCAACAAGTCTCTGAGTGGCAGCCAGACTTCCAGTGCTTGGAGCAACTTATCAGGGTTTAGTTCCTCTGCAA
GCTGGGAGGAAGTGAATTATCACGTTGACGACAGGTCAGCCAGAAAAGAGCCTGGCAAAGAACATCTGGTGGACACTCAGTGTTCCACTG
CCTTGTCTGAGGAGCTAGAGAATGACAGGGAAGGCAGAGCTATGCATTCATTGCATTCACAGCTTCATGATCTCTCTCTTCAGGAACCCA
ACAATGACAATTTGGAGCCTTCTCAAAATCAGCCACAGCAACAGATGCCCTTGACACCCTTCTCGCCTCATAATACCCCAGGCATTTTCT
TGGCCCCTGGTGCAGGGCTTCTAGAAGGAGCTCCAGAAGGTATCCAGGAAGTCAGAAATATGGGACCCAGAAATACTTCTGCTCACTCCA
GACCCTCATATCGTTCTGCTTCTTGGTCTTCTGATTCTGGTAGGCCCAAGAATATGGGCACACATCCTTCAGTCCAAAAAGAAGAAGCCT
TTGAAATAATTGTTGAGTTTCCAGAAACCAACTGCGATGTCAAAGACAGGCAGGGGAAAGAGCAGGGAGAAGAAATTAGTGAAAGAGGCG
CAGGCCCTACATTTAAAGCTAGTCCCTCCTGGGTTGACCCAGAAGGAGAAACAGCAGAAAGCACTGAAGATGCACCCTTAGACTTTCACA
GGGTCCTGCACAATTCTCTGGGAAACATTTCCATGCTGCCATGTAGCTCCTTCACCCCTAATTGGCCTGTTCAAAATCCTGACTCCAGAA
AAAGTGGTGGCCCAGTCGCAGAGCAGGGCATCGACCCTGATGCCTCCACAGTGGATGAGGAGGGGCAACTGCTCGACAGCATGGATGTTC
CCTGCACAAATGGGCACGGCTCTCATAGACTGTGCATTCTGAGACAGCCGCCTGGTCAGAGGGCGGAGACCCCCAATTCCTCTGTAAGCG
GTAACATCCTCTTCCCTGTCCTCAGCGAGGACTGCACTACCACAGAGGAAGGAAATCAGCCTGGAAACATGCTAAACTGCAGCCAGAACT
CCAGCTCATCCTCAGTGTGGTGGCTGAAATCACCTGCATTTTCCAGTGGTTCTTCTGAGGGGGACAGCCCTTGGTCCTATCTGAATTCCA
GTGGGAGTTCTTGGGTTTCATTGCCGGGAAAGATGAGGAAAGAGATCCTTGAGGCTCGCACCTTGCAACCTGATGACTTTGAAAAGCTGT
TGGCAGGAGTGAGGCATGATTGGCTGTTTCAGAGACTAGAGAATACGGGGGTTTTTAAGCCCAGTCAACTCCACCGAGCACATAGTGCTC
TTTTGTTAAAATATTCAAAAAAATCTGAACTGTGGACGGCCCAGGAAACTATTGTCTATTTGGGGGACTACTTGACTGTGAAGAAAAAAG
GCAGACAAAGAAATGCTTTTTGGGTTCATCATCTTCATCAAGAAGAAATTCTGGGGAGGTATGTTGGGAAAGACTATAAGGAGCAGAAGG
GGCTCTGGCACCACTTCACTGATGTGGAGCGACAGATGACCGCACAGCACTATGTGACAGAATTTAACAAGAGACTCTATGAACAAAACA
TTCCCACCCAGATATTCTACATCCCATCCACAATACTACTGATTTTAGAGGACAAGACAATAAAGGGATGTATCAGTGTGGAGCCTTACA
TACTGGGAGAATTTGTAAAATTGTCAAATAACACGAAAGTGGTGAAAACAGAATACAAAGCCACAGAATATGGCTTGGCCTATGGCCATT
TTTCTTATGAGTTTTCTAATCATAGAGATGTTGTGGTCGATTTACAAGGTTGGGTAACCGGTAATGGAAAAGGACTCATCTACCTCACAG
ATCCCCAGATTCACTCCGTTGATCAGAAAGTTTTCACTACCAATTTTGGAAAGAGAGGAATTTTTTACTTCTTTAATAACCAGCATGTGG
AATGTAATGAAATCTGCCATCGTCTTTCTTTGACTAGACCTTCAATGGAGAAACCATCAAACATGTCTGACAAAGAGCTAAAGAAGCTAC
GTAATAAACAAAGAAGAGCTCAAAAGAAAGCCCAGATAGAAGAAGAGAAAAAAAATGCAGAAAAAGAAAAGCAGCAGAGAAATCAGAAAA
AGAAGAAGGATGATGATGATGAGGAGATAGGAGGTCCAAAAGAAGAACTTATTCCAGAGAAACTGGCCAAGGTTGAAACTCCATTGGAAG
AAGCTATTAAATTTTTAACACCGTTGAAGAACTTGGTGAAGAACAAGATAGAGACTCATCTTTTTGCCTTTGAGATTTACTTTAGGAAAG
AAAAGTTTCTTTTGATGCTACAATCAGTAAAGAGGGCATTTGCTATTGATTCTAGTCATCCCTGGCTTCATGAGTGTATGATTCGTCTCT
TTAATACTGTGTGTGAAAGTAAAGATTTATCTGATACAGTTAGAACAGTATTAAAACAAGAAATGAATCGTCTTTTTGGAGCAACGAATC
CAAAGAATTTTAATGAAACTTTTCTGAAAAGGAATTCTGATTCATTGCCACACAGATTATCAGCTGCCAAAATGGTATATTACTTAGATC
CTTCTAGTCAGAAGCGAGCTATAGAGTTGGCAACAACACTTGATGAATCTCTCACTAACAGAAACCTCCAGACATGTATGGAGGTATTGG
AAGCCTTGTATGATGGTAGCCTAGGAGACTGTAAAGAAGCTGCTGAAATTTATAGAGCAAATTGTCATAAGCTTTTCCCTTATGCTTTGG
CTTTCATGCCTCCTGGATATGAAGAGGATATGAAGATCACAGTTAATGGAGATAGTTCTGCAGAAGCTGAAGAACTGGCCAATGAAATTT
GAACATCACTAAACAAGCAAATGGAATGACTTTGGACCATATCTAGTATATAATATTTTTGTCACGCACCTGCTGCATTGCTCTAACTTA
CACAGAATGAGAGGAGTAAATGTTCTTGCCTTCAAATAGTGTTTTACGTTTTTTATCCTGCTGAAAAAGTATATATAAAATATCTAACAT
TACAGGATAGAGGTTCAGTTTCTTAAAAAATTAAAGCTGCTAAAATTGAGTGGTTAAAAAAGATACCTTATCCTATTCCTCCCCACCCAC
CCATGTTTTTAAACTAATTTATATAAAATCTGGAGGCTGTTACAGCTAACAAAGCAGGTGTGTGGCAGAAATATTACTTTAAATTTGTCT
TGTGAGATTTTACTATATCTCAGACAGCATAAATGCTGTTTTAGCACTGGATTCTTTCACTGAGCACAAAGAGTTGTTGGGGCTTTAGCA
TCTGACTGATTTTGTTACGGGGTTGATTCTGACCATAGGAAGTATGCAATGTGAATCACTATTTACAGAGAAACCTACAACAGATGCTTG
ATGTTGTAGAAACTGGGACATATAGATACCAAGCAAAATTATAAGAAACCTATAAGGTGTTCAATACGCTTGTGTTTCCAAAATTCACTG
TACATGATCAGTTTGGTGTTCTTGTACCACAGTTTTTAACTGAAGGAACCAGTTGTAACAGTCTCAATTTTAACTAAAACTTGAAGAACT
AAAACAACAATGCAAACCTTTCAGCATTGTTTGGCCAAACTTGTTAAAACTGTAATGCAAGAACCAAATGCACTGTGATGTGGCACCAAC
TAATTAGCAAGCATGAATTTTTCACCCAAGAGTGAAAAAAGGAAAATCTACCATGGCTTGAAGTTAAAGAGCAGAACTCCTGACTACCAT
TCTATGACTGATCAAAAGACTAATAGTTAAAAACCTCAGCAGGCCTTGTTCACGATATGCAGAAAAAAAAGTGCTGCAGTTTAGATACCT
CTGGAATTTTTCCACAGTGTCACAGGTTTGTAATACTTGAAGCCCTACATTTCTAAGAATATATTTCTTGCTCAGTTGTTTCAGGCAAGC
CCAAGACTTTGTAATTTTTAAAGGGCCCAAGATTTTTTTTTTTTTTTTTTTTTTCAAATAACAGACCAGCTTCTTTTTCTTGCAGTTACA
GATGTAATTTCCTTTTTGTTGTCAAACATAAGGTACCAAATATGATGCAATAAATTGTTTTGAAAAACAGTTGTGTGAATATTTCAACTA
ATCTGTGTTGGGCTTCTGTGAAATACACAGGTGGAAACAGAGGTGCAAGCCAGAGGCAATGTAATATGCTGTAAGGCTAGTGCAGATGGG
AGCTTTTTAGAAGGGGCTAAGTGCTGGTGTCAGGGAAATTCCATAATGAAGTAGAATGCTGCTCCTGCATTAAGATTTCATTGAGGGCAA
GGCTGGTGGCAGGTACTATGAATGTAATTCATAATTTAAAAGGAAAACTAAAAACTATTTTGATTTGGGAAAATGAGCCTTAATTTGTTA
AACCTATACACTGAGAACTAGCCTCAGGCTTAATATTCTCATTGCATTTGCAAGATCTGAGCAAATAAGATTAAGTAAAACAAATCAATT
GTATATATAATTGACCTTTTTGTGGAACATGTAGTTTATAGAAAGTATACTCTAAAGGGAATTTGCCGAAGACCTTTTACTGATTGAACA
GTTGTGCTACAATCAACTTTTCATAGTACATGACCTGCATTCCACATCTCAGTCTAACAGTTTAGTAGTGATGTAAAGAGAAGTACAAAC
CGAACTCCAGTGCTTTGTTATGTTTTATTAACTGGCCCTGTCTCAGGAACATCTTAACAGATGGCAAAAAAACAAAAACTTTTTTTCAAC
TCCTATGAGTGGCAACTGAAGTTCTTATTGTTGGGAAAGAACACTAGTCCTACCTCTGCCACTAATGAGGTGTTTGGAGGAGGTACCAGC
CATATAATAGGGGGTGTATGTGTGAATTTTGTTTAAACTCTACTGTATATTGAAATGAAATTCATTTATTTGTCTTGACAATGTTCAAAT
GATGTAGATTGTCTTAGAATGAATATTCATAAGTACTCAGAACTCTTAAGATGCAGATGCCACCCGTGAGGAGCTAAATTCCTAATGTGT
ATTGTATTCCAACCCAATTTTACTGGAACTATTGAATAAATCTTTTATTTTCTTTCAGGTTTACTTGATAGTGGATACATTGGTGTCAGA
GGACCTTGACTGGGTTCATTTTATGTCCAGACATCACCCCTGAAACACTGTACTGTACTATCTTGCTCTGAGTAGTATCAACTGGATCGT
CTCATATTTGCCTCATTCATCCTATTAATTTCAAATGATACTGTGGGGGAAAAACAGGGTTAGAAAAACAAGTGGAAAAAATGGAGTGAT
GTTGTAATCTAAACAAGTGCCTTATGTTTATTGCTAAGAACTGGTGTTACCAACCCTTTTGAGAAGAAAGGGTCTCTTGACCTGTATTAA
CATAGGAAAGTAAAGTTCTTTTTGTTTTTCTTTATATTCAGCTACTCTTGTCATTTCTCGTTGAAAAACTAAAATCTGACTAGGTTAGTT
TACTCAGCTTTAATTAGATAGTTGAGTCATATATTTTCAACATTTTTCTGTATCGTATTTATTATGCACAAAAATAAAGTGTGATCTCTA
ATAGCATGGCTAAAGGTAATGCCAATATTAGTGAATAGCTTTGCTGTGGGCTTTATCAGGCCCTTGTTTTTCACAGTGTTGTTTGTACTC
CATGGTGTATTGCTATTAGAGCTGTGAAATGAAAGCTGTGACTTTATGAAGAGATCAAAAAAAGTGTGGTCCAGATTCAGGTAGGTTGTG
GTAAGTTCAGGGAAGTCTTGTATGATTTCTAAGAATATTTCAGTTACCATGTATAGTATTTGGGAAAAGCTGTAATGTAAAATATTGGAC
TTTGTTGCAAGATAGGTATATACTTGTGTCAGATAATTAAAGCCTTAAATTTTGAAATGAATCTTTGAGATTTCAAAGAAAGACTGACCT
TTCAAATAGAAGGCGGTCTTTCTATTCTAGCTAATGCCCCACTTCTTTAAGTTATAAACAAAGTTTCATGATACCATTCTGCTATCATCT
AAACTTTGCTGAACTCTACTGCCAACCTACATTAAAAACAAAGTCCCACGAAATGGCTGTGTTACACCATAGTGAGCAGAAACTTAAATT

>4109_4109_2_ALPK1-NAA15_ALPK1_chr4_113362261_ENST00000177648_NAA15_chr4_140291365_ENST00000398947_length(amino acids)=1548AA_BP=1268
MKSNEPKLWVIDHPRPRDTQFIVIIMNNQKVVAVLLQECKQVLDQLLLEAPDVSEEDKSEDQRCRALLPSELRTLIQEAKEMKWPFVPEK
WQYKQAVGPEDKTNLKDVIGAGLQQLLASLRASILARDCAAAAAIVFLVDRFLYGLDVSGKLLQVAKGLHKLQPATPIAPQVVIRQARIS
VNSGKLLKAEYILSSLISNNGATGTWLYRNESDKVLVQSVCIQIRGQILQKLGMWYEAAELIWASIVGYLALPQPDKKGLSTSLGILADI
FVSMSKNDYEKFKNNPQINLSLLKEFDHHLLSAAEACKLAAAFSAYTPLFVLTAVNIRGTCLLSYSSSNDCPPELKNLHLCEAKEAFEIG
LLTKRDDEPVTGKQELHSFVKAAFGLTTVHRRLHGETGTVHAASQLCKEAMGKLYNFSTSSRSQDREALSQEVMSVIAQVKEHLQVQSFS
NVDDRSYVPESFECRLDKLILHGQGDFQKILDTYSQHHTSVCEVFESDCGNNKNEQKDAKTGVCITALKTEIKNIDTVSTTQEKPHCQRD
TGISSSLMGKNVQRELRRGGRRNWTHSDAFRVSLDQDVETETEPSDYSNGEGAVFNKSLSGSQTSSAWSNLSGFSSSASWEEVNYHVDDR
SARKEPGKEHLVDTQCSTALSEELENDREGRAMHSLHSQLHDLSLQEPNNDNLEPSQNQPQQQMPLTPFSPHNTPGIFLAPGAGLLEGAP
EGIQEVRNMGPRNTSAHSRPSYRSASWSSDSGRPKNMGTHPSVQKEEAFEIIVEFPETNCDVKDRQGKEQGEEISERGAGPTFKASPSWV
DPEGETAESTEDAPLDFHRVLHNSLGNISMLPCSSFTPNWPVQNPDSRKSGGPVAEQGIDPDASTVDEEGQLLDSMDVPCTNGHGSHRLC
ILRQPPGQRAETPNSSVSGNILFPVLSEDCTTTEEGNQPGNMLNCSQNSSSSSVWWLKSPAFSSGSSEGDSPWSYLNSSGSSWVSLPGKM
RKEILEARTLQPDDFEKLLAGVRHDWLFQRLENTGVFKPSQLHRAHSALLLKYSKKSELWTAQETIVYLGDYLTVKKKGRQRNAFWVHHL
HQEEILGRYVGKDYKEQKGLWHHFTDVERQMTAQHYVTEFNKRLYEQNIPTQIFYIPSTILLILEDKTIKGCISVEPYILGEFVKLSNNT
KVVKTEYKATEYGLAYGHFSYEFSNHRDVVVDLQGWVTGNGKGLIYLTDPQIHSVDQKVFTTNFGKRGIFYFFNNQHVECNEICHRLSLT
RPSMEKPSNMSDKELKKLRNKQRRAQKKAQIEEEKKNAEKEKQQRNQKKKKDDDDEEIGGPKEELIPEKLAKVETPLEEAIKFLTPLKNL
VKNKIETHLFAFEIYFRKEKFLLMLQSVKRAFAIDSSHPWLHECMIRLFNTVCESKDLSDTVRTVLKQEMNRLFGATNPKNFNETFLKRN
SDSLPHRLSAAKMVYYLDPSSQKRAIELATTLDESLTNRNLQTCMEVLEALYDGSLGDCKEAAEIYRANCHKLFPYALAFMPPGYEEDMK

--------------------------------------------------------------
>4109_4109_3_ALPK1-NAA15_ALPK1_chr4_113362261_ENST00000458497_NAA15_chr4_140291365_ENST00000296543_length(transcript)=8152nt_BP=4006nt
AATTCCTACTTCCTGAAACTGAAGCCGTTTATGAGAAACAGTGTGTTTCAGAGAGGCTGTACCAGAATTAACTCTGCTCAGAGTTAGATT
TGCTGGTCTTAAAGTACTTTTCCTCTTTAAGATAAAAGAAGTTCTTCTAAATCAGGAATGGATTGAAATCTAATGAACCGAAACTTTGGG
TACTTCGGCCTTCAAGGGGCTCCTTTATTGAGAATCAATGTCTTCTCCTAGGTAATTGATCACCCTAGACCCAGGGACACCCAATTCATC
GTAATCATCATGAATAATCAAAAAGTGGTAGCTGTGCTACTGCAAGAGTGCAAGCAAGTGCTGGATCAGCTCTTGTTGGAAGCGCCAGAT
GTGTCGGAAGAGGACAAGAGCGAGGACCAGCGCTGCAGAGCTTTACTCCCCAGCGAGTTAAGGACCCTGATCCAGGAGGCAAAGGAAATG
AAGTGGCCCTTCGTGCCTGAAAAGTGGCAGTACAAACAAGCCGTGGGCCCAGAGGACAAAACAAACCTGAAGGATGTGATTGGCGCCGGG
TTGCAGCAGTTACTGGCGTCCCTGAGGGCCTCCATCCTCGCTCGGGACTGTGCGGCTGCGGCGGCTATTGTGTTCTTGGTGGACCGGTTC
CTGTATGGGCTCGACGTCTCTGGAAAACTTCTGCAGGTCGCCAAAGGTCTCCACAAGTTGCAGCCAGCCACGCCAATTGCCCCGCAGGTG
GTTATTCGCCAAGCCCGAATCTCCGTGAACTCAGGAAAACTTTTAAAAGCAGAGTATATTCTGAGCAGTCTAATAAGCAACAATGGAGCA
ACGGGTACCTGGCTGTACAGAAATGAAAGTGACAAGGTCCTGGTGCAGTCGGTCTGTATACAGATCAGAGGGCAGATTCTGCAAAAGCTG
GGGATGTGGTACGAAGCAGCAGAGTTAATATGGGCCTCCATTGTAGGATATTTGGCACTTCCTCAGCCGGATAAAAAGGGCCTCTCCACG
TCGCTAGGTATACTGGCAGACATCTTTGTTTCCATGAGCAAGAACGATTATGAAAAGTTTAAAAACAATCCACAAATTAATTTGAGCCTG
CTGAAGGAGTTTGACCACCATTTGCTGTCCGCTGCAGAAGCCTGCAAGCTGGCAGCTGCCTTCAGTGCCTATACGCCGCTCTTCGTGCTC
ACAGCTGTGAATATCCGTGGCACGTGTTTATTGTCCTACAGTAGTTCAAATGACTGTCCTCCAGAATTGAAAAACTTACATCTGTGTGAA
GCCAAAGAGGCCTTTGAGATTGGCCTCCTCACCAAGAGAGATGATGAGCCTGTTACTGGAAAACAGGAGCTTCACAGCTTTGTCAAAGCT
GCTTTCGGTCTCACCACAGTGCACAGAAGGCTCCATGGGGAGACAGGGACGGTCCATGCAGCAAGTCAGCTCTGTAAGGAAGCAATGGGG
AAGCTGTACAATTTCAGCACTTCCTCCAGAAGTCAGGACAGAGAAGCTCTGTCTCAAGAAGTTATGTCTGTGATTGCCCAGGTGAAGGAA
CATTTACAAGTTCAAAGCTTCTCAAATGTAGATGACAGATCTTATGTTCCCGAGAGTTTCGAGTGCAGGTTGGATAAACTTATCTTGCAT
GGGCAAGGGGATTTCCAAAAAATCCTTGACACCTATTCACAGCACCATACTTCGGTGTGTGAAGTATTTGAAAGTGATTGTGGAAACAAC
AAAAATGAACAGAAAGATGCAAAAACAGGAGTCTGCATCACTGCTCTAAAAACAGAAATAAAAAACATAGATACTGTGAGTACTACTCAA
GAAAAGCCACATTGTCAAAGAGACACAGGAATATCTTCCTCCCTAATGGGTAAGAATGTTCAGAGGGAACTCAGAAGGGGAGGAAGGAGA
AACTGGACCCATTCTGATGCATTTCGAGTCTCCTTGGATCAAGATGTGGAGACTGAGACTGAGCCATCGGACTACAGCAATGGTGAGGGA
GCTGTTTTCAACAAGTCTCTGAGTGGCAGCCAGACTTCCAGTGCTTGGAGCAACTTATCAGGGTTTAGTTCCTCTGCAAGCTGGGAGGAA
GTGAATTATCACGTTGACGACAGGTCAGCCAGAAAAGAGCCTGGCAAAGAACATCTGGTGGACACTCAGTGTTCCACTGCCTTGTCTGAG
GAGCTAGAGAATGACAGGGAAGGCAGAGCTATGCATTCATTGCATTCACAGCTTCATGATCTCTCTCTTCAGGAACCCAACAATGACAAT
TTGGAGCCTTCTCAAAATCAGCCACAGCAACAGATGCCCTTGACACCCTTCTCGCCTCATAATACCCCAGGCATTTTCTTGGCCCCTGGT
GCAGGGCTTCTAGAAGGAGCTCCAGAAGGTATCCAGGAAGTCAGAAATATGGGACCCAGAAATACTTCTGCTCACTCCAGACCCTCATAT
CGTTCTGCTTCTTGGTCTTCTGATTCTGGTAGGCCCAAGAATATGGGCACACATCCTTCAGTCCAAAAAGAAGAAGCCTTTGAAATAATT
GTTGAGTTTCCAGAAACCAACTGCGATGTCAAAGACAGGCAGGGGAAAGAGCAGGGAGAAGAAATTAGTGAAAGAGGCGCAGGCCCTACA
TTTAAAGCTAGTCCCTCCTGGGTTGACCCAGAAGGAGAAACAGCAGAAAGCACTGAAGATGCACCCTTAGACTTTCACAGGGTCCTGCAC
AATTCTCTGGGAAACATTTCCATGCTGCCATGTAGCTCCTTCACCCCTAATTGGCCTGTTCAAAATCCTGACTCCAGAAAAAGTGGTGGC
CCAGTCGCAGAGCAGGGCATCGACCCTGATGCCTCCACAGTGGATGAGGAGGGGCAACTGCTCGACAGCATGGATGTTCCCTGCACAAAT
GGGCACGGCTCTCATAGACTGTGCATTCTGAGACAGCCGCCTGGTCAGAGGGCGGAGACCCCCAATTCCTCTGTAAGCGGTAACATCCTC
TTCCCTGTCCTCAGCGAGGACTGCACTACCACAGAGGAAGGAAATCAGCCTGGAAACATGCTAAACTGCAGCCAGAACTCCAGCTCATCC
TCAGTGTGGTGGCTGAAATCACCTGCATTTTCCAGTGGTTCTTCTGAGGGGGACAGCCCTTGGTCCTATCTGAATTCCAGTGGGAGTTCT
TGGGTTTCATTGCCGGGAAAGATGAGGAAAGAGATCCTTGAGGCTCGCACCTTGCAACCTGATGACTTTGAAAAGCTGTTGGCAGGAGTG
AGGCATGATTGGCTGTTTCAGAGACTAGAGAATACGGGGGTTTTTAAGCCCAGTCAACTCCACCGAGCACATAGTGCTCTTTTGTTAAAA
TATTCAAAAAAATCTGAACTGTGGACGGCCCAGGAAACTATTGTCTATTTGGGGGACTACTTGACTGTGAAGAAAAAAGGCAGACAAAGA
AATGCTTTTTGGGTTCATCATCTTCATCAAGAAGAAATTCTGGGGAGGTATGTTGGGAAAGACTATAAGGAGCAGAAGGGGCTCTGGCAC
CACTTCACTGATGTGGAGCGACAGATGACCGCACAGCACTATGTGACAGAATTTAACAAGAGACTCTATGAACAAAACATTCCCACCCAG
ATATTCTACATCCCATCCACAATACTACTGATTTTAGAGGACAAGACAATAAAGGGATGTATCAGTGTGGAGCCTTACATACTGGGAGAA
TTTGTAAAATTGTCAAATAACACGAAAGTGGTGAAAACAGAATACAAAGCCACAGAATATGGCTTGGCCTATGGCCATTTTTCTTATGAG
TTTTCTAATCATAGAGATGTTGTGGTCGATTTACAAGGTTGGGTAACCGGTAATGGAAAAGGACTCATCTACCTCACAGATCCCCAGATT
CACTCCGTTGATCAGAAAGTTTTCACTACCAATTTTGGAAAGAGAGGAATTTTTTACTTCTTTAATAACCAGCATGTGGAATGTAATGAA
ATCTGCCATCGTCTTTCTTTGACTAGACCTTCAATGGAGAAACCATCAAACATGTCTGACAAAGAGCTAAAGAAGCTACGTAATAAACAA
AGAAGAGCTCAAAAGAAAGCCCAGATAGAAGAAGAGAAAAAAAATGCAGAAAAAGAAAAGCAGCAGAGAAATCAGAAAAAGAAGAAGGAT
GATGATGATGAGGAGATAGGAGGTCCAAAAGAAGAACTTATTCCAGAGAAACTGGCCAAGGTTGAAACTCCATTGGAAGAAGCTATTAAA
TTTTTAACACCGTTGAAGAACTTGGTGAAGAACAAGATAGAGACTCATCTTTTTGCCTTTGAGATTTACTTTAGGAAAGAAAAGTTTCTT
TTGATGCTACAATCAGTAAAGAGGGCATTTGCTATTGATTCTAGTCATCCCTGGCTTCATGAGTGTATGATTCGTCTCTTTAATACTGCA
GTGTGTGAAAGTAAAGATTTATCTGATACAGTTAGAACAGTATTAAAACAAGAAATGAATCGTCTTTTTGGAGCAACGAATCCAAAGAAT
TTTAATGAAACTTTTCTGAAAAGGAATTCTGATTCATTGCCACACAGATTATCAGCTGCCAAAATGGTATATTACTTAGATCCTTCTAGT
CAGAAGCGAGCTATAGAGTTGGCAACAACACTTGATGAATCTCTCACTAACAGAAACCTCCAGACATGTATGGAGGTATTGGAAGCCTTG
TATGATGGTAGCCTAGGAGACTGTAAAGAAGCTGCTGAAATTTATAGAGCAAATTGTCATAAGCTTTTCCCTTATGCTTTGGCTTTCATG
CCTCCTGGATATGAAGAGGATATGAAGATCACAGTTAATGGAGATAGTTCTGCAGAAGCTGAAGAACTGGCCAATGAAATTTGAACATCA
CTAAACAAGCAAATGGAATGACTTTGGACCATATCTAGTATATAATATTTTTGTCACGCACCTGCTGCATTGCTCTAACTTACACAGAAT
GAGAGGAGTAAATGTTCTTGCCTTCAAATAGTGTTTTACGTTTTTTATCCTGCTGAAAAAGTATATATAAAATATCTAACATTACAGGAT
AGAGGTTCAGTTTCTTAAAAAATTAAAGCTGCTAAAATTGAGTGGTTAAAAAAGATACCTTATCCTATTCCTCCCCACCCACCCATGTTT
TTAAACTAATTTATATAAAATCTGGAGGCTGTTACAGCTAACAAAGCAGGTGTGTGGCAGAAATATTACTTTAAATTTGTCTTGTGAGAT
TTTACTATATCTCAGACAGCATAAATGCTGTTTTAGCACTGGATTCTTTCACTGAGCACAAAGAGTTGTTGGGGCTTTAGCATCTGACTG
ATTTTGTTACGGGGTTGATTCTGACCATAGGAAGTATGCAATGTGAATCACTATTTACAGAGAAACCTACAACAGATGCTTGATGTTGTA
GAAACTGGGACATATAGATACCAAGCAAAATTATAAGAAACCTATAAGGTGTTCAATACGCTTGTGTTTCCAAAATTCACTGTACATGAT
CAGTTTGGTGTTCTTGTACCACAGTTTTTAACTGAAGGAACCAGTTGTAACAGTCTCAATTTTAACTAAAACTTGAAGAACTAAAACAAC
AATGCAAACCTTTCAGCATTGTTTGGCCAAACTTGTTAAAACTGTAATGCAAGAACCAAATGCACTGTGATGTGGCACCAACTAATTAGC
AAGCATGAATTTTTCACCCAAGAGTGAAAAAAGGAAAATCTACCATGGCTTGAAGTTAAAGAGCAGAACTCCTGACTACCATTCTATGAC
TGATCAAAAGACTAATAGTTAAAAACCTCAGCAGGCCTTGTTCACGATATGCAGAAAAAAAAGTGCTGCAGTTTAGATACCTCTGGAATT
TTTCCACAGTGTCACAGGTTTGTAATACTTGAAGCCCTACATTTCTAAGAATATATTTCTTGCTCAGTTGTTTCAGGCAAGCCCAAGACT
TTGTAATTTTTAAAGGGCCCAAGATTTTTTTTTTTTTTTTTTTTTTCAAATAACAGACCAGCTTCTTTTTCTTGCAGTTACAGATGTAAT
TTCCTTTTTGTTGTCAAACATAAGGTACCAAATATGATGCAATAAATTGTTTTGAAAAACAGTTGTGTGAATATTTCAACTAATCTGTGT
TGGGCTTCTGTGAAATACACAGGTGGAAACAGAGGTGCAAGCCAGAGGCAATGTAATATGCTGTAAGGCTAGTGCAGATGGGAGCTTTTT
AGAAGGGGCTAAGTGCTGGTGTCAGGGAAATTCCATAATGAAGTAGAATGCTGCTCCTGCATTAAGATTTCATTGAGGGCAAGGCTGGTG
GCAGGTACTATGAATGTAATTCATAATTTAAAAGGAAAACTAAAAACTATTTTGATTTGGGAAAATGAGCCTTAATTTGTTAAACCTATA
CACTGAGAACTAGCCTCAGGCTTAATATTCTCATTGCATTTGCAAGATCTGAGCAAATAAGATTAAGTAAAACAAATCAATTGTATATAT
AATTGACCTTTTTGTGGAACATGTAGTTTATAGAAAGTATACTCTAAAGGGAATTTGCCGAAGACCTTTTACTGATTGAACAGTTGTGCT
ACAATCAACTTTTCATAGTACATGACCTGCATTCCACATCTCAGTCTAACAGTTTAGTAGTGATGTAAAGAGAAGTACAAACCGAACTCC
AGTGCTTTGTTATGTTTTATTAACTGGCCCTGTCTCAGGAACATCTTAACAGATGGCAAAAAAACAAAAACTTTTTTTCAACTCCTATGA
GTGGCAACTGAAGTTCTTATTGTTGGGAAAGAACACTAGTCCTACCTCTGCCACTAATGAGGTGTTTGGAGGAGGTACCAGCCATATAAT
AGGGGGTGTATGTGTGAATTTTGTTTAAACTCTACTGTATATTGAAATGAAATTCATTTATTTGTCTTGACAATGTTCAAATGATGTAGA
TTGTCTTAGAATGAATATTCATAAGTACTCAGAACTCTTAAGATGCAGATGCCACCCGTGAGGAGCTAAATTCCTAATGTGTATTGTATT
CCAACCCAATTTTACTGGAACTATTGAATAAATCTTTTATTTTCTTTCAGGTTTACTTGATAGTGGATACATTGGTGTCAGAGGACCTTG
ACTGGGTTCATTTTATGTCCAGACATCACCCCTGAAACACTGTACTGTACTATCTTGCTCTGAGTAGTATCAACTGGATCGTCTCATATT
TGCCTCATTCATCCTATTAATTTCAAATGATACTGTGGGGGAAAAACAGGGTTAGAAAAACAAGTGGAAAAAATGGAGTGATGTTGTAAT
CTAAACAAGTGCCTTATGTTTATTGCTAAGAACTGGTGTTACCAACCCTTTTGAGAAGAAAGGGTCTCTTGACCTGTATTAACATAGGAA
AGTAAAGTTCTTTTTGTTTTTCTTTATATTCAGCTACTCTTGTCATTTCTCGTTGAAAAACTAAAATCTGACTAGGTTAGTTTACTCAGC
TTTAATTAGATAGTTGAGTCATATATTTTCAACATTTTTCTGTATCGTATTTATTATGCACAAAAATAAAGTGTGATCTCTAATAGCATG
GCTAAAGGTAATGCCAATATTAGTGAATAGCTTTGCTGTGGGCTTTATCAGGCCCTTGTTTTTCACAGTGTTGTTTGTACTCCATGGTGT
ATTGCTATTAGAGCTGTGAAATGAAAGCTGTGACTTTATGAAGAGATCAAAAAAAGTGTGGTCCAGATTCAGGTAGGTTGTGGTAAGTTC
AGGGAAGTCTTGTATGATTTCTAAGAATATTTCAGTTACCATGTATAGTATTTGGGAAAAGCTGTAATGTAAAATATTGGACTTTGTTGC
AAGATAGGTATATACTTGTGTCAGATAATTAAAGCCTTAAATTTTGAAATGAATCTTTGAGATTTCAAAGAAAGACTGACCTTTCAAATA
GAAGGCGGTCTTTCTATTCTAGCTAATGCCCCACTTCTTTAAGTTATAAACAAAGTTTCATGATACCATTCTGCTATCATCTAAACTTTG
CTGAACTCTACTGCCAACCTACATTAAAAACAAAGTCCCACGAAATGGCTGTGTTACACCATAGTGAGCAGAAACTTAAATTTTGTTCTA

>4109_4109_3_ALPK1-NAA15_ALPK1_chr4_113362261_ENST00000458497_NAA15_chr4_140291365_ENST00000296543_length(amino acids)=1524AA_BP=1243
MNNQKVVAVLLQECKQVLDQLLLEAPDVSEEDKSEDQRCRALLPSELRTLIQEAKEMKWPFVPEKWQYKQAVGPEDKTNLKDVIGAGLQQ
LLASLRASILARDCAAAAAIVFLVDRFLYGLDVSGKLLQVAKGLHKLQPATPIAPQVVIRQARISVNSGKLLKAEYILSSLISNNGATGT
WLYRNESDKVLVQSVCIQIRGQILQKLGMWYEAAELIWASIVGYLALPQPDKKGLSTSLGILADIFVSMSKNDYEKFKNNPQINLSLLKE
FDHHLLSAAEACKLAAAFSAYTPLFVLTAVNIRGTCLLSYSSSNDCPPELKNLHLCEAKEAFEIGLLTKRDDEPVTGKQELHSFVKAAFG
LTTVHRRLHGETGTVHAASQLCKEAMGKLYNFSTSSRSQDREALSQEVMSVIAQVKEHLQVQSFSNVDDRSYVPESFECRLDKLILHGQG
DFQKILDTYSQHHTSVCEVFESDCGNNKNEQKDAKTGVCITALKTEIKNIDTVSTTQEKPHCQRDTGISSSLMGKNVQRELRRGGRRNWT
HSDAFRVSLDQDVETETEPSDYSNGEGAVFNKSLSGSQTSSAWSNLSGFSSSASWEEVNYHVDDRSARKEPGKEHLVDTQCSTALSEELE
NDREGRAMHSLHSQLHDLSLQEPNNDNLEPSQNQPQQQMPLTPFSPHNTPGIFLAPGAGLLEGAPEGIQEVRNMGPRNTSAHSRPSYRSA
SWSSDSGRPKNMGTHPSVQKEEAFEIIVEFPETNCDVKDRQGKEQGEEISERGAGPTFKASPSWVDPEGETAESTEDAPLDFHRVLHNSL
GNISMLPCSSFTPNWPVQNPDSRKSGGPVAEQGIDPDASTVDEEGQLLDSMDVPCTNGHGSHRLCILRQPPGQRAETPNSSVSGNILFPV
LSEDCTTTEEGNQPGNMLNCSQNSSSSSVWWLKSPAFSSGSSEGDSPWSYLNSSGSSWVSLPGKMRKEILEARTLQPDDFEKLLAGVRHD
WLFQRLENTGVFKPSQLHRAHSALLLKYSKKSELWTAQETIVYLGDYLTVKKKGRQRNAFWVHHLHQEEILGRYVGKDYKEQKGLWHHFT
DVERQMTAQHYVTEFNKRLYEQNIPTQIFYIPSTILLILEDKTIKGCISVEPYILGEFVKLSNNTKVVKTEYKATEYGLAYGHFSYEFSN
HRDVVVDLQGWVTGNGKGLIYLTDPQIHSVDQKVFTTNFGKRGIFYFFNNQHVECNEICHRLSLTRPSMEKPSNMSDKELKKLRNKQRRA
QKKAQIEEEKKNAEKEKQQRNQKKKKDDDDEEIGGPKEELIPEKLAKVETPLEEAIKFLTPLKNLVKNKIETHLFAFEIYFRKEKFLLML
QSVKRAFAIDSSHPWLHECMIRLFNTAVCESKDLSDTVRTVLKQEMNRLFGATNPKNFNETFLKRNSDSLPHRLSAAKMVYYLDPSSQKR

--------------------------------------------------------------
>4109_4109_4_ALPK1-NAA15_ALPK1_chr4_113362261_ENST00000458497_NAA15_chr4_140291365_ENST00000398947_length(transcript)=8135nt_BP=4006nt
AATTCCTACTTCCTGAAACTGAAGCCGTTTATGAGAAACAGTGTGTTTCAGAGAGGCTGTACCAGAATTAACTCTGCTCAGAGTTAGATT
TGCTGGTCTTAAAGTACTTTTCCTCTTTAAGATAAAAGAAGTTCTTCTAAATCAGGAATGGATTGAAATCTAATGAACCGAAACTTTGGG
TACTTCGGCCTTCAAGGGGCTCCTTTATTGAGAATCAATGTCTTCTCCTAGGTAATTGATCACCCTAGACCCAGGGACACCCAATTCATC
GTAATCATCATGAATAATCAAAAAGTGGTAGCTGTGCTACTGCAAGAGTGCAAGCAAGTGCTGGATCAGCTCTTGTTGGAAGCGCCAGAT
GTGTCGGAAGAGGACAAGAGCGAGGACCAGCGCTGCAGAGCTTTACTCCCCAGCGAGTTAAGGACCCTGATCCAGGAGGCAAAGGAAATG
AAGTGGCCCTTCGTGCCTGAAAAGTGGCAGTACAAACAAGCCGTGGGCCCAGAGGACAAAACAAACCTGAAGGATGTGATTGGCGCCGGG
TTGCAGCAGTTACTGGCGTCCCTGAGGGCCTCCATCCTCGCTCGGGACTGTGCGGCTGCGGCGGCTATTGTGTTCTTGGTGGACCGGTTC
CTGTATGGGCTCGACGTCTCTGGAAAACTTCTGCAGGTCGCCAAAGGTCTCCACAAGTTGCAGCCAGCCACGCCAATTGCCCCGCAGGTG
GTTATTCGCCAAGCCCGAATCTCCGTGAACTCAGGAAAACTTTTAAAAGCAGAGTATATTCTGAGCAGTCTAATAAGCAACAATGGAGCA
ACGGGTACCTGGCTGTACAGAAATGAAAGTGACAAGGTCCTGGTGCAGTCGGTCTGTATACAGATCAGAGGGCAGATTCTGCAAAAGCTG
GGGATGTGGTACGAAGCAGCAGAGTTAATATGGGCCTCCATTGTAGGATATTTGGCACTTCCTCAGCCGGATAAAAAGGGCCTCTCCACG
TCGCTAGGTATACTGGCAGACATCTTTGTTTCCATGAGCAAGAACGATTATGAAAAGTTTAAAAACAATCCACAAATTAATTTGAGCCTG
CTGAAGGAGTTTGACCACCATTTGCTGTCCGCTGCAGAAGCCTGCAAGCTGGCAGCTGCCTTCAGTGCCTATACGCCGCTCTTCGTGCTC
ACAGCTGTGAATATCCGTGGCACGTGTTTATTGTCCTACAGTAGTTCAAATGACTGTCCTCCAGAATTGAAAAACTTACATCTGTGTGAA
GCCAAAGAGGCCTTTGAGATTGGCCTCCTCACCAAGAGAGATGATGAGCCTGTTACTGGAAAACAGGAGCTTCACAGCTTTGTCAAAGCT
GCTTTCGGTCTCACCACAGTGCACAGAAGGCTCCATGGGGAGACAGGGACGGTCCATGCAGCAAGTCAGCTCTGTAAGGAAGCAATGGGG
AAGCTGTACAATTTCAGCACTTCCTCCAGAAGTCAGGACAGAGAAGCTCTGTCTCAAGAAGTTATGTCTGTGATTGCCCAGGTGAAGGAA
CATTTACAAGTTCAAAGCTTCTCAAATGTAGATGACAGATCTTATGTTCCCGAGAGTTTCGAGTGCAGGTTGGATAAACTTATCTTGCAT
GGGCAAGGGGATTTCCAAAAAATCCTTGACACCTATTCACAGCACCATACTTCGGTGTGTGAAGTATTTGAAAGTGATTGTGGAAACAAC
AAAAATGAACAGAAAGATGCAAAAACAGGAGTCTGCATCACTGCTCTAAAAACAGAAATAAAAAACATAGATACTGTGAGTACTACTCAA
GAAAAGCCACATTGTCAAAGAGACACAGGAATATCTTCCTCCCTAATGGGTAAGAATGTTCAGAGGGAACTCAGAAGGGGAGGAAGGAGA
AACTGGACCCATTCTGATGCATTTCGAGTCTCCTTGGATCAAGATGTGGAGACTGAGACTGAGCCATCGGACTACAGCAATGGTGAGGGA
GCTGTTTTCAACAAGTCTCTGAGTGGCAGCCAGACTTCCAGTGCTTGGAGCAACTTATCAGGGTTTAGTTCCTCTGCAAGCTGGGAGGAA
GTGAATTATCACGTTGACGACAGGTCAGCCAGAAAAGAGCCTGGCAAAGAACATCTGGTGGACACTCAGTGTTCCACTGCCTTGTCTGAG
GAGCTAGAGAATGACAGGGAAGGCAGAGCTATGCATTCATTGCATTCACAGCTTCATGATCTCTCTCTTCAGGAACCCAACAATGACAAT
TTGGAGCCTTCTCAAAATCAGCCACAGCAACAGATGCCCTTGACACCCTTCTCGCCTCATAATACCCCAGGCATTTTCTTGGCCCCTGGT
GCAGGGCTTCTAGAAGGAGCTCCAGAAGGTATCCAGGAAGTCAGAAATATGGGACCCAGAAATACTTCTGCTCACTCCAGACCCTCATAT
CGTTCTGCTTCTTGGTCTTCTGATTCTGGTAGGCCCAAGAATATGGGCACACATCCTTCAGTCCAAAAAGAAGAAGCCTTTGAAATAATT
GTTGAGTTTCCAGAAACCAACTGCGATGTCAAAGACAGGCAGGGGAAAGAGCAGGGAGAAGAAATTAGTGAAAGAGGCGCAGGCCCTACA
TTTAAAGCTAGTCCCTCCTGGGTTGACCCAGAAGGAGAAACAGCAGAAAGCACTGAAGATGCACCCTTAGACTTTCACAGGGTCCTGCAC
AATTCTCTGGGAAACATTTCCATGCTGCCATGTAGCTCCTTCACCCCTAATTGGCCTGTTCAAAATCCTGACTCCAGAAAAAGTGGTGGC
CCAGTCGCAGAGCAGGGCATCGACCCTGATGCCTCCACAGTGGATGAGGAGGGGCAACTGCTCGACAGCATGGATGTTCCCTGCACAAAT
GGGCACGGCTCTCATAGACTGTGCATTCTGAGACAGCCGCCTGGTCAGAGGGCGGAGACCCCCAATTCCTCTGTAAGCGGTAACATCCTC
TTCCCTGTCCTCAGCGAGGACTGCACTACCACAGAGGAAGGAAATCAGCCTGGAAACATGCTAAACTGCAGCCAGAACTCCAGCTCATCC
TCAGTGTGGTGGCTGAAATCACCTGCATTTTCCAGTGGTTCTTCTGAGGGGGACAGCCCTTGGTCCTATCTGAATTCCAGTGGGAGTTCT
TGGGTTTCATTGCCGGGAAAGATGAGGAAAGAGATCCTTGAGGCTCGCACCTTGCAACCTGATGACTTTGAAAAGCTGTTGGCAGGAGTG
AGGCATGATTGGCTGTTTCAGAGACTAGAGAATACGGGGGTTTTTAAGCCCAGTCAACTCCACCGAGCACATAGTGCTCTTTTGTTAAAA
TATTCAAAAAAATCTGAACTGTGGACGGCCCAGGAAACTATTGTCTATTTGGGGGACTACTTGACTGTGAAGAAAAAAGGCAGACAAAGA
AATGCTTTTTGGGTTCATCATCTTCATCAAGAAGAAATTCTGGGGAGGTATGTTGGGAAAGACTATAAGGAGCAGAAGGGGCTCTGGCAC
CACTTCACTGATGTGGAGCGACAGATGACCGCACAGCACTATGTGACAGAATTTAACAAGAGACTCTATGAACAAAACATTCCCACCCAG
ATATTCTACATCCCATCCACAATACTACTGATTTTAGAGGACAAGACAATAAAGGGATGTATCAGTGTGGAGCCTTACATACTGGGAGAA
TTTGTAAAATTGTCAAATAACACGAAAGTGGTGAAAACAGAATACAAAGCCACAGAATATGGCTTGGCCTATGGCCATTTTTCTTATGAG
TTTTCTAATCATAGAGATGTTGTGGTCGATTTACAAGGTTGGGTAACCGGTAATGGAAAAGGACTCATCTACCTCACAGATCCCCAGATT
CACTCCGTTGATCAGAAAGTTTTCACTACCAATTTTGGAAAGAGAGGAATTTTTTACTTCTTTAATAACCAGCATGTGGAATGTAATGAA
ATCTGCCATCGTCTTTCTTTGACTAGACCTTCAATGGAGAAACCATCAAACATGTCTGACAAAGAGCTAAAGAAGCTACGTAATAAACAA
AGAAGAGCTCAAAAGAAAGCCCAGATAGAAGAAGAGAAAAAAAATGCAGAAAAAGAAAAGCAGCAGAGAAATCAGAAAAAGAAGAAGGAT
GATGATGATGAGGAGATAGGAGGTCCAAAAGAAGAACTTATTCCAGAGAAACTGGCCAAGGTTGAAACTCCATTGGAAGAAGCTATTAAA
TTTTTAACACCGTTGAAGAACTTGGTGAAGAACAAGATAGAGACTCATCTTTTTGCCTTTGAGATTTACTTTAGGAAAGAAAAGTTTCTT
TTGATGCTACAATCAGTAAAGAGGGCATTTGCTATTGATTCTAGTCATCCCTGGCTTCATGAGTGTATGATTCGTCTCTTTAATACTGTG
TGTGAAAGTAAAGATTTATCTGATACAGTTAGAACAGTATTAAAACAAGAAATGAATCGTCTTTTTGGAGCAACGAATCCAAAGAATTTT
AATGAAACTTTTCTGAAAAGGAATTCTGATTCATTGCCACACAGATTATCAGCTGCCAAAATGGTATATTACTTAGATCCTTCTAGTCAG
AAGCGAGCTATAGAGTTGGCAACAACACTTGATGAATCTCTCACTAACAGAAACCTCCAGACATGTATGGAGGTATTGGAAGCCTTGTAT
GATGGTAGCCTAGGAGACTGTAAAGAAGCTGCTGAAATTTATAGAGCAAATTGTCATAAGCTTTTCCCTTATGCTTTGGCTTTCATGCCT
CCTGGATATGAAGAGGATATGAAGATCACAGTTAATGGAGATAGTTCTGCAGAAGCTGAAGAACTGGCCAATGAAATTTGAACATCACTA
AACAAGCAAATGGAATGACTTTGGACCATATCTAGTATATAATATTTTTGTCACGCACCTGCTGCATTGCTCTAACTTACACAGAATGAG
AGGAGTAAATGTTCTTGCCTTCAAATAGTGTTTTACGTTTTTTATCCTGCTGAAAAAGTATATATAAAATATCTAACATTACAGGATAGA
GGTTCAGTTTCTTAAAAAATTAAAGCTGCTAAAATTGAGTGGTTAAAAAAGATACCTTATCCTATTCCTCCCCACCCACCCATGTTTTTA
AACTAATTTATATAAAATCTGGAGGCTGTTACAGCTAACAAAGCAGGTGTGTGGCAGAAATATTACTTTAAATTTGTCTTGTGAGATTTT
ACTATATCTCAGACAGCATAAATGCTGTTTTAGCACTGGATTCTTTCACTGAGCACAAAGAGTTGTTGGGGCTTTAGCATCTGACTGATT
TTGTTACGGGGTTGATTCTGACCATAGGAAGTATGCAATGTGAATCACTATTTACAGAGAAACCTACAACAGATGCTTGATGTTGTAGAA
ACTGGGACATATAGATACCAAGCAAAATTATAAGAAACCTATAAGGTGTTCAATACGCTTGTGTTTCCAAAATTCACTGTACATGATCAG
TTTGGTGTTCTTGTACCACAGTTTTTAACTGAAGGAACCAGTTGTAACAGTCTCAATTTTAACTAAAACTTGAAGAACTAAAACAACAAT
GCAAACCTTTCAGCATTGTTTGGCCAAACTTGTTAAAACTGTAATGCAAGAACCAAATGCACTGTGATGTGGCACCAACTAATTAGCAAG
CATGAATTTTTCACCCAAGAGTGAAAAAAGGAAAATCTACCATGGCTTGAAGTTAAAGAGCAGAACTCCTGACTACCATTCTATGACTGA
TCAAAAGACTAATAGTTAAAAACCTCAGCAGGCCTTGTTCACGATATGCAGAAAAAAAAGTGCTGCAGTTTAGATACCTCTGGAATTTTT
CCACAGTGTCACAGGTTTGTAATACTTGAAGCCCTACATTTCTAAGAATATATTTCTTGCTCAGTTGTTTCAGGCAAGCCCAAGACTTTG
TAATTTTTAAAGGGCCCAAGATTTTTTTTTTTTTTTTTTTTTTCAAATAACAGACCAGCTTCTTTTTCTTGCAGTTACAGATGTAATTTC
CTTTTTGTTGTCAAACATAAGGTACCAAATATGATGCAATAAATTGTTTTGAAAAACAGTTGTGTGAATATTTCAACTAATCTGTGTTGG
GCTTCTGTGAAATACACAGGTGGAAACAGAGGTGCAAGCCAGAGGCAATGTAATATGCTGTAAGGCTAGTGCAGATGGGAGCTTTTTAGA
AGGGGCTAAGTGCTGGTGTCAGGGAAATTCCATAATGAAGTAGAATGCTGCTCCTGCATTAAGATTTCATTGAGGGCAAGGCTGGTGGCA
GGTACTATGAATGTAATTCATAATTTAAAAGGAAAACTAAAAACTATTTTGATTTGGGAAAATGAGCCTTAATTTGTTAAACCTATACAC
TGAGAACTAGCCTCAGGCTTAATATTCTCATTGCATTTGCAAGATCTGAGCAAATAAGATTAAGTAAAACAAATCAATTGTATATATAAT
TGACCTTTTTGTGGAACATGTAGTTTATAGAAAGTATACTCTAAAGGGAATTTGCCGAAGACCTTTTACTGATTGAACAGTTGTGCTACA
ATCAACTTTTCATAGTACATGACCTGCATTCCACATCTCAGTCTAACAGTTTAGTAGTGATGTAAAGAGAAGTACAAACCGAACTCCAGT
GCTTTGTTATGTTTTATTAACTGGCCCTGTCTCAGGAACATCTTAACAGATGGCAAAAAAACAAAAACTTTTTTTCAACTCCTATGAGTG
GCAACTGAAGTTCTTATTGTTGGGAAAGAACACTAGTCCTACCTCTGCCACTAATGAGGTGTTTGGAGGAGGTACCAGCCATATAATAGG
GGGTGTATGTGTGAATTTTGTTTAAACTCTACTGTATATTGAAATGAAATTCATTTATTTGTCTTGACAATGTTCAAATGATGTAGATTG
TCTTAGAATGAATATTCATAAGTACTCAGAACTCTTAAGATGCAGATGCCACCCGTGAGGAGCTAAATTCCTAATGTGTATTGTATTCCA
ACCCAATTTTACTGGAACTATTGAATAAATCTTTTATTTTCTTTCAGGTTTACTTGATAGTGGATACATTGGTGTCAGAGGACCTTGACT
GGGTTCATTTTATGTCCAGACATCACCCCTGAAACACTGTACTGTACTATCTTGCTCTGAGTAGTATCAACTGGATCGTCTCATATTTGC
CTCATTCATCCTATTAATTTCAAATGATACTGTGGGGGAAAAACAGGGTTAGAAAAACAAGTGGAAAAAATGGAGTGATGTTGTAATCTA
AACAAGTGCCTTATGTTTATTGCTAAGAACTGGTGTTACCAACCCTTTTGAGAAGAAAGGGTCTCTTGACCTGTATTAACATAGGAAAGT
AAAGTTCTTTTTGTTTTTCTTTATATTCAGCTACTCTTGTCATTTCTCGTTGAAAAACTAAAATCTGACTAGGTTAGTTTACTCAGCTTT
AATTAGATAGTTGAGTCATATATTTTCAACATTTTTCTGTATCGTATTTATTATGCACAAAAATAAAGTGTGATCTCTAATAGCATGGCT
AAAGGTAATGCCAATATTAGTGAATAGCTTTGCTGTGGGCTTTATCAGGCCCTTGTTTTTCACAGTGTTGTTTGTACTCCATGGTGTATT
GCTATTAGAGCTGTGAAATGAAAGCTGTGACTTTATGAAGAGATCAAAAAAAGTGTGGTCCAGATTCAGGTAGGTTGTGGTAAGTTCAGG
GAAGTCTTGTATGATTTCTAAGAATATTTCAGTTACCATGTATAGTATTTGGGAAAAGCTGTAATGTAAAATATTGGACTTTGTTGCAAG
ATAGGTATATACTTGTGTCAGATAATTAAAGCCTTAAATTTTGAAATGAATCTTTGAGATTTCAAAGAAAGACTGACCTTTCAAATAGAA
GGCGGTCTTTCTATTCTAGCTAATGCCCCACTTCTTTAAGTTATAAACAAAGTTTCATGATACCATTCTGCTATCATCTAAACTTTGCTG
AACTCTACTGCCAACCTACATTAAAAACAAAGTCCCACGAAATGGCTGTGTTACACCATAGTGAGCAGAAACTTAAATTTTGTTCTAATT

>4109_4109_4_ALPK1-NAA15_ALPK1_chr4_113362261_ENST00000458497_NAA15_chr4_140291365_ENST00000398947_length(amino acids)=1523AA_BP=1243
MNNQKVVAVLLQECKQVLDQLLLEAPDVSEEDKSEDQRCRALLPSELRTLIQEAKEMKWPFVPEKWQYKQAVGPEDKTNLKDVIGAGLQQ
LLASLRASILARDCAAAAAIVFLVDRFLYGLDVSGKLLQVAKGLHKLQPATPIAPQVVIRQARISVNSGKLLKAEYILSSLISNNGATGT
WLYRNESDKVLVQSVCIQIRGQILQKLGMWYEAAELIWASIVGYLALPQPDKKGLSTSLGILADIFVSMSKNDYEKFKNNPQINLSLLKE
FDHHLLSAAEACKLAAAFSAYTPLFVLTAVNIRGTCLLSYSSSNDCPPELKNLHLCEAKEAFEIGLLTKRDDEPVTGKQELHSFVKAAFG
LTTVHRRLHGETGTVHAASQLCKEAMGKLYNFSTSSRSQDREALSQEVMSVIAQVKEHLQVQSFSNVDDRSYVPESFECRLDKLILHGQG
DFQKILDTYSQHHTSVCEVFESDCGNNKNEQKDAKTGVCITALKTEIKNIDTVSTTQEKPHCQRDTGISSSLMGKNVQRELRRGGRRNWT
HSDAFRVSLDQDVETETEPSDYSNGEGAVFNKSLSGSQTSSAWSNLSGFSSSASWEEVNYHVDDRSARKEPGKEHLVDTQCSTALSEELE
NDREGRAMHSLHSQLHDLSLQEPNNDNLEPSQNQPQQQMPLTPFSPHNTPGIFLAPGAGLLEGAPEGIQEVRNMGPRNTSAHSRPSYRSA
SWSSDSGRPKNMGTHPSVQKEEAFEIIVEFPETNCDVKDRQGKEQGEEISERGAGPTFKASPSWVDPEGETAESTEDAPLDFHRVLHNSL
GNISMLPCSSFTPNWPVQNPDSRKSGGPVAEQGIDPDASTVDEEGQLLDSMDVPCTNGHGSHRLCILRQPPGQRAETPNSSVSGNILFPV
LSEDCTTTEEGNQPGNMLNCSQNSSSSSVWWLKSPAFSSGSSEGDSPWSYLNSSGSSWVSLPGKMRKEILEARTLQPDDFEKLLAGVRHD
WLFQRLENTGVFKPSQLHRAHSALLLKYSKKSELWTAQETIVYLGDYLTVKKKGRQRNAFWVHHLHQEEILGRYVGKDYKEQKGLWHHFT
DVERQMTAQHYVTEFNKRLYEQNIPTQIFYIPSTILLILEDKTIKGCISVEPYILGEFVKLSNNTKVVKTEYKATEYGLAYGHFSYEFSN
HRDVVVDLQGWVTGNGKGLIYLTDPQIHSVDQKVFTTNFGKRGIFYFFNNQHVECNEICHRLSLTRPSMEKPSNMSDKELKKLRNKQRRA
QKKAQIEEEKKNAEKEKQQRNQKKKKDDDDEEIGGPKEELIPEKLAKVETPLEEAIKFLTPLKNLVKNKIETHLFAFEIYFRKEKFLLML
QSVKRAFAIDSSHPWLHECMIRLFNTVCESKDLSDTVRTVLKQEMNRLFGATNPKNFNETFLKRNSDSLPHRLSAAKMVYYLDPSSQKRA

--------------------------------------------------------------
>4109_4109_5_ALPK1-NAA15_ALPK1_chr4_113362261_ENST00000504176_NAA15_chr4_140291365_ENST00000296543_length(transcript)=7908nt_BP=3762nt
ACAGTGTGTTTCAGAGAGGCTGTACCAGAATTAACTCTGCTCAGAGTTAGATTTGCTGGTCTTAAAGTACTTTTCCTCTTTAAGATAAAA
GAAGTTCTTCTAAATCAGGAATGGATTGAAATCTAATGAACCGAAACTTTGGGTAATTGATCACCCTAGACCCAGGGACACCCAATTCAT
CGTAATCATCATGAATAATCAAAAAGTGGTAGCTGTGCTACTGCAAGAGTGCAAGCAAGTGCTGGATCAGCTCTTGTTGGAAGCGCCAGA
TGTGTCGGAAGAGGACAAGAGCGAGGACCAGCGCTGCAGAGGCGTCCCTGAGGGCCTCCATCCTCGCTCGGGACTGTGCGGCTGCGGCGG
CTATTGTGTTCTTGGTGGACCGGTTCCTGTATGGGCTCGACGTCTCTGGAAAACTTCTGCAGGTCGCCAAAGGTCTCCACAAGTTGCAGC
CAGCCACGCCAATTGCCCCGCAGGTGGTTATTCGCCAAGCCCGAATCTCCGTGAACTCAGGAAAACTTTTAAAAGCAGAGTATATTCTGA
GCAGTCTAATAAGCAACAATGGAGCAACGGGTACCTGGCTGTACAGAAATGAAAGTGACAAGGTCCTGGTGCAGTCGGTCTGTATACAGA
TCAGAGGGCAGATTCTGCAAAAGCTGGGGATGTGGTACGAAGCAGCAGAGTTAATATGGGCCTCCATTGTAGGATATTTGGCACTTCCTC
AGCCGGATAAAAAGGGCCTCTCCACGTCGCTAGGTATACTGGCAGACATCTTTGTTTCCATGAGCAAGAACGATTATGAAAAGTTTAAAA
ACAATCCACAAATTAATTTGAGCCTGCTGAAGGAGTTTGACCACCATTTGCTGTCCGCTGCAGAAGCCTGCAAGCTGGCAGCTGCCTTCA
GTGCCTATACGCCGCTCTTCGTGCTCACAGCTGTGAATATCCGTGGCACGTGTTTATTGTCCTACAGTAGTTCAAATGACTGTCCTCCAG
AATTGAAAAACTTACATCTGTGTGAAGCCAAAGAGGCCTTTGAGATTGGCCTCCTCACCAAGAGAGATGATGAGCCTGTTACTGGAAAAC
AGGAGCTTCACAGCTTTGTCAAAGCTGCTTTCGGTCTCACCACAGTGCACAGAAGGCTCCATGGGGAGACAGGGACGGTCCATGCAGCAA
GTCAGCTCTGTAAGGAAGCAATGGGGAAGCTGTACAATTTCAGCACTTCCTCCAGAAGTCAGGACAGAGAAGCTCTGTCTCAAGAAGTTA
TGTCTGTGATTGCCCAGGTGAAGGAACATTTACAAGTTCAAAGCTTCTCAAATGTAGATGACAGATCTTATGTTCCCGAGAGTTTCGAGT
GCAGGTTGGATAAACTTATCTTGCATGGGCAAGGGGATTTCCAAAAAATCCTTGACACCTATTCACAGCACCATACTTCGGTGTGTGAAG
TATTTGAAAGTGATTGTGGAAACAACAAAAATGAACAGAAAGATGCAAAAACAGGAGTCTGCATCACTGCTCTAAAAACAGAAATAAAAA
ACATAGATACTGTGAGTACTACTCAAGAAAAGCCACATTGTCAAAGAGACACAGGAATATCTTCCTCCCTAATGGGTAAGAATGTTCAGA
GGGAACTCAGAAGGGGAGGAAGGAGAAACTGGACCCATTCTGATGCATTTCGAGTCTCCTTGGATCAAGATGTGGAGACTGAGACTGAGC
CATCGGACTACAGCAATGGTGAGGGAGCTGTTTTCAACAAGTCTCTGAGTGGCAGCCAGACTTCCAGTGCTTGGAGCAACTTATCAGGGT
TTAGTTCCTCTGCAAGCTGGGAGGAAGTGAATTATCACGTTGACGACAGGTCAGCCAGAAAAGAGCCTGGCAAAGAACATCTGGTGGACA
CTCAGTGTTCCACTGCCTTGTCTGAGGAGCTAGAGAATGACAGGGAAGGCAGAGCTATGCATTCATTGCATTCACAGCTTCATGATCTCT
CTCTTCAGGAACCCAACAATGACAATTTGGAGCCTTCTCAAAATCAGCCACAGCAACAGATGCCCTTGACACCCTTCTCGCCTCATAATA
CCCCAGGCATTTTCTTGGCCCCTGGTGCAGGGCTTCTAGAAGGAGCTCCAGAAGGTATCCAGGAAGTCAGAAATATGGGACCCAGAAATA
CTTCTGCTCACTCCAGACCCTCATATCGTTCTGCTTCTTGGTCTTCTGATTCTGGTAGGCCCAAGAATATGGGCACACATCCTTCAGTCC
AAAAAGAAGAAGCCTTTGAAATAATTGTTGAGTTTCCAGAAACCAACTGCGATGTCAAAGACAGGCAGGGGAAAGAGCAGGGAGAAGAAA
TTAGTGAAAGAGGCGCAGGCCCTACATTTAAAGCTAGTCCCTCCTGGGTTGACCCAGAAGGAGAAACAGCAGAAAGCACTGAAGATGCAC
CCTTAGACTTTCACAGGGTCCTGCACAATTCTCTGGGAAACATTTCCATGCTGCCATGTAGCTCCTTCACCCCTAATTGGCCTGTTCAAA
ATCCTGACTCCAGAAAAAGTGGTGGCCCAGTCGCAGAGCAGGGCATCGACCCTGATGCCTCCACAGTGGATGAGGAGGGGCAACTGCTCG
ACAGCATGGATGTTCCCTGCACAAATGGGCACGGCTCTCATAGACTGTGCATTCTGAGACAGCCGCCTGGTCAGAGGGCGGAGACCCCCA
ATTCCTCTGTAAGCGGTAACATCCTCTTCCCTGTCCTCAGCGAGGACTGCACTACCACAGAGGAAGGAAATCAGCCTGGAAACATGCTAA
ACTGCAGCCAGAACTCCAGCTCATCCTCAGTGTGGTGGCTGAAATCACCTGCATTTTCCAGTGGTTCTTCTGAGGGGGACAGCCCTTGGT
CCTATCTGAATTCCAGTGGGAGTTCTTGGGTTTCATTGCCGGGAAAGATGAGGAAAGAGATCCTTGAGGCTCGCACCTTGCAACCTGATG
ACTTTGAAAAGCTGTTGGCAGGAGTGAGGCATGATTGGCTGTTTCAGAGACTAGAGAATACGGGGGTTTTTAAGCCCAGTCAACTCCACC
GAGCACATAGTGCTCTTTTGTTAAAATATTCAAAAAAATCTGAACTGTGGACGGCCCAGGAAACTATTGTCTATTTGGGGGACTACTTGA
CTGTGAAGAAAAAAGGCAGACAAAGAAATGCTTTTTGGGTTCATCATCTTCATCAAGAAGAAATTCTGGGGAGGTATGTTGGGAAAGACT
ATAAGGAGCAGAAGGGGCTCTGGCACCACTTCACTGATGTGGAGCGACAGATGACCGCACAGCACTATGTGACAGAATTTAACAAGAGAC
TCTATGAACAAAACATTCCCACCCAGATATTCTACATCCCATCCACAATACTACTGATTTTAGAGGACAAGACAATAAAGGGATGTATCA
GTGTGGAGCCTTACATACTGGGAGAATTTGTAAAATTGTCAAATAACACGAAAGTGGTGAAAACAGAATACAAAGCCACAGAATATGGCT
TGGCCTATGGCCATTTTTCTTATGAGTTTTCTAATCATAGAGATGTTGTGGTCGATTTACAAGGTTGGGTAACCGGTAATGGAAAAGGAC
TCATCTACCTCACAGATCCCCAGATTCACTCCGTTGATCAGAAAGTTTTCACTACCAATTTTGGAAAGAGAGGAATTTTTTACTTCTTTA
ATAACCAGCATGTGGAATGTAATGAAATCTGCCATCGTCTTTCTTTGACTAGACCTTCAATGGAGAAACCATCAAACATGTCTGACAAAG
AGCTAAAGAAGCTACGTAATAAACAAAGAAGAGCTCAAAAGAAAGCCCAGATAGAAGAAGAGAAAAAAAATGCAGAAAAAGAAAAGCAGC
AGAGAAATCAGAAAAAGAAGAAGGATGATGATGATGAGGAGATAGGAGGTCCAAAAGAAGAACTTATTCCAGAGAAACTGGCCAAGGTTG
AAACTCCATTGGAAGAAGCTATTAAATTTTTAACACCGTTGAAGAACTTGGTGAAGAACAAGATAGAGACTCATCTTTTTGCCTTTGAGA
TTTACTTTAGGAAAGAAAAGTTTCTTTTGATGCTACAATCAGTAAAGAGGGCATTTGCTATTGATTCTAGTCATCCCTGGCTTCATGAGT
GTATGATTCGTCTCTTTAATACTGCAGTGTGTGAAAGTAAAGATTTATCTGATACAGTTAGAACAGTATTAAAACAAGAAATGAATCGTC
TTTTTGGAGCAACGAATCCAAAGAATTTTAATGAAACTTTTCTGAAAAGGAATTCTGATTCATTGCCACACAGATTATCAGCTGCCAAAA
TGGTATATTACTTAGATCCTTCTAGTCAGAAGCGAGCTATAGAGTTGGCAACAACACTTGATGAATCTCTCACTAACAGAAACCTCCAGA
CATGTATGGAGGTATTGGAAGCCTTGTATGATGGTAGCCTAGGAGACTGTAAAGAAGCTGCTGAAATTTATAGAGCAAATTGTCATAAGC
TTTTCCCTTATGCTTTGGCTTTCATGCCTCCTGGATATGAAGAGGATATGAAGATCACAGTTAATGGAGATAGTTCTGCAGAAGCTGAAG
AACTGGCCAATGAAATTTGAACATCACTAAACAAGCAAATGGAATGACTTTGGACCATATCTAGTATATAATATTTTTGTCACGCACCTG
CTGCATTGCTCTAACTTACACAGAATGAGAGGAGTAAATGTTCTTGCCTTCAAATAGTGTTTTACGTTTTTTATCCTGCTGAAAAAGTAT
ATATAAAATATCTAACATTACAGGATAGAGGTTCAGTTTCTTAAAAAATTAAAGCTGCTAAAATTGAGTGGTTAAAAAAGATACCTTATC
CTATTCCTCCCCACCCACCCATGTTTTTAAACTAATTTATATAAAATCTGGAGGCTGTTACAGCTAACAAAGCAGGTGTGTGGCAGAAAT
ATTACTTTAAATTTGTCTTGTGAGATTTTACTATATCTCAGACAGCATAAATGCTGTTTTAGCACTGGATTCTTTCACTGAGCACAAAGA
GTTGTTGGGGCTTTAGCATCTGACTGATTTTGTTACGGGGTTGATTCTGACCATAGGAAGTATGCAATGTGAATCACTATTTACAGAGAA
ACCTACAACAGATGCTTGATGTTGTAGAAACTGGGACATATAGATACCAAGCAAAATTATAAGAAACCTATAAGGTGTTCAATACGCTTG
TGTTTCCAAAATTCACTGTACATGATCAGTTTGGTGTTCTTGTACCACAGTTTTTAACTGAAGGAACCAGTTGTAACAGTCTCAATTTTA
ACTAAAACTTGAAGAACTAAAACAACAATGCAAACCTTTCAGCATTGTTTGGCCAAACTTGTTAAAACTGTAATGCAAGAACCAAATGCA
CTGTGATGTGGCACCAACTAATTAGCAAGCATGAATTTTTCACCCAAGAGTGAAAAAAGGAAAATCTACCATGGCTTGAAGTTAAAGAGC
AGAACTCCTGACTACCATTCTATGACTGATCAAAAGACTAATAGTTAAAAACCTCAGCAGGCCTTGTTCACGATATGCAGAAAAAAAAGT
GCTGCAGTTTAGATACCTCTGGAATTTTTCCACAGTGTCACAGGTTTGTAATACTTGAAGCCCTACATTTCTAAGAATATATTTCTTGCT
CAGTTGTTTCAGGCAAGCCCAAGACTTTGTAATTTTTAAAGGGCCCAAGATTTTTTTTTTTTTTTTTTTTTTCAAATAACAGACCAGCTT
CTTTTTCTTGCAGTTACAGATGTAATTTCCTTTTTGTTGTCAAACATAAGGTACCAAATATGATGCAATAAATTGTTTTGAAAAACAGTT
GTGTGAATATTTCAACTAATCTGTGTTGGGCTTCTGTGAAATACACAGGTGGAAACAGAGGTGCAAGCCAGAGGCAATGTAATATGCTGT
AAGGCTAGTGCAGATGGGAGCTTTTTAGAAGGGGCTAAGTGCTGGTGTCAGGGAAATTCCATAATGAAGTAGAATGCTGCTCCTGCATTA
AGATTTCATTGAGGGCAAGGCTGGTGGCAGGTACTATGAATGTAATTCATAATTTAAAAGGAAAACTAAAAACTATTTTGATTTGGGAAA
ATGAGCCTTAATTTGTTAAACCTATACACTGAGAACTAGCCTCAGGCTTAATATTCTCATTGCATTTGCAAGATCTGAGCAAATAAGATT
AAGTAAAACAAATCAATTGTATATATAATTGACCTTTTTGTGGAACATGTAGTTTATAGAAAGTATACTCTAAAGGGAATTTGCCGAAGA
CCTTTTACTGATTGAACAGTTGTGCTACAATCAACTTTTCATAGTACATGACCTGCATTCCACATCTCAGTCTAACAGTTTAGTAGTGAT
GTAAAGAGAAGTACAAACCGAACTCCAGTGCTTTGTTATGTTTTATTAACTGGCCCTGTCTCAGGAACATCTTAACAGATGGCAAAAAAA
CAAAAACTTTTTTTCAACTCCTATGAGTGGCAACTGAAGTTCTTATTGTTGGGAAAGAACACTAGTCCTACCTCTGCCACTAATGAGGTG
TTTGGAGGAGGTACCAGCCATATAATAGGGGGTGTATGTGTGAATTTTGTTTAAACTCTACTGTATATTGAAATGAAATTCATTTATTTG
TCTTGACAATGTTCAAATGATGTAGATTGTCTTAGAATGAATATTCATAAGTACTCAGAACTCTTAAGATGCAGATGCCACCCGTGAGGA
GCTAAATTCCTAATGTGTATTGTATTCCAACCCAATTTTACTGGAACTATTGAATAAATCTTTTATTTTCTTTCAGGTTTACTTGATAGT
GGATACATTGGTGTCAGAGGACCTTGACTGGGTTCATTTTATGTCCAGACATCACCCCTGAAACACTGTACTGTACTATCTTGCTCTGAG
TAGTATCAACTGGATCGTCTCATATTTGCCTCATTCATCCTATTAATTTCAAATGATACTGTGGGGGAAAAACAGGGTTAGAAAAACAAG
TGGAAAAAATGGAGTGATGTTGTAATCTAAACAAGTGCCTTATGTTTATTGCTAAGAACTGGTGTTACCAACCCTTTTGAGAAGAAAGGG
TCTCTTGACCTGTATTAACATAGGAAAGTAAAGTTCTTTTTGTTTTTCTTTATATTCAGCTACTCTTGTCATTTCTCGTTGAAAAACTAA
AATCTGACTAGGTTAGTTTACTCAGCTTTAATTAGATAGTTGAGTCATATATTTTCAACATTTTTCTGTATCGTATTTATTATGCACAAA
AATAAAGTGTGATCTCTAATAGCATGGCTAAAGGTAATGCCAATATTAGTGAATAGCTTTGCTGTGGGCTTTATCAGGCCCTTGTTTTTC
ACAGTGTTGTTTGTACTCCATGGTGTATTGCTATTAGAGCTGTGAAATGAAAGCTGTGACTTTATGAAGAGATCAAAAAAAGTGTGGTCC
AGATTCAGGTAGGTTGTGGTAAGTTCAGGGAAGTCTTGTATGATTTCTAAGAATATTTCAGTTACCATGTATAGTATTTGGGAAAAGCTG
TAATGTAAAATATTGGACTTTGTTGCAAGATAGGTATATACTTGTGTCAGATAATTAAAGCCTTAAATTTTGAAATGAATCTTTGAGATT
TCAAAGAAAGACTGACCTTTCAAATAGAAGGCGGTCTTTCTATTCTAGCTAATGCCCCACTTCTTTAAGTTATAAACAAAGTTTCATGAT
ACCATTCTGCTATCATCTAAACTTTGCTGAACTCTACTGCCAACCTACATTAAAAACAAAGTCCCACGAAATGGCTGTGTTACACCATAG

>4109_4109_5_ALPK1-NAA15_ALPK1_chr4_113362261_ENST00000504176_NAA15_chr4_140291365_ENST00000296543_length(amino acids)=1465AA_BP=1184
MCYCKSASKCWISSCWKRQMCRKRTRARTSAAEASLRASILARDCAAAAAIVFLVDRFLYGLDVSGKLLQVAKGLHKLQPATPIAPQVVI
RQARISVNSGKLLKAEYILSSLISNNGATGTWLYRNESDKVLVQSVCIQIRGQILQKLGMWYEAAELIWASIVGYLALPQPDKKGLSTSL
GILADIFVSMSKNDYEKFKNNPQINLSLLKEFDHHLLSAAEACKLAAAFSAYTPLFVLTAVNIRGTCLLSYSSSNDCPPELKNLHLCEAK
EAFEIGLLTKRDDEPVTGKQELHSFVKAAFGLTTVHRRLHGETGTVHAASQLCKEAMGKLYNFSTSSRSQDREALSQEVMSVIAQVKEHL
QVQSFSNVDDRSYVPESFECRLDKLILHGQGDFQKILDTYSQHHTSVCEVFESDCGNNKNEQKDAKTGVCITALKTEIKNIDTVSTTQEK
PHCQRDTGISSSLMGKNVQRELRRGGRRNWTHSDAFRVSLDQDVETETEPSDYSNGEGAVFNKSLSGSQTSSAWSNLSGFSSSASWEEVN
YHVDDRSARKEPGKEHLVDTQCSTALSEELENDREGRAMHSLHSQLHDLSLQEPNNDNLEPSQNQPQQQMPLTPFSPHNTPGIFLAPGAG
LLEGAPEGIQEVRNMGPRNTSAHSRPSYRSASWSSDSGRPKNMGTHPSVQKEEAFEIIVEFPETNCDVKDRQGKEQGEEISERGAGPTFK
ASPSWVDPEGETAESTEDAPLDFHRVLHNSLGNISMLPCSSFTPNWPVQNPDSRKSGGPVAEQGIDPDASTVDEEGQLLDSMDVPCTNGH
GSHRLCILRQPPGQRAETPNSSVSGNILFPVLSEDCTTTEEGNQPGNMLNCSQNSSSSSVWWLKSPAFSSGSSEGDSPWSYLNSSGSSWV
SLPGKMRKEILEARTLQPDDFEKLLAGVRHDWLFQRLENTGVFKPSQLHRAHSALLLKYSKKSELWTAQETIVYLGDYLTVKKKGRQRNA
FWVHHLHQEEILGRYVGKDYKEQKGLWHHFTDVERQMTAQHYVTEFNKRLYEQNIPTQIFYIPSTILLILEDKTIKGCISVEPYILGEFV
KLSNNTKVVKTEYKATEYGLAYGHFSYEFSNHRDVVVDLQGWVTGNGKGLIYLTDPQIHSVDQKVFTTNFGKRGIFYFFNNQHVECNEIC
HRLSLTRPSMEKPSNMSDKELKKLRNKQRRAQKKAQIEEEKKNAEKEKQQRNQKKKKDDDDEEIGGPKEELIPEKLAKVETPLEEAIKFL
TPLKNLVKNKIETHLFAFEIYFRKEKFLLMLQSVKRAFAIDSSHPWLHECMIRLFNTAVCESKDLSDTVRTVLKQEMNRLFGATNPKNFN
ETFLKRNSDSLPHRLSAAKMVYYLDPSSQKRAIELATTLDESLTNRNLQTCMEVLEALYDGSLGDCKEAAEIYRANCHKLFPYALAFMPP

--------------------------------------------------------------
>4109_4109_6_ALPK1-NAA15_ALPK1_chr4_113362261_ENST00000504176_NAA15_chr4_140291365_ENST00000398947_length(transcript)=7891nt_BP=3762nt
ACAGTGTGTTTCAGAGAGGCTGTACCAGAATTAACTCTGCTCAGAGTTAGATTTGCTGGTCTTAAAGTACTTTTCCTCTTTAAGATAAAA
GAAGTTCTTCTAAATCAGGAATGGATTGAAATCTAATGAACCGAAACTTTGGGTAATTGATCACCCTAGACCCAGGGACACCCAATTCAT
CGTAATCATCATGAATAATCAAAAAGTGGTAGCTGTGCTACTGCAAGAGTGCAAGCAAGTGCTGGATCAGCTCTTGTTGGAAGCGCCAGA
TGTGTCGGAAGAGGACAAGAGCGAGGACCAGCGCTGCAGAGGCGTCCCTGAGGGCCTCCATCCTCGCTCGGGACTGTGCGGCTGCGGCGG
CTATTGTGTTCTTGGTGGACCGGTTCCTGTATGGGCTCGACGTCTCTGGAAAACTTCTGCAGGTCGCCAAAGGTCTCCACAAGTTGCAGC
CAGCCACGCCAATTGCCCCGCAGGTGGTTATTCGCCAAGCCCGAATCTCCGTGAACTCAGGAAAACTTTTAAAAGCAGAGTATATTCTGA
GCAGTCTAATAAGCAACAATGGAGCAACGGGTACCTGGCTGTACAGAAATGAAAGTGACAAGGTCCTGGTGCAGTCGGTCTGTATACAGA
TCAGAGGGCAGATTCTGCAAAAGCTGGGGATGTGGTACGAAGCAGCAGAGTTAATATGGGCCTCCATTGTAGGATATTTGGCACTTCCTC
AGCCGGATAAAAAGGGCCTCTCCACGTCGCTAGGTATACTGGCAGACATCTTTGTTTCCATGAGCAAGAACGATTATGAAAAGTTTAAAA
ACAATCCACAAATTAATTTGAGCCTGCTGAAGGAGTTTGACCACCATTTGCTGTCCGCTGCAGAAGCCTGCAAGCTGGCAGCTGCCTTCA
GTGCCTATACGCCGCTCTTCGTGCTCACAGCTGTGAATATCCGTGGCACGTGTTTATTGTCCTACAGTAGTTCAAATGACTGTCCTCCAG
AATTGAAAAACTTACATCTGTGTGAAGCCAAAGAGGCCTTTGAGATTGGCCTCCTCACCAAGAGAGATGATGAGCCTGTTACTGGAAAAC
AGGAGCTTCACAGCTTTGTCAAAGCTGCTTTCGGTCTCACCACAGTGCACAGAAGGCTCCATGGGGAGACAGGGACGGTCCATGCAGCAA
GTCAGCTCTGTAAGGAAGCAATGGGGAAGCTGTACAATTTCAGCACTTCCTCCAGAAGTCAGGACAGAGAAGCTCTGTCTCAAGAAGTTA
TGTCTGTGATTGCCCAGGTGAAGGAACATTTACAAGTTCAAAGCTTCTCAAATGTAGATGACAGATCTTATGTTCCCGAGAGTTTCGAGT
GCAGGTTGGATAAACTTATCTTGCATGGGCAAGGGGATTTCCAAAAAATCCTTGACACCTATTCACAGCACCATACTTCGGTGTGTGAAG
TATTTGAAAGTGATTGTGGAAACAACAAAAATGAACAGAAAGATGCAAAAACAGGAGTCTGCATCACTGCTCTAAAAACAGAAATAAAAA
ACATAGATACTGTGAGTACTACTCAAGAAAAGCCACATTGTCAAAGAGACACAGGAATATCTTCCTCCCTAATGGGTAAGAATGTTCAGA
GGGAACTCAGAAGGGGAGGAAGGAGAAACTGGACCCATTCTGATGCATTTCGAGTCTCCTTGGATCAAGATGTGGAGACTGAGACTGAGC
CATCGGACTACAGCAATGGTGAGGGAGCTGTTTTCAACAAGTCTCTGAGTGGCAGCCAGACTTCCAGTGCTTGGAGCAACTTATCAGGGT
TTAGTTCCTCTGCAAGCTGGGAGGAAGTGAATTATCACGTTGACGACAGGTCAGCCAGAAAAGAGCCTGGCAAAGAACATCTGGTGGACA
CTCAGTGTTCCACTGCCTTGTCTGAGGAGCTAGAGAATGACAGGGAAGGCAGAGCTATGCATTCATTGCATTCACAGCTTCATGATCTCT
CTCTTCAGGAACCCAACAATGACAATTTGGAGCCTTCTCAAAATCAGCCACAGCAACAGATGCCCTTGACACCCTTCTCGCCTCATAATA
CCCCAGGCATTTTCTTGGCCCCTGGTGCAGGGCTTCTAGAAGGAGCTCCAGAAGGTATCCAGGAAGTCAGAAATATGGGACCCAGAAATA
CTTCTGCTCACTCCAGACCCTCATATCGTTCTGCTTCTTGGTCTTCTGATTCTGGTAGGCCCAAGAATATGGGCACACATCCTTCAGTCC
AAAAAGAAGAAGCCTTTGAAATAATTGTTGAGTTTCCAGAAACCAACTGCGATGTCAAAGACAGGCAGGGGAAAGAGCAGGGAGAAGAAA
TTAGTGAAAGAGGCGCAGGCCCTACATTTAAAGCTAGTCCCTCCTGGGTTGACCCAGAAGGAGAAACAGCAGAAAGCACTGAAGATGCAC
CCTTAGACTTTCACAGGGTCCTGCACAATTCTCTGGGAAACATTTCCATGCTGCCATGTAGCTCCTTCACCCCTAATTGGCCTGTTCAAA
ATCCTGACTCCAGAAAAAGTGGTGGCCCAGTCGCAGAGCAGGGCATCGACCCTGATGCCTCCACAGTGGATGAGGAGGGGCAACTGCTCG
ACAGCATGGATGTTCCCTGCACAAATGGGCACGGCTCTCATAGACTGTGCATTCTGAGACAGCCGCCTGGTCAGAGGGCGGAGACCCCCA
ATTCCTCTGTAAGCGGTAACATCCTCTTCCCTGTCCTCAGCGAGGACTGCACTACCACAGAGGAAGGAAATCAGCCTGGAAACATGCTAA
ACTGCAGCCAGAACTCCAGCTCATCCTCAGTGTGGTGGCTGAAATCACCTGCATTTTCCAGTGGTTCTTCTGAGGGGGACAGCCCTTGGT
CCTATCTGAATTCCAGTGGGAGTTCTTGGGTTTCATTGCCGGGAAAGATGAGGAAAGAGATCCTTGAGGCTCGCACCTTGCAACCTGATG
ACTTTGAAAAGCTGTTGGCAGGAGTGAGGCATGATTGGCTGTTTCAGAGACTAGAGAATACGGGGGTTTTTAAGCCCAGTCAACTCCACC
GAGCACATAGTGCTCTTTTGTTAAAATATTCAAAAAAATCTGAACTGTGGACGGCCCAGGAAACTATTGTCTATTTGGGGGACTACTTGA
CTGTGAAGAAAAAAGGCAGACAAAGAAATGCTTTTTGGGTTCATCATCTTCATCAAGAAGAAATTCTGGGGAGGTATGTTGGGAAAGACT
ATAAGGAGCAGAAGGGGCTCTGGCACCACTTCACTGATGTGGAGCGACAGATGACCGCACAGCACTATGTGACAGAATTTAACAAGAGAC
TCTATGAACAAAACATTCCCACCCAGATATTCTACATCCCATCCACAATACTACTGATTTTAGAGGACAAGACAATAAAGGGATGTATCA
GTGTGGAGCCTTACATACTGGGAGAATTTGTAAAATTGTCAAATAACACGAAAGTGGTGAAAACAGAATACAAAGCCACAGAATATGGCT
TGGCCTATGGCCATTTTTCTTATGAGTTTTCTAATCATAGAGATGTTGTGGTCGATTTACAAGGTTGGGTAACCGGTAATGGAAAAGGAC
TCATCTACCTCACAGATCCCCAGATTCACTCCGTTGATCAGAAAGTTTTCACTACCAATTTTGGAAAGAGAGGAATTTTTTACTTCTTTA
ATAACCAGCATGTGGAATGTAATGAAATCTGCCATCGTCTTTCTTTGACTAGACCTTCAATGGAGAAACCATCAAACATGTCTGACAAAG
AGCTAAAGAAGCTACGTAATAAACAAAGAAGAGCTCAAAAGAAAGCCCAGATAGAAGAAGAGAAAAAAAATGCAGAAAAAGAAAAGCAGC
AGAGAAATCAGAAAAAGAAGAAGGATGATGATGATGAGGAGATAGGAGGTCCAAAAGAAGAACTTATTCCAGAGAAACTGGCCAAGGTTG
AAACTCCATTGGAAGAAGCTATTAAATTTTTAACACCGTTGAAGAACTTGGTGAAGAACAAGATAGAGACTCATCTTTTTGCCTTTGAGA
TTTACTTTAGGAAAGAAAAGTTTCTTTTGATGCTACAATCAGTAAAGAGGGCATTTGCTATTGATTCTAGTCATCCCTGGCTTCATGAGT
GTATGATTCGTCTCTTTAATACTGTGTGTGAAAGTAAAGATTTATCTGATACAGTTAGAACAGTATTAAAACAAGAAATGAATCGTCTTT
TTGGAGCAACGAATCCAAAGAATTTTAATGAAACTTTTCTGAAAAGGAATTCTGATTCATTGCCACACAGATTATCAGCTGCCAAAATGG
TATATTACTTAGATCCTTCTAGTCAGAAGCGAGCTATAGAGTTGGCAACAACACTTGATGAATCTCTCACTAACAGAAACCTCCAGACAT
GTATGGAGGTATTGGAAGCCTTGTATGATGGTAGCCTAGGAGACTGTAAAGAAGCTGCTGAAATTTATAGAGCAAATTGTCATAAGCTTT
TCCCTTATGCTTTGGCTTTCATGCCTCCTGGATATGAAGAGGATATGAAGATCACAGTTAATGGAGATAGTTCTGCAGAAGCTGAAGAAC
TGGCCAATGAAATTTGAACATCACTAAACAAGCAAATGGAATGACTTTGGACCATATCTAGTATATAATATTTTTGTCACGCACCTGCTG
CATTGCTCTAACTTACACAGAATGAGAGGAGTAAATGTTCTTGCCTTCAAATAGTGTTTTACGTTTTTTATCCTGCTGAAAAAGTATATA
TAAAATATCTAACATTACAGGATAGAGGTTCAGTTTCTTAAAAAATTAAAGCTGCTAAAATTGAGTGGTTAAAAAAGATACCTTATCCTA
TTCCTCCCCACCCACCCATGTTTTTAAACTAATTTATATAAAATCTGGAGGCTGTTACAGCTAACAAAGCAGGTGTGTGGCAGAAATATT
ACTTTAAATTTGTCTTGTGAGATTTTACTATATCTCAGACAGCATAAATGCTGTTTTAGCACTGGATTCTTTCACTGAGCACAAAGAGTT
GTTGGGGCTTTAGCATCTGACTGATTTTGTTACGGGGTTGATTCTGACCATAGGAAGTATGCAATGTGAATCACTATTTACAGAGAAACC
TACAACAGATGCTTGATGTTGTAGAAACTGGGACATATAGATACCAAGCAAAATTATAAGAAACCTATAAGGTGTTCAATACGCTTGTGT
TTCCAAAATTCACTGTACATGATCAGTTTGGTGTTCTTGTACCACAGTTTTTAACTGAAGGAACCAGTTGTAACAGTCTCAATTTTAACT
AAAACTTGAAGAACTAAAACAACAATGCAAACCTTTCAGCATTGTTTGGCCAAACTTGTTAAAACTGTAATGCAAGAACCAAATGCACTG
TGATGTGGCACCAACTAATTAGCAAGCATGAATTTTTCACCCAAGAGTGAAAAAAGGAAAATCTACCATGGCTTGAAGTTAAAGAGCAGA
ACTCCTGACTACCATTCTATGACTGATCAAAAGACTAATAGTTAAAAACCTCAGCAGGCCTTGTTCACGATATGCAGAAAAAAAAGTGCT
GCAGTTTAGATACCTCTGGAATTTTTCCACAGTGTCACAGGTTTGTAATACTTGAAGCCCTACATTTCTAAGAATATATTTCTTGCTCAG
TTGTTTCAGGCAAGCCCAAGACTTTGTAATTTTTAAAGGGCCCAAGATTTTTTTTTTTTTTTTTTTTTTCAAATAACAGACCAGCTTCTT
TTTCTTGCAGTTACAGATGTAATTTCCTTTTTGTTGTCAAACATAAGGTACCAAATATGATGCAATAAATTGTTTTGAAAAACAGTTGTG
TGAATATTTCAACTAATCTGTGTTGGGCTTCTGTGAAATACACAGGTGGAAACAGAGGTGCAAGCCAGAGGCAATGTAATATGCTGTAAG
GCTAGTGCAGATGGGAGCTTTTTAGAAGGGGCTAAGTGCTGGTGTCAGGGAAATTCCATAATGAAGTAGAATGCTGCTCCTGCATTAAGA
TTTCATTGAGGGCAAGGCTGGTGGCAGGTACTATGAATGTAATTCATAATTTAAAAGGAAAACTAAAAACTATTTTGATTTGGGAAAATG
AGCCTTAATTTGTTAAACCTATACACTGAGAACTAGCCTCAGGCTTAATATTCTCATTGCATTTGCAAGATCTGAGCAAATAAGATTAAG
TAAAACAAATCAATTGTATATATAATTGACCTTTTTGTGGAACATGTAGTTTATAGAAAGTATACTCTAAAGGGAATTTGCCGAAGACCT
TTTACTGATTGAACAGTTGTGCTACAATCAACTTTTCATAGTACATGACCTGCATTCCACATCTCAGTCTAACAGTTTAGTAGTGATGTA
AAGAGAAGTACAAACCGAACTCCAGTGCTTTGTTATGTTTTATTAACTGGCCCTGTCTCAGGAACATCTTAACAGATGGCAAAAAAACAA
AAACTTTTTTTCAACTCCTATGAGTGGCAACTGAAGTTCTTATTGTTGGGAAAGAACACTAGTCCTACCTCTGCCACTAATGAGGTGTTT
GGAGGAGGTACCAGCCATATAATAGGGGGTGTATGTGTGAATTTTGTTTAAACTCTACTGTATATTGAAATGAAATTCATTTATTTGTCT
TGACAATGTTCAAATGATGTAGATTGTCTTAGAATGAATATTCATAAGTACTCAGAACTCTTAAGATGCAGATGCCACCCGTGAGGAGCT
AAATTCCTAATGTGTATTGTATTCCAACCCAATTTTACTGGAACTATTGAATAAATCTTTTATTTTCTTTCAGGTTTACTTGATAGTGGA
TACATTGGTGTCAGAGGACCTTGACTGGGTTCATTTTATGTCCAGACATCACCCCTGAAACACTGTACTGTACTATCTTGCTCTGAGTAG
TATCAACTGGATCGTCTCATATTTGCCTCATTCATCCTATTAATTTCAAATGATACTGTGGGGGAAAAACAGGGTTAGAAAAACAAGTGG
AAAAAATGGAGTGATGTTGTAATCTAAACAAGTGCCTTATGTTTATTGCTAAGAACTGGTGTTACCAACCCTTTTGAGAAGAAAGGGTCT
CTTGACCTGTATTAACATAGGAAAGTAAAGTTCTTTTTGTTTTTCTTTATATTCAGCTACTCTTGTCATTTCTCGTTGAAAAACTAAAAT
CTGACTAGGTTAGTTTACTCAGCTTTAATTAGATAGTTGAGTCATATATTTTCAACATTTTTCTGTATCGTATTTATTATGCACAAAAAT
AAAGTGTGATCTCTAATAGCATGGCTAAAGGTAATGCCAATATTAGTGAATAGCTTTGCTGTGGGCTTTATCAGGCCCTTGTTTTTCACA
GTGTTGTTTGTACTCCATGGTGTATTGCTATTAGAGCTGTGAAATGAAAGCTGTGACTTTATGAAGAGATCAAAAAAAGTGTGGTCCAGA
TTCAGGTAGGTTGTGGTAAGTTCAGGGAAGTCTTGTATGATTTCTAAGAATATTTCAGTTACCATGTATAGTATTTGGGAAAAGCTGTAA
TGTAAAATATTGGACTTTGTTGCAAGATAGGTATATACTTGTGTCAGATAATTAAAGCCTTAAATTTTGAAATGAATCTTTGAGATTTCA
AAGAAAGACTGACCTTTCAAATAGAAGGCGGTCTTTCTATTCTAGCTAATGCCCCACTTCTTTAAGTTATAAACAAAGTTTCATGATACC
ATTCTGCTATCATCTAAACTTTGCTGAACTCTACTGCCAACCTACATTAAAAACAAAGTCCCACGAAATGGCTGTGTTACACCATAGTGA

>4109_4109_6_ALPK1-NAA15_ALPK1_chr4_113362261_ENST00000504176_NAA15_chr4_140291365_ENST00000398947_length(amino acids)=1464AA_BP=1184
MCYCKSASKCWISSCWKRQMCRKRTRARTSAAEASLRASILARDCAAAAAIVFLVDRFLYGLDVSGKLLQVAKGLHKLQPATPIAPQVVI
RQARISVNSGKLLKAEYILSSLISNNGATGTWLYRNESDKVLVQSVCIQIRGQILQKLGMWYEAAELIWASIVGYLALPQPDKKGLSTSL
GILADIFVSMSKNDYEKFKNNPQINLSLLKEFDHHLLSAAEACKLAAAFSAYTPLFVLTAVNIRGTCLLSYSSSNDCPPELKNLHLCEAK
EAFEIGLLTKRDDEPVTGKQELHSFVKAAFGLTTVHRRLHGETGTVHAASQLCKEAMGKLYNFSTSSRSQDREALSQEVMSVIAQVKEHL
QVQSFSNVDDRSYVPESFECRLDKLILHGQGDFQKILDTYSQHHTSVCEVFESDCGNNKNEQKDAKTGVCITALKTEIKNIDTVSTTQEK
PHCQRDTGISSSLMGKNVQRELRRGGRRNWTHSDAFRVSLDQDVETETEPSDYSNGEGAVFNKSLSGSQTSSAWSNLSGFSSSASWEEVN
YHVDDRSARKEPGKEHLVDTQCSTALSEELENDREGRAMHSLHSQLHDLSLQEPNNDNLEPSQNQPQQQMPLTPFSPHNTPGIFLAPGAG
LLEGAPEGIQEVRNMGPRNTSAHSRPSYRSASWSSDSGRPKNMGTHPSVQKEEAFEIIVEFPETNCDVKDRQGKEQGEEISERGAGPTFK
ASPSWVDPEGETAESTEDAPLDFHRVLHNSLGNISMLPCSSFTPNWPVQNPDSRKSGGPVAEQGIDPDASTVDEEGQLLDSMDVPCTNGH
GSHRLCILRQPPGQRAETPNSSVSGNILFPVLSEDCTTTEEGNQPGNMLNCSQNSSSSSVWWLKSPAFSSGSSEGDSPWSYLNSSGSSWV
SLPGKMRKEILEARTLQPDDFEKLLAGVRHDWLFQRLENTGVFKPSQLHRAHSALLLKYSKKSELWTAQETIVYLGDYLTVKKKGRQRNA
FWVHHLHQEEILGRYVGKDYKEQKGLWHHFTDVERQMTAQHYVTEFNKRLYEQNIPTQIFYIPSTILLILEDKTIKGCISVEPYILGEFV
KLSNNTKVVKTEYKATEYGLAYGHFSYEFSNHRDVVVDLQGWVTGNGKGLIYLTDPQIHSVDQKVFTTNFGKRGIFYFFNNQHVECNEIC
HRLSLTRPSMEKPSNMSDKELKKLRNKQRRAQKKAQIEEEKKNAEKEKQQRNQKKKKDDDDEEIGGPKEELIPEKLAKVETPLEEAIKFL
TPLKNLVKNKIETHLFAFEIYFRKEKFLLMLQSVKRAFAIDSSHPWLHECMIRLFNTVCESKDLSDTVRTVLKQEMNRLFGATNPKNFNE
TFLKRNSDSLPHRLSAAKMVYYLDPSSQKRAIELATTLDESLTNRNLQTCMEVLEALYDGSLGDCKEAAEIYRANCHKLFPYALAFMPPG

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for ALPK1-NAA15


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with
TgeneNAA15chr4:113362261chr4:140291365ENST000002965431320500_866584.3333333333334867.0HYPK


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for ALPK1-NAA15


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for ALPK1-NAA15


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
HgeneALPK1C0011849Diabetes Mellitus1CTD_human
HgeneALPK1C0011881Diabetic Nephropathy1CTD_human
HgeneALPK1C0017667Nodular glomerulosclerosis1CTD_human
HgeneALPK1C0018099Gout1CTD_human
HgeneALPK1C0027719Nephrosclerosis1CTD_human
HgeneALPK1C1262477Weight decreased1CTD_human
TgeneC0020796Profound Mental Retardation1CTD_human
TgeneC0025363Mental Retardation, Psychosocial1CTD_human
TgeneC0917816Mental deficiency1CTD_human
TgeneC3714756Intellectual Disability1CTD_human;GENOMICS_ENGLAND
TgeneC4540470MENTAL RETARDATION, AUTOSOMAL DOMINANT 501GENOMICS_ENGLAND;UNIPROT