FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:MGST1-PKP4 (FusionGDB2 ID:53441)

Fusion Gene Summary for MGST1-PKP4

check button Fusion gene summary
Fusion gene informationFusion gene name: MGST1-PKP4
Fusion gene ID: 53441
HgeneTgene
Gene symbol

MGST1

PKP4

Gene ID

4257

8502

Gene namemicrosomal glutathione S-transferase 1plakophilin 4
SynonymsGST12|MGST|MGST-Ip0071
Cytomap

12p12.3

2q24.1

Type of geneprotein-codingprotein-coding
Descriptionmicrosomal glutathione S-transferase 1glutathione S-transferase 12microsomal GST-1microsomal GST-Iplakophilin-4catenin 4
Modification date2020032020200313
UniProtAcc

P10620

.
Ensembl transtripts involved in fusion geneENST00000010404, ENST00000396207, 
ENST00000396209, ENST00000396210, 
ENST00000535309, ENST00000540056, 
ENST00000359720, 
ENST00000495123, 
ENST00000389757, ENST00000389759, 
Fusion gene scores* DoF score13 X 8 X 6=62415 X 19 X 6=1710
# samples 1322
** MAII scorelog2(13/624*10)=-2.26303440583379
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(22/1710*10)=-2.9584208962486
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: MGST1 [Title/Abstract] AND PKP4 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointMGST1(16507204)-PKP4(159433783), # samples:1
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneMGST1

GO:0055114

oxidation-reduction process

20727966

HgeneMGST1

GO:0071449

cellular response to lipid hydroperoxide

20727966

TgenePKP4

GO:0043547

positive regulation of GTPase activity

17115030


check buttonFusion gene breakpoints across MGST1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across PKP4 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4STADTCGA-BR-8078-01AMGST1chr12

16507204

+PKP4chr2

159433783

+


Top

Fusion Gene ORF analysis for MGST1-PKP4

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-intronENST00000010404ENST00000495123MGST1chr12

16507204

+PKP4chr2

159433783

+
5CDS-intronENST00000396207ENST00000495123MGST1chr12

16507204

+PKP4chr2

159433783

+
5CDS-intronENST00000396209ENST00000495123MGST1chr12

16507204

+PKP4chr2

159433783

+
5CDS-intronENST00000396210ENST00000495123MGST1chr12

16507204

+PKP4chr2

159433783

+
5CDS-intronENST00000535309ENST00000495123MGST1chr12

16507204

+PKP4chr2

159433783

+
5CDS-intronENST00000540056ENST00000495123MGST1chr12

16507204

+PKP4chr2

159433783

+
In-frameENST00000010404ENST00000389757MGST1chr12

16507204

+PKP4chr2

159433783

+
In-frameENST00000010404ENST00000389759MGST1chr12

16507204

+PKP4chr2

159433783

+
In-frameENST00000396207ENST00000389757MGST1chr12

16507204

+PKP4chr2

159433783

+
In-frameENST00000396207ENST00000389759MGST1chr12

16507204

+PKP4chr2

159433783

+
In-frameENST00000396209ENST00000389757MGST1chr12

16507204

+PKP4chr2

159433783

+
In-frameENST00000396209ENST00000389759MGST1chr12

16507204

+PKP4chr2

159433783

+
In-frameENST00000396210ENST00000389757MGST1chr12

16507204

+PKP4chr2

159433783

+
In-frameENST00000396210ENST00000389759MGST1chr12

16507204

+PKP4chr2

159433783

+
In-frameENST00000535309ENST00000389757MGST1chr12

16507204

+PKP4chr2

159433783

+
In-frameENST00000535309ENST00000389759MGST1chr12

16507204

+PKP4chr2

159433783

+
In-frameENST00000540056ENST00000389757MGST1chr12

16507204

+PKP4chr2

159433783

+
In-frameENST00000540056ENST00000389759MGST1chr12

16507204

+PKP4chr2

159433783

+
intron-3CDSENST00000359720ENST00000389757MGST1chr12

16507204

+PKP4chr2

159433783

+
intron-3CDSENST00000359720ENST00000389759MGST1chr12

16507204

+PKP4chr2

159433783

+
intron-intronENST00000359720ENST00000495123MGST1chr12

16507204

+PKP4chr2

159433783

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000010404MGST1chr1216507204+ENST00000389757PKP4chr2159433783+56711514634681140
ENST00000010404MGST1chr1216507204+ENST00000389759PKP4chr2159433783+43501514635971183
ENST00000396210MGST1chr1216507204+ENST00000389757PKP4chr2159433783+56901706534871140
ENST00000396210MGST1chr1216507204+ENST00000389759PKP4chr2159433783+43691706536161183
ENST00000535309MGST1chr1216507204+ENST00000389757PKP4chr2159433783+56451252034421140
ENST00000535309MGST1chr1216507204+ENST00000389759PKP4chr2159433783+43241252035711183
ENST00000540056MGST1chr1216507204+ENST00000389757PKP4chr2159433783+57492298835461152
ENST00000540056MGST1chr1216507204+ENST00000389759PKP4chr2159433783+44282298836751195
ENST00000396209MGST1chr1216507204+ENST00000389757PKP4chr2159433783+57492298835461152
ENST00000396209MGST1chr1216507204+ENST00000389759PKP4chr2159433783+44282298836751195
ENST00000396207MGST1chr1216507204+ENST00000389757PKP4chr2159433783+56791595434761140
ENST00000396207MGST1chr1216507204+ENST00000389759PKP4chr2159433783+43581595436051183

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000010404ENST00000389757MGST1chr1216507204+PKP4chr2159433783+0.0005195110.99948055
ENST00000010404ENST00000389759MGST1chr1216507204+PKP4chr2159433783+0.0010595770.9989404
ENST00000396210ENST00000389757MGST1chr1216507204+PKP4chr2159433783+0.0005567380.99944323
ENST00000396210ENST00000389759MGST1chr1216507204+PKP4chr2159433783+0.0011186670.9988813
ENST00000535309ENST00000389757MGST1chr1216507204+PKP4chr2159433783+0.0005065950.99949336
ENST00000535309ENST00000389759MGST1chr1216507204+PKP4chr2159433783+0.001039280.99896073
ENST00000540056ENST00000389757MGST1chr1216507204+PKP4chr2159433783+0.0005672360.99943274
ENST00000540056ENST00000389759MGST1chr1216507204+PKP4chr2159433783+0.0011304530.9988695
ENST00000396209ENST00000389757MGST1chr1216507204+PKP4chr2159433783+0.0005672360.99943274
ENST00000396209ENST00000389759MGST1chr1216507204+PKP4chr2159433783+0.0011304530.9988695
ENST00000396207ENST00000389757MGST1chr1216507204+PKP4chr2159433783+0.0004980120.99950194
ENST00000396207ENST00000389759MGST1chr1216507204+PKP4chr2159433783+0.0010047510.99899524

Top

Fusion Genomic Features for MGST1-PKP4


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.

Top

Fusion Protein Features for MGST1-PKP4


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr12:16507204/chr2:159433783)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
MGST1

P10620

.
FUNCTION: Conjugation of reduced glutathione to a wide number of exogenous and endogenous hydrophobic electrophiles. Has a wide substrate specificity.FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgenePKP4chr12:16507204chr2:159433783ENST00000389757121788_794441150.0Compositional biasNote=Poly-Lys
TgenePKP4chr12:16507204chr2:159433783ENST00000389759122788_794441193.0Compositional biasNote=Poly-Lys
TgenePKP4chr12:16507204chr2:159433783ENST00000389757121415_455441150.0RepeatNote=ARM 1
TgenePKP4chr12:16507204chr2:159433783ENST00000389757121518_557441150.0RepeatNote=ARM 2
TgenePKP4chr12:16507204chr2:159433783ENST00000389757121560_599441150.0RepeatNote=ARM 3
TgenePKP4chr12:16507204chr2:159433783ENST00000389757121604_644441150.0RepeatNote=ARM 4
TgenePKP4chr12:16507204chr2:159433783ENST00000389757121660_702441150.0RepeatNote=ARM 5
TgenePKP4chr12:16507204chr2:159433783ENST00000389757121706_751441150.0RepeatNote=ARM 6
TgenePKP4chr12:16507204chr2:159433783ENST00000389757121815_855441150.0RepeatNote=ARM 7
TgenePKP4chr12:16507204chr2:159433783ENST00000389757121862_901441150.0RepeatNote=ARM 8
TgenePKP4chr12:16507204chr2:159433783ENST00000389757121950_993441150.0RepeatNote=ARM 9
TgenePKP4chr12:16507204chr2:159433783ENST00000389759122415_455441193.0RepeatNote=ARM 1
TgenePKP4chr12:16507204chr2:159433783ENST00000389759122518_557441193.0RepeatNote=ARM 2
TgenePKP4chr12:16507204chr2:159433783ENST00000389759122560_599441193.0RepeatNote=ARM 3
TgenePKP4chr12:16507204chr2:159433783ENST00000389759122604_644441193.0RepeatNote=ARM 4
TgenePKP4chr12:16507204chr2:159433783ENST00000389759122660_702441193.0RepeatNote=ARM 5
TgenePKP4chr12:16507204chr2:159433783ENST00000389759122706_751441193.0RepeatNote=ARM 6
TgenePKP4chr12:16507204chr2:159433783ENST00000389759122815_855441193.0RepeatNote=ARM 7
TgenePKP4chr12:16507204chr2:159433783ENST00000389759122862_901441193.0RepeatNote=ARM 8
TgenePKP4chr12:16507204chr2:159433783ENST00000389759122950_993441193.0RepeatNote=ARM 9

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneMGST1chr12:16507204chr2:159433783ENST00000010404+14124_1280156.0Topological domainCytoplasmic
HgeneMGST1chr12:16507204chr2:159433783ENST00000010404+14149_1550156.0Topological domainLumenal
HgeneMGST1chr12:16507204chr2:159433783ENST00000010404+1434_620156.0Topological domainCytoplasmic
HgeneMGST1chr12:16507204chr2:159433783ENST00000010404+143_90156.0Topological domainLumenal
HgeneMGST1chr12:16507204chr2:159433783ENST00000010404+1497_990156.0Topological domainLumenal
HgeneMGST1chr12:16507204chr2:159433783ENST00000396207+14124_1280156.0Topological domainCytoplasmic
HgeneMGST1chr12:16507204chr2:159433783ENST00000396207+14149_1550156.0Topological domainLumenal
HgeneMGST1chr12:16507204chr2:159433783ENST00000396207+1434_620156.0Topological domainCytoplasmic
HgeneMGST1chr12:16507204chr2:159433783ENST00000396207+143_90156.0Topological domainLumenal
HgeneMGST1chr12:16507204chr2:159433783ENST00000396207+1497_990156.0Topological domainLumenal
HgeneMGST1chr12:16507204chr2:159433783ENST00000396209+14124_1280156.0Topological domainCytoplasmic
HgeneMGST1chr12:16507204chr2:159433783ENST00000396209+14149_1550156.0Topological domainLumenal
HgeneMGST1chr12:16507204chr2:159433783ENST00000396209+1434_620156.0Topological domainCytoplasmic
HgeneMGST1chr12:16507204chr2:159433783ENST00000396209+143_90156.0Topological domainLumenal
HgeneMGST1chr12:16507204chr2:159433783ENST00000396209+1497_990156.0Topological domainLumenal
HgeneMGST1chr12:16507204chr2:159433783ENST00000396210+14124_1280156.0Topological domainCytoplasmic
HgeneMGST1chr12:16507204chr2:159433783ENST00000396210+14149_1550156.0Topological domainLumenal
HgeneMGST1chr12:16507204chr2:159433783ENST00000396210+1434_620156.0Topological domainCytoplasmic
HgeneMGST1chr12:16507204chr2:159433783ENST00000396210+143_90156.0Topological domainLumenal
HgeneMGST1chr12:16507204chr2:159433783ENST00000396210+1497_990156.0Topological domainLumenal
HgeneMGST1chr12:16507204chr2:159433783ENST00000535309+14124_128088.0Topological domainCytoplasmic
HgeneMGST1chr12:16507204chr2:159433783ENST00000535309+14149_155088.0Topological domainLumenal
HgeneMGST1chr12:16507204chr2:159433783ENST00000535309+1434_62088.0Topological domainCytoplasmic
HgeneMGST1chr12:16507204chr2:159433783ENST00000535309+143_9088.0Topological domainLumenal
HgeneMGST1chr12:16507204chr2:159433783ENST00000535309+1497_99088.0Topological domainLumenal
HgeneMGST1chr12:16507204chr2:159433783ENST00000010404+14100_1230156.0TransmembraneHelical
HgeneMGST1chr12:16507204chr2:159433783ENST00000010404+1410_330156.0TransmembraneHelical
HgeneMGST1chr12:16507204chr2:159433783ENST00000010404+14129_1480156.0TransmembraneHelical
HgeneMGST1chr12:16507204chr2:159433783ENST00000010404+1463_960156.0TransmembraneHelical
HgeneMGST1chr12:16507204chr2:159433783ENST00000396207+14100_1230156.0TransmembraneHelical
HgeneMGST1chr12:16507204chr2:159433783ENST00000396207+1410_330156.0TransmembraneHelical
HgeneMGST1chr12:16507204chr2:159433783ENST00000396207+14129_1480156.0TransmembraneHelical
HgeneMGST1chr12:16507204chr2:159433783ENST00000396207+1463_960156.0TransmembraneHelical
HgeneMGST1chr12:16507204chr2:159433783ENST00000396209+14100_1230156.0TransmembraneHelical
HgeneMGST1chr12:16507204chr2:159433783ENST00000396209+1410_330156.0TransmembraneHelical
HgeneMGST1chr12:16507204chr2:159433783ENST00000396209+14129_1480156.0TransmembraneHelical
HgeneMGST1chr12:16507204chr2:159433783ENST00000396209+1463_960156.0TransmembraneHelical
HgeneMGST1chr12:16507204chr2:159433783ENST00000396210+14100_1230156.0TransmembraneHelical
HgeneMGST1chr12:16507204chr2:159433783ENST00000396210+1410_330156.0TransmembraneHelical
HgeneMGST1chr12:16507204chr2:159433783ENST00000396210+14129_1480156.0TransmembraneHelical
HgeneMGST1chr12:16507204chr2:159433783ENST00000396210+1463_960156.0TransmembraneHelical
HgeneMGST1chr12:16507204chr2:159433783ENST00000535309+14100_123088.0TransmembraneHelical
HgeneMGST1chr12:16507204chr2:159433783ENST00000535309+1410_33088.0TransmembraneHelical
HgeneMGST1chr12:16507204chr2:159433783ENST00000535309+14129_148088.0TransmembraneHelical
HgeneMGST1chr12:16507204chr2:159433783ENST00000535309+1463_96088.0TransmembraneHelical
TgenePKP4chr12:16507204chr2:159433783ENST0000038975712136_70441150.0Coiled coilOntology_term=ECO:0000255
TgenePKP4chr12:16507204chr2:159433783ENST0000038975912236_70441193.0Coiled coilOntology_term=ECO:0000255


Top

Fusion Gene Sequence for MGST1-PKP4


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>53441_53441_1_MGST1-PKP4_MGST1_chr12_16507204_ENST00000010404_PKP4_chr2_159433783_ENST00000389757_length(transcript)=5671nt_BP=151nt
TCGTGACAAAGCAAATTGTCTGGTATCATGAATTTGGAACACCGTAATGGATGATGAAGTATTCATGGCTTTTGCATCCTATGCAACAAT
TATTCTTTCAAAAATGATGCTTATGAGTACTGCAACTGCATTCTATAGATTGACAAGAAAGGAGCTTCAGTTTCAGCGACTCACCCGAGA
ACTGGAAGTGGAAAGGCAGATTGTTGCCAGTCAGCTAGAAAGATGTAGGCTTGGAGCAGAATCACCAAGCATCGCCAGCACCAGCTCAAC
TGAGAAGTCATTTCCTTGGAGATCAACAGACGTGCCAAATACTGGTGTAAGCAAACCTAGAGTTTCTGACGCTGTCCAGCCCAACAACTA
TCTCATCAGGACAGAGCCAGAACAAGGAACCCTCTATTCACCAGAACAGACATCTCTCCATGAAAGTGAGGGATCATTGGGTAACTCAAG
AAGTTCAACACAAATGAATTCTTATTCCGACAGTGGATACCAGGAAGCAGGGAGTTTCCACAACAGCCAGAACGTGAGCAAGGCAGACAA
CAGACAGCAGCATTCATTCATAGGATCAACTAACAACCATGTGGTGAGGAATTCAAGAGCTGAAGGACAAACACTGGTTCAGCCATCAGT
AGCCAATCGGGCCATGAGAAGAGTTAGTTCAGTTCCATCTAGAGCACAGTCTCCTTCTTATGTTATCAGCACAGGCGTGTCTCCTTCAAG
GGGGTCTCTGAGAACTTCTCTGGGTAGTGGATTTGGCTCTCCGTCAGTGACCGACCCCCGACCTCTGAACCCCAGTGCATATTCCTCCAC
CACATTACCTGCTGCACGGGCAGCCTCTCCGTACTCACAGAGACCCGCCTCCCCAACAGCTATACGGCGGATTGGGTCAGTCACCTCCCG
GCAGACCTCCAATCCCAACGGACCAACCCCTCAATACCAAACCACCGCCAGAGTGGGGTCCCCACTGACCCTGACGGATGCACAGACTCG
AGTAGCTTCCCCATCCCAAGGCCAGGTGGGGTCGTCGTCCCCCAAACGCTCAGGGATGACCGCCGTACCACAGCATCTGGGACCTTCACT
GCAAAGGACTGTTCATGACATGGAGCAATTCGGACAGCAGCAGTATGACATTTATGAGAGGATGGTTCCACCCAGGCCAGACAGCCTGAC
AGGCTTACGGAGTTCCTATGCTAGTCAGCATAGTCAGCTTGGGCAAGACCTTCGTTCTGCCGTGTCTCCCGACTTGCACATTACTCCTAT
ATATGAGGGGAGGACCTATTACAGCCCAGTGTACCGCAGCCCAAACCATGGAACTGTGGAGCTCCAAGGATCGCAGACGGCGTTGTATCG
CACAGGTTCAGTAGGTATTGGAAATCTACAAAGGACATCCAGCCAACGAAGTACCCTTACATACCAAAGAAATAATTATGCTCTGAACAC
AACAGCTACCTACGCGGAGCCCTACAGGCCTATACAATACCGAGTGCAAGAGTGCAATTATAACAGGCTTCAGCATGCAGTGCCGGCTGA
TGATGGCACCACAAGATCCCCATCAATAGACAGCATTCAGAAGGACCCCAGGGAGTTTGCCTGGCGTGATCCTGAGTTGCCTGAGGTCAT
TCACATGCTTCAGCACCAGTTCCCATCTGTTCAGGCAAATGCAGCGGCCTACCTGCAGCACCTGTGCTTTGGTGACAACAAAGTGAAGAT
GGAGGTGTGTAGGTTAGGGGGAATCAAGCATCTGGTTGACCTTCTGGACCACAGAGTTTTGGAAGTTCAGAAGAATGCTTGTGGTGCCCT
TCGAAACCTCGTTTTTGGCAAGTCTACAGATGAAAATAAAATAGCAATGAAGAATGTTGGTGGGATACCTGCCTTGTTGCGACTGTTGAG
AAAATCTATTGATGCAGAAGTAAGGGAGCTTGTTACAGGAGTTCTTTGGAATTTATCCTCATGTGATGCTGTAAAAATGACAATCATTCG
AGATGCTCTCTCAACCTTAACAAACACTGTGATTGTTCCACATTCTGGATGGAATAACTCTTCTTTTGATGATGATCATAAAATTAAATT
TCAGACTTCACTAGTTCTGCGTAACACGACAGGTTGCCTAAGGAACCTCAGCTCCGCGGGGGAAGAAGCTCGGAAGCAAATGCGGTCCTG
CGAGGGGCTGGTAGACTCACTGTTGTATGTGATCCACACGTGTGTGAACACATCCGATTACGACAGCAAGACGGTGGAGAACTGCGTGTG
CACCCTGAGGAACCTGTCCTATCGGCTGGAGCTGGAGGTGCCCCAGGCCCGGTTACTGGGACTGAACGAATTGGATGACTTACTAGGAAA
AGAGTCTCCCAGCAAAGACTCTGAGCCAAGTTGCTGGGGGAAGAAGAAGAAAAAGAAAAAGAGGACTCCGCAAGAAGATCAATGGGATGG
AGTTGGTCCTATCCCAGGACTGTCGAAGTCCCCCAAAGGGGTTGAGATGCTGTGGCACCCATCGGTGGTAAAACCATATCTGACTCTTCT
AGCAGAAAGTTCCAACCCAGCCACCTTGGAAGGCTCTGCAGGGTCTCTCCAGAACCTCTCTGCTGGCAACTGGAAGTTTGCAGCATATAT
CCGGGCGGCCGTCCGAAAAGAAAAGGGGCTCCCCATCCTTGTGGAGCTTCTGAGAATGGATAACGATAGAGTTGTTTCTTCCGTGGCAAC
AGCCTTGAGGAATATGGCACTAGATGTTCGCAACAAGGAGCTCATAGGCAAATACGCCATGCGAGACCTGGTCAACCGGCTCCCCGGCGG
CAATGGCCCCAGTGTCTTGTCTGATGAGACCATGGCAGCCATCTGCTGTGCTCTGCACGAGGTCACCAGCAAAAACATGGAGAACGCAAA
AGCCCTGGCCGACTCAGGAGGCATAGAGAAGCTGGTGAACATAACCAAAGGCAGGGGCGACAGATCATCTCTGAAAGTGGTGAAGGCAGC
AGCCCAGGTCTTGAATACATTATGGCAATATCGGGACCTCCGGAGCATTTATAAAAAGGATGGGTGGAATCAGAACCATTTTATTACACC
TGTGTCGACATTGGAGCGAGACCGATTCAAATCACATCCTTCCTTGTCTACCACCAACCAACAGATGTCACCCATCATTCAGTCAGGCTC
CAGCAAACCTTCACCAATTTACATCAGTTCCTATTCCTCACCAGCAAGAGAACAAAATAGACGGCTACAGCATCAACAGCTGTATTATAG
TCAAGATGACTCCAACAGAAAGAACTTTGATGCATACAGATTGTATTTGCAGTCTCCTCATAGCTATGAAGATCCTTATTTTGATGACCG
AGTTCACTTTCCAGCTTCTACTGATTACTCAACACAGTATGGACTGAAATCGACCACAAATTATGTAGACTTTTATTCCACTAAACGACC
TTCTTATAGAGCAGAACAGTACCCAGGGTCCCCAGACTCATGGGTGTAGCATCAAGATGCCCAACAGAGGAACTCTTTCTTTCTAACCTT
GTTCAGATTGAGGTGAAAAGTCCATCTTGCTGATTTGATGATTGAAATGTGAAAGTGAAGTGGAAGGAATGAATGAAGTGTGTTTTTTTT
TTCTTTTTTGAGGAATTATCAGGGAAGTGAGGAAATGTTTGGGAGAGGACTTTCTAAGCTCTATTTAGGTGTTAGATCTAATTACTTATA
GATTCTGTAGTCTGGTGAAGGTGTGGGTGACGTGATGAGAGGTTTGAGAAATGGGTGAAATGAAATGGGGGATATGTAGGTCAAATCAAA
TTAAAGATGATTTTTTTAATGTGAATAAAGTTATGTTCTGATAGTTTGTACAGAAAAAATAAAATGGATGCCCATGTTTTATTGCTATTA
CTAAATGTCAAGATTGTATGCTATTATGTCTTGTAAATTTCTTTTGTTGGTGTAAATATGGAAATGCCACATTGGTTAAGTGCCATCATT
TGTAATGCAATGTGTCACTTGAAAAGAGATTTGAAGAAACTGACAACTTCAAAAACAAATGAGAAGCCCAAGGAACTGTGAGCAATTAAA
AGCAAACCGCGACACCCTTTGTCTCCACCACACATAGTGTACTTTGGAAGCACAACGTCCAGGCTGGTACCGCAGCGCCATGCCCATTCC
TCGCCTCATTCATAGGACACTTCACTGCCATTTTCTATTCACATAAAAGAAAAATAAATGTGGAAATTTCATCCTTGGAAATGTGTTCTC
TATTCTTTTTAAATTTTCTGTATGTATATGGAAAAGCAGATTTAATAAACACAAATCTAACTGCATGTTGGAAACAGCTTAAATATATCC
AACTATTTTGGACTTTTCCTAGTTTTATATATACTTTGTGAGATGGACCACACCAAAGAATTAAGCTATGAACTTTCCTTCTCTGACAAT
CTAGGTATTTTTATTTCTTGGAATCTATTTCAAATCTAGTACCCTTCACAGTTTTATCATGTTTTTCTATATGGCAGCCCGGCCATAACT
AAATTGATATAACTAAATTGATACCTTTTTTAGGCATCATTTTTAAGAACTCAGTAACTTCATAAGAACAGTGGTTGGGCATTTATTTGG
TTGGCACCAATATAATAAAACAAATTCTAAATTATGATCTTGTTGAGTAATTCCTGGATTAGGAATGGGGTAGGGGAAAGAAGCCCCACA
GTAGGGTAGGCAGAGGTCCCCATAGCACCACACAGCATTACTTGGGAGCTTTTCTCCATCTGTGGAGGCAATGAGGGACACCAGGCCCAA
AGTCTGAGGCCTTCTCAGGTCTTGCAGTTTCCACCCTTCAGTCCTAGCTCCTGGATCTTTGTTGGTGTAAATATGGAAATGCCACATTGG
TTAAGTGCCATCATGGAAAACTAGGAACTAGTTTACCTGTAGCTGTACTAGGAAAACTAGGAACTAACGCCCATGGCAAGTCAAACCCAG
AAGTGTTGTCTTAGCTCAATATACCCTGCAAGTGTAATACAAATCGCAGATTAAAACATTTGTACTTATATTTGTGAGTATATGGAAAGA
CACCCAAGAGAAAACTGCGTTTTCTTGCCGAATGGACAGCAGCCCAAGATGAGCATCCAAAGTGAAATGGAAATACCAGTGCGAGCATGA
AAGTTCCTAAGATCTGCAAGCCGGGAGCCCTTCTGGGGTGCCTTCTGAAGAACACTCGAGCCAGCTTGCGGAGTCGACTGCCCAGCTACT
GAGCCCTGGCTGCCTCATACCTGGCGGAGCTCAGCAGACAGCCCTTGTTAGATGGCAGAGGTCGCCAGATGTTCCTGAGGAACTGGCATC
AGGACTTCTGGGACGTAACCACTTCAGTTTCCAATCACTGACTTCCCTGAACCTGGTATCTTCAGATCTGGGTTGAAGCTGAAGGCTCAG
TTGGGGAATCAGGAAAACAAGGGAATCGGAGCCAGCCCAAGCCAACTGACAATAACTTTCTTTTAGTGACTGCATTAGTTTGGGGTAATG
CTGTAACAGGCAGCATTTCAAAAACGGATCCAAATTTGGGTTGAAAATAACTCACCTGGGTCTCCAATCAGACCTAACAGAGCCAGAAAG
AGGCCTGTACACGGGGACTGGGCAGAAGATCTAATGCCGTTTAAGACTCCTCCCTTGAGACCAGAGAAAAGAAACTGCAGACACAGCCTG

>53441_53441_1_MGST1-PKP4_MGST1_chr12_16507204_ENST00000010404_PKP4_chr2_159433783_ENST00000389757_length(amino acids)=1140AA_BP=27
MDDEVFMAFASYATIILSKMMLMSTATAFYRLTRKELQFQRLTRELEVERQIVASQLERCRLGAESPSIASTSSTEKSFPWRSTDVPNTG
VSKPRVSDAVQPNNYLIRTEPEQGTLYSPEQTSLHESEGSLGNSRSSTQMNSYSDSGYQEAGSFHNSQNVSKADNRQQHSFIGSTNNHVV
RNSRAEGQTLVQPSVANRAMRRVSSVPSRAQSPSYVISTGVSPSRGSLRTSLGSGFGSPSVTDPRPLNPSAYSSTTLPAARAASPYSQRP
ASPTAIRRIGSVTSRQTSNPNGPTPQYQTTARVGSPLTLTDAQTRVASPSQGQVGSSSPKRSGMTAVPQHLGPSLQRTVHDMEQFGQQQY
DIYERMVPPRPDSLTGLRSSYASQHSQLGQDLRSAVSPDLHITPIYEGRTYYSPVYRSPNHGTVELQGSQTALYRTGSVGIGNLQRTSSQ
RSTLTYQRNNYALNTTATYAEPYRPIQYRVQECNYNRLQHAVPADDGTTRSPSIDSIQKDPREFAWRDPELPEVIHMLQHQFPSVQANAA
AYLQHLCFGDNKVKMEVCRLGGIKHLVDLLDHRVLEVQKNACGALRNLVFGKSTDENKIAMKNVGGIPALLRLLRKSIDAEVRELVTGVL
WNLSSCDAVKMTIIRDALSTLTNTVIVPHSGWNNSSFDDDHKIKFQTSLVLRNTTGCLRNLSSAGEEARKQMRSCEGLVDSLLYVIHTCV
NTSDYDSKTVENCVCTLRNLSYRLELEVPQARLLGLNELDDLLGKESPSKDSEPSCWGKKKKKKKRTPQEDQWDGVGPIPGLSKSPKGVE
MLWHPSVVKPYLTLLAESSNPATLEGSAGSLQNLSAGNWKFAAYIRAAVRKEKGLPILVELLRMDNDRVVSSVATALRNMALDVRNKELI
GKYAMRDLVNRLPGGNGPSVLSDETMAAICCALHEVTSKNMENAKALADSGGIEKLVNITKGRGDRSSLKVVKAAAQVLNTLWQYRDLRS
IYKKDGWNQNHFITPVSTLERDRFKSHPSLSTTNQQMSPIIQSGSSKPSPIYISSYSSPAREQNRRLQHQQLYYSQDDSNRKNFDAYRLY

--------------------------------------------------------------
>53441_53441_2_MGST1-PKP4_MGST1_chr12_16507204_ENST00000010404_PKP4_chr2_159433783_ENST00000389759_length(transcript)=4350nt_BP=151nt
TCGTGACAAAGCAAATTGTCTGGTATCATGAATTTGGAACACCGTAATGGATGATGAAGTATTCATGGCTTTTGCATCCTATGCAACAAT
TATTCTTTCAAAAATGATGCTTATGAGTACTGCAACTGCATTCTATAGATTGACAAGAAAGGAGCTTCAGTTTCAGCGACTCACCCGAGA
ACTGGAAGTGGAAAGGCAGATTGTTGCCAGTCAGCTAGAAAGATGTAGGCTTGGAGCAGAATCACCAAGCATCGCCAGCACCAGCTCAAC
TGAGAAGTCATTTCCTTGGAGATCAACAGACGTGCCAAATACTGGTGTAAGCAAACCTAGAGTTTCTGACGCTGTCCAGCCCAACAACTA
TCTCATCAGGACAGAGCCAGAACAAGGAACCCTCTATTCACCAGAACAGACATCTCTCCATGAAAGTGAGGGATCATTGGGTAACTCAAG
AAGTTCAACACAAATGAATTCTTATTCCGACAGTGGATACCAGGAAGCAGGGAGTTTCCACAACAGCCAGAACGTGAGCAAGGCAGACAA
CAGACAGCAGCATTCATTCATAGGATCAACTAACAACCATGTGGTGAGGAATTCAAGAGCTGAAGGACAAACACTGGTTCAGCCATCAGT
AGCCAATCGGGCCATGAGAAGAGTTAGTTCAGTTCCATCTAGAGCACAGTCTCCTTCTTATGTTATCAGCACAGGCGTGTCTCCTTCAAG
GGGGTCTCTGAGAACTTCTCTGGGTAGTGGATTTGGCTCTCCGTCAGTGACCGACCCCCGACCTCTGAACCCCAGTGCATATTCCTCCAC
CACATTACCTGCTGCACGGGCAGCCTCTCCGTACTCACAGAGACCCGCCTCCCCAACAGCTATACGGCGGATTGGGTCAGTCACCTCCCG
GCAGACCTCCAATCCCAACGGACCAACCCCTCAATACCAAACCACCGCCAGAGTGGGGTCCCCACTGACCCTGACGGATGCACAGACTCG
AGTAGCTTCCCCATCCCAAGGCCAGGTGGGGTCGTCGTCCCCCAAACGCTCAGGGATGACCGCCGTACCACAGCATCTGGGACCTTCACT
GCAAAGGACTGTTCATGACATGGAGCAATTCGGACAGCAGCAGTATGACATTTATGAGAGGATGGTTCCACCCAGGCCAGACAGCCTGAC
AGGCTTACGGAGTTCCTATGCTAGTCAGCATAGTCAGCTTGGGCAAGACCTTCGTTCTGCCGTGTCTCCCGACTTGCACATTACTCCTAT
ATATGAGGGGAGGACCTATTACAGCCCAGTGTACCGCAGCCCAAACCATGGAACTGTGGAGCTCCAAGGATCGCAGACGGCGTTGTATCG
CACAGGTTCAGTAGGTATTGGAAATCTACAAAGGACATCCAGCCAACGAAGTACCCTTACATACCAAAGAAATAATTATGCTCTGAACAC
AACAGCTACCTACGCGGAGCCCTACAGGCCTATACAATACCGAGTGCAAGAGTGCAATTATAACAGGCTTCAGCATGCAGTGCCGGCTGA
TGATGGCACCACAAGATCCCCATCAATAGACAGCATTCAGAAGGACCCCAGGGAGTTTGCCTGGCGTGATCCTGAGTTGCCTGAGGTCAT
TCACATGCTTCAGCACCAGTTCCCATCTGTTCAGGCAAATGCAGCGGCCTACCTGCAGCACCTGTGCTTTGGTGACAACAAAGTGAAGAT
GGAGGTGTGTAGGTTAGGGGGAATCAAGCATCTGGTTGACCTTCTGGACCACAGAGTTTTGGAAGTTCAGAAGAATGCTTGTGGTGCCCT
TCGAAACCTCGTTTTTGGCAAGTCTACAGATGAAAATAAAATAGCAATGAAGAATGTTGGTGGGATACCTGCCTTGTTGCGACTGTTGAG
AAAATCTATTGATGCAGAAGTAAGGGAGCTTGTTACAGGAGTTCTTTGGAATTTATCCTCATGTGATGCTGTAAAAATGACAATCATTCG
AGATGCTCTCTCAACCTTAACAAACACTGTGATTGTTCCACATTCTGGATGGAATAACTCTTCTTTTGATGATGATCATAAAATTAAATT
TCAGACTTCACTAGTTCTGCGTAACACGACAGGTTGCCTAAGGAACCTCAGCTCCGCGGGGGAAGAAGCTCGGAAGCAAATGCGGTCCTG
CGAGGGGCTGGTAGACTCACTGTTGTATGTGATCCACACGTGTGTGAACACATCCGATTACGACAGCAAGACGGTGGAGAACTGCGTGTG
CACCCTGAGGAACCTGTCCTATCGGCTGGAGCTGGAGGTGCCCCAGGCCCGGTTACTGGGACTGAACGAATTGGATGACTTACTAGGAAA
AGAGTCTCCCAGCAAAGACTCTGAGCCAAGTTGCTGGGGGAAGAAGAAGAAAAAGAAAAAGAGGACTCCGCAAGAAGATCAATGGGATGG
AGTTGGTCCTATCCCAGGACTGTCGAAGTCCCCCAAAGGGGTTGAGATGCTGTGGCACCCATCGGTGGTAAAACCATATCTGACTCTTCT
AGCAGAAAGTTCCAACCCAGCCACCTTGGAAGGCTCTGCAGGGTCTCTCCAGAACCTCTCTGCTGGCAACTGGAAGTTTGCAGCATATAT
CCGGGCGGCCGTCCGAAAAGAAAAGGGGCTCCCCATCCTTGTGGAGCTTCTGAGAATGGATAACGATAGAGTTGTTTCTTCCGTGGCAAC
AGCCTTGAGGAATATGGCACTAGATGTTCGCAACAAGGAGCTCATAGGCAAATACGCCATGCGAGACCTGGTCAACCGGCTCCCCGGCGG
CAATGGCCCCAGTGTCTTGTCTGATGAGACCATGGCAGCCATCTGCTGTGCTCTGCACGAGGTCACCAGCAAAAACATGGAGAACGCAAA
AGCCCTGGCCGACTCAGGAGGCATAGAGAAGCTGGTGAACATAACCAAAGGCAGGGGCGACAGATCATCTCTGAAAGTGGTGAAGGCAGC
AGCCCAGGTCTTGAATACATTATGGCAATATCGGGACCTCCGGAGCATTTATAAAAAGGATGGGTGGAATCAGAACCATTTTATTACACC
TGTGTCGACATTGGAGCGAGACCGATTCAAATCACATCCTTCCTTGTCTACCACCAACCAACAGATGTCACCCATCATTCAGTCAGTCGG
CAGCACCTCTTCCTCACCAGCACTGTTAGGAATCAGAGACCCTCGCTCTGAATACGATAGGACCCAGCCACCTATGCAGTATTACAATAG
CCAAGGGGATGCCACACATAAAGGCCTGTACCCTGGCTCCAGCAAACCTTCACCAATTTACATCAGTTCCTATTCCTCACCAGCAAGAGA
ACAAAATAGACGGCTACAGCATCAACAGCTGTATTATAGTCAAGATGACTCCAACAGAAAGAACTTTGATGCATACAGATTGTATTTGCA
GTCTCCTCATAGCTATGAAGATCCTTATTTTGATGACCGAGTTCACTTTCCAGCTTCTACTGATTACTCAACACAGTATGGACTGAAATC
GACCACAAATTATGTAGACTTTTATTCCACTAAACGACCTTCTTATAGAGCAGAACAGTACCCAGGGTCCCCAGACTCATGGGTGTAGCA
TCAAGATGCCCAACAGAGGAACTCTTTCTTTCTAACCTTGTTCAGATTGAGGTGAAAAGTCCATCTTGCTGATTTGATGATTGAAATGTG
AAAGTGAAGTGGAAGGAATGAATGAAGTGTGTTTTTTTTTTCTTTTTTGAGGAATTATCAGGGAAGTGAGGAAATGTTTGGGAGAGGACT
TTCTAAGCTCTATTTAGGTGTTAGATCTAATTACTTATAGATTCTGTAGTCTGGTGAAGGTGTGGGTGACGTGATGAGAGGTTTGAGAAA
TGGGTGAAATGAAATGGGGGATATGTAGGTCAAATCAAATTAAAGATGATTTTTTTAATGTGAATAAAGTTATGTTCTGATAGTTTGTAC
AGAAAAAATAAAATGGATGCCCATGTTTTATTGCTATTACTAAATGTCAAGATTGTATGCTATTATGTCTTGTAAATTTCTTTTGTTGGT
GTAAATATGGAAATGCCACATTGGTTAAGTGCCATCATTTGTAATGCAATGTGTCACTTGAAAAGAGATTTGAAGAAACTGACAACTTCA
AAAACAAATGAGAAGCCCAAGGAACTGTGAGCAATTAAAAGCAAACCGCGACACCCTTTGTCTCCACCACACATAGTGTACTTTGGAAGC
ACAACGTCCAGGCTGGTACCGCAGCGCCATGCCCATTCCTCGCCTCATTCATAGGACACTTCACTGCCATTTTCTATTCACATAAAAGAA

>53441_53441_2_MGST1-PKP4_MGST1_chr12_16507204_ENST00000010404_PKP4_chr2_159433783_ENST00000389759_length(amino acids)=1183AA_BP=27
MDDEVFMAFASYATIILSKMMLMSTATAFYRLTRKELQFQRLTRELEVERQIVASQLERCRLGAESPSIASTSSTEKSFPWRSTDVPNTG
VSKPRVSDAVQPNNYLIRTEPEQGTLYSPEQTSLHESEGSLGNSRSSTQMNSYSDSGYQEAGSFHNSQNVSKADNRQQHSFIGSTNNHVV
RNSRAEGQTLVQPSVANRAMRRVSSVPSRAQSPSYVISTGVSPSRGSLRTSLGSGFGSPSVTDPRPLNPSAYSSTTLPAARAASPYSQRP
ASPTAIRRIGSVTSRQTSNPNGPTPQYQTTARVGSPLTLTDAQTRVASPSQGQVGSSSPKRSGMTAVPQHLGPSLQRTVHDMEQFGQQQY
DIYERMVPPRPDSLTGLRSSYASQHSQLGQDLRSAVSPDLHITPIYEGRTYYSPVYRSPNHGTVELQGSQTALYRTGSVGIGNLQRTSSQ
RSTLTYQRNNYALNTTATYAEPYRPIQYRVQECNYNRLQHAVPADDGTTRSPSIDSIQKDPREFAWRDPELPEVIHMLQHQFPSVQANAA
AYLQHLCFGDNKVKMEVCRLGGIKHLVDLLDHRVLEVQKNACGALRNLVFGKSTDENKIAMKNVGGIPALLRLLRKSIDAEVRELVTGVL
WNLSSCDAVKMTIIRDALSTLTNTVIVPHSGWNNSSFDDDHKIKFQTSLVLRNTTGCLRNLSSAGEEARKQMRSCEGLVDSLLYVIHTCV
NTSDYDSKTVENCVCTLRNLSYRLELEVPQARLLGLNELDDLLGKESPSKDSEPSCWGKKKKKKKRTPQEDQWDGVGPIPGLSKSPKGVE
MLWHPSVVKPYLTLLAESSNPATLEGSAGSLQNLSAGNWKFAAYIRAAVRKEKGLPILVELLRMDNDRVVSSVATALRNMALDVRNKELI
GKYAMRDLVNRLPGGNGPSVLSDETMAAICCALHEVTSKNMENAKALADSGGIEKLVNITKGRGDRSSLKVVKAAAQVLNTLWQYRDLRS
IYKKDGWNQNHFITPVSTLERDRFKSHPSLSTTNQQMSPIIQSVGSTSSSPALLGIRDPRSEYDRTQPPMQYYNSQGDATHKGLYPGSSK
PSPIYISSYSSPAREQNRRLQHQQLYYSQDDSNRKNFDAYRLYLQSPHSYEDPYFDDRVHFPASTDYSTQYGLKSTTNYVDFYSTKRPSY

--------------------------------------------------------------
>53441_53441_3_MGST1-PKP4_MGST1_chr12_16507204_ENST00000396207_PKP4_chr2_159433783_ENST00000389757_length(transcript)=5679nt_BP=159nt
GAATTCAAGTCCTAAAGCCTACAGTTTTGAATACTACTGAAATGACAAGTTGTAATGGATGATGAAGTATTCATGGCTTTTGCATCCTAT
GCAACAATTATTCTTTCAAAAATGATGCTTATGAGTACTGCAACTGCATTCTATAGATTGACAAGAAAGGAGCTTCAGTTTCAGCGACTC
ACCCGAGAACTGGAAGTGGAAAGGCAGATTGTTGCCAGTCAGCTAGAAAGATGTAGGCTTGGAGCAGAATCACCAAGCATCGCCAGCACC
AGCTCAACTGAGAAGTCATTTCCTTGGAGATCAACAGACGTGCCAAATACTGGTGTAAGCAAACCTAGAGTTTCTGACGCTGTCCAGCCC
AACAACTATCTCATCAGGACAGAGCCAGAACAAGGAACCCTCTATTCACCAGAACAGACATCTCTCCATGAAAGTGAGGGATCATTGGGT
AACTCAAGAAGTTCAACACAAATGAATTCTTATTCCGACAGTGGATACCAGGAAGCAGGGAGTTTCCACAACAGCCAGAACGTGAGCAAG
GCAGACAACAGACAGCAGCATTCATTCATAGGATCAACTAACAACCATGTGGTGAGGAATTCAAGAGCTGAAGGACAAACACTGGTTCAG
CCATCAGTAGCCAATCGGGCCATGAGAAGAGTTAGTTCAGTTCCATCTAGAGCACAGTCTCCTTCTTATGTTATCAGCACAGGCGTGTCT
CCTTCAAGGGGGTCTCTGAGAACTTCTCTGGGTAGTGGATTTGGCTCTCCGTCAGTGACCGACCCCCGACCTCTGAACCCCAGTGCATAT
TCCTCCACCACATTACCTGCTGCACGGGCAGCCTCTCCGTACTCACAGAGACCCGCCTCCCCAACAGCTATACGGCGGATTGGGTCAGTC
ACCTCCCGGCAGACCTCCAATCCCAACGGACCAACCCCTCAATACCAAACCACCGCCAGAGTGGGGTCCCCACTGACCCTGACGGATGCA
CAGACTCGAGTAGCTTCCCCATCCCAAGGCCAGGTGGGGTCGTCGTCCCCCAAACGCTCAGGGATGACCGCCGTACCACAGCATCTGGGA
CCTTCACTGCAAAGGACTGTTCATGACATGGAGCAATTCGGACAGCAGCAGTATGACATTTATGAGAGGATGGTTCCACCCAGGCCAGAC
AGCCTGACAGGCTTACGGAGTTCCTATGCTAGTCAGCATAGTCAGCTTGGGCAAGACCTTCGTTCTGCCGTGTCTCCCGACTTGCACATT
ACTCCTATATATGAGGGGAGGACCTATTACAGCCCAGTGTACCGCAGCCCAAACCATGGAACTGTGGAGCTCCAAGGATCGCAGACGGCG
TTGTATCGCACAGGTTCAGTAGGTATTGGAAATCTACAAAGGACATCCAGCCAACGAAGTACCCTTACATACCAAAGAAATAATTATGCT
CTGAACACAACAGCTACCTACGCGGAGCCCTACAGGCCTATACAATACCGAGTGCAAGAGTGCAATTATAACAGGCTTCAGCATGCAGTG
CCGGCTGATGATGGCACCACAAGATCCCCATCAATAGACAGCATTCAGAAGGACCCCAGGGAGTTTGCCTGGCGTGATCCTGAGTTGCCT
GAGGTCATTCACATGCTTCAGCACCAGTTCCCATCTGTTCAGGCAAATGCAGCGGCCTACCTGCAGCACCTGTGCTTTGGTGACAACAAA
GTGAAGATGGAGGTGTGTAGGTTAGGGGGAATCAAGCATCTGGTTGACCTTCTGGACCACAGAGTTTTGGAAGTTCAGAAGAATGCTTGT
GGTGCCCTTCGAAACCTCGTTTTTGGCAAGTCTACAGATGAAAATAAAATAGCAATGAAGAATGTTGGTGGGATACCTGCCTTGTTGCGA
CTGTTGAGAAAATCTATTGATGCAGAAGTAAGGGAGCTTGTTACAGGAGTTCTTTGGAATTTATCCTCATGTGATGCTGTAAAAATGACA
ATCATTCGAGATGCTCTCTCAACCTTAACAAACACTGTGATTGTTCCACATTCTGGATGGAATAACTCTTCTTTTGATGATGATCATAAA
ATTAAATTTCAGACTTCACTAGTTCTGCGTAACACGACAGGTTGCCTAAGGAACCTCAGCTCCGCGGGGGAAGAAGCTCGGAAGCAAATG
CGGTCCTGCGAGGGGCTGGTAGACTCACTGTTGTATGTGATCCACACGTGTGTGAACACATCCGATTACGACAGCAAGACGGTGGAGAAC
TGCGTGTGCACCCTGAGGAACCTGTCCTATCGGCTGGAGCTGGAGGTGCCCCAGGCCCGGTTACTGGGACTGAACGAATTGGATGACTTA
CTAGGAAAAGAGTCTCCCAGCAAAGACTCTGAGCCAAGTTGCTGGGGGAAGAAGAAGAAAAAGAAAAAGAGGACTCCGCAAGAAGATCAA
TGGGATGGAGTTGGTCCTATCCCAGGACTGTCGAAGTCCCCCAAAGGGGTTGAGATGCTGTGGCACCCATCGGTGGTAAAACCATATCTG
ACTCTTCTAGCAGAAAGTTCCAACCCAGCCACCTTGGAAGGCTCTGCAGGGTCTCTCCAGAACCTCTCTGCTGGCAACTGGAAGTTTGCA
GCATATATCCGGGCGGCCGTCCGAAAAGAAAAGGGGCTCCCCATCCTTGTGGAGCTTCTGAGAATGGATAACGATAGAGTTGTTTCTTCC
GTGGCAACAGCCTTGAGGAATATGGCACTAGATGTTCGCAACAAGGAGCTCATAGGCAAATACGCCATGCGAGACCTGGTCAACCGGCTC
CCCGGCGGCAATGGCCCCAGTGTCTTGTCTGATGAGACCATGGCAGCCATCTGCTGTGCTCTGCACGAGGTCACCAGCAAAAACATGGAG
AACGCAAAAGCCCTGGCCGACTCAGGAGGCATAGAGAAGCTGGTGAACATAACCAAAGGCAGGGGCGACAGATCATCTCTGAAAGTGGTG
AAGGCAGCAGCCCAGGTCTTGAATACATTATGGCAATATCGGGACCTCCGGAGCATTTATAAAAAGGATGGGTGGAATCAGAACCATTTT
ATTACACCTGTGTCGACATTGGAGCGAGACCGATTCAAATCACATCCTTCCTTGTCTACCACCAACCAACAGATGTCACCCATCATTCAG
TCAGGCTCCAGCAAACCTTCACCAATTTACATCAGTTCCTATTCCTCACCAGCAAGAGAACAAAATAGACGGCTACAGCATCAACAGCTG
TATTATAGTCAAGATGACTCCAACAGAAAGAACTTTGATGCATACAGATTGTATTTGCAGTCTCCTCATAGCTATGAAGATCCTTATTTT
GATGACCGAGTTCACTTTCCAGCTTCTACTGATTACTCAACACAGTATGGACTGAAATCGACCACAAATTATGTAGACTTTTATTCCACT
AAACGACCTTCTTATAGAGCAGAACAGTACCCAGGGTCCCCAGACTCATGGGTGTAGCATCAAGATGCCCAACAGAGGAACTCTTTCTTT
CTAACCTTGTTCAGATTGAGGTGAAAAGTCCATCTTGCTGATTTGATGATTGAAATGTGAAAGTGAAGTGGAAGGAATGAATGAAGTGTG
TTTTTTTTTTCTTTTTTGAGGAATTATCAGGGAAGTGAGGAAATGTTTGGGAGAGGACTTTCTAAGCTCTATTTAGGTGTTAGATCTAAT
TACTTATAGATTCTGTAGTCTGGTGAAGGTGTGGGTGACGTGATGAGAGGTTTGAGAAATGGGTGAAATGAAATGGGGGATATGTAGGTC
AAATCAAATTAAAGATGATTTTTTTAATGTGAATAAAGTTATGTTCTGATAGTTTGTACAGAAAAAATAAAATGGATGCCCATGTTTTAT
TGCTATTACTAAATGTCAAGATTGTATGCTATTATGTCTTGTAAATTTCTTTTGTTGGTGTAAATATGGAAATGCCACATTGGTTAAGTG
CCATCATTTGTAATGCAATGTGTCACTTGAAAAGAGATTTGAAGAAACTGACAACTTCAAAAACAAATGAGAAGCCCAAGGAACTGTGAG
CAATTAAAAGCAAACCGCGACACCCTTTGTCTCCACCACACATAGTGTACTTTGGAAGCACAACGTCCAGGCTGGTACCGCAGCGCCATG
CCCATTCCTCGCCTCATTCATAGGACACTTCACTGCCATTTTCTATTCACATAAAAGAAAAATAAATGTGGAAATTTCATCCTTGGAAAT
GTGTTCTCTATTCTTTTTAAATTTTCTGTATGTATATGGAAAAGCAGATTTAATAAACACAAATCTAACTGCATGTTGGAAACAGCTTAA
ATATATCCAACTATTTTGGACTTTTCCTAGTTTTATATATACTTTGTGAGATGGACCACACCAAAGAATTAAGCTATGAACTTTCCTTCT
CTGACAATCTAGGTATTTTTATTTCTTGGAATCTATTTCAAATCTAGTACCCTTCACAGTTTTATCATGTTTTTCTATATGGCAGCCCGG
CCATAACTAAATTGATATAACTAAATTGATACCTTTTTTAGGCATCATTTTTAAGAACTCAGTAACTTCATAAGAACAGTGGTTGGGCAT
TTATTTGGTTGGCACCAATATAATAAAACAAATTCTAAATTATGATCTTGTTGAGTAATTCCTGGATTAGGAATGGGGTAGGGGAAAGAA
GCCCCACAGTAGGGTAGGCAGAGGTCCCCATAGCACCACACAGCATTACTTGGGAGCTTTTCTCCATCTGTGGAGGCAATGAGGGACACC
AGGCCCAAAGTCTGAGGCCTTCTCAGGTCTTGCAGTTTCCACCCTTCAGTCCTAGCTCCTGGATCTTTGTTGGTGTAAATATGGAAATGC
CACATTGGTTAAGTGCCATCATGGAAAACTAGGAACTAGTTTACCTGTAGCTGTACTAGGAAAACTAGGAACTAACGCCCATGGCAAGTC
AAACCCAGAAGTGTTGTCTTAGCTCAATATACCCTGCAAGTGTAATACAAATCGCAGATTAAAACATTTGTACTTATATTTGTGAGTATA
TGGAAAGACACCCAAGAGAAAACTGCGTTTTCTTGCCGAATGGACAGCAGCCCAAGATGAGCATCCAAAGTGAAATGGAAATACCAGTGC
GAGCATGAAAGTTCCTAAGATCTGCAAGCCGGGAGCCCTTCTGGGGTGCCTTCTGAAGAACACTCGAGCCAGCTTGCGGAGTCGACTGCC
CAGCTACTGAGCCCTGGCTGCCTCATACCTGGCGGAGCTCAGCAGACAGCCCTTGTTAGATGGCAGAGGTCGCCAGATGTTCCTGAGGAA
CTGGCATCAGGACTTCTGGGACGTAACCACTTCAGTTTCCAATCACTGACTTCCCTGAACCTGGTATCTTCAGATCTGGGTTGAAGCTGA
AGGCTCAGTTGGGGAATCAGGAAAACAAGGGAATCGGAGCCAGCCCAAGCCAACTGACAATAACTTTCTTTTAGTGACTGCATTAGTTTG
GGGTAATGCTGTAACAGGCAGCATTTCAAAAACGGATCCAAATTTGGGTTGAAAATAACTCACCTGGGTCTCCAATCAGACCTAACAGAG
CCAGAAAGAGGCCTGTACACGGGGACTGGGCAGAAGATCTAATGCCGTTTAAGACTCCTCCCTTGAGACCAGAGAAAAGAAACTGCAGAC

>53441_53441_3_MGST1-PKP4_MGST1_chr12_16507204_ENST00000396207_PKP4_chr2_159433783_ENST00000389757_length(amino acids)=1140AA_BP=27
MDDEVFMAFASYATIILSKMMLMSTATAFYRLTRKELQFQRLTRELEVERQIVASQLERCRLGAESPSIASTSSTEKSFPWRSTDVPNTG
VSKPRVSDAVQPNNYLIRTEPEQGTLYSPEQTSLHESEGSLGNSRSSTQMNSYSDSGYQEAGSFHNSQNVSKADNRQQHSFIGSTNNHVV
RNSRAEGQTLVQPSVANRAMRRVSSVPSRAQSPSYVISTGVSPSRGSLRTSLGSGFGSPSVTDPRPLNPSAYSSTTLPAARAASPYSQRP
ASPTAIRRIGSVTSRQTSNPNGPTPQYQTTARVGSPLTLTDAQTRVASPSQGQVGSSSPKRSGMTAVPQHLGPSLQRTVHDMEQFGQQQY
DIYERMVPPRPDSLTGLRSSYASQHSQLGQDLRSAVSPDLHITPIYEGRTYYSPVYRSPNHGTVELQGSQTALYRTGSVGIGNLQRTSSQ
RSTLTYQRNNYALNTTATYAEPYRPIQYRVQECNYNRLQHAVPADDGTTRSPSIDSIQKDPREFAWRDPELPEVIHMLQHQFPSVQANAA
AYLQHLCFGDNKVKMEVCRLGGIKHLVDLLDHRVLEVQKNACGALRNLVFGKSTDENKIAMKNVGGIPALLRLLRKSIDAEVRELVTGVL
WNLSSCDAVKMTIIRDALSTLTNTVIVPHSGWNNSSFDDDHKIKFQTSLVLRNTTGCLRNLSSAGEEARKQMRSCEGLVDSLLYVIHTCV
NTSDYDSKTVENCVCTLRNLSYRLELEVPQARLLGLNELDDLLGKESPSKDSEPSCWGKKKKKKKRTPQEDQWDGVGPIPGLSKSPKGVE
MLWHPSVVKPYLTLLAESSNPATLEGSAGSLQNLSAGNWKFAAYIRAAVRKEKGLPILVELLRMDNDRVVSSVATALRNMALDVRNKELI
GKYAMRDLVNRLPGGNGPSVLSDETMAAICCALHEVTSKNMENAKALADSGGIEKLVNITKGRGDRSSLKVVKAAAQVLNTLWQYRDLRS
IYKKDGWNQNHFITPVSTLERDRFKSHPSLSTTNQQMSPIIQSGSSKPSPIYISSYSSPAREQNRRLQHQQLYYSQDDSNRKNFDAYRLY

--------------------------------------------------------------
>53441_53441_4_MGST1-PKP4_MGST1_chr12_16507204_ENST00000396207_PKP4_chr2_159433783_ENST00000389759_length(transcript)=4358nt_BP=159nt
GAATTCAAGTCCTAAAGCCTACAGTTTTGAATACTACTGAAATGACAAGTTGTAATGGATGATGAAGTATTCATGGCTTTTGCATCCTAT
GCAACAATTATTCTTTCAAAAATGATGCTTATGAGTACTGCAACTGCATTCTATAGATTGACAAGAAAGGAGCTTCAGTTTCAGCGACTC
ACCCGAGAACTGGAAGTGGAAAGGCAGATTGTTGCCAGTCAGCTAGAAAGATGTAGGCTTGGAGCAGAATCACCAAGCATCGCCAGCACC
AGCTCAACTGAGAAGTCATTTCCTTGGAGATCAACAGACGTGCCAAATACTGGTGTAAGCAAACCTAGAGTTTCTGACGCTGTCCAGCCC
AACAACTATCTCATCAGGACAGAGCCAGAACAAGGAACCCTCTATTCACCAGAACAGACATCTCTCCATGAAAGTGAGGGATCATTGGGT
AACTCAAGAAGTTCAACACAAATGAATTCTTATTCCGACAGTGGATACCAGGAAGCAGGGAGTTTCCACAACAGCCAGAACGTGAGCAAG
GCAGACAACAGACAGCAGCATTCATTCATAGGATCAACTAACAACCATGTGGTGAGGAATTCAAGAGCTGAAGGACAAACACTGGTTCAG
CCATCAGTAGCCAATCGGGCCATGAGAAGAGTTAGTTCAGTTCCATCTAGAGCACAGTCTCCTTCTTATGTTATCAGCACAGGCGTGTCT
CCTTCAAGGGGGTCTCTGAGAACTTCTCTGGGTAGTGGATTTGGCTCTCCGTCAGTGACCGACCCCCGACCTCTGAACCCCAGTGCATAT
TCCTCCACCACATTACCTGCTGCACGGGCAGCCTCTCCGTACTCACAGAGACCCGCCTCCCCAACAGCTATACGGCGGATTGGGTCAGTC
ACCTCCCGGCAGACCTCCAATCCCAACGGACCAACCCCTCAATACCAAACCACCGCCAGAGTGGGGTCCCCACTGACCCTGACGGATGCA
CAGACTCGAGTAGCTTCCCCATCCCAAGGCCAGGTGGGGTCGTCGTCCCCCAAACGCTCAGGGATGACCGCCGTACCACAGCATCTGGGA
CCTTCACTGCAAAGGACTGTTCATGACATGGAGCAATTCGGACAGCAGCAGTATGACATTTATGAGAGGATGGTTCCACCCAGGCCAGAC
AGCCTGACAGGCTTACGGAGTTCCTATGCTAGTCAGCATAGTCAGCTTGGGCAAGACCTTCGTTCTGCCGTGTCTCCCGACTTGCACATT
ACTCCTATATATGAGGGGAGGACCTATTACAGCCCAGTGTACCGCAGCCCAAACCATGGAACTGTGGAGCTCCAAGGATCGCAGACGGCG
TTGTATCGCACAGGTTCAGTAGGTATTGGAAATCTACAAAGGACATCCAGCCAACGAAGTACCCTTACATACCAAAGAAATAATTATGCT
CTGAACACAACAGCTACCTACGCGGAGCCCTACAGGCCTATACAATACCGAGTGCAAGAGTGCAATTATAACAGGCTTCAGCATGCAGTG
CCGGCTGATGATGGCACCACAAGATCCCCATCAATAGACAGCATTCAGAAGGACCCCAGGGAGTTTGCCTGGCGTGATCCTGAGTTGCCT
GAGGTCATTCACATGCTTCAGCACCAGTTCCCATCTGTTCAGGCAAATGCAGCGGCCTACCTGCAGCACCTGTGCTTTGGTGACAACAAA
GTGAAGATGGAGGTGTGTAGGTTAGGGGGAATCAAGCATCTGGTTGACCTTCTGGACCACAGAGTTTTGGAAGTTCAGAAGAATGCTTGT
GGTGCCCTTCGAAACCTCGTTTTTGGCAAGTCTACAGATGAAAATAAAATAGCAATGAAGAATGTTGGTGGGATACCTGCCTTGTTGCGA
CTGTTGAGAAAATCTATTGATGCAGAAGTAAGGGAGCTTGTTACAGGAGTTCTTTGGAATTTATCCTCATGTGATGCTGTAAAAATGACA
ATCATTCGAGATGCTCTCTCAACCTTAACAAACACTGTGATTGTTCCACATTCTGGATGGAATAACTCTTCTTTTGATGATGATCATAAA
ATTAAATTTCAGACTTCACTAGTTCTGCGTAACACGACAGGTTGCCTAAGGAACCTCAGCTCCGCGGGGGAAGAAGCTCGGAAGCAAATG
CGGTCCTGCGAGGGGCTGGTAGACTCACTGTTGTATGTGATCCACACGTGTGTGAACACATCCGATTACGACAGCAAGACGGTGGAGAAC
TGCGTGTGCACCCTGAGGAACCTGTCCTATCGGCTGGAGCTGGAGGTGCCCCAGGCCCGGTTACTGGGACTGAACGAATTGGATGACTTA
CTAGGAAAAGAGTCTCCCAGCAAAGACTCTGAGCCAAGTTGCTGGGGGAAGAAGAAGAAAAAGAAAAAGAGGACTCCGCAAGAAGATCAA
TGGGATGGAGTTGGTCCTATCCCAGGACTGTCGAAGTCCCCCAAAGGGGTTGAGATGCTGTGGCACCCATCGGTGGTAAAACCATATCTG
ACTCTTCTAGCAGAAAGTTCCAACCCAGCCACCTTGGAAGGCTCTGCAGGGTCTCTCCAGAACCTCTCTGCTGGCAACTGGAAGTTTGCA
GCATATATCCGGGCGGCCGTCCGAAAAGAAAAGGGGCTCCCCATCCTTGTGGAGCTTCTGAGAATGGATAACGATAGAGTTGTTTCTTCC
GTGGCAACAGCCTTGAGGAATATGGCACTAGATGTTCGCAACAAGGAGCTCATAGGCAAATACGCCATGCGAGACCTGGTCAACCGGCTC
CCCGGCGGCAATGGCCCCAGTGTCTTGTCTGATGAGACCATGGCAGCCATCTGCTGTGCTCTGCACGAGGTCACCAGCAAAAACATGGAG
AACGCAAAAGCCCTGGCCGACTCAGGAGGCATAGAGAAGCTGGTGAACATAACCAAAGGCAGGGGCGACAGATCATCTCTGAAAGTGGTG
AAGGCAGCAGCCCAGGTCTTGAATACATTATGGCAATATCGGGACCTCCGGAGCATTTATAAAAAGGATGGGTGGAATCAGAACCATTTT
ATTACACCTGTGTCGACATTGGAGCGAGACCGATTCAAATCACATCCTTCCTTGTCTACCACCAACCAACAGATGTCACCCATCATTCAG
TCAGTCGGCAGCACCTCTTCCTCACCAGCACTGTTAGGAATCAGAGACCCTCGCTCTGAATACGATAGGACCCAGCCACCTATGCAGTAT
TACAATAGCCAAGGGGATGCCACACATAAAGGCCTGTACCCTGGCTCCAGCAAACCTTCACCAATTTACATCAGTTCCTATTCCTCACCA
GCAAGAGAACAAAATAGACGGCTACAGCATCAACAGCTGTATTATAGTCAAGATGACTCCAACAGAAAGAACTTTGATGCATACAGATTG
TATTTGCAGTCTCCTCATAGCTATGAAGATCCTTATTTTGATGACCGAGTTCACTTTCCAGCTTCTACTGATTACTCAACACAGTATGGA
CTGAAATCGACCACAAATTATGTAGACTTTTATTCCACTAAACGACCTTCTTATAGAGCAGAACAGTACCCAGGGTCCCCAGACTCATGG
GTGTAGCATCAAGATGCCCAACAGAGGAACTCTTTCTTTCTAACCTTGTTCAGATTGAGGTGAAAAGTCCATCTTGCTGATTTGATGATT
GAAATGTGAAAGTGAAGTGGAAGGAATGAATGAAGTGTGTTTTTTTTTTCTTTTTTGAGGAATTATCAGGGAAGTGAGGAAATGTTTGGG
AGAGGACTTTCTAAGCTCTATTTAGGTGTTAGATCTAATTACTTATAGATTCTGTAGTCTGGTGAAGGTGTGGGTGACGTGATGAGAGGT
TTGAGAAATGGGTGAAATGAAATGGGGGATATGTAGGTCAAATCAAATTAAAGATGATTTTTTTAATGTGAATAAAGTTATGTTCTGATA
GTTTGTACAGAAAAAATAAAATGGATGCCCATGTTTTATTGCTATTACTAAATGTCAAGATTGTATGCTATTATGTCTTGTAAATTTCTT
TTGTTGGTGTAAATATGGAAATGCCACATTGGTTAAGTGCCATCATTTGTAATGCAATGTGTCACTTGAAAAGAGATTTGAAGAAACTGA
CAACTTCAAAAACAAATGAGAAGCCCAAGGAACTGTGAGCAATTAAAAGCAAACCGCGACACCCTTTGTCTCCACCACACATAGTGTACT
TTGGAAGCACAACGTCCAGGCTGGTACCGCAGCGCCATGCCCATTCCTCGCCTCATTCATAGGACACTTCACTGCCATTTTCTATTCACA

>53441_53441_4_MGST1-PKP4_MGST1_chr12_16507204_ENST00000396207_PKP4_chr2_159433783_ENST00000389759_length(amino acids)=1183AA_BP=27
MDDEVFMAFASYATIILSKMMLMSTATAFYRLTRKELQFQRLTRELEVERQIVASQLERCRLGAESPSIASTSSTEKSFPWRSTDVPNTG
VSKPRVSDAVQPNNYLIRTEPEQGTLYSPEQTSLHESEGSLGNSRSSTQMNSYSDSGYQEAGSFHNSQNVSKADNRQQHSFIGSTNNHVV
RNSRAEGQTLVQPSVANRAMRRVSSVPSRAQSPSYVISTGVSPSRGSLRTSLGSGFGSPSVTDPRPLNPSAYSSTTLPAARAASPYSQRP
ASPTAIRRIGSVTSRQTSNPNGPTPQYQTTARVGSPLTLTDAQTRVASPSQGQVGSSSPKRSGMTAVPQHLGPSLQRTVHDMEQFGQQQY
DIYERMVPPRPDSLTGLRSSYASQHSQLGQDLRSAVSPDLHITPIYEGRTYYSPVYRSPNHGTVELQGSQTALYRTGSVGIGNLQRTSSQ
RSTLTYQRNNYALNTTATYAEPYRPIQYRVQECNYNRLQHAVPADDGTTRSPSIDSIQKDPREFAWRDPELPEVIHMLQHQFPSVQANAA
AYLQHLCFGDNKVKMEVCRLGGIKHLVDLLDHRVLEVQKNACGALRNLVFGKSTDENKIAMKNVGGIPALLRLLRKSIDAEVRELVTGVL
WNLSSCDAVKMTIIRDALSTLTNTVIVPHSGWNNSSFDDDHKIKFQTSLVLRNTTGCLRNLSSAGEEARKQMRSCEGLVDSLLYVIHTCV
NTSDYDSKTVENCVCTLRNLSYRLELEVPQARLLGLNELDDLLGKESPSKDSEPSCWGKKKKKKKRTPQEDQWDGVGPIPGLSKSPKGVE
MLWHPSVVKPYLTLLAESSNPATLEGSAGSLQNLSAGNWKFAAYIRAAVRKEKGLPILVELLRMDNDRVVSSVATALRNMALDVRNKELI
GKYAMRDLVNRLPGGNGPSVLSDETMAAICCALHEVTSKNMENAKALADSGGIEKLVNITKGRGDRSSLKVVKAAAQVLNTLWQYRDLRS
IYKKDGWNQNHFITPVSTLERDRFKSHPSLSTTNQQMSPIIQSVGSTSSSPALLGIRDPRSEYDRTQPPMQYYNSQGDATHKGLYPGSSK
PSPIYISSYSSPAREQNRRLQHQQLYYSQDDSNRKNFDAYRLYLQSPHSYEDPYFDDRVHFPASTDYSTQYGLKSTTNYVDFYSTKRPSY

--------------------------------------------------------------
>53441_53441_5_MGST1-PKP4_MGST1_chr12_16507204_ENST00000396209_PKP4_chr2_159433783_ENST00000389757_length(transcript)=5749nt_BP=229nt
GGCAGATGGAAGACTTGGGGGGGTCTCTGCCAGCTGGAAGTGCTTGGCTCCACTTAGCAGCTAAACTTAGCTTTTCAATCGATCGCTTTT
GAAAGGGAATTGTATTTCTGTCCCCGTGCGGGTAATGGATGATGAAGTATTCATGGCTTTTGCATCCTATGCAACAATTATTCTTTCAAA
AATGATGCTTATGAGTACTGCAACTGCATTCTATAGATTGACAAGAAAGGAGCTTCAGTTTCAGCGACTCACCCGAGAACTGGAAGTGGA
AAGGCAGATTGTTGCCAGTCAGCTAGAAAGATGTAGGCTTGGAGCAGAATCACCAAGCATCGCCAGCACCAGCTCAACTGAGAAGTCATT
TCCTTGGAGATCAACAGACGTGCCAAATACTGGTGTAAGCAAACCTAGAGTTTCTGACGCTGTCCAGCCCAACAACTATCTCATCAGGAC
AGAGCCAGAACAAGGAACCCTCTATTCACCAGAACAGACATCTCTCCATGAAAGTGAGGGATCATTGGGTAACTCAAGAAGTTCAACACA
AATGAATTCTTATTCCGACAGTGGATACCAGGAAGCAGGGAGTTTCCACAACAGCCAGAACGTGAGCAAGGCAGACAACAGACAGCAGCA
TTCATTCATAGGATCAACTAACAACCATGTGGTGAGGAATTCAAGAGCTGAAGGACAAACACTGGTTCAGCCATCAGTAGCCAATCGGGC
CATGAGAAGAGTTAGTTCAGTTCCATCTAGAGCACAGTCTCCTTCTTATGTTATCAGCACAGGCGTGTCTCCTTCAAGGGGGTCTCTGAG
AACTTCTCTGGGTAGTGGATTTGGCTCTCCGTCAGTGACCGACCCCCGACCTCTGAACCCCAGTGCATATTCCTCCACCACATTACCTGC
TGCACGGGCAGCCTCTCCGTACTCACAGAGACCCGCCTCCCCAACAGCTATACGGCGGATTGGGTCAGTCACCTCCCGGCAGACCTCCAA
TCCCAACGGACCAACCCCTCAATACCAAACCACCGCCAGAGTGGGGTCCCCACTGACCCTGACGGATGCACAGACTCGAGTAGCTTCCCC
ATCCCAAGGCCAGGTGGGGTCGTCGTCCCCCAAACGCTCAGGGATGACCGCCGTACCACAGCATCTGGGACCTTCACTGCAAAGGACTGT
TCATGACATGGAGCAATTCGGACAGCAGCAGTATGACATTTATGAGAGGATGGTTCCACCCAGGCCAGACAGCCTGACAGGCTTACGGAG
TTCCTATGCTAGTCAGCATAGTCAGCTTGGGCAAGACCTTCGTTCTGCCGTGTCTCCCGACTTGCACATTACTCCTATATATGAGGGGAG
GACCTATTACAGCCCAGTGTACCGCAGCCCAAACCATGGAACTGTGGAGCTCCAAGGATCGCAGACGGCGTTGTATCGCACAGGTTCAGT
AGGTATTGGAAATCTACAAAGGACATCCAGCCAACGAAGTACCCTTACATACCAAAGAAATAATTATGCTCTGAACACAACAGCTACCTA
CGCGGAGCCCTACAGGCCTATACAATACCGAGTGCAAGAGTGCAATTATAACAGGCTTCAGCATGCAGTGCCGGCTGATGATGGCACCAC
AAGATCCCCATCAATAGACAGCATTCAGAAGGACCCCAGGGAGTTTGCCTGGCGTGATCCTGAGTTGCCTGAGGTCATTCACATGCTTCA
GCACCAGTTCCCATCTGTTCAGGCAAATGCAGCGGCCTACCTGCAGCACCTGTGCTTTGGTGACAACAAAGTGAAGATGGAGGTGTGTAG
GTTAGGGGGAATCAAGCATCTGGTTGACCTTCTGGACCACAGAGTTTTGGAAGTTCAGAAGAATGCTTGTGGTGCCCTTCGAAACCTCGT
TTTTGGCAAGTCTACAGATGAAAATAAAATAGCAATGAAGAATGTTGGTGGGATACCTGCCTTGTTGCGACTGTTGAGAAAATCTATTGA
TGCAGAAGTAAGGGAGCTTGTTACAGGAGTTCTTTGGAATTTATCCTCATGTGATGCTGTAAAAATGACAATCATTCGAGATGCTCTCTC
AACCTTAACAAACACTGTGATTGTTCCACATTCTGGATGGAATAACTCTTCTTTTGATGATGATCATAAAATTAAATTTCAGACTTCACT
AGTTCTGCGTAACACGACAGGTTGCCTAAGGAACCTCAGCTCCGCGGGGGAAGAAGCTCGGAAGCAAATGCGGTCCTGCGAGGGGCTGGT
AGACTCACTGTTGTATGTGATCCACACGTGTGTGAACACATCCGATTACGACAGCAAGACGGTGGAGAACTGCGTGTGCACCCTGAGGAA
CCTGTCCTATCGGCTGGAGCTGGAGGTGCCCCAGGCCCGGTTACTGGGACTGAACGAATTGGATGACTTACTAGGAAAAGAGTCTCCCAG
CAAAGACTCTGAGCCAAGTTGCTGGGGGAAGAAGAAGAAAAAGAAAAAGAGGACTCCGCAAGAAGATCAATGGGATGGAGTTGGTCCTAT
CCCAGGACTGTCGAAGTCCCCCAAAGGGGTTGAGATGCTGTGGCACCCATCGGTGGTAAAACCATATCTGACTCTTCTAGCAGAAAGTTC
CAACCCAGCCACCTTGGAAGGCTCTGCAGGGTCTCTCCAGAACCTCTCTGCTGGCAACTGGAAGTTTGCAGCATATATCCGGGCGGCCGT
CCGAAAAGAAAAGGGGCTCCCCATCCTTGTGGAGCTTCTGAGAATGGATAACGATAGAGTTGTTTCTTCCGTGGCAACAGCCTTGAGGAA
TATGGCACTAGATGTTCGCAACAAGGAGCTCATAGGCAAATACGCCATGCGAGACCTGGTCAACCGGCTCCCCGGCGGCAATGGCCCCAG
TGTCTTGTCTGATGAGACCATGGCAGCCATCTGCTGTGCTCTGCACGAGGTCACCAGCAAAAACATGGAGAACGCAAAAGCCCTGGCCGA
CTCAGGAGGCATAGAGAAGCTGGTGAACATAACCAAAGGCAGGGGCGACAGATCATCTCTGAAAGTGGTGAAGGCAGCAGCCCAGGTCTT
GAATACATTATGGCAATATCGGGACCTCCGGAGCATTTATAAAAAGGATGGGTGGAATCAGAACCATTTTATTACACCTGTGTCGACATT
GGAGCGAGACCGATTCAAATCACATCCTTCCTTGTCTACCACCAACCAACAGATGTCACCCATCATTCAGTCAGGCTCCAGCAAACCTTC
ACCAATTTACATCAGTTCCTATTCCTCACCAGCAAGAGAACAAAATAGACGGCTACAGCATCAACAGCTGTATTATAGTCAAGATGACTC
CAACAGAAAGAACTTTGATGCATACAGATTGTATTTGCAGTCTCCTCATAGCTATGAAGATCCTTATTTTGATGACCGAGTTCACTTTCC
AGCTTCTACTGATTACTCAACACAGTATGGACTGAAATCGACCACAAATTATGTAGACTTTTATTCCACTAAACGACCTTCTTATAGAGC
AGAACAGTACCCAGGGTCCCCAGACTCATGGGTGTAGCATCAAGATGCCCAACAGAGGAACTCTTTCTTTCTAACCTTGTTCAGATTGAG
GTGAAAAGTCCATCTTGCTGATTTGATGATTGAAATGTGAAAGTGAAGTGGAAGGAATGAATGAAGTGTGTTTTTTTTTTCTTTTTTGAG
GAATTATCAGGGAAGTGAGGAAATGTTTGGGAGAGGACTTTCTAAGCTCTATTTAGGTGTTAGATCTAATTACTTATAGATTCTGTAGTC
TGGTGAAGGTGTGGGTGACGTGATGAGAGGTTTGAGAAATGGGTGAAATGAAATGGGGGATATGTAGGTCAAATCAAATTAAAGATGATT
TTTTTAATGTGAATAAAGTTATGTTCTGATAGTTTGTACAGAAAAAATAAAATGGATGCCCATGTTTTATTGCTATTACTAAATGTCAAG
ATTGTATGCTATTATGTCTTGTAAATTTCTTTTGTTGGTGTAAATATGGAAATGCCACATTGGTTAAGTGCCATCATTTGTAATGCAATG
TGTCACTTGAAAAGAGATTTGAAGAAACTGACAACTTCAAAAACAAATGAGAAGCCCAAGGAACTGTGAGCAATTAAAAGCAAACCGCGA
CACCCTTTGTCTCCACCACACATAGTGTACTTTGGAAGCACAACGTCCAGGCTGGTACCGCAGCGCCATGCCCATTCCTCGCCTCATTCA
TAGGACACTTCACTGCCATTTTCTATTCACATAAAAGAAAAATAAATGTGGAAATTTCATCCTTGGAAATGTGTTCTCTATTCTTTTTAA
ATTTTCTGTATGTATATGGAAAAGCAGATTTAATAAACACAAATCTAACTGCATGTTGGAAACAGCTTAAATATATCCAACTATTTTGGA
CTTTTCCTAGTTTTATATATACTTTGTGAGATGGACCACACCAAAGAATTAAGCTATGAACTTTCCTTCTCTGACAATCTAGGTATTTTT
ATTTCTTGGAATCTATTTCAAATCTAGTACCCTTCACAGTTTTATCATGTTTTTCTATATGGCAGCCCGGCCATAACTAAATTGATATAA
CTAAATTGATACCTTTTTTAGGCATCATTTTTAAGAACTCAGTAACTTCATAAGAACAGTGGTTGGGCATTTATTTGGTTGGCACCAATA
TAATAAAACAAATTCTAAATTATGATCTTGTTGAGTAATTCCTGGATTAGGAATGGGGTAGGGGAAAGAAGCCCCACAGTAGGGTAGGCA
GAGGTCCCCATAGCACCACACAGCATTACTTGGGAGCTTTTCTCCATCTGTGGAGGCAATGAGGGACACCAGGCCCAAAGTCTGAGGCCT
TCTCAGGTCTTGCAGTTTCCACCCTTCAGTCCTAGCTCCTGGATCTTTGTTGGTGTAAATATGGAAATGCCACATTGGTTAAGTGCCATC
ATGGAAAACTAGGAACTAGTTTACCTGTAGCTGTACTAGGAAAACTAGGAACTAACGCCCATGGCAAGTCAAACCCAGAAGTGTTGTCTT
AGCTCAATATACCCTGCAAGTGTAATACAAATCGCAGATTAAAACATTTGTACTTATATTTGTGAGTATATGGAAAGACACCCAAGAGAA
AACTGCGTTTTCTTGCCGAATGGACAGCAGCCCAAGATGAGCATCCAAAGTGAAATGGAAATACCAGTGCGAGCATGAAAGTTCCTAAGA
TCTGCAAGCCGGGAGCCCTTCTGGGGTGCCTTCTGAAGAACACTCGAGCCAGCTTGCGGAGTCGACTGCCCAGCTACTGAGCCCTGGCTG
CCTCATACCTGGCGGAGCTCAGCAGACAGCCCTTGTTAGATGGCAGAGGTCGCCAGATGTTCCTGAGGAACTGGCATCAGGACTTCTGGG
ACGTAACCACTTCAGTTTCCAATCACTGACTTCCCTGAACCTGGTATCTTCAGATCTGGGTTGAAGCTGAAGGCTCAGTTGGGGAATCAG
GAAAACAAGGGAATCGGAGCCAGCCCAAGCCAACTGACAATAACTTTCTTTTAGTGACTGCATTAGTTTGGGGTAATGCTGTAACAGGCA
GCATTTCAAAAACGGATCCAAATTTGGGTTGAAAATAACTCACCTGGGTCTCCAATCAGACCTAACAGAGCCAGAAAGAGGCCTGTACAC

>53441_53441_5_MGST1-PKP4_MGST1_chr12_16507204_ENST00000396209_PKP4_chr2_159433783_ENST00000389757_length(amino acids)=1152AA_BP=39
MKGNCISVPVRVMDDEVFMAFASYATIILSKMMLMSTATAFYRLTRKELQFQRLTRELEVERQIVASQLERCRLGAESPSIASTSSTEKS
FPWRSTDVPNTGVSKPRVSDAVQPNNYLIRTEPEQGTLYSPEQTSLHESEGSLGNSRSSTQMNSYSDSGYQEAGSFHNSQNVSKADNRQQ
HSFIGSTNNHVVRNSRAEGQTLVQPSVANRAMRRVSSVPSRAQSPSYVISTGVSPSRGSLRTSLGSGFGSPSVTDPRPLNPSAYSSTTLP
AARAASPYSQRPASPTAIRRIGSVTSRQTSNPNGPTPQYQTTARVGSPLTLTDAQTRVASPSQGQVGSSSPKRSGMTAVPQHLGPSLQRT
VHDMEQFGQQQYDIYERMVPPRPDSLTGLRSSYASQHSQLGQDLRSAVSPDLHITPIYEGRTYYSPVYRSPNHGTVELQGSQTALYRTGS
VGIGNLQRTSSQRSTLTYQRNNYALNTTATYAEPYRPIQYRVQECNYNRLQHAVPADDGTTRSPSIDSIQKDPREFAWRDPELPEVIHML
QHQFPSVQANAAAYLQHLCFGDNKVKMEVCRLGGIKHLVDLLDHRVLEVQKNACGALRNLVFGKSTDENKIAMKNVGGIPALLRLLRKSI
DAEVRELVTGVLWNLSSCDAVKMTIIRDALSTLTNTVIVPHSGWNNSSFDDDHKIKFQTSLVLRNTTGCLRNLSSAGEEARKQMRSCEGL
VDSLLYVIHTCVNTSDYDSKTVENCVCTLRNLSYRLELEVPQARLLGLNELDDLLGKESPSKDSEPSCWGKKKKKKKRTPQEDQWDGVGP
IPGLSKSPKGVEMLWHPSVVKPYLTLLAESSNPATLEGSAGSLQNLSAGNWKFAAYIRAAVRKEKGLPILVELLRMDNDRVVSSVATALR
NMALDVRNKELIGKYAMRDLVNRLPGGNGPSVLSDETMAAICCALHEVTSKNMENAKALADSGGIEKLVNITKGRGDRSSLKVVKAAAQV
LNTLWQYRDLRSIYKKDGWNQNHFITPVSTLERDRFKSHPSLSTTNQQMSPIIQSGSSKPSPIYISSYSSPAREQNRRLQHQQLYYSQDD

--------------------------------------------------------------
>53441_53441_6_MGST1-PKP4_MGST1_chr12_16507204_ENST00000396209_PKP4_chr2_159433783_ENST00000389759_length(transcript)=4428nt_BP=229nt
GGCAGATGGAAGACTTGGGGGGGTCTCTGCCAGCTGGAAGTGCTTGGCTCCACTTAGCAGCTAAACTTAGCTTTTCAATCGATCGCTTTT
GAAAGGGAATTGTATTTCTGTCCCCGTGCGGGTAATGGATGATGAAGTATTCATGGCTTTTGCATCCTATGCAACAATTATTCTTTCAAA
AATGATGCTTATGAGTACTGCAACTGCATTCTATAGATTGACAAGAAAGGAGCTTCAGTTTCAGCGACTCACCCGAGAACTGGAAGTGGA
AAGGCAGATTGTTGCCAGTCAGCTAGAAAGATGTAGGCTTGGAGCAGAATCACCAAGCATCGCCAGCACCAGCTCAACTGAGAAGTCATT
TCCTTGGAGATCAACAGACGTGCCAAATACTGGTGTAAGCAAACCTAGAGTTTCTGACGCTGTCCAGCCCAACAACTATCTCATCAGGAC
AGAGCCAGAACAAGGAACCCTCTATTCACCAGAACAGACATCTCTCCATGAAAGTGAGGGATCATTGGGTAACTCAAGAAGTTCAACACA
AATGAATTCTTATTCCGACAGTGGATACCAGGAAGCAGGGAGTTTCCACAACAGCCAGAACGTGAGCAAGGCAGACAACAGACAGCAGCA
TTCATTCATAGGATCAACTAACAACCATGTGGTGAGGAATTCAAGAGCTGAAGGACAAACACTGGTTCAGCCATCAGTAGCCAATCGGGC
CATGAGAAGAGTTAGTTCAGTTCCATCTAGAGCACAGTCTCCTTCTTATGTTATCAGCACAGGCGTGTCTCCTTCAAGGGGGTCTCTGAG
AACTTCTCTGGGTAGTGGATTTGGCTCTCCGTCAGTGACCGACCCCCGACCTCTGAACCCCAGTGCATATTCCTCCACCACATTACCTGC
TGCACGGGCAGCCTCTCCGTACTCACAGAGACCCGCCTCCCCAACAGCTATACGGCGGATTGGGTCAGTCACCTCCCGGCAGACCTCCAA
TCCCAACGGACCAACCCCTCAATACCAAACCACCGCCAGAGTGGGGTCCCCACTGACCCTGACGGATGCACAGACTCGAGTAGCTTCCCC
ATCCCAAGGCCAGGTGGGGTCGTCGTCCCCCAAACGCTCAGGGATGACCGCCGTACCACAGCATCTGGGACCTTCACTGCAAAGGACTGT
TCATGACATGGAGCAATTCGGACAGCAGCAGTATGACATTTATGAGAGGATGGTTCCACCCAGGCCAGACAGCCTGACAGGCTTACGGAG
TTCCTATGCTAGTCAGCATAGTCAGCTTGGGCAAGACCTTCGTTCTGCCGTGTCTCCCGACTTGCACATTACTCCTATATATGAGGGGAG
GACCTATTACAGCCCAGTGTACCGCAGCCCAAACCATGGAACTGTGGAGCTCCAAGGATCGCAGACGGCGTTGTATCGCACAGGTTCAGT
AGGTATTGGAAATCTACAAAGGACATCCAGCCAACGAAGTACCCTTACATACCAAAGAAATAATTATGCTCTGAACACAACAGCTACCTA
CGCGGAGCCCTACAGGCCTATACAATACCGAGTGCAAGAGTGCAATTATAACAGGCTTCAGCATGCAGTGCCGGCTGATGATGGCACCAC
AAGATCCCCATCAATAGACAGCATTCAGAAGGACCCCAGGGAGTTTGCCTGGCGTGATCCTGAGTTGCCTGAGGTCATTCACATGCTTCA
GCACCAGTTCCCATCTGTTCAGGCAAATGCAGCGGCCTACCTGCAGCACCTGTGCTTTGGTGACAACAAAGTGAAGATGGAGGTGTGTAG
GTTAGGGGGAATCAAGCATCTGGTTGACCTTCTGGACCACAGAGTTTTGGAAGTTCAGAAGAATGCTTGTGGTGCCCTTCGAAACCTCGT
TTTTGGCAAGTCTACAGATGAAAATAAAATAGCAATGAAGAATGTTGGTGGGATACCTGCCTTGTTGCGACTGTTGAGAAAATCTATTGA
TGCAGAAGTAAGGGAGCTTGTTACAGGAGTTCTTTGGAATTTATCCTCATGTGATGCTGTAAAAATGACAATCATTCGAGATGCTCTCTC
AACCTTAACAAACACTGTGATTGTTCCACATTCTGGATGGAATAACTCTTCTTTTGATGATGATCATAAAATTAAATTTCAGACTTCACT
AGTTCTGCGTAACACGACAGGTTGCCTAAGGAACCTCAGCTCCGCGGGGGAAGAAGCTCGGAAGCAAATGCGGTCCTGCGAGGGGCTGGT
AGACTCACTGTTGTATGTGATCCACACGTGTGTGAACACATCCGATTACGACAGCAAGACGGTGGAGAACTGCGTGTGCACCCTGAGGAA
CCTGTCCTATCGGCTGGAGCTGGAGGTGCCCCAGGCCCGGTTACTGGGACTGAACGAATTGGATGACTTACTAGGAAAAGAGTCTCCCAG
CAAAGACTCTGAGCCAAGTTGCTGGGGGAAGAAGAAGAAAAAGAAAAAGAGGACTCCGCAAGAAGATCAATGGGATGGAGTTGGTCCTAT
CCCAGGACTGTCGAAGTCCCCCAAAGGGGTTGAGATGCTGTGGCACCCATCGGTGGTAAAACCATATCTGACTCTTCTAGCAGAAAGTTC
CAACCCAGCCACCTTGGAAGGCTCTGCAGGGTCTCTCCAGAACCTCTCTGCTGGCAACTGGAAGTTTGCAGCATATATCCGGGCGGCCGT
CCGAAAAGAAAAGGGGCTCCCCATCCTTGTGGAGCTTCTGAGAATGGATAACGATAGAGTTGTTTCTTCCGTGGCAACAGCCTTGAGGAA
TATGGCACTAGATGTTCGCAACAAGGAGCTCATAGGCAAATACGCCATGCGAGACCTGGTCAACCGGCTCCCCGGCGGCAATGGCCCCAG
TGTCTTGTCTGATGAGACCATGGCAGCCATCTGCTGTGCTCTGCACGAGGTCACCAGCAAAAACATGGAGAACGCAAAAGCCCTGGCCGA
CTCAGGAGGCATAGAGAAGCTGGTGAACATAACCAAAGGCAGGGGCGACAGATCATCTCTGAAAGTGGTGAAGGCAGCAGCCCAGGTCTT
GAATACATTATGGCAATATCGGGACCTCCGGAGCATTTATAAAAAGGATGGGTGGAATCAGAACCATTTTATTACACCTGTGTCGACATT
GGAGCGAGACCGATTCAAATCACATCCTTCCTTGTCTACCACCAACCAACAGATGTCACCCATCATTCAGTCAGTCGGCAGCACCTCTTC
CTCACCAGCACTGTTAGGAATCAGAGACCCTCGCTCTGAATACGATAGGACCCAGCCACCTATGCAGTATTACAATAGCCAAGGGGATGC
CACACATAAAGGCCTGTACCCTGGCTCCAGCAAACCTTCACCAATTTACATCAGTTCCTATTCCTCACCAGCAAGAGAACAAAATAGACG
GCTACAGCATCAACAGCTGTATTATAGTCAAGATGACTCCAACAGAAAGAACTTTGATGCATACAGATTGTATTTGCAGTCTCCTCATAG
CTATGAAGATCCTTATTTTGATGACCGAGTTCACTTTCCAGCTTCTACTGATTACTCAACACAGTATGGACTGAAATCGACCACAAATTA
TGTAGACTTTTATTCCACTAAACGACCTTCTTATAGAGCAGAACAGTACCCAGGGTCCCCAGACTCATGGGTGTAGCATCAAGATGCCCA
ACAGAGGAACTCTTTCTTTCTAACCTTGTTCAGATTGAGGTGAAAAGTCCATCTTGCTGATTTGATGATTGAAATGTGAAAGTGAAGTGG
AAGGAATGAATGAAGTGTGTTTTTTTTTTCTTTTTTGAGGAATTATCAGGGAAGTGAGGAAATGTTTGGGAGAGGACTTTCTAAGCTCTA
TTTAGGTGTTAGATCTAATTACTTATAGATTCTGTAGTCTGGTGAAGGTGTGGGTGACGTGATGAGAGGTTTGAGAAATGGGTGAAATGA
AATGGGGGATATGTAGGTCAAATCAAATTAAAGATGATTTTTTTAATGTGAATAAAGTTATGTTCTGATAGTTTGTACAGAAAAAATAAA
ATGGATGCCCATGTTTTATTGCTATTACTAAATGTCAAGATTGTATGCTATTATGTCTTGTAAATTTCTTTTGTTGGTGTAAATATGGAA
ATGCCACATTGGTTAAGTGCCATCATTTGTAATGCAATGTGTCACTTGAAAAGAGATTTGAAGAAACTGACAACTTCAAAAACAAATGAG
AAGCCCAAGGAACTGTGAGCAATTAAAAGCAAACCGCGACACCCTTTGTCTCCACCACACATAGTGTACTTTGGAAGCACAACGTCCAGG
CTGGTACCGCAGCGCCATGCCCATTCCTCGCCTCATTCATAGGACACTTCACTGCCATTTTCTATTCACATAAAAGAAAAATAAATGTGG

>53441_53441_6_MGST1-PKP4_MGST1_chr12_16507204_ENST00000396209_PKP4_chr2_159433783_ENST00000389759_length(amino acids)=1195AA_BP=39
MKGNCISVPVRVMDDEVFMAFASYATIILSKMMLMSTATAFYRLTRKELQFQRLTRELEVERQIVASQLERCRLGAESPSIASTSSTEKS
FPWRSTDVPNTGVSKPRVSDAVQPNNYLIRTEPEQGTLYSPEQTSLHESEGSLGNSRSSTQMNSYSDSGYQEAGSFHNSQNVSKADNRQQ
HSFIGSTNNHVVRNSRAEGQTLVQPSVANRAMRRVSSVPSRAQSPSYVISTGVSPSRGSLRTSLGSGFGSPSVTDPRPLNPSAYSSTTLP
AARAASPYSQRPASPTAIRRIGSVTSRQTSNPNGPTPQYQTTARVGSPLTLTDAQTRVASPSQGQVGSSSPKRSGMTAVPQHLGPSLQRT
VHDMEQFGQQQYDIYERMVPPRPDSLTGLRSSYASQHSQLGQDLRSAVSPDLHITPIYEGRTYYSPVYRSPNHGTVELQGSQTALYRTGS
VGIGNLQRTSSQRSTLTYQRNNYALNTTATYAEPYRPIQYRVQECNYNRLQHAVPADDGTTRSPSIDSIQKDPREFAWRDPELPEVIHML
QHQFPSVQANAAAYLQHLCFGDNKVKMEVCRLGGIKHLVDLLDHRVLEVQKNACGALRNLVFGKSTDENKIAMKNVGGIPALLRLLRKSI
DAEVRELVTGVLWNLSSCDAVKMTIIRDALSTLTNTVIVPHSGWNNSSFDDDHKIKFQTSLVLRNTTGCLRNLSSAGEEARKQMRSCEGL
VDSLLYVIHTCVNTSDYDSKTVENCVCTLRNLSYRLELEVPQARLLGLNELDDLLGKESPSKDSEPSCWGKKKKKKKRTPQEDQWDGVGP
IPGLSKSPKGVEMLWHPSVVKPYLTLLAESSNPATLEGSAGSLQNLSAGNWKFAAYIRAAVRKEKGLPILVELLRMDNDRVVSSVATALR
NMALDVRNKELIGKYAMRDLVNRLPGGNGPSVLSDETMAAICCALHEVTSKNMENAKALADSGGIEKLVNITKGRGDRSSLKVVKAAAQV
LNTLWQYRDLRSIYKKDGWNQNHFITPVSTLERDRFKSHPSLSTTNQQMSPIIQSVGSTSSSPALLGIRDPRSEYDRTQPPMQYYNSQGD
ATHKGLYPGSSKPSPIYISSYSSPAREQNRRLQHQQLYYSQDDSNRKNFDAYRLYLQSPHSYEDPYFDDRVHFPASTDYSTQYGLKSTTN

--------------------------------------------------------------
>53441_53441_7_MGST1-PKP4_MGST1_chr12_16507204_ENST00000396210_PKP4_chr2_159433783_ENST00000389757_length(transcript)=5690nt_BP=170nt
CCTGCATTGCGCGCGACCCGGCGGCGGGACAGGCTTGCTGCTTCCTCCTCCTCGGCCTCACCGTAATGGATGATGAAGTATTCATGGCTT
TTGCATCCTATGCAACAATTATTCTTTCAAAAATGATGCTTATGAGTACTGCAACTGCATTCTATAGATTGACAAGAAAGGAGCTTCAGT
TTCAGCGACTCACCCGAGAACTGGAAGTGGAAAGGCAGATTGTTGCCAGTCAGCTAGAAAGATGTAGGCTTGGAGCAGAATCACCAAGCA
TCGCCAGCACCAGCTCAACTGAGAAGTCATTTCCTTGGAGATCAACAGACGTGCCAAATACTGGTGTAAGCAAACCTAGAGTTTCTGACG
CTGTCCAGCCCAACAACTATCTCATCAGGACAGAGCCAGAACAAGGAACCCTCTATTCACCAGAACAGACATCTCTCCATGAAAGTGAGG
GATCATTGGGTAACTCAAGAAGTTCAACACAAATGAATTCTTATTCCGACAGTGGATACCAGGAAGCAGGGAGTTTCCACAACAGCCAGA
ACGTGAGCAAGGCAGACAACAGACAGCAGCATTCATTCATAGGATCAACTAACAACCATGTGGTGAGGAATTCAAGAGCTGAAGGACAAA
CACTGGTTCAGCCATCAGTAGCCAATCGGGCCATGAGAAGAGTTAGTTCAGTTCCATCTAGAGCACAGTCTCCTTCTTATGTTATCAGCA
CAGGCGTGTCTCCTTCAAGGGGGTCTCTGAGAACTTCTCTGGGTAGTGGATTTGGCTCTCCGTCAGTGACCGACCCCCGACCTCTGAACC
CCAGTGCATATTCCTCCACCACATTACCTGCTGCACGGGCAGCCTCTCCGTACTCACAGAGACCCGCCTCCCCAACAGCTATACGGCGGA
TTGGGTCAGTCACCTCCCGGCAGACCTCCAATCCCAACGGACCAACCCCTCAATACCAAACCACCGCCAGAGTGGGGTCCCCACTGACCC
TGACGGATGCACAGACTCGAGTAGCTTCCCCATCCCAAGGCCAGGTGGGGTCGTCGTCCCCCAAACGCTCAGGGATGACCGCCGTACCAC
AGCATCTGGGACCTTCACTGCAAAGGACTGTTCATGACATGGAGCAATTCGGACAGCAGCAGTATGACATTTATGAGAGGATGGTTCCAC
CCAGGCCAGACAGCCTGACAGGCTTACGGAGTTCCTATGCTAGTCAGCATAGTCAGCTTGGGCAAGACCTTCGTTCTGCCGTGTCTCCCG
ACTTGCACATTACTCCTATATATGAGGGGAGGACCTATTACAGCCCAGTGTACCGCAGCCCAAACCATGGAACTGTGGAGCTCCAAGGAT
CGCAGACGGCGTTGTATCGCACAGGTTCAGTAGGTATTGGAAATCTACAAAGGACATCCAGCCAACGAAGTACCCTTACATACCAAAGAA
ATAATTATGCTCTGAACACAACAGCTACCTACGCGGAGCCCTACAGGCCTATACAATACCGAGTGCAAGAGTGCAATTATAACAGGCTTC
AGCATGCAGTGCCGGCTGATGATGGCACCACAAGATCCCCATCAATAGACAGCATTCAGAAGGACCCCAGGGAGTTTGCCTGGCGTGATC
CTGAGTTGCCTGAGGTCATTCACATGCTTCAGCACCAGTTCCCATCTGTTCAGGCAAATGCAGCGGCCTACCTGCAGCACCTGTGCTTTG
GTGACAACAAAGTGAAGATGGAGGTGTGTAGGTTAGGGGGAATCAAGCATCTGGTTGACCTTCTGGACCACAGAGTTTTGGAAGTTCAGA
AGAATGCTTGTGGTGCCCTTCGAAACCTCGTTTTTGGCAAGTCTACAGATGAAAATAAAATAGCAATGAAGAATGTTGGTGGGATACCTG
CCTTGTTGCGACTGTTGAGAAAATCTATTGATGCAGAAGTAAGGGAGCTTGTTACAGGAGTTCTTTGGAATTTATCCTCATGTGATGCTG
TAAAAATGACAATCATTCGAGATGCTCTCTCAACCTTAACAAACACTGTGATTGTTCCACATTCTGGATGGAATAACTCTTCTTTTGATG
ATGATCATAAAATTAAATTTCAGACTTCACTAGTTCTGCGTAACACGACAGGTTGCCTAAGGAACCTCAGCTCCGCGGGGGAAGAAGCTC
GGAAGCAAATGCGGTCCTGCGAGGGGCTGGTAGACTCACTGTTGTATGTGATCCACACGTGTGTGAACACATCCGATTACGACAGCAAGA
CGGTGGAGAACTGCGTGTGCACCCTGAGGAACCTGTCCTATCGGCTGGAGCTGGAGGTGCCCCAGGCCCGGTTACTGGGACTGAACGAAT
TGGATGACTTACTAGGAAAAGAGTCTCCCAGCAAAGACTCTGAGCCAAGTTGCTGGGGGAAGAAGAAGAAAAAGAAAAAGAGGACTCCGC
AAGAAGATCAATGGGATGGAGTTGGTCCTATCCCAGGACTGTCGAAGTCCCCCAAAGGGGTTGAGATGCTGTGGCACCCATCGGTGGTAA
AACCATATCTGACTCTTCTAGCAGAAAGTTCCAACCCAGCCACCTTGGAAGGCTCTGCAGGGTCTCTCCAGAACCTCTCTGCTGGCAACT
GGAAGTTTGCAGCATATATCCGGGCGGCCGTCCGAAAAGAAAAGGGGCTCCCCATCCTTGTGGAGCTTCTGAGAATGGATAACGATAGAG
TTGTTTCTTCCGTGGCAACAGCCTTGAGGAATATGGCACTAGATGTTCGCAACAAGGAGCTCATAGGCAAATACGCCATGCGAGACCTGG
TCAACCGGCTCCCCGGCGGCAATGGCCCCAGTGTCTTGTCTGATGAGACCATGGCAGCCATCTGCTGTGCTCTGCACGAGGTCACCAGCA
AAAACATGGAGAACGCAAAAGCCCTGGCCGACTCAGGAGGCATAGAGAAGCTGGTGAACATAACCAAAGGCAGGGGCGACAGATCATCTC
TGAAAGTGGTGAAGGCAGCAGCCCAGGTCTTGAATACATTATGGCAATATCGGGACCTCCGGAGCATTTATAAAAAGGATGGGTGGAATC
AGAACCATTTTATTACACCTGTGTCGACATTGGAGCGAGACCGATTCAAATCACATCCTTCCTTGTCTACCACCAACCAACAGATGTCAC
CCATCATTCAGTCAGGCTCCAGCAAACCTTCACCAATTTACATCAGTTCCTATTCCTCACCAGCAAGAGAACAAAATAGACGGCTACAGC
ATCAACAGCTGTATTATAGTCAAGATGACTCCAACAGAAAGAACTTTGATGCATACAGATTGTATTTGCAGTCTCCTCATAGCTATGAAG
ATCCTTATTTTGATGACCGAGTTCACTTTCCAGCTTCTACTGATTACTCAACACAGTATGGACTGAAATCGACCACAAATTATGTAGACT
TTTATTCCACTAAACGACCTTCTTATAGAGCAGAACAGTACCCAGGGTCCCCAGACTCATGGGTGTAGCATCAAGATGCCCAACAGAGGA
ACTCTTTCTTTCTAACCTTGTTCAGATTGAGGTGAAAAGTCCATCTTGCTGATTTGATGATTGAAATGTGAAAGTGAAGTGGAAGGAATG
AATGAAGTGTGTTTTTTTTTTCTTTTTTGAGGAATTATCAGGGAAGTGAGGAAATGTTTGGGAGAGGACTTTCTAAGCTCTATTTAGGTG
TTAGATCTAATTACTTATAGATTCTGTAGTCTGGTGAAGGTGTGGGTGACGTGATGAGAGGTTTGAGAAATGGGTGAAATGAAATGGGGG
ATATGTAGGTCAAATCAAATTAAAGATGATTTTTTTAATGTGAATAAAGTTATGTTCTGATAGTTTGTACAGAAAAAATAAAATGGATGC
CCATGTTTTATTGCTATTACTAAATGTCAAGATTGTATGCTATTATGTCTTGTAAATTTCTTTTGTTGGTGTAAATATGGAAATGCCACA
TTGGTTAAGTGCCATCATTTGTAATGCAATGTGTCACTTGAAAAGAGATTTGAAGAAACTGACAACTTCAAAAACAAATGAGAAGCCCAA
GGAACTGTGAGCAATTAAAAGCAAACCGCGACACCCTTTGTCTCCACCACACATAGTGTACTTTGGAAGCACAACGTCCAGGCTGGTACC
GCAGCGCCATGCCCATTCCTCGCCTCATTCATAGGACACTTCACTGCCATTTTCTATTCACATAAAAGAAAAATAAATGTGGAAATTTCA
TCCTTGGAAATGTGTTCTCTATTCTTTTTAAATTTTCTGTATGTATATGGAAAAGCAGATTTAATAAACACAAATCTAACTGCATGTTGG
AAACAGCTTAAATATATCCAACTATTTTGGACTTTTCCTAGTTTTATATATACTTTGTGAGATGGACCACACCAAAGAATTAAGCTATGA
ACTTTCCTTCTCTGACAATCTAGGTATTTTTATTTCTTGGAATCTATTTCAAATCTAGTACCCTTCACAGTTTTATCATGTTTTTCTATA
TGGCAGCCCGGCCATAACTAAATTGATATAACTAAATTGATACCTTTTTTAGGCATCATTTTTAAGAACTCAGTAACTTCATAAGAACAG
TGGTTGGGCATTTATTTGGTTGGCACCAATATAATAAAACAAATTCTAAATTATGATCTTGTTGAGTAATTCCTGGATTAGGAATGGGGT
AGGGGAAAGAAGCCCCACAGTAGGGTAGGCAGAGGTCCCCATAGCACCACACAGCATTACTTGGGAGCTTTTCTCCATCTGTGGAGGCAA
TGAGGGACACCAGGCCCAAAGTCTGAGGCCTTCTCAGGTCTTGCAGTTTCCACCCTTCAGTCCTAGCTCCTGGATCTTTGTTGGTGTAAA
TATGGAAATGCCACATTGGTTAAGTGCCATCATGGAAAACTAGGAACTAGTTTACCTGTAGCTGTACTAGGAAAACTAGGAACTAACGCC
CATGGCAAGTCAAACCCAGAAGTGTTGTCTTAGCTCAATATACCCTGCAAGTGTAATACAAATCGCAGATTAAAACATTTGTACTTATAT
TTGTGAGTATATGGAAAGACACCCAAGAGAAAACTGCGTTTTCTTGCCGAATGGACAGCAGCCCAAGATGAGCATCCAAAGTGAAATGGA
AATACCAGTGCGAGCATGAAAGTTCCTAAGATCTGCAAGCCGGGAGCCCTTCTGGGGTGCCTTCTGAAGAACACTCGAGCCAGCTTGCGG
AGTCGACTGCCCAGCTACTGAGCCCTGGCTGCCTCATACCTGGCGGAGCTCAGCAGACAGCCCTTGTTAGATGGCAGAGGTCGCCAGATG
TTCCTGAGGAACTGGCATCAGGACTTCTGGGACGTAACCACTTCAGTTTCCAATCACTGACTTCCCTGAACCTGGTATCTTCAGATCTGG
GTTGAAGCTGAAGGCTCAGTTGGGGAATCAGGAAAACAAGGGAATCGGAGCCAGCCCAAGCCAACTGACAATAACTTTCTTTTAGTGACT
GCATTAGTTTGGGGTAATGCTGTAACAGGCAGCATTTCAAAAACGGATCCAAATTTGGGTTGAAAATAACTCACCTGGGTCTCCAATCAG
ACCTAACAGAGCCAGAAAGAGGCCTGTACACGGGGACTGGGCAGAAGATCTAATGCCGTTTAAGACTCCTCCCTTGAGACCAGAGAAAAG

>53441_53441_7_MGST1-PKP4_MGST1_chr12_16507204_ENST00000396210_PKP4_chr2_159433783_ENST00000389757_length(amino acids)=1140AA_BP=27
MDDEVFMAFASYATIILSKMMLMSTATAFYRLTRKELQFQRLTRELEVERQIVASQLERCRLGAESPSIASTSSTEKSFPWRSTDVPNTG
VSKPRVSDAVQPNNYLIRTEPEQGTLYSPEQTSLHESEGSLGNSRSSTQMNSYSDSGYQEAGSFHNSQNVSKADNRQQHSFIGSTNNHVV
RNSRAEGQTLVQPSVANRAMRRVSSVPSRAQSPSYVISTGVSPSRGSLRTSLGSGFGSPSVTDPRPLNPSAYSSTTLPAARAASPYSQRP
ASPTAIRRIGSVTSRQTSNPNGPTPQYQTTARVGSPLTLTDAQTRVASPSQGQVGSSSPKRSGMTAVPQHLGPSLQRTVHDMEQFGQQQY
DIYERMVPPRPDSLTGLRSSYASQHSQLGQDLRSAVSPDLHITPIYEGRTYYSPVYRSPNHGTVELQGSQTALYRTGSVGIGNLQRTSSQ
RSTLTYQRNNYALNTTATYAEPYRPIQYRVQECNYNRLQHAVPADDGTTRSPSIDSIQKDPREFAWRDPELPEVIHMLQHQFPSVQANAA
AYLQHLCFGDNKVKMEVCRLGGIKHLVDLLDHRVLEVQKNACGALRNLVFGKSTDENKIAMKNVGGIPALLRLLRKSIDAEVRELVTGVL
WNLSSCDAVKMTIIRDALSTLTNTVIVPHSGWNNSSFDDDHKIKFQTSLVLRNTTGCLRNLSSAGEEARKQMRSCEGLVDSLLYVIHTCV
NTSDYDSKTVENCVCTLRNLSYRLELEVPQARLLGLNELDDLLGKESPSKDSEPSCWGKKKKKKKRTPQEDQWDGVGPIPGLSKSPKGVE
MLWHPSVVKPYLTLLAESSNPATLEGSAGSLQNLSAGNWKFAAYIRAAVRKEKGLPILVELLRMDNDRVVSSVATALRNMALDVRNKELI
GKYAMRDLVNRLPGGNGPSVLSDETMAAICCALHEVTSKNMENAKALADSGGIEKLVNITKGRGDRSSLKVVKAAAQVLNTLWQYRDLRS
IYKKDGWNQNHFITPVSTLERDRFKSHPSLSTTNQQMSPIIQSGSSKPSPIYISSYSSPAREQNRRLQHQQLYYSQDDSNRKNFDAYRLY

--------------------------------------------------------------
>53441_53441_8_MGST1-PKP4_MGST1_chr12_16507204_ENST00000396210_PKP4_chr2_159433783_ENST00000389759_length(transcript)=4369nt_BP=170nt
CCTGCATTGCGCGCGACCCGGCGGCGGGACAGGCTTGCTGCTTCCTCCTCCTCGGCCTCACCGTAATGGATGATGAAGTATTCATGGCTT
TTGCATCCTATGCAACAATTATTCTTTCAAAAATGATGCTTATGAGTACTGCAACTGCATTCTATAGATTGACAAGAAAGGAGCTTCAGT
TTCAGCGACTCACCCGAGAACTGGAAGTGGAAAGGCAGATTGTTGCCAGTCAGCTAGAAAGATGTAGGCTTGGAGCAGAATCACCAAGCA
TCGCCAGCACCAGCTCAACTGAGAAGTCATTTCCTTGGAGATCAACAGACGTGCCAAATACTGGTGTAAGCAAACCTAGAGTTTCTGACG
CTGTCCAGCCCAACAACTATCTCATCAGGACAGAGCCAGAACAAGGAACCCTCTATTCACCAGAACAGACATCTCTCCATGAAAGTGAGG
GATCATTGGGTAACTCAAGAAGTTCAACACAAATGAATTCTTATTCCGACAGTGGATACCAGGAAGCAGGGAGTTTCCACAACAGCCAGA
ACGTGAGCAAGGCAGACAACAGACAGCAGCATTCATTCATAGGATCAACTAACAACCATGTGGTGAGGAATTCAAGAGCTGAAGGACAAA
CACTGGTTCAGCCATCAGTAGCCAATCGGGCCATGAGAAGAGTTAGTTCAGTTCCATCTAGAGCACAGTCTCCTTCTTATGTTATCAGCA
CAGGCGTGTCTCCTTCAAGGGGGTCTCTGAGAACTTCTCTGGGTAGTGGATTTGGCTCTCCGTCAGTGACCGACCCCCGACCTCTGAACC
CCAGTGCATATTCCTCCACCACATTACCTGCTGCACGGGCAGCCTCTCCGTACTCACAGAGACCCGCCTCCCCAACAGCTATACGGCGGA
TTGGGTCAGTCACCTCCCGGCAGACCTCCAATCCCAACGGACCAACCCCTCAATACCAAACCACCGCCAGAGTGGGGTCCCCACTGACCC
TGACGGATGCACAGACTCGAGTAGCTTCCCCATCCCAAGGCCAGGTGGGGTCGTCGTCCCCCAAACGCTCAGGGATGACCGCCGTACCAC
AGCATCTGGGACCTTCACTGCAAAGGACTGTTCATGACATGGAGCAATTCGGACAGCAGCAGTATGACATTTATGAGAGGATGGTTCCAC
CCAGGCCAGACAGCCTGACAGGCTTACGGAGTTCCTATGCTAGTCAGCATAGTCAGCTTGGGCAAGACCTTCGTTCTGCCGTGTCTCCCG
ACTTGCACATTACTCCTATATATGAGGGGAGGACCTATTACAGCCCAGTGTACCGCAGCCCAAACCATGGAACTGTGGAGCTCCAAGGAT
CGCAGACGGCGTTGTATCGCACAGGTTCAGTAGGTATTGGAAATCTACAAAGGACATCCAGCCAACGAAGTACCCTTACATACCAAAGAA
ATAATTATGCTCTGAACACAACAGCTACCTACGCGGAGCCCTACAGGCCTATACAATACCGAGTGCAAGAGTGCAATTATAACAGGCTTC
AGCATGCAGTGCCGGCTGATGATGGCACCACAAGATCCCCATCAATAGACAGCATTCAGAAGGACCCCAGGGAGTTTGCCTGGCGTGATC
CTGAGTTGCCTGAGGTCATTCACATGCTTCAGCACCAGTTCCCATCTGTTCAGGCAAATGCAGCGGCCTACCTGCAGCACCTGTGCTTTG
GTGACAACAAAGTGAAGATGGAGGTGTGTAGGTTAGGGGGAATCAAGCATCTGGTTGACCTTCTGGACCACAGAGTTTTGGAAGTTCAGA
AGAATGCTTGTGGTGCCCTTCGAAACCTCGTTTTTGGCAAGTCTACAGATGAAAATAAAATAGCAATGAAGAATGTTGGTGGGATACCTG
CCTTGTTGCGACTGTTGAGAAAATCTATTGATGCAGAAGTAAGGGAGCTTGTTACAGGAGTTCTTTGGAATTTATCCTCATGTGATGCTG
TAAAAATGACAATCATTCGAGATGCTCTCTCAACCTTAACAAACACTGTGATTGTTCCACATTCTGGATGGAATAACTCTTCTTTTGATG
ATGATCATAAAATTAAATTTCAGACTTCACTAGTTCTGCGTAACACGACAGGTTGCCTAAGGAACCTCAGCTCCGCGGGGGAAGAAGCTC
GGAAGCAAATGCGGTCCTGCGAGGGGCTGGTAGACTCACTGTTGTATGTGATCCACACGTGTGTGAACACATCCGATTACGACAGCAAGA
CGGTGGAGAACTGCGTGTGCACCCTGAGGAACCTGTCCTATCGGCTGGAGCTGGAGGTGCCCCAGGCCCGGTTACTGGGACTGAACGAAT
TGGATGACTTACTAGGAAAAGAGTCTCCCAGCAAAGACTCTGAGCCAAGTTGCTGGGGGAAGAAGAAGAAAAAGAAAAAGAGGACTCCGC
AAGAAGATCAATGGGATGGAGTTGGTCCTATCCCAGGACTGTCGAAGTCCCCCAAAGGGGTTGAGATGCTGTGGCACCCATCGGTGGTAA
AACCATATCTGACTCTTCTAGCAGAAAGTTCCAACCCAGCCACCTTGGAAGGCTCTGCAGGGTCTCTCCAGAACCTCTCTGCTGGCAACT
GGAAGTTTGCAGCATATATCCGGGCGGCCGTCCGAAAAGAAAAGGGGCTCCCCATCCTTGTGGAGCTTCTGAGAATGGATAACGATAGAG
TTGTTTCTTCCGTGGCAACAGCCTTGAGGAATATGGCACTAGATGTTCGCAACAAGGAGCTCATAGGCAAATACGCCATGCGAGACCTGG
TCAACCGGCTCCCCGGCGGCAATGGCCCCAGTGTCTTGTCTGATGAGACCATGGCAGCCATCTGCTGTGCTCTGCACGAGGTCACCAGCA
AAAACATGGAGAACGCAAAAGCCCTGGCCGACTCAGGAGGCATAGAGAAGCTGGTGAACATAACCAAAGGCAGGGGCGACAGATCATCTC
TGAAAGTGGTGAAGGCAGCAGCCCAGGTCTTGAATACATTATGGCAATATCGGGACCTCCGGAGCATTTATAAAAAGGATGGGTGGAATC
AGAACCATTTTATTACACCTGTGTCGACATTGGAGCGAGACCGATTCAAATCACATCCTTCCTTGTCTACCACCAACCAACAGATGTCAC
CCATCATTCAGTCAGTCGGCAGCACCTCTTCCTCACCAGCACTGTTAGGAATCAGAGACCCTCGCTCTGAATACGATAGGACCCAGCCAC
CTATGCAGTATTACAATAGCCAAGGGGATGCCACACATAAAGGCCTGTACCCTGGCTCCAGCAAACCTTCACCAATTTACATCAGTTCCT
ATTCCTCACCAGCAAGAGAACAAAATAGACGGCTACAGCATCAACAGCTGTATTATAGTCAAGATGACTCCAACAGAAAGAACTTTGATG
CATACAGATTGTATTTGCAGTCTCCTCATAGCTATGAAGATCCTTATTTTGATGACCGAGTTCACTTTCCAGCTTCTACTGATTACTCAA
CACAGTATGGACTGAAATCGACCACAAATTATGTAGACTTTTATTCCACTAAACGACCTTCTTATAGAGCAGAACAGTACCCAGGGTCCC
CAGACTCATGGGTGTAGCATCAAGATGCCCAACAGAGGAACTCTTTCTTTCTAACCTTGTTCAGATTGAGGTGAAAAGTCCATCTTGCTG
ATTTGATGATTGAAATGTGAAAGTGAAGTGGAAGGAATGAATGAAGTGTGTTTTTTTTTTCTTTTTTGAGGAATTATCAGGGAAGTGAGG
AAATGTTTGGGAGAGGACTTTCTAAGCTCTATTTAGGTGTTAGATCTAATTACTTATAGATTCTGTAGTCTGGTGAAGGTGTGGGTGACG
TGATGAGAGGTTTGAGAAATGGGTGAAATGAAATGGGGGATATGTAGGTCAAATCAAATTAAAGATGATTTTTTTAATGTGAATAAAGTT
ATGTTCTGATAGTTTGTACAGAAAAAATAAAATGGATGCCCATGTTTTATTGCTATTACTAAATGTCAAGATTGTATGCTATTATGTCTT
GTAAATTTCTTTTGTTGGTGTAAATATGGAAATGCCACATTGGTTAAGTGCCATCATTTGTAATGCAATGTGTCACTTGAAAAGAGATTT
GAAGAAACTGACAACTTCAAAAACAAATGAGAAGCCCAAGGAACTGTGAGCAATTAAAAGCAAACCGCGACACCCTTTGTCTCCACCACA
CATAGTGTACTTTGGAAGCACAACGTCCAGGCTGGTACCGCAGCGCCATGCCCATTCCTCGCCTCATTCATAGGACACTTCACTGCCATT

>53441_53441_8_MGST1-PKP4_MGST1_chr12_16507204_ENST00000396210_PKP4_chr2_159433783_ENST00000389759_length(amino acids)=1183AA_BP=27
MDDEVFMAFASYATIILSKMMLMSTATAFYRLTRKELQFQRLTRELEVERQIVASQLERCRLGAESPSIASTSSTEKSFPWRSTDVPNTG
VSKPRVSDAVQPNNYLIRTEPEQGTLYSPEQTSLHESEGSLGNSRSSTQMNSYSDSGYQEAGSFHNSQNVSKADNRQQHSFIGSTNNHVV
RNSRAEGQTLVQPSVANRAMRRVSSVPSRAQSPSYVISTGVSPSRGSLRTSLGSGFGSPSVTDPRPLNPSAYSSTTLPAARAASPYSQRP
ASPTAIRRIGSVTSRQTSNPNGPTPQYQTTARVGSPLTLTDAQTRVASPSQGQVGSSSPKRSGMTAVPQHLGPSLQRTVHDMEQFGQQQY
DIYERMVPPRPDSLTGLRSSYASQHSQLGQDLRSAVSPDLHITPIYEGRTYYSPVYRSPNHGTVELQGSQTALYRTGSVGIGNLQRTSSQ
RSTLTYQRNNYALNTTATYAEPYRPIQYRVQECNYNRLQHAVPADDGTTRSPSIDSIQKDPREFAWRDPELPEVIHMLQHQFPSVQANAA
AYLQHLCFGDNKVKMEVCRLGGIKHLVDLLDHRVLEVQKNACGALRNLVFGKSTDENKIAMKNVGGIPALLRLLRKSIDAEVRELVTGVL
WNLSSCDAVKMTIIRDALSTLTNTVIVPHSGWNNSSFDDDHKIKFQTSLVLRNTTGCLRNLSSAGEEARKQMRSCEGLVDSLLYVIHTCV
NTSDYDSKTVENCVCTLRNLSYRLELEVPQARLLGLNELDDLLGKESPSKDSEPSCWGKKKKKKKRTPQEDQWDGVGPIPGLSKSPKGVE
MLWHPSVVKPYLTLLAESSNPATLEGSAGSLQNLSAGNWKFAAYIRAAVRKEKGLPILVELLRMDNDRVVSSVATALRNMALDVRNKELI
GKYAMRDLVNRLPGGNGPSVLSDETMAAICCALHEVTSKNMENAKALADSGGIEKLVNITKGRGDRSSLKVVKAAAQVLNTLWQYRDLRS
IYKKDGWNQNHFITPVSTLERDRFKSHPSLSTTNQQMSPIIQSVGSTSSSPALLGIRDPRSEYDRTQPPMQYYNSQGDATHKGLYPGSSK
PSPIYISSYSSPAREQNRRLQHQQLYYSQDDSNRKNFDAYRLYLQSPHSYEDPYFDDRVHFPASTDYSTQYGLKSTTNYVDFYSTKRPSY

--------------------------------------------------------------
>53441_53441_9_MGST1-PKP4_MGST1_chr12_16507204_ENST00000535309_PKP4_chr2_159433783_ENST00000389757_length(transcript)=5645nt_BP=125nt
TCCTCCTCGGCCTCACCGTAATGGATGATGAAGTATTCATGGCTTTTGCATCCTATGCAACAATTATTCTTTCAAAAATGATGCTTATGA
GTACTGCAACTGCATTCTATAGATTGACAAGAAAGGAGCTTCAGTTTCAGCGACTCACCCGAGAACTGGAAGTGGAAAGGCAGATTGTTG
CCAGTCAGCTAGAAAGATGTAGGCTTGGAGCAGAATCACCAAGCATCGCCAGCACCAGCTCAACTGAGAAGTCATTTCCTTGGAGATCAA
CAGACGTGCCAAATACTGGTGTAAGCAAACCTAGAGTTTCTGACGCTGTCCAGCCCAACAACTATCTCATCAGGACAGAGCCAGAACAAG
GAACCCTCTATTCACCAGAACAGACATCTCTCCATGAAAGTGAGGGATCATTGGGTAACTCAAGAAGTTCAACACAAATGAATTCTTATT
CCGACAGTGGATACCAGGAAGCAGGGAGTTTCCACAACAGCCAGAACGTGAGCAAGGCAGACAACAGACAGCAGCATTCATTCATAGGAT
CAACTAACAACCATGTGGTGAGGAATTCAAGAGCTGAAGGACAAACACTGGTTCAGCCATCAGTAGCCAATCGGGCCATGAGAAGAGTTA
GTTCAGTTCCATCTAGAGCACAGTCTCCTTCTTATGTTATCAGCACAGGCGTGTCTCCTTCAAGGGGGTCTCTGAGAACTTCTCTGGGTA
GTGGATTTGGCTCTCCGTCAGTGACCGACCCCCGACCTCTGAACCCCAGTGCATATTCCTCCACCACATTACCTGCTGCACGGGCAGCCT
CTCCGTACTCACAGAGACCCGCCTCCCCAACAGCTATACGGCGGATTGGGTCAGTCACCTCCCGGCAGACCTCCAATCCCAACGGACCAA
CCCCTCAATACCAAACCACCGCCAGAGTGGGGTCCCCACTGACCCTGACGGATGCACAGACTCGAGTAGCTTCCCCATCCCAAGGCCAGG
TGGGGTCGTCGTCCCCCAAACGCTCAGGGATGACCGCCGTACCACAGCATCTGGGACCTTCACTGCAAAGGACTGTTCATGACATGGAGC
AATTCGGACAGCAGCAGTATGACATTTATGAGAGGATGGTTCCACCCAGGCCAGACAGCCTGACAGGCTTACGGAGTTCCTATGCTAGTC
AGCATAGTCAGCTTGGGCAAGACCTTCGTTCTGCCGTGTCTCCCGACTTGCACATTACTCCTATATATGAGGGGAGGACCTATTACAGCC
CAGTGTACCGCAGCCCAAACCATGGAACTGTGGAGCTCCAAGGATCGCAGACGGCGTTGTATCGCACAGGTTCAGTAGGTATTGGAAATC
TACAAAGGACATCCAGCCAACGAAGTACCCTTACATACCAAAGAAATAATTATGCTCTGAACACAACAGCTACCTACGCGGAGCCCTACA
GGCCTATACAATACCGAGTGCAAGAGTGCAATTATAACAGGCTTCAGCATGCAGTGCCGGCTGATGATGGCACCACAAGATCCCCATCAA
TAGACAGCATTCAGAAGGACCCCAGGGAGTTTGCCTGGCGTGATCCTGAGTTGCCTGAGGTCATTCACATGCTTCAGCACCAGTTCCCAT
CTGTTCAGGCAAATGCAGCGGCCTACCTGCAGCACCTGTGCTTTGGTGACAACAAAGTGAAGATGGAGGTGTGTAGGTTAGGGGGAATCA
AGCATCTGGTTGACCTTCTGGACCACAGAGTTTTGGAAGTTCAGAAGAATGCTTGTGGTGCCCTTCGAAACCTCGTTTTTGGCAAGTCTA
CAGATGAAAATAAAATAGCAATGAAGAATGTTGGTGGGATACCTGCCTTGTTGCGACTGTTGAGAAAATCTATTGATGCAGAAGTAAGGG
AGCTTGTTACAGGAGTTCTTTGGAATTTATCCTCATGTGATGCTGTAAAAATGACAATCATTCGAGATGCTCTCTCAACCTTAACAAACA
CTGTGATTGTTCCACATTCTGGATGGAATAACTCTTCTTTTGATGATGATCATAAAATTAAATTTCAGACTTCACTAGTTCTGCGTAACA
CGACAGGTTGCCTAAGGAACCTCAGCTCCGCGGGGGAAGAAGCTCGGAAGCAAATGCGGTCCTGCGAGGGGCTGGTAGACTCACTGTTGT
ATGTGATCCACACGTGTGTGAACACATCCGATTACGACAGCAAGACGGTGGAGAACTGCGTGTGCACCCTGAGGAACCTGTCCTATCGGC
TGGAGCTGGAGGTGCCCCAGGCCCGGTTACTGGGACTGAACGAATTGGATGACTTACTAGGAAAAGAGTCTCCCAGCAAAGACTCTGAGC
CAAGTTGCTGGGGGAAGAAGAAGAAAAAGAAAAAGAGGACTCCGCAAGAAGATCAATGGGATGGAGTTGGTCCTATCCCAGGACTGTCGA
AGTCCCCCAAAGGGGTTGAGATGCTGTGGCACCCATCGGTGGTAAAACCATATCTGACTCTTCTAGCAGAAAGTTCCAACCCAGCCACCT
TGGAAGGCTCTGCAGGGTCTCTCCAGAACCTCTCTGCTGGCAACTGGAAGTTTGCAGCATATATCCGGGCGGCCGTCCGAAAAGAAAAGG
GGCTCCCCATCCTTGTGGAGCTTCTGAGAATGGATAACGATAGAGTTGTTTCTTCCGTGGCAACAGCCTTGAGGAATATGGCACTAGATG
TTCGCAACAAGGAGCTCATAGGCAAATACGCCATGCGAGACCTGGTCAACCGGCTCCCCGGCGGCAATGGCCCCAGTGTCTTGTCTGATG
AGACCATGGCAGCCATCTGCTGTGCTCTGCACGAGGTCACCAGCAAAAACATGGAGAACGCAAAAGCCCTGGCCGACTCAGGAGGCATAG
AGAAGCTGGTGAACATAACCAAAGGCAGGGGCGACAGATCATCTCTGAAAGTGGTGAAGGCAGCAGCCCAGGTCTTGAATACATTATGGC
AATATCGGGACCTCCGGAGCATTTATAAAAAGGATGGGTGGAATCAGAACCATTTTATTACACCTGTGTCGACATTGGAGCGAGACCGAT
TCAAATCACATCCTTCCTTGTCTACCACCAACCAACAGATGTCACCCATCATTCAGTCAGGCTCCAGCAAACCTTCACCAATTTACATCA
GTTCCTATTCCTCACCAGCAAGAGAACAAAATAGACGGCTACAGCATCAACAGCTGTATTATAGTCAAGATGACTCCAACAGAAAGAACT
TTGATGCATACAGATTGTATTTGCAGTCTCCTCATAGCTATGAAGATCCTTATTTTGATGACCGAGTTCACTTTCCAGCTTCTACTGATT
ACTCAACACAGTATGGACTGAAATCGACCACAAATTATGTAGACTTTTATTCCACTAAACGACCTTCTTATAGAGCAGAACAGTACCCAG
GGTCCCCAGACTCATGGGTGTAGCATCAAGATGCCCAACAGAGGAACTCTTTCTTTCTAACCTTGTTCAGATTGAGGTGAAAAGTCCATC
TTGCTGATTTGATGATTGAAATGTGAAAGTGAAGTGGAAGGAATGAATGAAGTGTGTTTTTTTTTTCTTTTTTGAGGAATTATCAGGGAA
GTGAGGAAATGTTTGGGAGAGGACTTTCTAAGCTCTATTTAGGTGTTAGATCTAATTACTTATAGATTCTGTAGTCTGGTGAAGGTGTGG
GTGACGTGATGAGAGGTTTGAGAAATGGGTGAAATGAAATGGGGGATATGTAGGTCAAATCAAATTAAAGATGATTTTTTTAATGTGAAT
AAAGTTATGTTCTGATAGTTTGTACAGAAAAAATAAAATGGATGCCCATGTTTTATTGCTATTACTAAATGTCAAGATTGTATGCTATTA
TGTCTTGTAAATTTCTTTTGTTGGTGTAAATATGGAAATGCCACATTGGTTAAGTGCCATCATTTGTAATGCAATGTGTCACTTGAAAAG
AGATTTGAAGAAACTGACAACTTCAAAAACAAATGAGAAGCCCAAGGAACTGTGAGCAATTAAAAGCAAACCGCGACACCCTTTGTCTCC
ACCACACATAGTGTACTTTGGAAGCACAACGTCCAGGCTGGTACCGCAGCGCCATGCCCATTCCTCGCCTCATTCATAGGACACTTCACT
GCCATTTTCTATTCACATAAAAGAAAAATAAATGTGGAAATTTCATCCTTGGAAATGTGTTCTCTATTCTTTTTAAATTTTCTGTATGTA
TATGGAAAAGCAGATTTAATAAACACAAATCTAACTGCATGTTGGAAACAGCTTAAATATATCCAACTATTTTGGACTTTTCCTAGTTTT
ATATATACTTTGTGAGATGGACCACACCAAAGAATTAAGCTATGAACTTTCCTTCTCTGACAATCTAGGTATTTTTATTTCTTGGAATCT
ATTTCAAATCTAGTACCCTTCACAGTTTTATCATGTTTTTCTATATGGCAGCCCGGCCATAACTAAATTGATATAACTAAATTGATACCT
TTTTTAGGCATCATTTTTAAGAACTCAGTAACTTCATAAGAACAGTGGTTGGGCATTTATTTGGTTGGCACCAATATAATAAAACAAATT
CTAAATTATGATCTTGTTGAGTAATTCCTGGATTAGGAATGGGGTAGGGGAAAGAAGCCCCACAGTAGGGTAGGCAGAGGTCCCCATAGC
ACCACACAGCATTACTTGGGAGCTTTTCTCCATCTGTGGAGGCAATGAGGGACACCAGGCCCAAAGTCTGAGGCCTTCTCAGGTCTTGCA
GTTTCCACCCTTCAGTCCTAGCTCCTGGATCTTTGTTGGTGTAAATATGGAAATGCCACATTGGTTAAGTGCCATCATGGAAAACTAGGA
ACTAGTTTACCTGTAGCTGTACTAGGAAAACTAGGAACTAACGCCCATGGCAAGTCAAACCCAGAAGTGTTGTCTTAGCTCAATATACCC
TGCAAGTGTAATACAAATCGCAGATTAAAACATTTGTACTTATATTTGTGAGTATATGGAAAGACACCCAAGAGAAAACTGCGTTTTCTT
GCCGAATGGACAGCAGCCCAAGATGAGCATCCAAAGTGAAATGGAAATACCAGTGCGAGCATGAAAGTTCCTAAGATCTGCAAGCCGGGA
GCCCTTCTGGGGTGCCTTCTGAAGAACACTCGAGCCAGCTTGCGGAGTCGACTGCCCAGCTACTGAGCCCTGGCTGCCTCATACCTGGCG
GAGCTCAGCAGACAGCCCTTGTTAGATGGCAGAGGTCGCCAGATGTTCCTGAGGAACTGGCATCAGGACTTCTGGGACGTAACCACTTCA
GTTTCCAATCACTGACTTCCCTGAACCTGGTATCTTCAGATCTGGGTTGAAGCTGAAGGCTCAGTTGGGGAATCAGGAAAACAAGGGAAT
CGGAGCCAGCCCAAGCCAACTGACAATAACTTTCTTTTAGTGACTGCATTAGTTTGGGGTAATGCTGTAACAGGCAGCATTTCAAAAACG
GATCCAAATTTGGGTTGAAAATAACTCACCTGGGTCTCCAATCAGACCTAACAGAGCCAGAAAGAGGCCTGTACACGGGGACTGGGCAGA

>53441_53441_9_MGST1-PKP4_MGST1_chr12_16507204_ENST00000535309_PKP4_chr2_159433783_ENST00000389757_length(amino acids)=1140AA_BP=27
MDDEVFMAFASYATIILSKMMLMSTATAFYRLTRKELQFQRLTRELEVERQIVASQLERCRLGAESPSIASTSSTEKSFPWRSTDVPNTG
VSKPRVSDAVQPNNYLIRTEPEQGTLYSPEQTSLHESEGSLGNSRSSTQMNSYSDSGYQEAGSFHNSQNVSKADNRQQHSFIGSTNNHVV
RNSRAEGQTLVQPSVANRAMRRVSSVPSRAQSPSYVISTGVSPSRGSLRTSLGSGFGSPSVTDPRPLNPSAYSSTTLPAARAASPYSQRP
ASPTAIRRIGSVTSRQTSNPNGPTPQYQTTARVGSPLTLTDAQTRVASPSQGQVGSSSPKRSGMTAVPQHLGPSLQRTVHDMEQFGQQQY
DIYERMVPPRPDSLTGLRSSYASQHSQLGQDLRSAVSPDLHITPIYEGRTYYSPVYRSPNHGTVELQGSQTALYRTGSVGIGNLQRTSSQ
RSTLTYQRNNYALNTTATYAEPYRPIQYRVQECNYNRLQHAVPADDGTTRSPSIDSIQKDPREFAWRDPELPEVIHMLQHQFPSVQANAA
AYLQHLCFGDNKVKMEVCRLGGIKHLVDLLDHRVLEVQKNACGALRNLVFGKSTDENKIAMKNVGGIPALLRLLRKSIDAEVRELVTGVL
WNLSSCDAVKMTIIRDALSTLTNTVIVPHSGWNNSSFDDDHKIKFQTSLVLRNTTGCLRNLSSAGEEARKQMRSCEGLVDSLLYVIHTCV
NTSDYDSKTVENCVCTLRNLSYRLELEVPQARLLGLNELDDLLGKESPSKDSEPSCWGKKKKKKKRTPQEDQWDGVGPIPGLSKSPKGVE
MLWHPSVVKPYLTLLAESSNPATLEGSAGSLQNLSAGNWKFAAYIRAAVRKEKGLPILVELLRMDNDRVVSSVATALRNMALDVRNKELI
GKYAMRDLVNRLPGGNGPSVLSDETMAAICCALHEVTSKNMENAKALADSGGIEKLVNITKGRGDRSSLKVVKAAAQVLNTLWQYRDLRS
IYKKDGWNQNHFITPVSTLERDRFKSHPSLSTTNQQMSPIIQSGSSKPSPIYISSYSSPAREQNRRLQHQQLYYSQDDSNRKNFDAYRLY

--------------------------------------------------------------
>53441_53441_10_MGST1-PKP4_MGST1_chr12_16507204_ENST00000535309_PKP4_chr2_159433783_ENST00000389759_length(transcript)=4324nt_BP=125nt
TCCTCCTCGGCCTCACCGTAATGGATGATGAAGTATTCATGGCTTTTGCATCCTATGCAACAATTATTCTTTCAAAAATGATGCTTATGA
GTACTGCAACTGCATTCTATAGATTGACAAGAAAGGAGCTTCAGTTTCAGCGACTCACCCGAGAACTGGAAGTGGAAAGGCAGATTGTTG
CCAGTCAGCTAGAAAGATGTAGGCTTGGAGCAGAATCACCAAGCATCGCCAGCACCAGCTCAACTGAGAAGTCATTTCCTTGGAGATCAA
CAGACGTGCCAAATACTGGTGTAAGCAAACCTAGAGTTTCTGACGCTGTCCAGCCCAACAACTATCTCATCAGGACAGAGCCAGAACAAG
GAACCCTCTATTCACCAGAACAGACATCTCTCCATGAAAGTGAGGGATCATTGGGTAACTCAAGAAGTTCAACACAAATGAATTCTTATT
CCGACAGTGGATACCAGGAAGCAGGGAGTTTCCACAACAGCCAGAACGTGAGCAAGGCAGACAACAGACAGCAGCATTCATTCATAGGAT
CAACTAACAACCATGTGGTGAGGAATTCAAGAGCTGAAGGACAAACACTGGTTCAGCCATCAGTAGCCAATCGGGCCATGAGAAGAGTTA
GTTCAGTTCCATCTAGAGCACAGTCTCCTTCTTATGTTATCAGCACAGGCGTGTCTCCTTCAAGGGGGTCTCTGAGAACTTCTCTGGGTA
GTGGATTTGGCTCTCCGTCAGTGACCGACCCCCGACCTCTGAACCCCAGTGCATATTCCTCCACCACATTACCTGCTGCACGGGCAGCCT
CTCCGTACTCACAGAGACCCGCCTCCCCAACAGCTATACGGCGGATTGGGTCAGTCACCTCCCGGCAGACCTCCAATCCCAACGGACCAA
CCCCTCAATACCAAACCACCGCCAGAGTGGGGTCCCCACTGACCCTGACGGATGCACAGACTCGAGTAGCTTCCCCATCCCAAGGCCAGG
TGGGGTCGTCGTCCCCCAAACGCTCAGGGATGACCGCCGTACCACAGCATCTGGGACCTTCACTGCAAAGGACTGTTCATGACATGGAGC
AATTCGGACAGCAGCAGTATGACATTTATGAGAGGATGGTTCCACCCAGGCCAGACAGCCTGACAGGCTTACGGAGTTCCTATGCTAGTC
AGCATAGTCAGCTTGGGCAAGACCTTCGTTCTGCCGTGTCTCCCGACTTGCACATTACTCCTATATATGAGGGGAGGACCTATTACAGCC
CAGTGTACCGCAGCCCAAACCATGGAACTGTGGAGCTCCAAGGATCGCAGACGGCGTTGTATCGCACAGGTTCAGTAGGTATTGGAAATC
TACAAAGGACATCCAGCCAACGAAGTACCCTTACATACCAAAGAAATAATTATGCTCTGAACACAACAGCTACCTACGCGGAGCCCTACA
GGCCTATACAATACCGAGTGCAAGAGTGCAATTATAACAGGCTTCAGCATGCAGTGCCGGCTGATGATGGCACCACAAGATCCCCATCAA
TAGACAGCATTCAGAAGGACCCCAGGGAGTTTGCCTGGCGTGATCCTGAGTTGCCTGAGGTCATTCACATGCTTCAGCACCAGTTCCCAT
CTGTTCAGGCAAATGCAGCGGCCTACCTGCAGCACCTGTGCTTTGGTGACAACAAAGTGAAGATGGAGGTGTGTAGGTTAGGGGGAATCA
AGCATCTGGTTGACCTTCTGGACCACAGAGTTTTGGAAGTTCAGAAGAATGCTTGTGGTGCCCTTCGAAACCTCGTTTTTGGCAAGTCTA
CAGATGAAAATAAAATAGCAATGAAGAATGTTGGTGGGATACCTGCCTTGTTGCGACTGTTGAGAAAATCTATTGATGCAGAAGTAAGGG
AGCTTGTTACAGGAGTTCTTTGGAATTTATCCTCATGTGATGCTGTAAAAATGACAATCATTCGAGATGCTCTCTCAACCTTAACAAACA
CTGTGATTGTTCCACATTCTGGATGGAATAACTCTTCTTTTGATGATGATCATAAAATTAAATTTCAGACTTCACTAGTTCTGCGTAACA
CGACAGGTTGCCTAAGGAACCTCAGCTCCGCGGGGGAAGAAGCTCGGAAGCAAATGCGGTCCTGCGAGGGGCTGGTAGACTCACTGTTGT
ATGTGATCCACACGTGTGTGAACACATCCGATTACGACAGCAAGACGGTGGAGAACTGCGTGTGCACCCTGAGGAACCTGTCCTATCGGC
TGGAGCTGGAGGTGCCCCAGGCCCGGTTACTGGGACTGAACGAATTGGATGACTTACTAGGAAAAGAGTCTCCCAGCAAAGACTCTGAGC
CAAGTTGCTGGGGGAAGAAGAAGAAAAAGAAAAAGAGGACTCCGCAAGAAGATCAATGGGATGGAGTTGGTCCTATCCCAGGACTGTCGA
AGTCCCCCAAAGGGGTTGAGATGCTGTGGCACCCATCGGTGGTAAAACCATATCTGACTCTTCTAGCAGAAAGTTCCAACCCAGCCACCT
TGGAAGGCTCTGCAGGGTCTCTCCAGAACCTCTCTGCTGGCAACTGGAAGTTTGCAGCATATATCCGGGCGGCCGTCCGAAAAGAAAAGG
GGCTCCCCATCCTTGTGGAGCTTCTGAGAATGGATAACGATAGAGTTGTTTCTTCCGTGGCAACAGCCTTGAGGAATATGGCACTAGATG
TTCGCAACAAGGAGCTCATAGGCAAATACGCCATGCGAGACCTGGTCAACCGGCTCCCCGGCGGCAATGGCCCCAGTGTCTTGTCTGATG
AGACCATGGCAGCCATCTGCTGTGCTCTGCACGAGGTCACCAGCAAAAACATGGAGAACGCAAAAGCCCTGGCCGACTCAGGAGGCATAG
AGAAGCTGGTGAACATAACCAAAGGCAGGGGCGACAGATCATCTCTGAAAGTGGTGAAGGCAGCAGCCCAGGTCTTGAATACATTATGGC
AATATCGGGACCTCCGGAGCATTTATAAAAAGGATGGGTGGAATCAGAACCATTTTATTACACCTGTGTCGACATTGGAGCGAGACCGAT
TCAAATCACATCCTTCCTTGTCTACCACCAACCAACAGATGTCACCCATCATTCAGTCAGTCGGCAGCACCTCTTCCTCACCAGCACTGT
TAGGAATCAGAGACCCTCGCTCTGAATACGATAGGACCCAGCCACCTATGCAGTATTACAATAGCCAAGGGGATGCCACACATAAAGGCC
TGTACCCTGGCTCCAGCAAACCTTCACCAATTTACATCAGTTCCTATTCCTCACCAGCAAGAGAACAAAATAGACGGCTACAGCATCAAC
AGCTGTATTATAGTCAAGATGACTCCAACAGAAAGAACTTTGATGCATACAGATTGTATTTGCAGTCTCCTCATAGCTATGAAGATCCTT
ATTTTGATGACCGAGTTCACTTTCCAGCTTCTACTGATTACTCAACACAGTATGGACTGAAATCGACCACAAATTATGTAGACTTTTATT
CCACTAAACGACCTTCTTATAGAGCAGAACAGTACCCAGGGTCCCCAGACTCATGGGTGTAGCATCAAGATGCCCAACAGAGGAACTCTT
TCTTTCTAACCTTGTTCAGATTGAGGTGAAAAGTCCATCTTGCTGATTTGATGATTGAAATGTGAAAGTGAAGTGGAAGGAATGAATGAA
GTGTGTTTTTTTTTTCTTTTTTGAGGAATTATCAGGGAAGTGAGGAAATGTTTGGGAGAGGACTTTCTAAGCTCTATTTAGGTGTTAGAT
CTAATTACTTATAGATTCTGTAGTCTGGTGAAGGTGTGGGTGACGTGATGAGAGGTTTGAGAAATGGGTGAAATGAAATGGGGGATATGT
AGGTCAAATCAAATTAAAGATGATTTTTTTAATGTGAATAAAGTTATGTTCTGATAGTTTGTACAGAAAAAATAAAATGGATGCCCATGT
TTTATTGCTATTACTAAATGTCAAGATTGTATGCTATTATGTCTTGTAAATTTCTTTTGTTGGTGTAAATATGGAAATGCCACATTGGTT
AAGTGCCATCATTTGTAATGCAATGTGTCACTTGAAAAGAGATTTGAAGAAACTGACAACTTCAAAAACAAATGAGAAGCCCAAGGAACT
GTGAGCAATTAAAAGCAAACCGCGACACCCTTTGTCTCCACCACACATAGTGTACTTTGGAAGCACAACGTCCAGGCTGGTACCGCAGCG
CCATGCCCATTCCTCGCCTCATTCATAGGACACTTCACTGCCATTTTCTATTCACATAAAAGAAAAATAAATGTGGAAATTTCATCCTTG

>53441_53441_10_MGST1-PKP4_MGST1_chr12_16507204_ENST00000535309_PKP4_chr2_159433783_ENST00000389759_length(amino acids)=1183AA_BP=27
MDDEVFMAFASYATIILSKMMLMSTATAFYRLTRKELQFQRLTRELEVERQIVASQLERCRLGAESPSIASTSSTEKSFPWRSTDVPNTG
VSKPRVSDAVQPNNYLIRTEPEQGTLYSPEQTSLHESEGSLGNSRSSTQMNSYSDSGYQEAGSFHNSQNVSKADNRQQHSFIGSTNNHVV
RNSRAEGQTLVQPSVANRAMRRVSSVPSRAQSPSYVISTGVSPSRGSLRTSLGSGFGSPSVTDPRPLNPSAYSSTTLPAARAASPYSQRP
ASPTAIRRIGSVTSRQTSNPNGPTPQYQTTARVGSPLTLTDAQTRVASPSQGQVGSSSPKRSGMTAVPQHLGPSLQRTVHDMEQFGQQQY
DIYERMVPPRPDSLTGLRSSYASQHSQLGQDLRSAVSPDLHITPIYEGRTYYSPVYRSPNHGTVELQGSQTALYRTGSVGIGNLQRTSSQ
RSTLTYQRNNYALNTTATYAEPYRPIQYRVQECNYNRLQHAVPADDGTTRSPSIDSIQKDPREFAWRDPELPEVIHMLQHQFPSVQANAA
AYLQHLCFGDNKVKMEVCRLGGIKHLVDLLDHRVLEVQKNACGALRNLVFGKSTDENKIAMKNVGGIPALLRLLRKSIDAEVRELVTGVL
WNLSSCDAVKMTIIRDALSTLTNTVIVPHSGWNNSSFDDDHKIKFQTSLVLRNTTGCLRNLSSAGEEARKQMRSCEGLVDSLLYVIHTCV
NTSDYDSKTVENCVCTLRNLSYRLELEVPQARLLGLNELDDLLGKESPSKDSEPSCWGKKKKKKKRTPQEDQWDGVGPIPGLSKSPKGVE
MLWHPSVVKPYLTLLAESSNPATLEGSAGSLQNLSAGNWKFAAYIRAAVRKEKGLPILVELLRMDNDRVVSSVATALRNMALDVRNKELI
GKYAMRDLVNRLPGGNGPSVLSDETMAAICCALHEVTSKNMENAKALADSGGIEKLVNITKGRGDRSSLKVVKAAAQVLNTLWQYRDLRS
IYKKDGWNQNHFITPVSTLERDRFKSHPSLSTTNQQMSPIIQSVGSTSSSPALLGIRDPRSEYDRTQPPMQYYNSQGDATHKGLYPGSSK
PSPIYISSYSSPAREQNRRLQHQQLYYSQDDSNRKNFDAYRLYLQSPHSYEDPYFDDRVHFPASTDYSTQYGLKSTTNYVDFYSTKRPSY

--------------------------------------------------------------
>53441_53441_11_MGST1-PKP4_MGST1_chr12_16507204_ENST00000540056_PKP4_chr2_159433783_ENST00000389757_length(transcript)=5749nt_BP=229nt
GGCAGATGGAAGACTTGGGGGGGTCTCTGCCAGCTGGAAGTGCTTGGCTCCACTTAGCAGCTAAACTTAGCTTTTCAATCGATCGCTTTT
GAAAGGGAATTGTATTTCTGTCCCCGTGCGGGTAATGGATGATGAAGTATTCATGGCTTTTGCATCCTATGCAACAATTATTCTTTCAAA
AATGATGCTTATGAGTACTGCAACTGCATTCTATAGATTGACAAGAAAGGAGCTTCAGTTTCAGCGACTCACCCGAGAACTGGAAGTGGA
AAGGCAGATTGTTGCCAGTCAGCTAGAAAGATGTAGGCTTGGAGCAGAATCACCAAGCATCGCCAGCACCAGCTCAACTGAGAAGTCATT
TCCTTGGAGATCAACAGACGTGCCAAATACTGGTGTAAGCAAACCTAGAGTTTCTGACGCTGTCCAGCCCAACAACTATCTCATCAGGAC
AGAGCCAGAACAAGGAACCCTCTATTCACCAGAACAGACATCTCTCCATGAAAGTGAGGGATCATTGGGTAACTCAAGAAGTTCAACACA
AATGAATTCTTATTCCGACAGTGGATACCAGGAAGCAGGGAGTTTCCACAACAGCCAGAACGTGAGCAAGGCAGACAACAGACAGCAGCA
TTCATTCATAGGATCAACTAACAACCATGTGGTGAGGAATTCAAGAGCTGAAGGACAAACACTGGTTCAGCCATCAGTAGCCAATCGGGC
CATGAGAAGAGTTAGTTCAGTTCCATCTAGAGCACAGTCTCCTTCTTATGTTATCAGCACAGGCGTGTCTCCTTCAAGGGGGTCTCTGAG
AACTTCTCTGGGTAGTGGATTTGGCTCTCCGTCAGTGACCGACCCCCGACCTCTGAACCCCAGTGCATATTCCTCCACCACATTACCTGC
TGCACGGGCAGCCTCTCCGTACTCACAGAGACCCGCCTCCCCAACAGCTATACGGCGGATTGGGTCAGTCACCTCCCGGCAGACCTCCAA
TCCCAACGGACCAACCCCTCAATACCAAACCACCGCCAGAGTGGGGTCCCCACTGACCCTGACGGATGCACAGACTCGAGTAGCTTCCCC
ATCCCAAGGCCAGGTGGGGTCGTCGTCCCCCAAACGCTCAGGGATGACCGCCGTACCACAGCATCTGGGACCTTCACTGCAAAGGACTGT
TCATGACATGGAGCAATTCGGACAGCAGCAGTATGACATTTATGAGAGGATGGTTCCACCCAGGCCAGACAGCCTGACAGGCTTACGGAG
TTCCTATGCTAGTCAGCATAGTCAGCTTGGGCAAGACCTTCGTTCTGCCGTGTCTCCCGACTTGCACATTACTCCTATATATGAGGGGAG
GACCTATTACAGCCCAGTGTACCGCAGCCCAAACCATGGAACTGTGGAGCTCCAAGGATCGCAGACGGCGTTGTATCGCACAGGTTCAGT
AGGTATTGGAAATCTACAAAGGACATCCAGCCAACGAAGTACCCTTACATACCAAAGAAATAATTATGCTCTGAACACAACAGCTACCTA
CGCGGAGCCCTACAGGCCTATACAATACCGAGTGCAAGAGTGCAATTATAACAGGCTTCAGCATGCAGTGCCGGCTGATGATGGCACCAC
AAGATCCCCATCAATAGACAGCATTCAGAAGGACCCCAGGGAGTTTGCCTGGCGTGATCCTGAGTTGCCTGAGGTCATTCACATGCTTCA
GCACCAGTTCCCATCTGTTCAGGCAAATGCAGCGGCCTACCTGCAGCACCTGTGCTTTGGTGACAACAAAGTGAAGATGGAGGTGTGTAG
GTTAGGGGGAATCAAGCATCTGGTTGACCTTCTGGACCACAGAGTTTTGGAAGTTCAGAAGAATGCTTGTGGTGCCCTTCGAAACCTCGT
TTTTGGCAAGTCTACAGATGAAAATAAAATAGCAATGAAGAATGTTGGTGGGATACCTGCCTTGTTGCGACTGTTGAGAAAATCTATTGA
TGCAGAAGTAAGGGAGCTTGTTACAGGAGTTCTTTGGAATTTATCCTCATGTGATGCTGTAAAAATGACAATCATTCGAGATGCTCTCTC
AACCTTAACAAACACTGTGATTGTTCCACATTCTGGATGGAATAACTCTTCTTTTGATGATGATCATAAAATTAAATTTCAGACTTCACT
AGTTCTGCGTAACACGACAGGTTGCCTAAGGAACCTCAGCTCCGCGGGGGAAGAAGCTCGGAAGCAAATGCGGTCCTGCGAGGGGCTGGT
AGACTCACTGTTGTATGTGATCCACACGTGTGTGAACACATCCGATTACGACAGCAAGACGGTGGAGAACTGCGTGTGCACCCTGAGGAA
CCTGTCCTATCGGCTGGAGCTGGAGGTGCCCCAGGCCCGGTTACTGGGACTGAACGAATTGGATGACTTACTAGGAAAAGAGTCTCCCAG
CAAAGACTCTGAGCCAAGTTGCTGGGGGAAGAAGAAGAAAAAGAAAAAGAGGACTCCGCAAGAAGATCAATGGGATGGAGTTGGTCCTAT
CCCAGGACTGTCGAAGTCCCCCAAAGGGGTTGAGATGCTGTGGCACCCATCGGTGGTAAAACCATATCTGACTCTTCTAGCAGAAAGTTC
CAACCCAGCCACCTTGGAAGGCTCTGCAGGGTCTCTCCAGAACCTCTCTGCTGGCAACTGGAAGTTTGCAGCATATATCCGGGCGGCCGT
CCGAAAAGAAAAGGGGCTCCCCATCCTTGTGGAGCTTCTGAGAATGGATAACGATAGAGTTGTTTCTTCCGTGGCAACAGCCTTGAGGAA
TATGGCACTAGATGTTCGCAACAAGGAGCTCATAGGCAAATACGCCATGCGAGACCTGGTCAACCGGCTCCCCGGCGGCAATGGCCCCAG
TGTCTTGTCTGATGAGACCATGGCAGCCATCTGCTGTGCTCTGCACGAGGTCACCAGCAAAAACATGGAGAACGCAAAAGCCCTGGCCGA
CTCAGGAGGCATAGAGAAGCTGGTGAACATAACCAAAGGCAGGGGCGACAGATCATCTCTGAAAGTGGTGAAGGCAGCAGCCCAGGTCTT
GAATACATTATGGCAATATCGGGACCTCCGGAGCATTTATAAAAAGGATGGGTGGAATCAGAACCATTTTATTACACCTGTGTCGACATT
GGAGCGAGACCGATTCAAATCACATCCTTCCTTGTCTACCACCAACCAACAGATGTCACCCATCATTCAGTCAGGCTCCAGCAAACCTTC
ACCAATTTACATCAGTTCCTATTCCTCACCAGCAAGAGAACAAAATAGACGGCTACAGCATCAACAGCTGTATTATAGTCAAGATGACTC
CAACAGAAAGAACTTTGATGCATACAGATTGTATTTGCAGTCTCCTCATAGCTATGAAGATCCTTATTTTGATGACCGAGTTCACTTTCC
AGCTTCTACTGATTACTCAACACAGTATGGACTGAAATCGACCACAAATTATGTAGACTTTTATTCCACTAAACGACCTTCTTATAGAGC
AGAACAGTACCCAGGGTCCCCAGACTCATGGGTGTAGCATCAAGATGCCCAACAGAGGAACTCTTTCTTTCTAACCTTGTTCAGATTGAG
GTGAAAAGTCCATCTTGCTGATTTGATGATTGAAATGTGAAAGTGAAGTGGAAGGAATGAATGAAGTGTGTTTTTTTTTTCTTTTTTGAG
GAATTATCAGGGAAGTGAGGAAATGTTTGGGAGAGGACTTTCTAAGCTCTATTTAGGTGTTAGATCTAATTACTTATAGATTCTGTAGTC
TGGTGAAGGTGTGGGTGACGTGATGAGAGGTTTGAGAAATGGGTGAAATGAAATGGGGGATATGTAGGTCAAATCAAATTAAAGATGATT
TTTTTAATGTGAATAAAGTTATGTTCTGATAGTTTGTACAGAAAAAATAAAATGGATGCCCATGTTTTATTGCTATTACTAAATGTCAAG
ATTGTATGCTATTATGTCTTGTAAATTTCTTTTGTTGGTGTAAATATGGAAATGCCACATTGGTTAAGTGCCATCATTTGTAATGCAATG
TGTCACTTGAAAAGAGATTTGAAGAAACTGACAACTTCAAAAACAAATGAGAAGCCCAAGGAACTGTGAGCAATTAAAAGCAAACCGCGA
CACCCTTTGTCTCCACCACACATAGTGTACTTTGGAAGCACAACGTCCAGGCTGGTACCGCAGCGCCATGCCCATTCCTCGCCTCATTCA
TAGGACACTTCACTGCCATTTTCTATTCACATAAAAGAAAAATAAATGTGGAAATTTCATCCTTGGAAATGTGTTCTCTATTCTTTTTAA
ATTTTCTGTATGTATATGGAAAAGCAGATTTAATAAACACAAATCTAACTGCATGTTGGAAACAGCTTAAATATATCCAACTATTTTGGA
CTTTTCCTAGTTTTATATATACTTTGTGAGATGGACCACACCAAAGAATTAAGCTATGAACTTTCCTTCTCTGACAATCTAGGTATTTTT
ATTTCTTGGAATCTATTTCAAATCTAGTACCCTTCACAGTTTTATCATGTTTTTCTATATGGCAGCCCGGCCATAACTAAATTGATATAA
CTAAATTGATACCTTTTTTAGGCATCATTTTTAAGAACTCAGTAACTTCATAAGAACAGTGGTTGGGCATTTATTTGGTTGGCACCAATA
TAATAAAACAAATTCTAAATTATGATCTTGTTGAGTAATTCCTGGATTAGGAATGGGGTAGGGGAAAGAAGCCCCACAGTAGGGTAGGCA
GAGGTCCCCATAGCACCACACAGCATTACTTGGGAGCTTTTCTCCATCTGTGGAGGCAATGAGGGACACCAGGCCCAAAGTCTGAGGCCT
TCTCAGGTCTTGCAGTTTCCACCCTTCAGTCCTAGCTCCTGGATCTTTGTTGGTGTAAATATGGAAATGCCACATTGGTTAAGTGCCATC
ATGGAAAACTAGGAACTAGTTTACCTGTAGCTGTACTAGGAAAACTAGGAACTAACGCCCATGGCAAGTCAAACCCAGAAGTGTTGTCTT
AGCTCAATATACCCTGCAAGTGTAATACAAATCGCAGATTAAAACATTTGTACTTATATTTGTGAGTATATGGAAAGACACCCAAGAGAA
AACTGCGTTTTCTTGCCGAATGGACAGCAGCCCAAGATGAGCATCCAAAGTGAAATGGAAATACCAGTGCGAGCATGAAAGTTCCTAAGA
TCTGCAAGCCGGGAGCCCTTCTGGGGTGCCTTCTGAAGAACACTCGAGCCAGCTTGCGGAGTCGACTGCCCAGCTACTGAGCCCTGGCTG
CCTCATACCTGGCGGAGCTCAGCAGACAGCCCTTGTTAGATGGCAGAGGTCGCCAGATGTTCCTGAGGAACTGGCATCAGGACTTCTGGG
ACGTAACCACTTCAGTTTCCAATCACTGACTTCCCTGAACCTGGTATCTTCAGATCTGGGTTGAAGCTGAAGGCTCAGTTGGGGAATCAG
GAAAACAAGGGAATCGGAGCCAGCCCAAGCCAACTGACAATAACTTTCTTTTAGTGACTGCATTAGTTTGGGGTAATGCTGTAACAGGCA
GCATTTCAAAAACGGATCCAAATTTGGGTTGAAAATAACTCACCTGGGTCTCCAATCAGACCTAACAGAGCCAGAAAGAGGCCTGTACAC

>53441_53441_11_MGST1-PKP4_MGST1_chr12_16507204_ENST00000540056_PKP4_chr2_159433783_ENST00000389757_length(amino acids)=1152AA_BP=39
MKGNCISVPVRVMDDEVFMAFASYATIILSKMMLMSTATAFYRLTRKELQFQRLTRELEVERQIVASQLERCRLGAESPSIASTSSTEKS
FPWRSTDVPNTGVSKPRVSDAVQPNNYLIRTEPEQGTLYSPEQTSLHESEGSLGNSRSSTQMNSYSDSGYQEAGSFHNSQNVSKADNRQQ
HSFIGSTNNHVVRNSRAEGQTLVQPSVANRAMRRVSSVPSRAQSPSYVISTGVSPSRGSLRTSLGSGFGSPSVTDPRPLNPSAYSSTTLP
AARAASPYSQRPASPTAIRRIGSVTSRQTSNPNGPTPQYQTTARVGSPLTLTDAQTRVASPSQGQVGSSSPKRSGMTAVPQHLGPSLQRT
VHDMEQFGQQQYDIYERMVPPRPDSLTGLRSSYASQHSQLGQDLRSAVSPDLHITPIYEGRTYYSPVYRSPNHGTVELQGSQTALYRTGS
VGIGNLQRTSSQRSTLTYQRNNYALNTTATYAEPYRPIQYRVQECNYNRLQHAVPADDGTTRSPSIDSIQKDPREFAWRDPELPEVIHML
QHQFPSVQANAAAYLQHLCFGDNKVKMEVCRLGGIKHLVDLLDHRVLEVQKNACGALRNLVFGKSTDENKIAMKNVGGIPALLRLLRKSI
DAEVRELVTGVLWNLSSCDAVKMTIIRDALSTLTNTVIVPHSGWNNSSFDDDHKIKFQTSLVLRNTTGCLRNLSSAGEEARKQMRSCEGL
VDSLLYVIHTCVNTSDYDSKTVENCVCTLRNLSYRLELEVPQARLLGLNELDDLLGKESPSKDSEPSCWGKKKKKKKRTPQEDQWDGVGP
IPGLSKSPKGVEMLWHPSVVKPYLTLLAESSNPATLEGSAGSLQNLSAGNWKFAAYIRAAVRKEKGLPILVELLRMDNDRVVSSVATALR
NMALDVRNKELIGKYAMRDLVNRLPGGNGPSVLSDETMAAICCALHEVTSKNMENAKALADSGGIEKLVNITKGRGDRSSLKVVKAAAQV
LNTLWQYRDLRSIYKKDGWNQNHFITPVSTLERDRFKSHPSLSTTNQQMSPIIQSGSSKPSPIYISSYSSPAREQNRRLQHQQLYYSQDD

--------------------------------------------------------------
>53441_53441_12_MGST1-PKP4_MGST1_chr12_16507204_ENST00000540056_PKP4_chr2_159433783_ENST00000389759_length(transcript)=4428nt_BP=229nt
GGCAGATGGAAGACTTGGGGGGGTCTCTGCCAGCTGGAAGTGCTTGGCTCCACTTAGCAGCTAAACTTAGCTTTTCAATCGATCGCTTTT
GAAAGGGAATTGTATTTCTGTCCCCGTGCGGGTAATGGATGATGAAGTATTCATGGCTTTTGCATCCTATGCAACAATTATTCTTTCAAA
AATGATGCTTATGAGTACTGCAACTGCATTCTATAGATTGACAAGAAAGGAGCTTCAGTTTCAGCGACTCACCCGAGAACTGGAAGTGGA
AAGGCAGATTGTTGCCAGTCAGCTAGAAAGATGTAGGCTTGGAGCAGAATCACCAAGCATCGCCAGCACCAGCTCAACTGAGAAGTCATT
TCCTTGGAGATCAACAGACGTGCCAAATACTGGTGTAAGCAAACCTAGAGTTTCTGACGCTGTCCAGCCCAACAACTATCTCATCAGGAC
AGAGCCAGAACAAGGAACCCTCTATTCACCAGAACAGACATCTCTCCATGAAAGTGAGGGATCATTGGGTAACTCAAGAAGTTCAACACA
AATGAATTCTTATTCCGACAGTGGATACCAGGAAGCAGGGAGTTTCCACAACAGCCAGAACGTGAGCAAGGCAGACAACAGACAGCAGCA
TTCATTCATAGGATCAACTAACAACCATGTGGTGAGGAATTCAAGAGCTGAAGGACAAACACTGGTTCAGCCATCAGTAGCCAATCGGGC
CATGAGAAGAGTTAGTTCAGTTCCATCTAGAGCACAGTCTCCTTCTTATGTTATCAGCACAGGCGTGTCTCCTTCAAGGGGGTCTCTGAG
AACTTCTCTGGGTAGTGGATTTGGCTCTCCGTCAGTGACCGACCCCCGACCTCTGAACCCCAGTGCATATTCCTCCACCACATTACCTGC
TGCACGGGCAGCCTCTCCGTACTCACAGAGACCCGCCTCCCCAACAGCTATACGGCGGATTGGGTCAGTCACCTCCCGGCAGACCTCCAA
TCCCAACGGACCAACCCCTCAATACCAAACCACCGCCAGAGTGGGGTCCCCACTGACCCTGACGGATGCACAGACTCGAGTAGCTTCCCC
ATCCCAAGGCCAGGTGGGGTCGTCGTCCCCCAAACGCTCAGGGATGACCGCCGTACCACAGCATCTGGGACCTTCACTGCAAAGGACTGT
TCATGACATGGAGCAATTCGGACAGCAGCAGTATGACATTTATGAGAGGATGGTTCCACCCAGGCCAGACAGCCTGACAGGCTTACGGAG
TTCCTATGCTAGTCAGCATAGTCAGCTTGGGCAAGACCTTCGTTCTGCCGTGTCTCCCGACTTGCACATTACTCCTATATATGAGGGGAG
GACCTATTACAGCCCAGTGTACCGCAGCCCAAACCATGGAACTGTGGAGCTCCAAGGATCGCAGACGGCGTTGTATCGCACAGGTTCAGT
AGGTATTGGAAATCTACAAAGGACATCCAGCCAACGAAGTACCCTTACATACCAAAGAAATAATTATGCTCTGAACACAACAGCTACCTA
CGCGGAGCCCTACAGGCCTATACAATACCGAGTGCAAGAGTGCAATTATAACAGGCTTCAGCATGCAGTGCCGGCTGATGATGGCACCAC
AAGATCCCCATCAATAGACAGCATTCAGAAGGACCCCAGGGAGTTTGCCTGGCGTGATCCTGAGTTGCCTGAGGTCATTCACATGCTTCA
GCACCAGTTCCCATCTGTTCAGGCAAATGCAGCGGCCTACCTGCAGCACCTGTGCTTTGGTGACAACAAAGTGAAGATGGAGGTGTGTAG
GTTAGGGGGAATCAAGCATCTGGTTGACCTTCTGGACCACAGAGTTTTGGAAGTTCAGAAGAATGCTTGTGGTGCCCTTCGAAACCTCGT
TTTTGGCAAGTCTACAGATGAAAATAAAATAGCAATGAAGAATGTTGGTGGGATACCTGCCTTGTTGCGACTGTTGAGAAAATCTATTGA
TGCAGAAGTAAGGGAGCTTGTTACAGGAGTTCTTTGGAATTTATCCTCATGTGATGCTGTAAAAATGACAATCATTCGAGATGCTCTCTC
AACCTTAACAAACACTGTGATTGTTCCACATTCTGGATGGAATAACTCTTCTTTTGATGATGATCATAAAATTAAATTTCAGACTTCACT
AGTTCTGCGTAACACGACAGGTTGCCTAAGGAACCTCAGCTCCGCGGGGGAAGAAGCTCGGAAGCAAATGCGGTCCTGCGAGGGGCTGGT
AGACTCACTGTTGTATGTGATCCACACGTGTGTGAACACATCCGATTACGACAGCAAGACGGTGGAGAACTGCGTGTGCACCCTGAGGAA
CCTGTCCTATCGGCTGGAGCTGGAGGTGCCCCAGGCCCGGTTACTGGGACTGAACGAATTGGATGACTTACTAGGAAAAGAGTCTCCCAG
CAAAGACTCTGAGCCAAGTTGCTGGGGGAAGAAGAAGAAAAAGAAAAAGAGGACTCCGCAAGAAGATCAATGGGATGGAGTTGGTCCTAT
CCCAGGACTGTCGAAGTCCCCCAAAGGGGTTGAGATGCTGTGGCACCCATCGGTGGTAAAACCATATCTGACTCTTCTAGCAGAAAGTTC
CAACCCAGCCACCTTGGAAGGCTCTGCAGGGTCTCTCCAGAACCTCTCTGCTGGCAACTGGAAGTTTGCAGCATATATCCGGGCGGCCGT
CCGAAAAGAAAAGGGGCTCCCCATCCTTGTGGAGCTTCTGAGAATGGATAACGATAGAGTTGTTTCTTCCGTGGCAACAGCCTTGAGGAA
TATGGCACTAGATGTTCGCAACAAGGAGCTCATAGGCAAATACGCCATGCGAGACCTGGTCAACCGGCTCCCCGGCGGCAATGGCCCCAG
TGTCTTGTCTGATGAGACCATGGCAGCCATCTGCTGTGCTCTGCACGAGGTCACCAGCAAAAACATGGAGAACGCAAAAGCCCTGGCCGA
CTCAGGAGGCATAGAGAAGCTGGTGAACATAACCAAAGGCAGGGGCGACAGATCATCTCTGAAAGTGGTGAAGGCAGCAGCCCAGGTCTT
GAATACATTATGGCAATATCGGGACCTCCGGAGCATTTATAAAAAGGATGGGTGGAATCAGAACCATTTTATTACACCTGTGTCGACATT
GGAGCGAGACCGATTCAAATCACATCCTTCCTTGTCTACCACCAACCAACAGATGTCACCCATCATTCAGTCAGTCGGCAGCACCTCTTC
CTCACCAGCACTGTTAGGAATCAGAGACCCTCGCTCTGAATACGATAGGACCCAGCCACCTATGCAGTATTACAATAGCCAAGGGGATGC
CACACATAAAGGCCTGTACCCTGGCTCCAGCAAACCTTCACCAATTTACATCAGTTCCTATTCCTCACCAGCAAGAGAACAAAATAGACG
GCTACAGCATCAACAGCTGTATTATAGTCAAGATGACTCCAACAGAAAGAACTTTGATGCATACAGATTGTATTTGCAGTCTCCTCATAG
CTATGAAGATCCTTATTTTGATGACCGAGTTCACTTTCCAGCTTCTACTGATTACTCAACACAGTATGGACTGAAATCGACCACAAATTA
TGTAGACTTTTATTCCACTAAACGACCTTCTTATAGAGCAGAACAGTACCCAGGGTCCCCAGACTCATGGGTGTAGCATCAAGATGCCCA
ACAGAGGAACTCTTTCTTTCTAACCTTGTTCAGATTGAGGTGAAAAGTCCATCTTGCTGATTTGATGATTGAAATGTGAAAGTGAAGTGG
AAGGAATGAATGAAGTGTGTTTTTTTTTTCTTTTTTGAGGAATTATCAGGGAAGTGAGGAAATGTTTGGGAGAGGACTTTCTAAGCTCTA
TTTAGGTGTTAGATCTAATTACTTATAGATTCTGTAGTCTGGTGAAGGTGTGGGTGACGTGATGAGAGGTTTGAGAAATGGGTGAAATGA
AATGGGGGATATGTAGGTCAAATCAAATTAAAGATGATTTTTTTAATGTGAATAAAGTTATGTTCTGATAGTTTGTACAGAAAAAATAAA
ATGGATGCCCATGTTTTATTGCTATTACTAAATGTCAAGATTGTATGCTATTATGTCTTGTAAATTTCTTTTGTTGGTGTAAATATGGAA
ATGCCACATTGGTTAAGTGCCATCATTTGTAATGCAATGTGTCACTTGAAAAGAGATTTGAAGAAACTGACAACTTCAAAAACAAATGAG
AAGCCCAAGGAACTGTGAGCAATTAAAAGCAAACCGCGACACCCTTTGTCTCCACCACACATAGTGTACTTTGGAAGCACAACGTCCAGG
CTGGTACCGCAGCGCCATGCCCATTCCTCGCCTCATTCATAGGACACTTCACTGCCATTTTCTATTCACATAAAAGAAAAATAAATGTGG

>53441_53441_12_MGST1-PKP4_MGST1_chr12_16507204_ENST00000540056_PKP4_chr2_159433783_ENST00000389759_length(amino acids)=1195AA_BP=39
MKGNCISVPVRVMDDEVFMAFASYATIILSKMMLMSTATAFYRLTRKELQFQRLTRELEVERQIVASQLERCRLGAESPSIASTSSTEKS
FPWRSTDVPNTGVSKPRVSDAVQPNNYLIRTEPEQGTLYSPEQTSLHESEGSLGNSRSSTQMNSYSDSGYQEAGSFHNSQNVSKADNRQQ
HSFIGSTNNHVVRNSRAEGQTLVQPSVANRAMRRVSSVPSRAQSPSYVISTGVSPSRGSLRTSLGSGFGSPSVTDPRPLNPSAYSSTTLP
AARAASPYSQRPASPTAIRRIGSVTSRQTSNPNGPTPQYQTTARVGSPLTLTDAQTRVASPSQGQVGSSSPKRSGMTAVPQHLGPSLQRT
VHDMEQFGQQQYDIYERMVPPRPDSLTGLRSSYASQHSQLGQDLRSAVSPDLHITPIYEGRTYYSPVYRSPNHGTVELQGSQTALYRTGS
VGIGNLQRTSSQRSTLTYQRNNYALNTTATYAEPYRPIQYRVQECNYNRLQHAVPADDGTTRSPSIDSIQKDPREFAWRDPELPEVIHML
QHQFPSVQANAAAYLQHLCFGDNKVKMEVCRLGGIKHLVDLLDHRVLEVQKNACGALRNLVFGKSTDENKIAMKNVGGIPALLRLLRKSI
DAEVRELVTGVLWNLSSCDAVKMTIIRDALSTLTNTVIVPHSGWNNSSFDDDHKIKFQTSLVLRNTTGCLRNLSSAGEEARKQMRSCEGL
VDSLLYVIHTCVNTSDYDSKTVENCVCTLRNLSYRLELEVPQARLLGLNELDDLLGKESPSKDSEPSCWGKKKKKKKRTPQEDQWDGVGP
IPGLSKSPKGVEMLWHPSVVKPYLTLLAESSNPATLEGSAGSLQNLSAGNWKFAAYIRAAVRKEKGLPILVELLRMDNDRVVSSVATALR
NMALDVRNKELIGKYAMRDLVNRLPGGNGPSVLSDETMAAICCALHEVTSKNMENAKALADSGGIEKLVNITKGRGDRSSLKVVKAAAQV
LNTLWQYRDLRSIYKKDGWNQNHFITPVSTLERDRFKSHPSLSTTNQQMSPIIQSVGSTSSSPALLGIRDPRSEYDRTQPPMQYYNSQGD
ATHKGLYPGSSKPSPIYISSYSSPAREQNRRLQHQQLYYSQDDSNRKNFDAYRLYLQSPHSYEDPYFDDRVHFPASTDYSTQYGLKSTTN

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for MGST1-PKP4


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for MGST1-PKP4


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for MGST1-PKP4


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource