Fusion Gene Studies
in Kim Lab

FusionBase FusionGDB FusionGDB2 FusionPDB FusionNeoAntigen FusionAI FusionNW FGviewer Publication Contact
FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:ALG10B-MPRIP (FusionGDB2 ID:HG144245TG23164)

Fusion Gene Summary for ALG10B-MPRIP

check button Fusion gene summary
Fusion gene informationFusion gene name: ALG10B-MPRIP
Fusion gene ID: hg144245tg23164
HgeneTgene
Gene symbol

ALG10B

MPRIP

Gene ID

144245

23164

Gene nameALG10 alpha-1,2-glucosyltransferase Bmyosin phosphatase Rho interacting protein
SynonymsALG10|KCR1M-RIP|MRIP|RHOIP3|RIP3|p116Rip
Cytomap('ALG10B')('MPRIP')

12q12

17p11.2

Type of geneprotein-codingprotein-coding
Descriptionputative Dol-P-Glc:Glc(2)Man(9)GlcNAc(2)-PP-Dol alpha-1,2-glucosyltransferaseALG10B, alpha-1,2-glucosyltransferasealpha-1,2-glucosyltransferase ALG10-Aalpha-2-glucosyltransferase ALG10-Basparagine-linked glycosylation 10 homolog B (yeast, alpha-1,2-glmyosin phosphatase Rho-interacting proteinRho interacting protein 3
Modification date2020031320200313
UniProtAcc..
Ensembl transtripts involved in fusion geneENST00000308742, ENST00000551464, 
Fusion gene scores* DoF score2 X 1 X 2=49 X 7 X 6=378
# samples 39
** MAII scorelog2(3/4*10)=2.90689059560852log2(9/378*10)=-2.0703893278914
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: ALG10B [Title/Abstract] AND MPRIP [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointALG10B(38712260)-MPRIP(16979024), # samples:1
Anticipated loss of major functional domain due to fusion event.ALG10B-MPRIP seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
ALG10B-MPRIP seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
ALG10B-MPRIP seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
ALG10B-MPRIP seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID

check buttonFusion gene breakpoints across ALG10B (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure
check buttonFusion gene breakpoints across MPRIP (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4STADTCGA-HU-A4GN-01AALG10Bchr12

38712260

+MPRIPchr17

16979024

+


Top

Fusion Gene ORF analysis for ALG10B-MPRIP

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-3UTRENST00000308742ENST00000395807ALG10Bchr12

38712260

+MPRIPchr17

16979024

+
5CDS-3UTRENST00000551464ENST00000395807ALG10Bchr12

38712260

+MPRIPchr17

16979024

+
5CDS-intronENST00000308742ENST00000395806ALG10Bchr12

38712260

+MPRIPchr17

16979024

+
5CDS-intronENST00000551464ENST00000395806ALG10Bchr12

38712260

+MPRIPchr17

16979024

+
In-frameENST00000308742ENST00000341712ALG10Bchr12

38712260

+MPRIPchr17

16979024

+
In-frameENST00000308742ENST00000395804ALG10Bchr12

38712260

+MPRIPchr17

16979024

+
In-frameENST00000308742ENST00000395811ALG10Bchr12

38712260

+MPRIPchr17

16979024

+
In-frameENST00000308742ENST00000444976ALG10Bchr12

38712260

+MPRIPchr17

16979024

+
In-frameENST00000551464ENST00000341712ALG10Bchr12

38712260

+MPRIPchr17

16979024

+
In-frameENST00000551464ENST00000395804ALG10Bchr12

38712260

+MPRIPchr17

16979024

+
In-frameENST00000551464ENST00000395811ALG10Bchr12

38712260

+MPRIPchr17

16979024

+
In-frameENST00000551464ENST00000444976ALG10Bchr12

38712260

+MPRIPchr17

16979024

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000308742ALG10Bchr1238712260+ENST00000395811MPRIPchr1716979024+1143368531636781120
ENST00000308742ALG10Bchr1238712260+ENST00000444976MPRIPchr1716979024+372668531635641082
ENST00000308742ALG10Bchr1238712260+ENST00000395804MPRIPchr1716979024+364068531636391108
ENST00000308742ALG10Bchr1238712260+ENST00000341712MPRIPchr1716979024+440868531636391107
ENST00000551464ALG10Bchr1238712260+ENST00000395811MPRIPchr1716979024+1124950113234941120
ENST00000551464ALG10Bchr1238712260+ENST00000444976MPRIPchr1716979024+354250113233801082
ENST00000551464ALG10Bchr1238712260+ENST00000395804MPRIPchr1716979024+345650113234551107
ENST00000551464ALG10Bchr1238712260+ENST00000341712MPRIPchr1716979024+422450113234551107

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000308742ENST00000395811ALG10Bchr1238712260+MPRIPchr1716979024+0.0006286350.99937135
ENST00000308742ENST00000444976ALG10Bchr1238712260+MPRIPchr1716979024+0.0079848430.9920151
ENST00000308742ENST00000395804ALG10Bchr1238712260+MPRIPchr1716979024+0.0068857880.9931142
ENST00000308742ENST00000341712ALG10Bchr1238712260+MPRIPchr1716979024+0.0037296260.9962704
ENST00000551464ENST00000395811ALG10Bchr1238712260+MPRIPchr1716979024+0.0005383730.9994616
ENST00000551464ENST00000444976ALG10Bchr1238712260+MPRIPchr1716979024+0.0083396890.9916603
ENST00000551464ENST00000395804ALG10Bchr1238712260+MPRIPchr1716979024+0.0071276250.99287236
ENST00000551464ENST00000341712ALG10Bchr1238712260+MPRIPchr1716979024+0.003684690.9963153

Top

Fusion Genomic Features for ALG10B-MPRIP


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
ALG10Bchr1238712260+MPRIPchr1716979023+0.00333870.9966613
ALG10Bchr1238712260+MPRIPchr1716979023+0.00333870.9966613

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.
genomic feature of top 1%

Top

Fusion Protein Features for ALG10B-MPRIP


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr12:38712260/chr17:16979024)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
..
FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+23119_126123474.0Topological domainExtracellular
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+231_6123474.0Topological domainCytoplasmic
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+2328_64123474.0Topological domainExtracellular
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+2386_97123474.0Topological domainCytoplasmic
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+2365_85123474.0TransmembraneHelical
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+237_27123474.0TransmembraneHelical
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+2398_118123474.0TransmembraneHelical
TgeneMPRIPchr12:38712260chr17:16979024ENST00000341712024673_977411231.0Coiled coilOntology_term=ECO:0000255
TgeneMPRIPchr12:38712260chr17:16979024ENST00000395804023673_977411026.0Coiled coilOntology_term=ECO:0000255
TgeneMPRIPchr12:38712260chr17:16979024ENST00000395811023673_977411039.0Coiled coilOntology_term=ECO:0000255
TgeneMPRIPchr12:38712260chr17:16979024ENST00000444976022673_977411001.0Coiled coilOntology_term=ECO:0000255
TgeneMPRIPchr12:38712260chr17:16979024ENST00000341712024179_252411231.0Compositional biasNote=Ser-rich
TgeneMPRIPchr12:38712260chr17:16979024ENST00000395804023179_252411026.0Compositional biasNote=Ser-rich
TgeneMPRIPchr12:38712260chr17:16979024ENST00000395811023179_252411039.0Compositional biasNote=Ser-rich
TgeneMPRIPchr12:38712260chr17:16979024ENST00000444976022179_252411001.0Compositional biasNote=Ser-rich
TgeneMPRIPchr12:38712260chr17:16979024ENST00000341712024387_483411231.0DomainPH 2
TgeneMPRIPchr12:38712260chr17:16979024ENST0000034171202443_150411231.0DomainPH 1
TgeneMPRIPchr12:38712260chr17:16979024ENST00000395804023387_483411026.0DomainPH 2
TgeneMPRIPchr12:38712260chr17:16979024ENST0000039580402343_150411026.0DomainPH 1
TgeneMPRIPchr12:38712260chr17:16979024ENST00000395811023387_483411039.0DomainPH 2
TgeneMPRIPchr12:38712260chr17:16979024ENST0000039581102343_150411039.0DomainPH 1
TgeneMPRIPchr12:38712260chr17:16979024ENST00000444976022387_483411001.0DomainPH 2
TgeneMPRIPchr12:38712260chr17:16979024ENST0000044497602243_150411001.0DomainPH 1

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+23148_150123474.0Topological domainCytoplasmic
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+23172_175123474.0Topological domainExtracellular
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+23197_256123474.0Topological domainCytoplasmic
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+23278_283123474.0Topological domainExtracellular
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+23305_317123474.0Topological domainCytoplasmic
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+23339_365123474.0Topological domainExtracellular
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+23387_392123474.0Topological domainCytoplasmic
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+23414_436123474.0Topological domainExtracellular
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+23458_473123474.0Topological domainCytoplasmic
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+23127_147123474.0TransmembraneHelical
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+23151_171123474.0TransmembraneHelical
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+23176_196123474.0TransmembraneHelical
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+23257_277123474.0TransmembraneHelical
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+23284_304123474.0TransmembraneHelical
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+23318_338123474.0TransmembraneHelical
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+23366_386123474.0TransmembraneHelical
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+23393_413123474.0TransmembraneHelical
HgeneALG10Bchr12:38712260chr17:16979024ENST00000308742+23437_457123474.0TransmembraneHelical


Top

Fusion Gene Sequence for ALG10B-MPRIP


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>3980_3980_1_ALG10B-MPRIP_ALG10B_chr12_38712260_ENST00000308742_MPRIP_chr17_16979024_ENST00000341712_length(transcript)=4408nt_BP=685nt
AAGCAGCAAGCCACGCCCCTCCCGCGCTCGCGAAATCCGAGACCCGCCCTTTCCGGAAGTTTTGACACTGTGCGCCCCGAGTAATGTGAT
GGAGAGGGTAATCATCCGGTCCGTTATCTAAACCCGTCACTCCAGGAAACAGCGACCCGCTGTTTTCCGGATCCGCGCTCTCCCAGCATC
CTTTGCCTTCCGGTATGTGGCCCCGTCTGGCTAGTCCTGTCTAGCGCGCCCATTTCGAGCCCAAGTTTCCAGCTCGGGTTTCCGGGCTCA
GAATTTTCCAGGAGTGGGTTCTTGGGCAGTGGCTGTGGGAGCAGGAATGGCGCAGCTAGAGGGTTACTGTTTCTCGGCCGCCTTGAGCTG
TACCTTTTTAGTGTCCTGCCTCCTCTTCTCCGCCTTCAGCCGGGCGCTGCGAGAGCCCTACATGGACGAGATCTTCCACCTGCCTCAGGC
GCAGCGCTACTGTGAGGGCCATTTCTCCCTTTCCCAGTGGGATCCCATGATTACTACATTACCTGGCTTGTACCTGGTGTCAGTTGGAGT
GGTCAAACCTGCCATTTGGATCTTTGCATGGTCTGAACATGTTGTCTGCTCCATTGGGATGCTCAGATTTGTTAATCTTCTCTTCAGTGT
TGGCAACTTCTATTTACTATATTTGCTTTTCCACAAGGTACAACCCAGAAACAAGGCAAAACCCATTTATGGCGGTTGGCTGCTCCTGGC
TCCAGATGGGACCGACTTTGACAACCCAGTGCACCGGTCTCGGAAATGGCAGCGACGGTTCTTCATCCTTTACGAGCACGGCCTCTTGCG
CTACGCCCTGGATGAGATGCCCACGACCCTTCCTCAGGGCACCATCAACATGAACCAGTGCACAGATGTGGTGGATGGGGAGGGCCGCAC
GGGCCAGAAGTTCTCCCTGTGTATTCTGACGCCTGAGAAGGAGCATTTCATCCGGGCGGAGACCAAGGAGATCGTCAGTGGGTGGCTGGA
GATGCTCATGGTCTATCCCCGGACCAACAAGCAGAATCAGAAGAAGAAACGGAAAGTGGAGCCCCCCACACCACAGGAGCCTGGGCCTGC
CAAGGTGGCTGTTACCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCATCCCCAGTGCTGAGAAAGTCCCCACCACCAAGTCCACACT
CTGGCAGGAAGAAATGAGGACCAAGGACCAGCCAGATGGCAGCAGCCTGAGTCCAGCTCAGAGTCCCAGCCAGAGCCAGCCTCCTGCTGC
CAGCTCCCTGCGGGAACCTGGGCTAGAGAGCAAAGAAGAGGAGAGCGCCATGAGTAGCGACCGCATGGACTGTGGCCGCAAAGTCCGGGT
GGAGAGCGGCTACTTCTCTCTGGAGAAGACCAAACAGGACTTGAAGGCTGAAGAACAGCAGCTGCCCCCGCCGCTCTCCCCTCCCAGCCC
CAGCACCCCCAACCACAGGAGGTCCCAGGTGATTGAAAAGTTTGAGGCCTTGGACATTGAGAAGGCAGAGCACATGGAGACCAATGCAGT
GGGGCCCTCACCATCCAGCGACACACGCCAGGGCCGCAGCGAGAAGAGGGCGTTCCCTAGGAAGCGGGACTTCACCAATGAAGCCCCCCC
AGCTCCTCTCCCAGACGCCTCGGCTTCCCCCCTGTCTCCACACCGAAGAGCCAAGTCACTGGACAGGAGGTCCACGGAGCCCTCCGTGAC
GCCCGACCTGCTGAATTTCAAGAAAGGCTGGCTGACTAAGCAGTATGAGGACGGCCAGTGGAAGAAACACTGGTTTGTCCTCGCCGATCA
AAGCCTGAGATACTACAGGGATTCAGTGGCTGAGGAGGCAGCCGACTTGGATGGAGAAATTGACTTGTCCGCATGTTACGATGTCACAGA
GTATCCAGTTCAGAGAAACTATGGCTTCCAGATACATACAAAGGAGGGCGAGTTTACCCTGTCGGCCATGACATCTGGGATTCGGCGGAA
CTGGATCCAGACCATCATGAAGCACGTGCACCCGACCACTGCCCCGGATGTGACCAGCTCGTTGCCAGAGGAAAAAAACAAGAGCAGCTG
CTCTTTTGAGACCTGCCCGAGGCCTACTGAGAAGCAAGAGGCAGAGCTGGGGGAGCCGGACCCTGAGCAGAAGAGGAGCCGCGCACGGGA
GCGGAGGCGAGAGGGCCGCTCCAAGACCTTTGACTGGGCTGAGTTCCGTCCCATCCAGCAGGCCCTGGCTCAGGAGCGGGTGGGCGGCGT
GGGGCCTGCTGACACCCACGAGCCCCTGCGCCCTGAGGCGGAGCCTGGGGAGCTGGAGCGGGAGCGTGCACGGAGGCGGGAGGAGCGCCG
CAAGCGCTTCGGGATGCTCGACGCCACAGACGGGCCAGGCACTGAGGATGCAGCCCTGCGCATGGAGGTGGACCGGAGCCCAGGGCTGCC
TATGAGCGACCTCAAAACGCATAACGTCCACGTGGAGATTGAGCAGCGGTGGCATCAGGTGGAGACCACACCTCTCCGGGAAGAGAAGCA
GGTGCCCATCGCCCCCGTCCACCTGTCTTCTGAAGATGGGGGTGACCGGCTCTCCACACACGAGCTGACCTCTCTGCTCGAGAAGGAGCT
GGAGCAGAGCCAGAAGGAGGCCTCAGACCTTCTGGAGCAGAACCGGCTCCTGCAGGACCAGCTGAGGGTGGCCCTGGGCCGGGAGCAGAG
CGCCCGTGAGGGCTACGTGCTGCAGGCCACGTGCGAGCGAGGGTTTGCAGCAATGGAAGAAACGCACCAGAAGAAGATTGAAGATCTCCA
GAGGCAGCACCAGCGGGAGCTAGAGAAACTTCGAGAAGAGAAAGACCGCCTCCTAGCCGAGGAGACAGCGGCCACCATCTCAGCCATCGA
AGCCATGAAGAACGCCCACCGGGAGGAAATGGAGCGGGAGCTGGAGAAGAGCCAGCGGTCCCAGATCAGCAGCGTCAACTCGGATGTTGA
GGCCCTGCGGCGCCAGTACCTGGAGGAGCTGCAGTCGGTGCAGCGGGAACTGGAGGTCCTCTCGGAGCAGTACTCGCAGAAGTGCCTGGA
GAATGCCCATCTGGCCCAGGCGCTGGAGGCCGAGCGGCAGGCCCTGCGGCAGTGCCAGCGTGAGAACCAGGAGCTCAATGCCCACAACCA
GGAGCTGAACAACCGCCTGGCTGCAGAGATCACACGGTTGCGGACGCTGCTGACTGGGGACGGCGGTGGGGAGGCCACTGGGTCACCCCT
TGCACAGGGCAAGGATGCCTATGAACTAGAGGTCTTATTGCGGGTAAAGGAATCGGAAATACAGTACCTGAAACAGGAGATTAGCTCCCT
CAAGGATGAGCTGCAGACGGCACTGCGGGACAAGAAGTACGCAAGTGACAAGTACAAAGACATCTACACAGAGCTCAGCATCGCGAAGGC
TAAGGCTGACTGTGACATCAGCAGGTTGAAGGAGCAGCTCAAGGCTGCAACGGAAGCACTGGGGGAGAAGTCCCCTGACAGTGCCACGGT
GTCCGGATATGATATAATGAAATCTAAAAGCAACCCTGACTTCTTGAAGAAAGACAGATCCTGTGTCACCCGGCAACTCAGAAACATCAG
GTCCAAGTCCGTAATTGAGCAGGTCTCGTGGGATACCTGAAATGCACCCGCTTCCCGGCCCATGCAGGAGAGTCTGAAGGAAGGCCTGAC
GGTGCAAGAACGGTTGAAGCTCTTTGAATCCAGGGACTTGAAGAAAGACTAGGTGTGTCCCATCCAAGTTGAGCACGCGCCTTCCCCAGC
TTGCAGCAGCACACCCCAAGCGCTGCTTTTCACCTGTACCTTTGTTTTATTATTATTATTATTATTGCTGTTGTTGTCATCGTTAACTGT
GGGCATGGAATGCGTGAGGCTGGCTTCTGGGTTGTCCACACCACTCTCTGCTGTGTTGACTTCCTGTTGTCTTCATCAAAGCTTTTTTCC
GTGGTATTCTAAAATTAGGCCAGCAGTGGGGGCTGGGAGGGCATCTGTGTTAGTCCTTTCCTGGCTGTGACCCGCCACACTCACTGTCAG
TATTAAGGCCCAGCAGCCTGTTGATAAGCTACCCTGTCTCACCATGTGCTGGTGTGGAAACGGGGCCCAGCCAGCACGCCTCAAGGTAGA
TGGAATCCCCACTGGTCAGAGAAAAAGCTATGCGGACACTCCAGCTTGGCCTGGGTCACAGCACTGACTCCTCACCCGCTAGTCTGGCTG
TTAAGAGGAGAAAGTGCACTGCCTTCCAGCCCAGGAGGAGGACAGCATTTTGTATTTGTTCCACTGATGCAGCTTAGAACCACACCCCTG

>3980_3980_1_ALG10B-MPRIP_ALG10B_chr12_38712260_ENST00000308742_MPRIP_chr17_16979024_ENST00000341712_length(amino acids)=1107AA_BP=123
MAQLEGYCFSAALSCTFLVSCLLFSAFSRALREPYMDEIFHLPQAQRYCEGHFSLSQWDPMITTLPGLYLVSVGVVKPAIWIFAWSEHVV
CSIGMLRFVNLLFSVGNFYLLYLLFHKVQPRNKAKPIYGGWLLLAPDGTDFDNPVHRSRKWQRRFFILYEHGLLRYALDEMPTTLPQGTI
NMNQCTDVVDGEGRTGQKFSLCILTPEKEHFIRAETKEIVSGWLEMLMVYPRTNKQNQKKKRKVEPPTPQEPGPAKVAVTSSSSSSSSSS
SIPSAEKVPTTKSTLWQEEMRTKDQPDGSSLSPAQSPSQSQPPAASSLREPGLESKEEESAMSSDRMDCGRKVRVESGYFSLEKTKQDLK
AEEQQLPPPLSPPSPSTPNHRRSQVIEKFEALDIEKAEHMETNAVGPSPSSDTRQGRSEKRAFPRKRDFTNEAPPAPLPDASASPLSPHR
RAKSLDRRSTEPSVTPDLLNFKKGWLTKQYEDGQWKKHWFVLADQSLRYYRDSVAEEAADLDGEIDLSACYDVTEYPVQRNYGFQIHTKE
GEFTLSAMTSGIRRNWIQTIMKHVHPTTAPDVTSSLPEEKNKSSCSFETCPRPTEKQEAELGEPDPEQKRSRARERRREGRSKTFDWAEF
RPIQQALAQERVGGVGPADTHEPLRPEAEPGELERERARRREERRKRFGMLDATDGPGTEDAALRMEVDRSPGLPMSDLKTHNVHVEIEQ
RWHQVETTPLREEKQVPIAPVHLSSEDGGDRLSTHELTSLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQATCERGF
AAMEETHQKKIEDLQRQHQRELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSVNSDVEALRRQYLEELQSVQR
ELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLTGDGGGEATGSPLAQGKDAYELEVLLRV
KESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGEKSPDSATVSGYDIMKSKSNPDFL

--------------------------------------------------------------
>3980_3980_2_ALG10B-MPRIP_ALG10B_chr12_38712260_ENST00000308742_MPRIP_chr17_16979024_ENST00000395804_length(transcript)=3640nt_BP=685nt
AAGCAGCAAGCCACGCCCCTCCCGCGCTCGCGAAATCCGAGACCCGCCCTTTCCGGAAGTTTTGACACTGTGCGCCCCGAGTAATGTGAT
GGAGAGGGTAATCATCCGGTCCGTTATCTAAACCCGTCACTCCAGGAAACAGCGACCCGCTGTTTTCCGGATCCGCGCTCTCCCAGCATC
CTTTGCCTTCCGGTATGTGGCCCCGTCTGGCTAGTCCTGTCTAGCGCGCCCATTTCGAGCCCAAGTTTCCAGCTCGGGTTTCCGGGCTCA
GAATTTTCCAGGAGTGGGTTCTTGGGCAGTGGCTGTGGGAGCAGGAATGGCGCAGCTAGAGGGTTACTGTTTCTCGGCCGCCTTGAGCTG
TACCTTTTTAGTGTCCTGCCTCCTCTTCTCCGCCTTCAGCCGGGCGCTGCGAGAGCCCTACATGGACGAGATCTTCCACCTGCCTCAGGC
GCAGCGCTACTGTGAGGGCCATTTCTCCCTTTCCCAGTGGGATCCCATGATTACTACATTACCTGGCTTGTACCTGGTGTCAGTTGGAGT
GGTCAAACCTGCCATTTGGATCTTTGCATGGTCTGAACATGTTGTCTGCTCCATTGGGATGCTCAGATTTGTTAATCTTCTCTTCAGTGT
TGGCAACTTCTATTTACTATATTTGCTTTTCCACAAGGTACAACCCAGAAACAAGGCAAAACCCATTTATGGCGGTTGGCTGCTCCTGGC
TCCAGATGGGACCGACTTTGACAACCCAGTGCACCGGTCTCGGAAATGGCAGCGACGGTTCTTCATCCTTTACGAGCACGGCCTCTTGCG
CTACGCCCTGGATGAGATGCCCACGACCCTTCCTCAGGGCACCATCAACATGAACCAGTGCACAGATGTGGTGGATGGGGAGGGCCGCAC
GGGCCAGAAGTTCTCCCTGTGTATTCTGACGCCTGAGAAGGAGCATTTCATCCGGGCGGAGACCAAGGAGATCGTCAGTGGGTGGCTGGA
GATGCTCATGGTCTATCCCCGGACCAACAAGCAGAATCAGAAGAAGAAACGGAAAGTGGAGCCCCCCACACCACAGGAGCCTGGGCCTGC
CAAGGTGGCTGTTACCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCATCCCCAGTGCTGAGAAAGTCCCCACCACCAAGTCCACACT
CTGGCAGGAAGAAATGAGGACCAAGGACCAGCCAGATGGCAGCAGCCTGAGTCCAGCTCAGAGTCCCAGCCAGAGCCAGCCTCCTGCTGC
CAGCTCCCTGCGGGAACCTGGGCTAGAGAGCAAAGAAGAGGAGAGCGCCATGAGTAGCGACCGCATGGACTGTGGCCGCAAAGTCCGGGT
GGAGAGCGGCTACTTCTCTCTGGAGAAGACCAAACAGGACTTGAAGGCTGAAGAACAGCAGCTGCCCCCGCCGCTCTCCCCTCCCAGCCC
CAGCACCCCCAACCACAGGAGGTCCCAGGTGATTGAAAAGTTTGAGGCCTTGGACATTGAGAAGGCAGAGCACATGGAGACCAATGCAGT
GGGGCCCTCACCATCCAGCGACACACGCCAGGGCCGCAGCGAGAAGAGGGCGTTCCCTAGGAAGCGGGACTTCACCAATGAAGCCCCCCC
AGCTCCTCTCCCAGACGCCTCGGCTTCCCCCCTGTCTCCACACCGAAGAGCCAAGTCACTGGACAGGAGGTCCACGGAGCCCTCCGTGAC
GCCCGACCTGCTGAATTTCAAGAAAGGCTGGCTGACTAAGCAGTATGAGGACGGCCAGTGGAAGAAACACTGGTTTGTCCTCGCCGATCA
AAGCCTGAGATACTACAGGGATTCAGTGGCTGAGGAGGCAGCCGACTTGGATGGAGAAATTGACTTGTCCGCATGTTACGATGTCACAGA
GTATCCAGTTCAGAGAAACTATGGCTTCCAGATACATACAAAGGAGGGCGAGTTTACCCTGTCGGCCATGACATCTGGGATTCGGCGGAA
CTGGATCCAGACCATCATGAAGCACGTGCACCCGACCACTGCCCCGGATGTGACCAGCTCGTTGCCAGAGGAAAAAAACAAGAGCAGCTG
CTCTTTTGAGACCTGCCCGAGGCCTACTGAGAAGCAAGAGGCAGAGCTGGGGGAGCCGGACCCTGAGCAGAAGAGGAGCCGCGCACGGGA
GCGGAGGCGAGAGGGCCGCTCCAAGACCTTTGACTGGGCTGAGTTCCGTCCCATCCAGCAGGCCCTGGCTCAGGAGCGGGTGGGCGGCGT
GGGGCCTGCTGACACCCACGAGCCCCTGCGCCCTGAGGCGGAGCCTGGGGAGCTGGAGCGGGAGCGTGCACGGAGGCGGGAGGAGCGCCG
CAAGCGCTTCGGGATGCTCGACGCCACAGACGGGCCAGGCACTGAGGATGCAGCCCTGCGCATGGAGGTGGACCGGAGCCCAGGGCTGCC
TATGAGCGACCTCAAAACGCATAACGTCCACGTGGAGATTGAGCAGCGGTGGCATCAGGTGGAGACCACACCTCTCCGGGAAGAGAAGCA
GGTGCCCATCGCCCCCGTCCACCTGTCTTCTGAAGATGGGGGTGACCGGCTCTCCACACACGAGCTGACCTCTCTGCTCGAGAAGGAGCT
GGAGCAGAGCCAGAAGGAGGCCTCAGACCTTCTGGAGCAGAACCGGCTCCTGCAGGACCAGCTGAGGGTGGCCCTGGGCCGGGAGCAGAG
CGCCCGTGAGGGCTACGTGCTGCAGGCCACGTGCGAGCGAGGGTTTGCAGCAATGGAAGAAACGCACCAGAAGAAGATTGAAGATCTCCA
GAGGCAGCACCAGCGGGAGCTAGAGAAACTTCGAGAAGAGAAAGACCGCCTCCTAGCCGAGGAGACAGCGGCCACCATCTCAGCCATCGA
AGCCATGAAGAACGCCCACCGGGAGGAAATGGAGCGGGAGCTGGAGAAGAGCCAGCGGTCCCAGATCAGCAGCGTCAACTCGGATGTTGA
GGCCCTGCGGCGCCAGTACCTGGAGGAGCTGCAGTCGGTGCAGCGGGAACTGGAGGTCCTCTCGGAGCAGTACTCGCAGAAGTGCCTGGA
GAATGCCCATCTGGCCCAGGCGCTGGAGGCCGAGCGGCAGGCCCTGCGGCAGTGCCAGCGTGAGAACCAGGAGCTCAATGCCCACAACCA
GGAGCTGAACAACCGCCTGGCTGCAGAGATCACACGGTTGCGGACGCTGCTGACTGGGGACGGCGGTGGGGAGGCCACTGGGTCACCCCT
TGCACAGGGCAAGGATGCCTATGAACTAGAGGTCTTATTGCGGGTAAAGGAATCGGAAATACAGTACCTGAAACAGGAGATTAGCTCCCT
CAAGGATGAGCTGCAGACGGCACTGCGGGACAAGAAGTACGCAAGTGACAAGTACAAAGACATCTACACAGAGCTCAGCATCGCGAAGGC
TAAGGCTGACTGTGACATCAGCAGGTTGAAGGAGCAGCTCAAGGCTGCAACGGAAGCACTGGGGGAGAAGTCCCCTGACAGTGCCACGGT
GTCCGGATATGATATAATGAAATCTAAAAGCAACCCTGACTTCTTGAAGAAAGACAGATCCTGTGTCACCCGGCAACTCAGAAACATCAG

>3980_3980_2_ALG10B-MPRIP_ALG10B_chr12_38712260_ENST00000308742_MPRIP_chr17_16979024_ENST00000395804_length(amino acids)=1108AA_BP=123
MAQLEGYCFSAALSCTFLVSCLLFSAFSRALREPYMDEIFHLPQAQRYCEGHFSLSQWDPMITTLPGLYLVSVGVVKPAIWIFAWSEHVV
CSIGMLRFVNLLFSVGNFYLLYLLFHKVQPRNKAKPIYGGWLLLAPDGTDFDNPVHRSRKWQRRFFILYEHGLLRYALDEMPTTLPQGTI
NMNQCTDVVDGEGRTGQKFSLCILTPEKEHFIRAETKEIVSGWLEMLMVYPRTNKQNQKKKRKVEPPTPQEPGPAKVAVTSSSSSSSSSS
SIPSAEKVPTTKSTLWQEEMRTKDQPDGSSLSPAQSPSQSQPPAASSLREPGLESKEEESAMSSDRMDCGRKVRVESGYFSLEKTKQDLK
AEEQQLPPPLSPPSPSTPNHRRSQVIEKFEALDIEKAEHMETNAVGPSPSSDTRQGRSEKRAFPRKRDFTNEAPPAPLPDASASPLSPHR
RAKSLDRRSTEPSVTPDLLNFKKGWLTKQYEDGQWKKHWFVLADQSLRYYRDSVAEEAADLDGEIDLSACYDVTEYPVQRNYGFQIHTKE
GEFTLSAMTSGIRRNWIQTIMKHVHPTTAPDVTSSLPEEKNKSSCSFETCPRPTEKQEAELGEPDPEQKRSRARERRREGRSKTFDWAEF
RPIQQALAQERVGGVGPADTHEPLRPEAEPGELERERARRREERRKRFGMLDATDGPGTEDAALRMEVDRSPGLPMSDLKTHNVHVEIEQ
RWHQVETTPLREEKQVPIAPVHLSSEDGGDRLSTHELTSLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQATCERGF
AAMEETHQKKIEDLQRQHQRELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSVNSDVEALRRQYLEELQSVQR
ELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLTGDGGGEATGSPLAQGKDAYELEVLLRV
KESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGEKSPDSATVSGYDIMKSKSNPDFL

--------------------------------------------------------------
>3980_3980_3_ALG10B-MPRIP_ALG10B_chr12_38712260_ENST00000308742_MPRIP_chr17_16979024_ENST00000395811_length(transcript)=11433nt_BP=685nt
AAGCAGCAAGCCACGCCCCTCCCGCGCTCGCGAAATCCGAGACCCGCCCTTTCCGGAAGTTTTGACACTGTGCGCCCCGAGTAATGTGAT
GGAGAGGGTAATCATCCGGTCCGTTATCTAAACCCGTCACTCCAGGAAACAGCGACCCGCTGTTTTCCGGATCCGCGCTCTCCCAGCATC
CTTTGCCTTCCGGTATGTGGCCCCGTCTGGCTAGTCCTGTCTAGCGCGCCCATTTCGAGCCCAAGTTTCCAGCTCGGGTTTCCGGGCTCA
GAATTTTCCAGGAGTGGGTTCTTGGGCAGTGGCTGTGGGAGCAGGAATGGCGCAGCTAGAGGGTTACTGTTTCTCGGCCGCCTTGAGCTG
TACCTTTTTAGTGTCCTGCCTCCTCTTCTCCGCCTTCAGCCGGGCGCTGCGAGAGCCCTACATGGACGAGATCTTCCACCTGCCTCAGGC
GCAGCGCTACTGTGAGGGCCATTTCTCCCTTTCCCAGTGGGATCCCATGATTACTACATTACCTGGCTTGTACCTGGTGTCAGTTGGAGT
GGTCAAACCTGCCATTTGGATCTTTGCATGGTCTGAACATGTTGTCTGCTCCATTGGGATGCTCAGATTTGTTAATCTTCTCTTCAGTGT
TGGCAACTTCTATTTACTATATTTGCTTTTCCACAAGGTACAACCCAGAAACAAGGCAAAACCCATTTATGGCGGTTGGCTGCTCCTGGC
TCCAGATGGGACCGACTTTGACAACCCAGTGCACCGGTCTCGGAAATGGCAGCGACGGTTCTTCATCCTTTACGAGCACGGCCTCTTGCG
CTACGCCCTGGATGAGATGCCCACGACCCTTCCTCAGGGCACCATCAACATGAACCAGTGCACAGATGTGGTGGATGGGGAGGGCCGCAC
GGGCCAGAAGTTCTCCCTGTGTATTCTGACGCCTGAGAAGGAGCATTTCATCCGGGCGGAGACCAAGGAGATCGTCAGTGGGTGGCTGGA
GATGCTCATGGTCTATCCCCGGACCAACAAGCAGAATCAGAAGAAGAAACGGAAAGTGGAGCCCCCCACACCACAGGAGCCTGGGCCTGC
CAAGGTGGCTGTTACCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCATCCCCAGTGCTGAGAAAGTCCCCACCACCAAGTCCACACT
CTGGCAGGAAGAAATGAGGACCAAGGACCAGCCAGATGGCAGCAGCCTGAGTCCAGCTCAGAGTCCCAGCCAGAGCCAGCCTCCTGCTGC
CAGCTCCCTGCGGGAACCTGGGCTAGAGAGCAAAGAAGAGGAGAGCGCCATGAGTAGCGACCGCATGGACTGTGGCCGCAAAGTCCGGGT
GGAGAGCGGCTACTTCTCTCTGGAGAAGACCAAACAGGACTTGAAGGCTGAAGAACAGCAGCTGCCCCCGCCGCTCTCCCCTCCCAGCCC
CAGCACCCCCAACCACAGGAGGTCCCAGGTGATTGAAAAGTTTGAGGCCTTGGACATTGAGAAGGCAGAGCACATGGAGACCAATGCAGT
GGGGCCCTCACCATCCAGCGACACACGCCAGGGCCGCAGCGAGAAGAGGGCGTTCCCTAGGAAGCGGGACTTCACCAATGAAGCCCCCCC
AGCTCCTCTCCCAGACGCCTCGGCTTCCCCCCTGTCTCCACACCGAAGAGCCAAGTCACTGGACAGGAGGTCCACGGAGCCCTCCGTGAC
GCCCGACCTGCTGAATTTCAAGAAAGGCTGGCTGACTAAGCAGTATGAGGACGGCCAGTGGAAGAAACACTGGTTTGTCCTCGCCGATCA
AAGCCTGAGATACTACAGGGATTCAGTGGCTGAGGAGGCAGCCGACTTGGATGGAGAAATTGACTTGTCCGCATGTTACGATGTCACAGA
GTATCCAGTTCAGAGAAACTATGGCTTCCAGATACATACAAAGGAGGGCGAGTTTACCCTGTCGGCCATGACATCTGGGATTCGGCGGAA
CTGGATCCAGACCATCATGAAGCACGTGCACCCGACCACTGCCCCGGATGTGACCAGCTCGTTGCCAGAGGAAAAAAACAAGAGCAGCTG
CTCTTTTGAGACCTGCCCGAGGCCTACTGAGAAGCAAGAGGCAGAGCTGGGGGAGCCGGACCCTGAGCAGAAGAGGAGCCGCGCACGGGA
GCGGAGGCGAGAGGGCCGCTCCAAGACCTTTGACTGGGCTGAGTTCCGTCCCATCCAGCAGGCCCTGGCTCAGGAGCGGGTGGGCGGCGT
GGGGCCTGCTGACACCCACGAGCCCCTGCGCCCTGAGGCGGAGCCTGGGGAGCTGGAGCGGGAGCGTGCACGGAGGCGGGAGGAGCGCCG
CAAGCGCTTCGGGATGCTCGACGCCACAGACGGGCCAGGCACTGAGGATGCAGCCCTGCGCATGGAGGTGGACCGGAGCCCAGGGCTGCC
TATGAGCGACCTCAAAACGCATAACGTCCACGTGGAGATTGAGCAGCGGTGGCATCAGGTGGAGACCACACCTCTCCGGGAAGAGAAGCA
GGTGCCCATCGCCCCCGTCCACCTGTCTTCTGAAGATGGGGGTGACCGGCTCTCCACACACGAGCTGACCTCTCTGCTCGAGAAGGAGCT
GGAGCAGAGCCAGAAGGAGGCCTCAGACCTTCTGGAGCAGAACCGGCTCCTGCAGGACCAGCTGAGGGTGGCCCTGGGCCGGGAGCAGAG
CGCCCGTGAGGGCTACGTGCTGCAGGCCACGTGCGAGCGAGGGTTTGCAGCAATGGAAGAAACGCACCAGAAGAAGATTGAAGATCTCCA
GAGGCAGCACCAGCGGGAGCTAGAGAAACTTCGAGAAGAGAAAGACCGCCTCCTAGCCGAGGAGACAGCGGCCACCATCTCAGCCATCGA
AGCCATGAAGAACGCCCACCGGGAGGAAATGGAGCGGGAGCTGGAGAAGAGCCAGCGGTCCCAGATCAGCAGCGTCAACTCGGATGTTGA
GGCCCTGCGGCGCCAGTACCTGGAGGAGCTGCAGTCGGTGCAGCGGGAACTGGAGGTCCTCTCGGAGCAGTACTCGCAGAAGTGCCTGGA
GAATGCCCATCTGGCCCAGGCGCTGGAGGCCGAGCGGCAGGCCCTGCGGCAGTGCCAGCGTGAGAACCAGGAGCTCAATGCCCACAACCA
GGAGCTGAACAACCGCCTGGCTGCAGAGATCACACGGTTGCGGACGCTGCTGACTGGGGACGGCGGTGGGGAGGCCACTGGGTCACCCCT
TGCACAGGGCAAGGATGCCTATGAACTAGAGGTCTTATTGCGGGTAAAGGAATCGGAAATACAGTACCTGAAACAGGAGATTAGCTCCCT
CAAGGATGAGCTGCAGACGGCACTGCGGGACAAGAAGTACGCAAGTGACAAGTACAAAGACATCTACACAGAGCTCAGCATCGCGAAGGC
TAAGGCTGACTGTGACATCAGCAGGTTGAAGGAGCAGCTCAAGGCTGCAACGGAAGCACTGGGGGAGAAGTCCCCTGACAGTGCCACGGT
GTCCGGATATGATATAATGAAATCTAAAAGCAACCCTGACTTCTTGAAGAAAGACAGATCCTGTGTCACCCGGCAACTCAGAAACATCAG
GTCCAAGAGTCTGAAGGAAGGCCTGACGGTGCAAGAACGGTTGAAGCTCTTTGAATCCAGGGACTTGAAGAAAGACTAGGTGTGTCCCAT
CCAAGTTGAGCACGCGCCTTCCCCAGCTTGCAGCAGCACACCCCAAGCGCTGCTTTTCACCTGTACCTTTGTTTTATTATTATTATTATT
ATTGCTGTTGTTGTCATCGTTAACTGTGGGCATGGAATGCGTGAGGCTGGCTTCTGGGTTGTCCACACCACTCTCTGCTGTGTTGACTTC
CTGTTGTCTTCATCAAAGCTTTTTTCCGTGGTATTCTAAAATTAGGCCAGCAGTGGGGGCTGGGAGGGCATCTGTGTTAGTCCTTTCCTG
GCTGTGACCCGCCACACTCACTGTCAGTATTAAGGCCCAGCAGCCTGTTGATAAGCTACCCTGTCTCACCATGTGCTGGTGTGGAAACGG
GGCCCAGCCAGCACGCCTCAAGGTAGATGGAATCCCCACTGGTCAGAGAAAAAGCTATGCGGACACTCCAGCTTGGCCTGGGTCACAGCA
CTGACTCCTCACCCGCTAGTCTGGCTGTTAAGAGGAGAAAGTGCACTGCCTTCCAGCCCAGGAGGAGGACAGCATTTTGTATTTGTTCCA
CTGATGCAGCTTAGAACCACACCCCTGAGAGTCGTGGCAAACCTTTCACAACCTGGAAAATGTTGAAAGCAACCATTCCTATTTTTGTTT
GTTTTTTATTAAATCTTGCACAAAATCCCCGGCCCCTCTCCTTCCTTCCTTCCTTCCTTCCTCCGCTCGTTCCTTTCTTGGTCTCCAGTA
ACCCTGGTCTTTTCATAACTGCTCGAGATTGTTGACCTGCAGCCCAGGTTTCAGACTCTGATTGCAAAAAACAAATGAATTCCCCCCAGG
AATCATTCAAAATGGGGGAAGGTTTGGGGGTTTGGGTTTTTTTTTTACCTTTTGGAAAAGAAACCGTCACATTGCTTTGGAAAAGGTTGA
GAGGAGACCCCTGTTAAGTCAAGAAGAAAGTACAGAGGATGTCAGAATCTGATGAGAACAGCACATTAGTGTTTATTGAGACTCCGATCT
TAACTCTCATTTAATTAATCTGAGCTCTGAAAACCTATCTTGCAGCATTTATCTTTAAAAGAGCCTGGTTAAAGTAAACCTATACTAACA
ATTTTGCTTTTTCTAACAGTTTGAGGAAGACCTTTTTAACCACCACAAAACATTCTATGGCAATTCTTGAAAATCTCTTAAATTGGAGTC
TATTATGGCCCCATGAAAACCATTAATCCCATTAAGATAGGGAGTATAAACCCCTGGCTGGTGGAACAGGTTCTGCTACTTTAGGAGCAA
GGTGGGGTGTGAGTAGATGGTTTTCATGCCAAGAACATGCTTTCACTTTGTATTCATGCTTGTGTTGGTGTGATGGTCTCTGTGGGTGGG
TGGATGCTTTGGGCGTTGAAATCTAGAAATCCTGTTGCTCAGTTTCTAGATGAAGTCATGAGCAAGGCCATCAGTGGAGCTCTGGCCCCG
CCCCCAATGTGCAGAAGGGCCGGGAGCAAGGCCTGGAGTTTTCATGTGTTTTCAGACCCAGGTTTAGGTGCTCTCTTCTCACTGAAATAA
CTAAGTGCTCTCCACTGGCATCGAGCCCTTTCCACAAGTTTTTAAGGCTCTTAACCCACACTTTCACTCCTCTGCTACTAGTCTTCAGTG
TTGTTAACAGCAAGAGAAAATTGGGTTTGTTTAAAAATCTACTTCTCTGAGGTGGCACAGTTGCGTAGCTGTAGTCCCAGCTACTCAGGA
GGCTGAGGTAGTAGGATTGCTTGAGCCCGGGAGGTCGAGGCTGCAATCAGTCATGATCGTGTCACTGCACTCCAGCCTGGGTGACAAAGC
AAGGCCCCATATCAGATATAGATATACTTATCAGACCCCCCCTGACCATTTAGATTGGCAGTGCTTTGAGAAATGCACTATGACCTTTCT
GTGTCAATGGGAATATACAGAAGGAACATTCGGGACCCCGCTGTCCCCCACAGCCTCATTGTTGTCTCCAGGACACTGCTGGGTCACACG
AATGCTCCAGGACAGACAGGGACCTGGAGTGCATCAGGATCTGACCAGATAGGAGTTTTTGCCTCGTGTCTGGGTGCTACGATTTTGTGC
CGTTCTCTGAGGTCCACCACCTGCCCTTCCTGGCATGGTTTCCTTCGTGACCATCCCTGCTGCCCCTGGGGGTGGACCCCACTGGCCCTT
CTGCAGACAGCTCCCTGCCTTCTGCCCTCCAGGGGGTTCTGGCCAGAGTCCATGCTTGGAGACAGGATCATCTGCCTTCAGCCCTCACAG
TGCTTTAAATTAAAGCAAGTTTGCCCATAGGACAAAAGAGCATTTGATTCCCTTTTTTCTGTCACATATCCCTTGAGGCTGGACTTCAGG
AATCCTGGAAAATTAATATGAGTGCAGCATGTGAGGGGTCAGAGACAGGCCAGCAGGGCGTCTGCATTCCTCCCTGCCACAGGTCTCTCC
CCAGAGGCTGGTTTAGTGTAGGGTATTGCCAGGAAACGGACTGAGGCTGCTTTGCTAAGAGCTCCTGAAAATGCCCTGGGCCTGTCCTGG
CGTTTCTGAAGAGCCCTCATACAGGGACAGCCACCATCTGGGTCAAGGAAGTCTGGGTTCCCTGCTGGTGGGCTCCATCCTGCGATGGAG
TGAACCAGGCGAGAAAGGATGACGATGTTCTTCATGTTGCACCTGGACATGCCCCAGGAACAGAGACTTGCCCAGGTGGCAACACTGGCA
CAGATGTTGACGGCTGCCCAACTGGTGCCACACTGAGCAGGGAGCCTTGTGCTGCACAGGGCTGGGCCCTCTCTCCAGTTTCCTTCCTGC
AGGCATCCAAATACCCTGGAAGGGATTTAACCCCTGAATTCCAGAGGGAAGAAAGAAGAACAGTGAAGAAGTAGAACTGGTTTCTGTATG
GGGAGAGGAAAGTCTTAGGGACAGCTGCAGGCGGGGTCTCAGGCTGCTCCTTGGCACCAGCTACACAGTAGTGAGCTTTCCCAGCTTTAC
CGATGAGGAAGAAGTTCAAATAGATAGACTTCAGCATTTTAATTATTTTCCTATAAATGTATTTATGTGTAGTATGCTAGCACCAGCCAG
TAAGCTGTGCCACACATATGAATGGGAAAGCGAGGCAGTTGTGCTCGTGTGAGTTTCTGCAGGCTTGTGGGTAATTACCTTGTGTGCACG
CCTGCACGTGCAGAATAGTCACTTTCTGCTGGTCAGTTTCTTTATCCACCCATGGTGCCCCAGCCCCAGGCAGGTGTGGAGACCAGCATT
TCAGAGGACGCGCTGTCCACAGCCTCCCGGGTCTGAGTGGATCATTGGGCAGGGGTGGAGACAGTGCGCTGCCCTCTGAGCTGGAAGCCT
GTGCTTCAGGGAGTCATAATGGGCCTGTGCTAAGTGGGTGATGCAGTGGACATCCCAGGGCGACTAGAGGTGGCAGTATCGCGAATTTGC
AGGTTTATTGAACAAGAGGTAACATCGGAGAGGATCTTGCCTTCGGATTCAGCAAGTATGAAGGCAGAAGAGCATGGAGAGCAAGGCCCC
ACAGCCTGCTTAGTGAGTTGGAAGGCCCAGCAAGAACCTGTTTCTGCAGCAGCCACCAGCTCCCATCACCCCTTGACCCTCCAGCTCATG
CTGGAGAAGAGGGAATTTTGGCTGTTTAAAGAACACAGTTGTGAATCTCAGAATGTGCCTGAAAGGAATACTGACAGATAAGGCCGGAAA
CAAAACTGATGGCTTGAAAAACATTTTTATGGAATGTATTTACTATCATTTTGTTTTACTATAGAGGTAGATGGGACTCTTAACTTTTGG
GTACATGGAAACATGCTGAAAACTGAACACAATCCTGATCATCACTCCTGCCTGGCTGTCTCCTGGGAGGCTGCCGGGTGCCACGGAGCT
GGGACACAGCAGAGCCCGCTAGGTGTTGCAGGGCCCTGGAGGCCAAGGCCACCCTGTGTGGGGTCCCTGTTGGCAGCCAGGTCCCTACAC
AAACAAGTAATCCTGTTTGGCCTCCTAGGTTTTGCATATGACCTGCAGCCTAATTTGGGGTGTAGGGGAAGCTCTGCTGGCCCCTGCTCC
TTTGTATGTTGGGTGACTTTAATGGCTGGCCACATACCCCTTTCTCCCAGCTACTCATTCACTGACTTGGGTAAGTTCTAAGACAGTTCG
CACTTAGAAAAGAATGTGACACATCAACATTAACTTTTCCTGAAAAGAAGAGTTTGCCTAACATGGTCCTAAAGAAGCTTGGAATTTATA
AGACTTTCCTTTATAAGATATAGTGGGGGTTTTTTTGGGTGGAGGGGGGTTGTTTTTTGTTTTTTGTTTTCAAGACAGAGTCTCGCTCTG
TTGTCCAGGCTGGAGTGTAGTGGCATGATCTCGGCTCACTGCAACCTCTGCCTCCCAGGTTCATGCCATTCTCCTGCCTCAGCCTCCCGA
GTAGCTGGGACTACAGGTGTCTGCCGCCACGCCTGGCTAATTTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGTCAGGATG
GTCTCGATTTCCTGACCTCGTGATCCGCCTGTCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCACGCCTGGCCTATAAGA
TACGGTAAAAAAAAAAAACTGTGACCCCTTTGTCACTAAGGGAGAAAGAAATTAAGTATTGTCAAAGTTCTATAAAGAATGGAAATGTAT
GATATTATACTTCAAAGGAATTTGATGTTGAAATTTTAAAGAAAATTTGTCATGTTGATGAGAAGCTTCACTTTCCTGGGAACTTCATTC
GTTTTAGGGCATGAGATAAAAGTCCTGGCTAGGGGAGCCATAGGTCTGTTGTACAAGGAATTTGCTTTCTAAACAAGTTGTAACTTGCCC
TAAGGTCCCTGTTGGAGCACTAAGAGGTGACACAGGCCAGAGACAACGTTTCGTTTCCCCTTCCCTGCAAGCTGGGATCAGCCCTGTGTT
TTCTCCTTTCAGCTGAAGTGAGCGAAGGTTCTCAGTGCTGGCAAAAGAGCCCACTTTCTAAAAGGACTTGGGAAGAAAGCTGCTGGGAAC
TTGCTTATTAAAAAGTTCCTTAGAATTAAGGTATCTACCCACTGTTTTCGCACCTTTCACCTTCCTGGGCTTTCCTGCCCTCCAGCATTC
TTCTCTAGAGAGGTTCCTAGCCCGCTCAGCGCGAGCGTCTCCAGTAGGTAATAGCAGCTGAACGTGGGTTTTCCACGGACTTCAGGCTTG
GAGGTGCCATATACAAGCACACTTCTTCCTTCCCCTGGCTTCTCCATGCCACCACCCACTTTAAAGATGTAAACTCAGTAGATTTTTCAT
CCAGTGAACGGTCATCTTCACATCGAAAGGTGAAGGCCACCACTGTTCTCAATGCCAAGCAACAGAACGTTCTGAGATGGCCGTTCTTCC
TTGCACAGCAGCTACGGCAGGTTGTTCTGCAGCCACCCCTTAGAGGGGGCTCTTCGTTTTACCTTTGTACAGTTCTTGTGTTTACACATT
TGGGCCAAACAGCTTTCAGCAAGGGCATGTGTCCACAGCTGATGGGCAGTTAAGAACCAGCCTGAGCTGAAGGCTAGTAATACCGTGCTG
TAGGCTCTTTAAAAGGAAAGCCTGGCATAAACCCAGCATGGAAAGGAACATTATCAGTTATCTCAAATTTTGTCTGCCAGGGACAAGACC
CTGTTCATTCTTTTGCCCTTTTCAGAACTGTGAGCTTCAAGTATTCTTGCTTCTCTGTAAAGGGAAGACATCTCCCTTCTCTGAAATCCT
TCAACAAAAGAAAAGGCTCTTGGCAGGGTAGGGGAGTCAGTAGCTCAACACTAGATCATCCCTAGAGATGGGGCAAGTTTCTGTCTGAAC
ACGTCTTGGGTCCGAGTCCTTAGGTGTTCGGATGCAGTACTTTGTGAATACTTAAGCTACTGCATGCTTGGTGTAGCTTGCAATTTCTCT
GTATTTAAAAGCAGCTGTGTTTATTTTCTTCAAAATAACCTGTATATTATTTAGAGCAAGCAATGTAAATATTACTGAGAAGTTACTGCA
GGGATTTTTGTGACAGAGTTTGTATGGGTTTTTAAAAAAATCTTAGACACCCCTTTTTAAGATGGGGAGAACAGGGTTGACTGCACCGTT
GAAGCCCGCCCAGCATTATAAGGAAATGTTTTTAATGACTGCTGCATCTTTGTAAAACGTTTGGTCATCTAACAGATGGTTTTAAAGTGT
ACAATATCCAAAATAACGATAGCCCTGTATCCATACATTGTTTCATTGAAAGAATTCTCTATTGCCTCTTCTTGGTAGAGCCAGAGTCCT
TAAGGAAAATCAGGAAAATTAAGAAAATGATGGTGCCATCTTGACCAGACTTCTGCACAGTAATTTAACGCTATCCTAGGGAGACTTGGT
TGAAGGCACAGTTCTGGGATCAGGGTCTAAATGTGCAGTTTCTGAGAACCTTCAAGACCACTCACTGGGCAGGGCTCTGTGGAGCACTGG
AGCTGTTTGGATTCCCCAGCCCTTTGGTCATATCCTGGAATTCCGTGGAGGCTGCAGAACTTAGATGCAGCTGTTTTTTACAGCACCTAT
TTTTGTCAGATTGGTAAGGAAACACTGAGTCACAGAATACTTAAGAATTGGAGACTCCAGTAATGTAGGATGGCCTGAGAGGACGTCCAA
GTCCCAAGGGGTGGACACGGCATGTTCCTCGGGCACAGCCTCAGTGGGGGCCTTCCCCAGGCGCAGCTCGGCCACCTGAGGAAAGGGTGT
TTCGGAGGCGCAGCCACACACACAGCGCTGGCAGCCTCACGGTCACGCCCATCACTCCCTGCCCCCCACTGCCCTTGAGAAGTTAGTGGT
GTCACATCCTTAGTTTTATAGACAGCTAGGAATAGATTGTGAAGAACACTCAGTTCACTACTGTGTTACATTTATATCACAAGCTTCAAT
TAAAATGGATTTTAAAGGATTTTAGGATTTACCTTTAGTATTAACAACGTATCTACTGACATACTGTTAGGATTCAAAACCAGTTAAGTA
TAAGAATTACTTCATGTGGTTTTCCTAGGGTACAATTTATAAAAGGTAGAAAGCATCCAAGTGGCTCCTCAACAATTACAATTCTTAATG
ATTTTTCTCACAGCTGTGCCCTTCTGTCAGGGTCAGTGTCAAAATTCGTTATCAAAGGCAAAACCTACTGTGCCAAGCTGGGGCGCTATA
TGTGAACGGAGTGGAAATGCTTCAGTCACCTCTGCCGCAGCTTGTGATTCCAGCAGTTCTCACAAACGTTCTGTCACATGATGAAAAGAA
GCAGCTTGTATAATTCCAACTGGTGTTTCATTTCTGTTCTAATGCTAAGTGGTAACGCTTAACAAACAGACTAAAAGCTGTGTGCAGAAG
AAAGGGCTGAATGAGTACCGCCTCCCTAGGTTCCAGCACAGCGCTCGGGTCTAAGAAGTAGAGCCCCGGGGTAGGGTGGGCCATCCACTG
TCAGGCCAGTGTCTCAAGAAAGCCTGACCAGCTGAGCTGCTGCTTTTTTTTTGGGGGGGGGGGGGGGAGGGGCGTCTTGAGGCTTTTTTT
TTTTTTACAAAGTTAGTTTGTGATCAACGATTCACTACAATTGAAGTGTTACTTTGTCAGAATATTTATTCCTTTGTGTGACATGCTAGA
TTCCCTGGATGTAGCTGATCATTTTTATTTTGTAAATATTACCTAACTTTACATAAACTATATCATAATAAACTATTTTTGCATCACCCT

>3980_3980_3_ALG10B-MPRIP_ALG10B_chr12_38712260_ENST00000308742_MPRIP_chr17_16979024_ENST00000395811_length(amino acids)=1120AA_BP=123
MAQLEGYCFSAALSCTFLVSCLLFSAFSRALREPYMDEIFHLPQAQRYCEGHFSLSQWDPMITTLPGLYLVSVGVVKPAIWIFAWSEHVV
CSIGMLRFVNLLFSVGNFYLLYLLFHKVQPRNKAKPIYGGWLLLAPDGTDFDNPVHRSRKWQRRFFILYEHGLLRYALDEMPTTLPQGTI
NMNQCTDVVDGEGRTGQKFSLCILTPEKEHFIRAETKEIVSGWLEMLMVYPRTNKQNQKKKRKVEPPTPQEPGPAKVAVTSSSSSSSSSS
SIPSAEKVPTTKSTLWQEEMRTKDQPDGSSLSPAQSPSQSQPPAASSLREPGLESKEEESAMSSDRMDCGRKVRVESGYFSLEKTKQDLK
AEEQQLPPPLSPPSPSTPNHRRSQVIEKFEALDIEKAEHMETNAVGPSPSSDTRQGRSEKRAFPRKRDFTNEAPPAPLPDASASPLSPHR
RAKSLDRRSTEPSVTPDLLNFKKGWLTKQYEDGQWKKHWFVLADQSLRYYRDSVAEEAADLDGEIDLSACYDVTEYPVQRNYGFQIHTKE
GEFTLSAMTSGIRRNWIQTIMKHVHPTTAPDVTSSLPEEKNKSSCSFETCPRPTEKQEAELGEPDPEQKRSRARERRREGRSKTFDWAEF
RPIQQALAQERVGGVGPADTHEPLRPEAEPGELERERARRREERRKRFGMLDATDGPGTEDAALRMEVDRSPGLPMSDLKTHNVHVEIEQ
RWHQVETTPLREEKQVPIAPVHLSSEDGGDRLSTHELTSLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQATCERGF
AAMEETHQKKIEDLQRQHQRELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSVNSDVEALRRQYLEELQSVQR
ELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLTGDGGGEATGSPLAQGKDAYELEVLLRV
KESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGEKSPDSATVSGYDIMKSKSNPDFL

--------------------------------------------------------------
>3980_3980_4_ALG10B-MPRIP_ALG10B_chr12_38712260_ENST00000308742_MPRIP_chr17_16979024_ENST00000444976_length(transcript)=3726nt_BP=685nt
AAGCAGCAAGCCACGCCCCTCCCGCGCTCGCGAAATCCGAGACCCGCCCTTTCCGGAAGTTTTGACACTGTGCGCCCCGAGTAATGTGAT
GGAGAGGGTAATCATCCGGTCCGTTATCTAAACCCGTCACTCCAGGAAACAGCGACCCGCTGTTTTCCGGATCCGCGCTCTCCCAGCATC
CTTTGCCTTCCGGTATGTGGCCCCGTCTGGCTAGTCCTGTCTAGCGCGCCCATTTCGAGCCCAAGTTTCCAGCTCGGGTTTCCGGGCTCA
GAATTTTCCAGGAGTGGGTTCTTGGGCAGTGGCTGTGGGAGCAGGAATGGCGCAGCTAGAGGGTTACTGTTTCTCGGCCGCCTTGAGCTG
TACCTTTTTAGTGTCCTGCCTCCTCTTCTCCGCCTTCAGCCGGGCGCTGCGAGAGCCCTACATGGACGAGATCTTCCACCTGCCTCAGGC
GCAGCGCTACTGTGAGGGCCATTTCTCCCTTTCCCAGTGGGATCCCATGATTACTACATTACCTGGCTTGTACCTGGTGTCAGTTGGAGT
GGTCAAACCTGCCATTTGGATCTTTGCATGGTCTGAACATGTTGTCTGCTCCATTGGGATGCTCAGATTTGTTAATCTTCTCTTCAGTGT
TGGCAACTTCTATTTACTATATTTGCTTTTCCACAAGGTACAACCCAGAAACAAGGCAAAACCCATTTATGGCGGTTGGCTGCTCCTGGC
TCCAGATGGGACCGACTTTGACAACCCAGTGCACCGGTCTCGGAAATGGCAGCGACGGTTCTTCATCCTTTACGAGCACGGCCTCTTGCG
CTACGCCCTGGATGAGATGCCCACGACCCTTCCTCAGGGCACCATCAACATGAACCAGTGCACAGATGTGGTGGATGGGGAGGGCCGCAC
GGGCCAGAAGTTCTCCCTGTGTATTCTGACGCCTGAGAAGGAGCATTTCATCCGGGCGGAGACCAAGGAGATCGTCAGTGGGTGGCTGGA
GATGCTCATGGTCTATCCCCGGACCAACAAGCAGAATCAGAAGAAGAAACGGAAAGTGGAGCCCCCCACACCACAGGAGCCTGGGCCTGC
CAAGGTGGCTGTTACCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCATCCCCAGTGCTGAGAAAGTCCCCACCACCAAGTCCACACT
CTGGCAGGAAGAAATGAGGACCAAGGACCAGCCAGATGGCAGCAGCCTGAGTCCAGCTCAGAGTCCCAGCCAGAGCCAGCCTCCTGCTGC
CAGCTCCCTGCGGGAACCTGGGCTAGAGAGCAAAGAAGAGGAGAGCGCCATGAGTAGCGACCGCATGGACTGTGGCCGCAAAGTCCGGGT
GGAGAGCGGCTACTTCTCTCTGGAGAAGACCAAACAGGACTTGAAGGCTGAAGAACAGCAGCTGCCCCCGCCGCTCTCCCCTCCCAGCCC
CAGCACCCCCAACCACAGGAGGTCCCAGGTGATTGAAAAGTTTGAGGCCTTGGACATTGAGAAGGCAGAGCACATGGAGACCAATGCAGT
GGGGCCCTCACCATCCAGCGACACACGCCAGGGCCGCAGCGAGAAGAGGGCGTTCCCTAGGAAGCGGCCCGACCTGCTGAATTTCAAGAA
AGGCTGGCTGACTAAGCAGTATGAGGACGGCCAGTGGAAGAAACACTGGTTTGTCCTCGCCGATCAAAGCCTGAGATACTACAGGGATTC
AGTGGCTGAGGAGGCAGCCGACTTGGATGGAGAAATTGACTTGTCCGCATGTTACGATGTCACAGAGTATCCAGTTCAGAGAAACTATGG
CTTCCAGATACATACAAAGGAGGGCGAGTTTACCCTGTCGGCCATGACATCTGGGATTCGGCGGAACTGGATCCAGACCATCATGAAGCA
CGTGCACCCGACCACTGCCCCGGATGTGACCAGCTCGTTGCCAGAGGAAAAAAACAAGAGCAGCTGCTCTTTTGAGACCTGCCCGAGGCC
TACTGAGAAGCAAGAGGCAGAGCTGGGGGAGCCGGACCCTGAGCAGAAGAGGAGCCGCGCACGGGAGCGGAGGCGAGAGGGCCGCTCCAA
GACCTTTGACTGGGCTGAGTTCCGTCCCATCCAGCAGGCCCTGGCTCAGGAGCGGGTGGGCGGCGTGGGGCCTGCTGACACCCACGAGCC
CCTGCGCCCTGAGGCGGAGCCTGGGGAGCTGGAGCGGGAGCGTGCACGGAGGCGGGAGGAGCGCCGCAAGCGCTTCGGGATGCTCGACGC
CACAGACGGGCCAGGCACTGAGGATGCAGCCCTGCGCATGGAGGTGGACCGGAGCCCAGGGCTGCCTATGAGCGACCTCAAAACGCATAA
CGTCCACGTGGAGATTGAGCAGCGGTGGCATCAGGTGGAGACCACACCTCTCCGGGAAGAGAAGCAGGTGCCCATCGCCCCCGTCCACCT
GTCTTCTGAAGATGGGGGTGACCGGCTCTCCACACACGAGCTGACCTCTCTGCTCGAGAAGGAGCTGGAGCAGAGCCAGAAGGAGGCCTC
AGACCTTCTGGAGCAGAACCGGCTCCTGCAGGACCAGCTGAGGGTGGCCCTGGGCCGGGAGCAGAGCGCCCGTGAGGGCTACGTGCTGCA
GGCCACGTGCGAGCGAGGGTTTGCAGCAATGGAAGAAACGCACCAGAAGAAGATTGAAGATCTCCAGAGGCAGCACCAGCGGGAGCTAGA
GAAACTTCGAGAAGAGAAAGACCGCCTCCTAGCCGAGGAGACAGCGGCCACCATCTCAGCCATCGAAGCCATGAAGAACGCCCACCGGGA
GGAAATGGAGCGGGAGCTGGAGAAGAGCCAGCGGTCCCAGATCAGCAGCGTCAACTCGGATGTTGAGGCCCTGCGGCGCCAGTACCTGGA
GGAGCTGCAGTCGGTGCAGCGGGAACTGGAGGTCCTCTCGGAGCAGTACTCGCAGAAGTGCCTGGAGAATGCCCATCTGGCCCAGGCGCT
GGAGGCCGAGCGGCAGGCCCTGCGGCAGTGCCAGCGTGAGAACCAGGAGCTCAATGCCCACAACCAGGAGCTGAACAACCGCCTGGCTGC
AGAGATCACACGGTTGCGGACGCTGCTGACTGGGGACGGCGGTGGGGAGGCCACTGGGTCACCCCTTGCACAGGGCAAGGATGCCTATGA
ACTAGAGGTCTTATTGCGGGTAAAGGAATCGGAAATACAGTACCTGAAACAGGAGATTAGCTCCCTCAAGGATGAGCTGCAGACGGCACT
GCGGGACAAGAAGTACGCAAGTGACAAGTACAAAGACATCTACACAGAGCTCAGCATCGCGAAGGCTAAGGCTGACTGTGACATCAGCAG
GTTGAAGGAGCAGCTCAAGGCTGCAACGGAAGCACTGGGGGAGAAGTCCCCTGACAGTGCCACGGTGTCCGGATATGATATAATGAAATC
TAAAAGCAACCCTGACTTCTTGAAGAAAGACAGATCCTGTGTCACCCGGCAACTCAGAAACATCAGGTCCAAGAGTCTGAAGGAAGGCCT
GACGGTGCAAGAACGGTTGAAGCTCTTTGAATCCAGGGACTTGAAGAAAGACTAGGTGTGTCCCATCCAAGTTGAGCACGCGCCTTCCCC
AGCTTGCAGCAGCACACCCCAAGCGCTGCTTTTCACCTGTACCTTTGTTTTATTATTATTATTATTATTGCTGTTGTTGTCATCGTTAAC

>3980_3980_4_ALG10B-MPRIP_ALG10B_chr12_38712260_ENST00000308742_MPRIP_chr17_16979024_ENST00000444976_length(amino acids)=1082AA_BP=123
MAQLEGYCFSAALSCTFLVSCLLFSAFSRALREPYMDEIFHLPQAQRYCEGHFSLSQWDPMITTLPGLYLVSVGVVKPAIWIFAWSEHVV
CSIGMLRFVNLLFSVGNFYLLYLLFHKVQPRNKAKPIYGGWLLLAPDGTDFDNPVHRSRKWQRRFFILYEHGLLRYALDEMPTTLPQGTI
NMNQCTDVVDGEGRTGQKFSLCILTPEKEHFIRAETKEIVSGWLEMLMVYPRTNKQNQKKKRKVEPPTPQEPGPAKVAVTSSSSSSSSSS
SIPSAEKVPTTKSTLWQEEMRTKDQPDGSSLSPAQSPSQSQPPAASSLREPGLESKEEESAMSSDRMDCGRKVRVESGYFSLEKTKQDLK
AEEQQLPPPLSPPSPSTPNHRRSQVIEKFEALDIEKAEHMETNAVGPSPSSDTRQGRSEKRAFPRKRPDLLNFKKGWLTKQYEDGQWKKH
WFVLADQSLRYYRDSVAEEAADLDGEIDLSACYDVTEYPVQRNYGFQIHTKEGEFTLSAMTSGIRRNWIQTIMKHVHPTTAPDVTSSLPE
EKNKSSCSFETCPRPTEKQEAELGEPDPEQKRSRARERRREGRSKTFDWAEFRPIQQALAQERVGGVGPADTHEPLRPEAEPGELERERA
RRREERRKRFGMLDATDGPGTEDAALRMEVDRSPGLPMSDLKTHNVHVEIEQRWHQVETTPLREEKQVPIAPVHLSSEDGGDRLSTHELT
SLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQATCERGFAAMEETHQKKIEDLQRQHQRELEKLREEKDRLLAEETA
ATISAIEAMKNAHREEMERELEKSQRSQISSVNSDVEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQ
ELNAHNQELNNRLAAEITRLRTLLTGDGGGEATGSPLAQGKDAYELEVLLRVKESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYT
ELSIAKAKADCDISRLKEQLKAATEALGEKSPDSATVSGYDIMKSKSNPDFLKKDRSCVTRQLRNIRSKSLKEGLTVQERLKLFESRDLK

--------------------------------------------------------------
>3980_3980_5_ALG10B-MPRIP_ALG10B_chr12_38712260_ENST00000551464_MPRIP_chr17_16979024_ENST00000341712_length(transcript)=4224nt_BP=501nt
GCCTTCCGGTATGTGGCCCCGTCTGGCTAGTCCTGTCTAGCGCGCCCATTTCGAGCCCAAGTTTCCAGCTCGGGTTTCCGGGCTCAGAAT
TTTCCAGGAGTGGGTTCTTGGGCAGTGGCTGTGGGAGCAGGAATGGCGCAGCTAGAGGGTTACTGTTTCTCGGCCGCCTTGAGCTGTACC
TTTTTAGTGTCCTGCCTCCTCTTCTCCGCCTTCAGCCGGGCGCTGCGAGAGCCCTACATGGACGAGATCTTCCACCTGCCTCAGGCGCAG
CGCTACTGTGAGGGCCATTTCTCCCTTTCCCAGTGGGATCCCATGATTACTACATTACCTGGCTTGTACCTGGTGTCAGTTGGAGTGGTC
AAACCTGCCATTTGGATCTTTGCATGGTCTGAACATGTTGTCTGCTCCATTGGGATGCTCAGATTTGTTAATCTTCTCTTCAGTGTTGGC
AACTTCTATTTACTATATTTGCTTTTCCACAAGGTACAACCCAGAAACAAGGCAAAACCCATTTATGGCGGTTGGCTGCTCCTGGCTCCA
GATGGGACCGACTTTGACAACCCAGTGCACCGGTCTCGGAAATGGCAGCGACGGTTCTTCATCCTTTACGAGCACGGCCTCTTGCGCTAC
GCCCTGGATGAGATGCCCACGACCCTTCCTCAGGGCACCATCAACATGAACCAGTGCACAGATGTGGTGGATGGGGAGGGCCGCACGGGC
CAGAAGTTCTCCCTGTGTATTCTGACGCCTGAGAAGGAGCATTTCATCCGGGCGGAGACCAAGGAGATCGTCAGTGGGTGGCTGGAGATG
CTCATGGTCTATCCCCGGACCAACAAGCAGAATCAGAAGAAGAAACGGAAAGTGGAGCCCCCCACACCACAGGAGCCTGGGCCTGCCAAG
GTGGCTGTTACCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCATCCCCAGTGCTGAGAAAGTCCCCACCACCAAGTCCACACTCTGG
CAGGAAGAAATGAGGACCAAGGACCAGCCAGATGGCAGCAGCCTGAGTCCAGCTCAGAGTCCCAGCCAGAGCCAGCCTCCTGCTGCCAGC
TCCCTGCGGGAACCTGGGCTAGAGAGCAAAGAAGAGGAGAGCGCCATGAGTAGCGACCGCATGGACTGTGGCCGCAAAGTCCGGGTGGAG
AGCGGCTACTTCTCTCTGGAGAAGACCAAACAGGACTTGAAGGCTGAAGAACAGCAGCTGCCCCCGCCGCTCTCCCCTCCCAGCCCCAGC
ACCCCCAACCACAGGAGGTCCCAGGTGATTGAAAAGTTTGAGGCCTTGGACATTGAGAAGGCAGAGCACATGGAGACCAATGCAGTGGGG
CCCTCACCATCCAGCGACACACGCCAGGGCCGCAGCGAGAAGAGGGCGTTCCCTAGGAAGCGGGACTTCACCAATGAAGCCCCCCCAGCT
CCTCTCCCAGACGCCTCGGCTTCCCCCCTGTCTCCACACCGAAGAGCCAAGTCACTGGACAGGAGGTCCACGGAGCCCTCCGTGACGCCC
GACCTGCTGAATTTCAAGAAAGGCTGGCTGACTAAGCAGTATGAGGACGGCCAGTGGAAGAAACACTGGTTTGTCCTCGCCGATCAAAGC
CTGAGATACTACAGGGATTCAGTGGCTGAGGAGGCAGCCGACTTGGATGGAGAAATTGACTTGTCCGCATGTTACGATGTCACAGAGTAT
CCAGTTCAGAGAAACTATGGCTTCCAGATACATACAAAGGAGGGCGAGTTTACCCTGTCGGCCATGACATCTGGGATTCGGCGGAACTGG
ATCCAGACCATCATGAAGCACGTGCACCCGACCACTGCCCCGGATGTGACCAGCTCGTTGCCAGAGGAAAAAAACAAGAGCAGCTGCTCT
TTTGAGACCTGCCCGAGGCCTACTGAGAAGCAAGAGGCAGAGCTGGGGGAGCCGGACCCTGAGCAGAAGAGGAGCCGCGCACGGGAGCGG
AGGCGAGAGGGCCGCTCCAAGACCTTTGACTGGGCTGAGTTCCGTCCCATCCAGCAGGCCCTGGCTCAGGAGCGGGTGGGCGGCGTGGGG
CCTGCTGACACCCACGAGCCCCTGCGCCCTGAGGCGGAGCCTGGGGAGCTGGAGCGGGAGCGTGCACGGAGGCGGGAGGAGCGCCGCAAG
CGCTTCGGGATGCTCGACGCCACAGACGGGCCAGGCACTGAGGATGCAGCCCTGCGCATGGAGGTGGACCGGAGCCCAGGGCTGCCTATG
AGCGACCTCAAAACGCATAACGTCCACGTGGAGATTGAGCAGCGGTGGCATCAGGTGGAGACCACACCTCTCCGGGAAGAGAAGCAGGTG
CCCATCGCCCCCGTCCACCTGTCTTCTGAAGATGGGGGTGACCGGCTCTCCACACACGAGCTGACCTCTCTGCTCGAGAAGGAGCTGGAG
CAGAGCCAGAAGGAGGCCTCAGACCTTCTGGAGCAGAACCGGCTCCTGCAGGACCAGCTGAGGGTGGCCCTGGGCCGGGAGCAGAGCGCC
CGTGAGGGCTACGTGCTGCAGGCCACGTGCGAGCGAGGGTTTGCAGCAATGGAAGAAACGCACCAGAAGAAGATTGAAGATCTCCAGAGG
CAGCACCAGCGGGAGCTAGAGAAACTTCGAGAAGAGAAAGACCGCCTCCTAGCCGAGGAGACAGCGGCCACCATCTCAGCCATCGAAGCC
ATGAAGAACGCCCACCGGGAGGAAATGGAGCGGGAGCTGGAGAAGAGCCAGCGGTCCCAGATCAGCAGCGTCAACTCGGATGTTGAGGCC
CTGCGGCGCCAGTACCTGGAGGAGCTGCAGTCGGTGCAGCGGGAACTGGAGGTCCTCTCGGAGCAGTACTCGCAGAAGTGCCTGGAGAAT
GCCCATCTGGCCCAGGCGCTGGAGGCCGAGCGGCAGGCCCTGCGGCAGTGCCAGCGTGAGAACCAGGAGCTCAATGCCCACAACCAGGAG
CTGAACAACCGCCTGGCTGCAGAGATCACACGGTTGCGGACGCTGCTGACTGGGGACGGCGGTGGGGAGGCCACTGGGTCACCCCTTGCA
CAGGGCAAGGATGCCTATGAACTAGAGGTCTTATTGCGGGTAAAGGAATCGGAAATACAGTACCTGAAACAGGAGATTAGCTCCCTCAAG
GATGAGCTGCAGACGGCACTGCGGGACAAGAAGTACGCAAGTGACAAGTACAAAGACATCTACACAGAGCTCAGCATCGCGAAGGCTAAG
GCTGACTGTGACATCAGCAGGTTGAAGGAGCAGCTCAAGGCTGCAACGGAAGCACTGGGGGAGAAGTCCCCTGACAGTGCCACGGTGTCC
GGATATGATATAATGAAATCTAAAAGCAACCCTGACTTCTTGAAGAAAGACAGATCCTGTGTCACCCGGCAACTCAGAAACATCAGGTCC
AAGTCCGTAATTGAGCAGGTCTCGTGGGATACCTGAAATGCACCCGCTTCCCGGCCCATGCAGGAGAGTCTGAAGGAAGGCCTGACGGTG
CAAGAACGGTTGAAGCTCTTTGAATCCAGGGACTTGAAGAAAGACTAGGTGTGTCCCATCCAAGTTGAGCACGCGCCTTCCCCAGCTTGC
AGCAGCACACCCCAAGCGCTGCTTTTCACCTGTACCTTTGTTTTATTATTATTATTATTATTGCTGTTGTTGTCATCGTTAACTGTGGGC
ATGGAATGCGTGAGGCTGGCTTCTGGGTTGTCCACACCACTCTCTGCTGTGTTGACTTCCTGTTGTCTTCATCAAAGCTTTTTTCCGTGG
TATTCTAAAATTAGGCCAGCAGTGGGGGCTGGGAGGGCATCTGTGTTAGTCCTTTCCTGGCTGTGACCCGCCACACTCACTGTCAGTATT
AAGGCCCAGCAGCCTGTTGATAAGCTACCCTGTCTCACCATGTGCTGGTGTGGAAACGGGGCCCAGCCAGCACGCCTCAAGGTAGATGGA
ATCCCCACTGGTCAGAGAAAAAGCTATGCGGACACTCCAGCTTGGCCTGGGTCACAGCACTGACTCCTCACCCGCTAGTCTGGCTGTTAA
GAGGAGAAAGTGCACTGCCTTCCAGCCCAGGAGGAGGACAGCATTTTGTATTTGTTCCACTGATGCAGCTTAGAACCACACCCCTGAGAG

>3980_3980_5_ALG10B-MPRIP_ALG10B_chr12_38712260_ENST00000551464_MPRIP_chr17_16979024_ENST00000341712_length(amino acids)=1107AA_BP=123
MAQLEGYCFSAALSCTFLVSCLLFSAFSRALREPYMDEIFHLPQAQRYCEGHFSLSQWDPMITTLPGLYLVSVGVVKPAIWIFAWSEHVV
CSIGMLRFVNLLFSVGNFYLLYLLFHKVQPRNKAKPIYGGWLLLAPDGTDFDNPVHRSRKWQRRFFILYEHGLLRYALDEMPTTLPQGTI
NMNQCTDVVDGEGRTGQKFSLCILTPEKEHFIRAETKEIVSGWLEMLMVYPRTNKQNQKKKRKVEPPTPQEPGPAKVAVTSSSSSSSSSS
SIPSAEKVPTTKSTLWQEEMRTKDQPDGSSLSPAQSPSQSQPPAASSLREPGLESKEEESAMSSDRMDCGRKVRVESGYFSLEKTKQDLK
AEEQQLPPPLSPPSPSTPNHRRSQVIEKFEALDIEKAEHMETNAVGPSPSSDTRQGRSEKRAFPRKRDFTNEAPPAPLPDASASPLSPHR
RAKSLDRRSTEPSVTPDLLNFKKGWLTKQYEDGQWKKHWFVLADQSLRYYRDSVAEEAADLDGEIDLSACYDVTEYPVQRNYGFQIHTKE
GEFTLSAMTSGIRRNWIQTIMKHVHPTTAPDVTSSLPEEKNKSSCSFETCPRPTEKQEAELGEPDPEQKRSRARERRREGRSKTFDWAEF
RPIQQALAQERVGGVGPADTHEPLRPEAEPGELERERARRREERRKRFGMLDATDGPGTEDAALRMEVDRSPGLPMSDLKTHNVHVEIEQ
RWHQVETTPLREEKQVPIAPVHLSSEDGGDRLSTHELTSLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQATCERGF
AAMEETHQKKIEDLQRQHQRELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSVNSDVEALRRQYLEELQSVQR
ELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLTGDGGGEATGSPLAQGKDAYELEVLLRV
KESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGEKSPDSATVSGYDIMKSKSNPDFL

--------------------------------------------------------------
>3980_3980_6_ALG10B-MPRIP_ALG10B_chr12_38712260_ENST00000551464_MPRIP_chr17_16979024_ENST00000395804_length(transcript)=3456nt_BP=501nt
GCCTTCCGGTATGTGGCCCCGTCTGGCTAGTCCTGTCTAGCGCGCCCATTTCGAGCCCAAGTTTCCAGCTCGGGTTTCCGGGCTCAGAAT
TTTCCAGGAGTGGGTTCTTGGGCAGTGGCTGTGGGAGCAGGAATGGCGCAGCTAGAGGGTTACTGTTTCTCGGCCGCCTTGAGCTGTACC
TTTTTAGTGTCCTGCCTCCTCTTCTCCGCCTTCAGCCGGGCGCTGCGAGAGCCCTACATGGACGAGATCTTCCACCTGCCTCAGGCGCAG
CGCTACTGTGAGGGCCATTTCTCCCTTTCCCAGTGGGATCCCATGATTACTACATTACCTGGCTTGTACCTGGTGTCAGTTGGAGTGGTC
AAACCTGCCATTTGGATCTTTGCATGGTCTGAACATGTTGTCTGCTCCATTGGGATGCTCAGATTTGTTAATCTTCTCTTCAGTGTTGGC
AACTTCTATTTACTATATTTGCTTTTCCACAAGGTACAACCCAGAAACAAGGCAAAACCCATTTATGGCGGTTGGCTGCTCCTGGCTCCA
GATGGGACCGACTTTGACAACCCAGTGCACCGGTCTCGGAAATGGCAGCGACGGTTCTTCATCCTTTACGAGCACGGCCTCTTGCGCTAC
GCCCTGGATGAGATGCCCACGACCCTTCCTCAGGGCACCATCAACATGAACCAGTGCACAGATGTGGTGGATGGGGAGGGCCGCACGGGC
CAGAAGTTCTCCCTGTGTATTCTGACGCCTGAGAAGGAGCATTTCATCCGGGCGGAGACCAAGGAGATCGTCAGTGGGTGGCTGGAGATG
CTCATGGTCTATCCCCGGACCAACAAGCAGAATCAGAAGAAGAAACGGAAAGTGGAGCCCCCCACACCACAGGAGCCTGGGCCTGCCAAG
GTGGCTGTTACCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCATCCCCAGTGCTGAGAAAGTCCCCACCACCAAGTCCACACTCTGG
CAGGAAGAAATGAGGACCAAGGACCAGCCAGATGGCAGCAGCCTGAGTCCAGCTCAGAGTCCCAGCCAGAGCCAGCCTCCTGCTGCCAGC
TCCCTGCGGGAACCTGGGCTAGAGAGCAAAGAAGAGGAGAGCGCCATGAGTAGCGACCGCATGGACTGTGGCCGCAAAGTCCGGGTGGAG
AGCGGCTACTTCTCTCTGGAGAAGACCAAACAGGACTTGAAGGCTGAAGAACAGCAGCTGCCCCCGCCGCTCTCCCCTCCCAGCCCCAGC
ACCCCCAACCACAGGAGGTCCCAGGTGATTGAAAAGTTTGAGGCCTTGGACATTGAGAAGGCAGAGCACATGGAGACCAATGCAGTGGGG
CCCTCACCATCCAGCGACACACGCCAGGGCCGCAGCGAGAAGAGGGCGTTCCCTAGGAAGCGGGACTTCACCAATGAAGCCCCCCCAGCT
CCTCTCCCAGACGCCTCGGCTTCCCCCCTGTCTCCACACCGAAGAGCCAAGTCACTGGACAGGAGGTCCACGGAGCCCTCCGTGACGCCC
GACCTGCTGAATTTCAAGAAAGGCTGGCTGACTAAGCAGTATGAGGACGGCCAGTGGAAGAAACACTGGTTTGTCCTCGCCGATCAAAGC
CTGAGATACTACAGGGATTCAGTGGCTGAGGAGGCAGCCGACTTGGATGGAGAAATTGACTTGTCCGCATGTTACGATGTCACAGAGTAT
CCAGTTCAGAGAAACTATGGCTTCCAGATACATACAAAGGAGGGCGAGTTTACCCTGTCGGCCATGACATCTGGGATTCGGCGGAACTGG
ATCCAGACCATCATGAAGCACGTGCACCCGACCACTGCCCCGGATGTGACCAGCTCGTTGCCAGAGGAAAAAAACAAGAGCAGCTGCTCT
TTTGAGACCTGCCCGAGGCCTACTGAGAAGCAAGAGGCAGAGCTGGGGGAGCCGGACCCTGAGCAGAAGAGGAGCCGCGCACGGGAGCGG
AGGCGAGAGGGCCGCTCCAAGACCTTTGACTGGGCTGAGTTCCGTCCCATCCAGCAGGCCCTGGCTCAGGAGCGGGTGGGCGGCGTGGGG
CCTGCTGACACCCACGAGCCCCTGCGCCCTGAGGCGGAGCCTGGGGAGCTGGAGCGGGAGCGTGCACGGAGGCGGGAGGAGCGCCGCAAG
CGCTTCGGGATGCTCGACGCCACAGACGGGCCAGGCACTGAGGATGCAGCCCTGCGCATGGAGGTGGACCGGAGCCCAGGGCTGCCTATG
AGCGACCTCAAAACGCATAACGTCCACGTGGAGATTGAGCAGCGGTGGCATCAGGTGGAGACCACACCTCTCCGGGAAGAGAAGCAGGTG
CCCATCGCCCCCGTCCACCTGTCTTCTGAAGATGGGGGTGACCGGCTCTCCACACACGAGCTGACCTCTCTGCTCGAGAAGGAGCTGGAG
CAGAGCCAGAAGGAGGCCTCAGACCTTCTGGAGCAGAACCGGCTCCTGCAGGACCAGCTGAGGGTGGCCCTGGGCCGGGAGCAGAGCGCC
CGTGAGGGCTACGTGCTGCAGGCCACGTGCGAGCGAGGGTTTGCAGCAATGGAAGAAACGCACCAGAAGAAGATTGAAGATCTCCAGAGG
CAGCACCAGCGGGAGCTAGAGAAACTTCGAGAAGAGAAAGACCGCCTCCTAGCCGAGGAGACAGCGGCCACCATCTCAGCCATCGAAGCC
ATGAAGAACGCCCACCGGGAGGAAATGGAGCGGGAGCTGGAGAAGAGCCAGCGGTCCCAGATCAGCAGCGTCAACTCGGATGTTGAGGCC
CTGCGGCGCCAGTACCTGGAGGAGCTGCAGTCGGTGCAGCGGGAACTGGAGGTCCTCTCGGAGCAGTACTCGCAGAAGTGCCTGGAGAAT
GCCCATCTGGCCCAGGCGCTGGAGGCCGAGCGGCAGGCCCTGCGGCAGTGCCAGCGTGAGAACCAGGAGCTCAATGCCCACAACCAGGAG
CTGAACAACCGCCTGGCTGCAGAGATCACACGGTTGCGGACGCTGCTGACTGGGGACGGCGGTGGGGAGGCCACTGGGTCACCCCTTGCA
CAGGGCAAGGATGCCTATGAACTAGAGGTCTTATTGCGGGTAAAGGAATCGGAAATACAGTACCTGAAACAGGAGATTAGCTCCCTCAAG
GATGAGCTGCAGACGGCACTGCGGGACAAGAAGTACGCAAGTGACAAGTACAAAGACATCTACACAGAGCTCAGCATCGCGAAGGCTAAG
GCTGACTGTGACATCAGCAGGTTGAAGGAGCAGCTCAAGGCTGCAACGGAAGCACTGGGGGAGAAGTCCCCTGACAGTGCCACGGTGTCC
GGATATGATATAATGAAATCTAAAAGCAACCCTGACTTCTTGAAGAAAGACAGATCCTGTGTCACCCGGCAACTCAGAAACATCAGGTCC

>3980_3980_6_ALG10B-MPRIP_ALG10B_chr12_38712260_ENST00000551464_MPRIP_chr17_16979024_ENST00000395804_length(amino acids)=1107AA_BP=123
MAQLEGYCFSAALSCTFLVSCLLFSAFSRALREPYMDEIFHLPQAQRYCEGHFSLSQWDPMITTLPGLYLVSVGVVKPAIWIFAWSEHVV
CSIGMLRFVNLLFSVGNFYLLYLLFHKVQPRNKAKPIYGGWLLLAPDGTDFDNPVHRSRKWQRRFFILYEHGLLRYALDEMPTTLPQGTI
NMNQCTDVVDGEGRTGQKFSLCILTPEKEHFIRAETKEIVSGWLEMLMVYPRTNKQNQKKKRKVEPPTPQEPGPAKVAVTSSSSSSSSSS
SIPSAEKVPTTKSTLWQEEMRTKDQPDGSSLSPAQSPSQSQPPAASSLREPGLESKEEESAMSSDRMDCGRKVRVESGYFSLEKTKQDLK
AEEQQLPPPLSPPSPSTPNHRRSQVIEKFEALDIEKAEHMETNAVGPSPSSDTRQGRSEKRAFPRKRDFTNEAPPAPLPDASASPLSPHR
RAKSLDRRSTEPSVTPDLLNFKKGWLTKQYEDGQWKKHWFVLADQSLRYYRDSVAEEAADLDGEIDLSACYDVTEYPVQRNYGFQIHTKE
GEFTLSAMTSGIRRNWIQTIMKHVHPTTAPDVTSSLPEEKNKSSCSFETCPRPTEKQEAELGEPDPEQKRSRARERRREGRSKTFDWAEF
RPIQQALAQERVGGVGPADTHEPLRPEAEPGELERERARRREERRKRFGMLDATDGPGTEDAALRMEVDRSPGLPMSDLKTHNVHVEIEQ
RWHQVETTPLREEKQVPIAPVHLSSEDGGDRLSTHELTSLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQATCERGF
AAMEETHQKKIEDLQRQHQRELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSVNSDVEALRRQYLEELQSVQR
ELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLTGDGGGEATGSPLAQGKDAYELEVLLRV
KESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGEKSPDSATVSGYDIMKSKSNPDFL

--------------------------------------------------------------
>3980_3980_7_ALG10B-MPRIP_ALG10B_chr12_38712260_ENST00000551464_MPRIP_chr17_16979024_ENST00000395811_length(transcript)=11249nt_BP=501nt
GCCTTCCGGTATGTGGCCCCGTCTGGCTAGTCCTGTCTAGCGCGCCCATTTCGAGCCCAAGTTTCCAGCTCGGGTTTCCGGGCTCAGAAT
TTTCCAGGAGTGGGTTCTTGGGCAGTGGCTGTGGGAGCAGGAATGGCGCAGCTAGAGGGTTACTGTTTCTCGGCCGCCTTGAGCTGTACC
TTTTTAGTGTCCTGCCTCCTCTTCTCCGCCTTCAGCCGGGCGCTGCGAGAGCCCTACATGGACGAGATCTTCCACCTGCCTCAGGCGCAG
CGCTACTGTGAGGGCCATTTCTCCCTTTCCCAGTGGGATCCCATGATTACTACATTACCTGGCTTGTACCTGGTGTCAGTTGGAGTGGTC
AAACCTGCCATTTGGATCTTTGCATGGTCTGAACATGTTGTCTGCTCCATTGGGATGCTCAGATTTGTTAATCTTCTCTTCAGTGTTGGC
AACTTCTATTTACTATATTTGCTTTTCCACAAGGTACAACCCAGAAACAAGGCAAAACCCATTTATGGCGGTTGGCTGCTCCTGGCTCCA
GATGGGACCGACTTTGACAACCCAGTGCACCGGTCTCGGAAATGGCAGCGACGGTTCTTCATCCTTTACGAGCACGGCCTCTTGCGCTAC
GCCCTGGATGAGATGCCCACGACCCTTCCTCAGGGCACCATCAACATGAACCAGTGCACAGATGTGGTGGATGGGGAGGGCCGCACGGGC
CAGAAGTTCTCCCTGTGTATTCTGACGCCTGAGAAGGAGCATTTCATCCGGGCGGAGACCAAGGAGATCGTCAGTGGGTGGCTGGAGATG
CTCATGGTCTATCCCCGGACCAACAAGCAGAATCAGAAGAAGAAACGGAAAGTGGAGCCCCCCACACCACAGGAGCCTGGGCCTGCCAAG
GTGGCTGTTACCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCATCCCCAGTGCTGAGAAAGTCCCCACCACCAAGTCCACACTCTGG
CAGGAAGAAATGAGGACCAAGGACCAGCCAGATGGCAGCAGCCTGAGTCCAGCTCAGAGTCCCAGCCAGAGCCAGCCTCCTGCTGCCAGC
TCCCTGCGGGAACCTGGGCTAGAGAGCAAAGAAGAGGAGAGCGCCATGAGTAGCGACCGCATGGACTGTGGCCGCAAAGTCCGGGTGGAG
AGCGGCTACTTCTCTCTGGAGAAGACCAAACAGGACTTGAAGGCTGAAGAACAGCAGCTGCCCCCGCCGCTCTCCCCTCCCAGCCCCAGC
ACCCCCAACCACAGGAGGTCCCAGGTGATTGAAAAGTTTGAGGCCTTGGACATTGAGAAGGCAGAGCACATGGAGACCAATGCAGTGGGG
CCCTCACCATCCAGCGACACACGCCAGGGCCGCAGCGAGAAGAGGGCGTTCCCTAGGAAGCGGGACTTCACCAATGAAGCCCCCCCAGCT
CCTCTCCCAGACGCCTCGGCTTCCCCCCTGTCTCCACACCGAAGAGCCAAGTCACTGGACAGGAGGTCCACGGAGCCCTCCGTGACGCCC
GACCTGCTGAATTTCAAGAAAGGCTGGCTGACTAAGCAGTATGAGGACGGCCAGTGGAAGAAACACTGGTTTGTCCTCGCCGATCAAAGC
CTGAGATACTACAGGGATTCAGTGGCTGAGGAGGCAGCCGACTTGGATGGAGAAATTGACTTGTCCGCATGTTACGATGTCACAGAGTAT
CCAGTTCAGAGAAACTATGGCTTCCAGATACATACAAAGGAGGGCGAGTTTACCCTGTCGGCCATGACATCTGGGATTCGGCGGAACTGG
ATCCAGACCATCATGAAGCACGTGCACCCGACCACTGCCCCGGATGTGACCAGCTCGTTGCCAGAGGAAAAAAACAAGAGCAGCTGCTCT
TTTGAGACCTGCCCGAGGCCTACTGAGAAGCAAGAGGCAGAGCTGGGGGAGCCGGACCCTGAGCAGAAGAGGAGCCGCGCACGGGAGCGG
AGGCGAGAGGGCCGCTCCAAGACCTTTGACTGGGCTGAGTTCCGTCCCATCCAGCAGGCCCTGGCTCAGGAGCGGGTGGGCGGCGTGGGG
CCTGCTGACACCCACGAGCCCCTGCGCCCTGAGGCGGAGCCTGGGGAGCTGGAGCGGGAGCGTGCACGGAGGCGGGAGGAGCGCCGCAAG
CGCTTCGGGATGCTCGACGCCACAGACGGGCCAGGCACTGAGGATGCAGCCCTGCGCATGGAGGTGGACCGGAGCCCAGGGCTGCCTATG
AGCGACCTCAAAACGCATAACGTCCACGTGGAGATTGAGCAGCGGTGGCATCAGGTGGAGACCACACCTCTCCGGGAAGAGAAGCAGGTG
CCCATCGCCCCCGTCCACCTGTCTTCTGAAGATGGGGGTGACCGGCTCTCCACACACGAGCTGACCTCTCTGCTCGAGAAGGAGCTGGAG
CAGAGCCAGAAGGAGGCCTCAGACCTTCTGGAGCAGAACCGGCTCCTGCAGGACCAGCTGAGGGTGGCCCTGGGCCGGGAGCAGAGCGCC
CGTGAGGGCTACGTGCTGCAGGCCACGTGCGAGCGAGGGTTTGCAGCAATGGAAGAAACGCACCAGAAGAAGATTGAAGATCTCCAGAGG
CAGCACCAGCGGGAGCTAGAGAAACTTCGAGAAGAGAAAGACCGCCTCCTAGCCGAGGAGACAGCGGCCACCATCTCAGCCATCGAAGCC
ATGAAGAACGCCCACCGGGAGGAAATGGAGCGGGAGCTGGAGAAGAGCCAGCGGTCCCAGATCAGCAGCGTCAACTCGGATGTTGAGGCC
CTGCGGCGCCAGTACCTGGAGGAGCTGCAGTCGGTGCAGCGGGAACTGGAGGTCCTCTCGGAGCAGTACTCGCAGAAGTGCCTGGAGAAT
GCCCATCTGGCCCAGGCGCTGGAGGCCGAGCGGCAGGCCCTGCGGCAGTGCCAGCGTGAGAACCAGGAGCTCAATGCCCACAACCAGGAG
CTGAACAACCGCCTGGCTGCAGAGATCACACGGTTGCGGACGCTGCTGACTGGGGACGGCGGTGGGGAGGCCACTGGGTCACCCCTTGCA
CAGGGCAAGGATGCCTATGAACTAGAGGTCTTATTGCGGGTAAAGGAATCGGAAATACAGTACCTGAAACAGGAGATTAGCTCCCTCAAG
GATGAGCTGCAGACGGCACTGCGGGACAAGAAGTACGCAAGTGACAAGTACAAAGACATCTACACAGAGCTCAGCATCGCGAAGGCTAAG
GCTGACTGTGACATCAGCAGGTTGAAGGAGCAGCTCAAGGCTGCAACGGAAGCACTGGGGGAGAAGTCCCCTGACAGTGCCACGGTGTCC
GGATATGATATAATGAAATCTAAAAGCAACCCTGACTTCTTGAAGAAAGACAGATCCTGTGTCACCCGGCAACTCAGAAACATCAGGTCC
AAGAGTCTGAAGGAAGGCCTGACGGTGCAAGAACGGTTGAAGCTCTTTGAATCCAGGGACTTGAAGAAAGACTAGGTGTGTCCCATCCAA
GTTGAGCACGCGCCTTCCCCAGCTTGCAGCAGCACACCCCAAGCGCTGCTTTTCACCTGTACCTTTGTTTTATTATTATTATTATTATTG
CTGTTGTTGTCATCGTTAACTGTGGGCATGGAATGCGTGAGGCTGGCTTCTGGGTTGTCCACACCACTCTCTGCTGTGTTGACTTCCTGT
TGTCTTCATCAAAGCTTTTTTCCGTGGTATTCTAAAATTAGGCCAGCAGTGGGGGCTGGGAGGGCATCTGTGTTAGTCCTTTCCTGGCTG
TGACCCGCCACACTCACTGTCAGTATTAAGGCCCAGCAGCCTGTTGATAAGCTACCCTGTCTCACCATGTGCTGGTGTGGAAACGGGGCC
CAGCCAGCACGCCTCAAGGTAGATGGAATCCCCACTGGTCAGAGAAAAAGCTATGCGGACACTCCAGCTTGGCCTGGGTCACAGCACTGA
CTCCTCACCCGCTAGTCTGGCTGTTAAGAGGAGAAAGTGCACTGCCTTCCAGCCCAGGAGGAGGACAGCATTTTGTATTTGTTCCACTGA
TGCAGCTTAGAACCACACCCCTGAGAGTCGTGGCAAACCTTTCACAACCTGGAAAATGTTGAAAGCAACCATTCCTATTTTTGTTTGTTT
TTTATTAAATCTTGCACAAAATCCCCGGCCCCTCTCCTTCCTTCCTTCCTTCCTTCCTCCGCTCGTTCCTTTCTTGGTCTCCAGTAACCC
TGGTCTTTTCATAACTGCTCGAGATTGTTGACCTGCAGCCCAGGTTTCAGACTCTGATTGCAAAAAACAAATGAATTCCCCCCAGGAATC
ATTCAAAATGGGGGAAGGTTTGGGGGTTTGGGTTTTTTTTTTACCTTTTGGAAAAGAAACCGTCACATTGCTTTGGAAAAGGTTGAGAGG
AGACCCCTGTTAAGTCAAGAAGAAAGTACAGAGGATGTCAGAATCTGATGAGAACAGCACATTAGTGTTTATTGAGACTCCGATCTTAAC
TCTCATTTAATTAATCTGAGCTCTGAAAACCTATCTTGCAGCATTTATCTTTAAAAGAGCCTGGTTAAAGTAAACCTATACTAACAATTT
TGCTTTTTCTAACAGTTTGAGGAAGACCTTTTTAACCACCACAAAACATTCTATGGCAATTCTTGAAAATCTCTTAAATTGGAGTCTATT
ATGGCCCCATGAAAACCATTAATCCCATTAAGATAGGGAGTATAAACCCCTGGCTGGTGGAACAGGTTCTGCTACTTTAGGAGCAAGGTG
GGGTGTGAGTAGATGGTTTTCATGCCAAGAACATGCTTTCACTTTGTATTCATGCTTGTGTTGGTGTGATGGTCTCTGTGGGTGGGTGGA
TGCTTTGGGCGTTGAAATCTAGAAATCCTGTTGCTCAGTTTCTAGATGAAGTCATGAGCAAGGCCATCAGTGGAGCTCTGGCCCCGCCCC
CAATGTGCAGAAGGGCCGGGAGCAAGGCCTGGAGTTTTCATGTGTTTTCAGACCCAGGTTTAGGTGCTCTCTTCTCACTGAAATAACTAA
GTGCTCTCCACTGGCATCGAGCCCTTTCCACAAGTTTTTAAGGCTCTTAACCCACACTTTCACTCCTCTGCTACTAGTCTTCAGTGTTGT
TAACAGCAAGAGAAAATTGGGTTTGTTTAAAAATCTACTTCTCTGAGGTGGCACAGTTGCGTAGCTGTAGTCCCAGCTACTCAGGAGGCT
GAGGTAGTAGGATTGCTTGAGCCCGGGAGGTCGAGGCTGCAATCAGTCATGATCGTGTCACTGCACTCCAGCCTGGGTGACAAAGCAAGG
CCCCATATCAGATATAGATATACTTATCAGACCCCCCCTGACCATTTAGATTGGCAGTGCTTTGAGAAATGCACTATGACCTTTCTGTGT
CAATGGGAATATACAGAAGGAACATTCGGGACCCCGCTGTCCCCCACAGCCTCATTGTTGTCTCCAGGACACTGCTGGGTCACACGAATG
CTCCAGGACAGACAGGGACCTGGAGTGCATCAGGATCTGACCAGATAGGAGTTTTTGCCTCGTGTCTGGGTGCTACGATTTTGTGCCGTT
CTCTGAGGTCCACCACCTGCCCTTCCTGGCATGGTTTCCTTCGTGACCATCCCTGCTGCCCCTGGGGGTGGACCCCACTGGCCCTTCTGC
AGACAGCTCCCTGCCTTCTGCCCTCCAGGGGGTTCTGGCCAGAGTCCATGCTTGGAGACAGGATCATCTGCCTTCAGCCCTCACAGTGCT
TTAAATTAAAGCAAGTTTGCCCATAGGACAAAAGAGCATTTGATTCCCTTTTTTCTGTCACATATCCCTTGAGGCTGGACTTCAGGAATC
CTGGAAAATTAATATGAGTGCAGCATGTGAGGGGTCAGAGACAGGCCAGCAGGGCGTCTGCATTCCTCCCTGCCACAGGTCTCTCCCCAG
AGGCTGGTTTAGTGTAGGGTATTGCCAGGAAACGGACTGAGGCTGCTTTGCTAAGAGCTCCTGAAAATGCCCTGGGCCTGTCCTGGCGTT
TCTGAAGAGCCCTCATACAGGGACAGCCACCATCTGGGTCAAGGAAGTCTGGGTTCCCTGCTGGTGGGCTCCATCCTGCGATGGAGTGAA
CCAGGCGAGAAAGGATGACGATGTTCTTCATGTTGCACCTGGACATGCCCCAGGAACAGAGACTTGCCCAGGTGGCAACACTGGCACAGA
TGTTGACGGCTGCCCAACTGGTGCCACACTGAGCAGGGAGCCTTGTGCTGCACAGGGCTGGGCCCTCTCTCCAGTTTCCTTCCTGCAGGC
ATCCAAATACCCTGGAAGGGATTTAACCCCTGAATTCCAGAGGGAAGAAAGAAGAACAGTGAAGAAGTAGAACTGGTTTCTGTATGGGGA
GAGGAAAGTCTTAGGGACAGCTGCAGGCGGGGTCTCAGGCTGCTCCTTGGCACCAGCTACACAGTAGTGAGCTTTCCCAGCTTTACCGAT
GAGGAAGAAGTTCAAATAGATAGACTTCAGCATTTTAATTATTTTCCTATAAATGTATTTATGTGTAGTATGCTAGCACCAGCCAGTAAG
CTGTGCCACACATATGAATGGGAAAGCGAGGCAGTTGTGCTCGTGTGAGTTTCTGCAGGCTTGTGGGTAATTACCTTGTGTGCACGCCTG
CACGTGCAGAATAGTCACTTTCTGCTGGTCAGTTTCTTTATCCACCCATGGTGCCCCAGCCCCAGGCAGGTGTGGAGACCAGCATTTCAG
AGGACGCGCTGTCCACAGCCTCCCGGGTCTGAGTGGATCATTGGGCAGGGGTGGAGACAGTGCGCTGCCCTCTGAGCTGGAAGCCTGTGC
TTCAGGGAGTCATAATGGGCCTGTGCTAAGTGGGTGATGCAGTGGACATCCCAGGGCGACTAGAGGTGGCAGTATCGCGAATTTGCAGGT
TTATTGAACAAGAGGTAACATCGGAGAGGATCTTGCCTTCGGATTCAGCAAGTATGAAGGCAGAAGAGCATGGAGAGCAAGGCCCCACAG
CCTGCTTAGTGAGTTGGAAGGCCCAGCAAGAACCTGTTTCTGCAGCAGCCACCAGCTCCCATCACCCCTTGACCCTCCAGCTCATGCTGG
AGAAGAGGGAATTTTGGCTGTTTAAAGAACACAGTTGTGAATCTCAGAATGTGCCTGAAAGGAATACTGACAGATAAGGCCGGAAACAAA
ACTGATGGCTTGAAAAACATTTTTATGGAATGTATTTACTATCATTTTGTTTTACTATAGAGGTAGATGGGACTCTTAACTTTTGGGTAC
ATGGAAACATGCTGAAAACTGAACACAATCCTGATCATCACTCCTGCCTGGCTGTCTCCTGGGAGGCTGCCGGGTGCCACGGAGCTGGGA
CACAGCAGAGCCCGCTAGGTGTTGCAGGGCCCTGGAGGCCAAGGCCACCCTGTGTGGGGTCCCTGTTGGCAGCCAGGTCCCTACACAAAC
AAGTAATCCTGTTTGGCCTCCTAGGTTTTGCATATGACCTGCAGCCTAATTTGGGGTGTAGGGGAAGCTCTGCTGGCCCCTGCTCCTTTG
TATGTTGGGTGACTTTAATGGCTGGCCACATACCCCTTTCTCCCAGCTACTCATTCACTGACTTGGGTAAGTTCTAAGACAGTTCGCACT
TAGAAAAGAATGTGACACATCAACATTAACTTTTCCTGAAAAGAAGAGTTTGCCTAACATGGTCCTAAAGAAGCTTGGAATTTATAAGAC
TTTCCTTTATAAGATATAGTGGGGGTTTTTTTGGGTGGAGGGGGGTTGTTTTTTGTTTTTTGTTTTCAAGACAGAGTCTCGCTCTGTTGT
CCAGGCTGGAGTGTAGTGGCATGATCTCGGCTCACTGCAACCTCTGCCTCCCAGGTTCATGCCATTCTCCTGCCTCAGCCTCCCGAGTAG
CTGGGACTACAGGTGTCTGCCGCCACGCCTGGCTAATTTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGTCAGGATGGTCT
CGATTTCCTGACCTCGTGATCCGCCTGTCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCACGCCTGGCCTATAAGATACG
GTAAAAAAAAAAAACTGTGACCCCTTTGTCACTAAGGGAGAAAGAAATTAAGTATTGTCAAAGTTCTATAAAGAATGGAAATGTATGATA
TTATACTTCAAAGGAATTTGATGTTGAAATTTTAAAGAAAATTTGTCATGTTGATGAGAAGCTTCACTTTCCTGGGAACTTCATTCGTTT
TAGGGCATGAGATAAAAGTCCTGGCTAGGGGAGCCATAGGTCTGTTGTACAAGGAATTTGCTTTCTAAACAAGTTGTAACTTGCCCTAAG
GTCCCTGTTGGAGCACTAAGAGGTGACACAGGCCAGAGACAACGTTTCGTTTCCCCTTCCCTGCAAGCTGGGATCAGCCCTGTGTTTTCT
CCTTTCAGCTGAAGTGAGCGAAGGTTCTCAGTGCTGGCAAAAGAGCCCACTTTCTAAAAGGACTTGGGAAGAAAGCTGCTGGGAACTTGC
TTATTAAAAAGTTCCTTAGAATTAAGGTATCTACCCACTGTTTTCGCACCTTTCACCTTCCTGGGCTTTCCTGCCCTCCAGCATTCTTCT
CTAGAGAGGTTCCTAGCCCGCTCAGCGCGAGCGTCTCCAGTAGGTAATAGCAGCTGAACGTGGGTTTTCCACGGACTTCAGGCTTGGAGG
TGCCATATACAAGCACACTTCTTCCTTCCCCTGGCTTCTCCATGCCACCACCCACTTTAAAGATGTAAACTCAGTAGATTTTTCATCCAG
TGAACGGTCATCTTCACATCGAAAGGTGAAGGCCACCACTGTTCTCAATGCCAAGCAACAGAACGTTCTGAGATGGCCGTTCTTCCTTGC
ACAGCAGCTACGGCAGGTTGTTCTGCAGCCACCCCTTAGAGGGGGCTCTTCGTTTTACCTTTGTACAGTTCTTGTGTTTACACATTTGGG
CCAAACAGCTTTCAGCAAGGGCATGTGTCCACAGCTGATGGGCAGTTAAGAACCAGCCTGAGCTGAAGGCTAGTAATACCGTGCTGTAGG
CTCTTTAAAAGGAAAGCCTGGCATAAACCCAGCATGGAAAGGAACATTATCAGTTATCTCAAATTTTGTCTGCCAGGGACAAGACCCTGT
TCATTCTTTTGCCCTTTTCAGAACTGTGAGCTTCAAGTATTCTTGCTTCTCTGTAAAGGGAAGACATCTCCCTTCTCTGAAATCCTTCAA
CAAAAGAAAAGGCTCTTGGCAGGGTAGGGGAGTCAGTAGCTCAACACTAGATCATCCCTAGAGATGGGGCAAGTTTCTGTCTGAACACGT
CTTGGGTCCGAGTCCTTAGGTGTTCGGATGCAGTACTTTGTGAATACTTAAGCTACTGCATGCTTGGTGTAGCTTGCAATTTCTCTGTAT
TTAAAAGCAGCTGTGTTTATTTTCTTCAAAATAACCTGTATATTATTTAGAGCAAGCAATGTAAATATTACTGAGAAGTTACTGCAGGGA
TTTTTGTGACAGAGTTTGTATGGGTTTTTAAAAAAATCTTAGACACCCCTTTTTAAGATGGGGAGAACAGGGTTGACTGCACCGTTGAAG
CCCGCCCAGCATTATAAGGAAATGTTTTTAATGACTGCTGCATCTTTGTAAAACGTTTGGTCATCTAACAGATGGTTTTAAAGTGTACAA
TATCCAAAATAACGATAGCCCTGTATCCATACATTGTTTCATTGAAAGAATTCTCTATTGCCTCTTCTTGGTAGAGCCAGAGTCCTTAAG
GAAAATCAGGAAAATTAAGAAAATGATGGTGCCATCTTGACCAGACTTCTGCACAGTAATTTAACGCTATCCTAGGGAGACTTGGTTGAA
GGCACAGTTCTGGGATCAGGGTCTAAATGTGCAGTTTCTGAGAACCTTCAAGACCACTCACTGGGCAGGGCTCTGTGGAGCACTGGAGCT
GTTTGGATTCCCCAGCCCTTTGGTCATATCCTGGAATTCCGTGGAGGCTGCAGAACTTAGATGCAGCTGTTTTTTACAGCACCTATTTTT
GTCAGATTGGTAAGGAAACACTGAGTCACAGAATACTTAAGAATTGGAGACTCCAGTAATGTAGGATGGCCTGAGAGGACGTCCAAGTCC
CAAGGGGTGGACACGGCATGTTCCTCGGGCACAGCCTCAGTGGGGGCCTTCCCCAGGCGCAGCTCGGCCACCTGAGGAAAGGGTGTTTCG
GAGGCGCAGCCACACACACAGCGCTGGCAGCCTCACGGTCACGCCCATCACTCCCTGCCCCCCACTGCCCTTGAGAAGTTAGTGGTGTCA
CATCCTTAGTTTTATAGACAGCTAGGAATAGATTGTGAAGAACACTCAGTTCACTACTGTGTTACATTTATATCACAAGCTTCAATTAAA
ATGGATTTTAAAGGATTTTAGGATTTACCTTTAGTATTAACAACGTATCTACTGACATACTGTTAGGATTCAAAACCAGTTAAGTATAAG
AATTACTTCATGTGGTTTTCCTAGGGTACAATTTATAAAAGGTAGAAAGCATCCAAGTGGCTCCTCAACAATTACAATTCTTAATGATTT
TTCTCACAGCTGTGCCCTTCTGTCAGGGTCAGTGTCAAAATTCGTTATCAAAGGCAAAACCTACTGTGCCAAGCTGGGGCGCTATATGTG
AACGGAGTGGAAATGCTTCAGTCACCTCTGCCGCAGCTTGTGATTCCAGCAGTTCTCACAAACGTTCTGTCACATGATGAAAAGAAGCAG
CTTGTATAATTCCAACTGGTGTTTCATTTCTGTTCTAATGCTAAGTGGTAACGCTTAACAAACAGACTAAAAGCTGTGTGCAGAAGAAAG
GGCTGAATGAGTACCGCCTCCCTAGGTTCCAGCACAGCGCTCGGGTCTAAGAAGTAGAGCCCCGGGGTAGGGTGGGCCATCCACTGTCAG
GCCAGTGTCTCAAGAAAGCCTGACCAGCTGAGCTGCTGCTTTTTTTTTGGGGGGGGGGGGGGGAGGGGCGTCTTGAGGCTTTTTTTTTTT
TTACAAAGTTAGTTTGTGATCAACGATTCACTACAATTGAAGTGTTACTTTGTCAGAATATTTATTCCTTTGTGTGACATGCTAGATTCC

>3980_3980_7_ALG10B-MPRIP_ALG10B_chr12_38712260_ENST00000551464_MPRIP_chr17_16979024_ENST00000395811_length(amino acids)=1120AA_BP=123
MAQLEGYCFSAALSCTFLVSCLLFSAFSRALREPYMDEIFHLPQAQRYCEGHFSLSQWDPMITTLPGLYLVSVGVVKPAIWIFAWSEHVV
CSIGMLRFVNLLFSVGNFYLLYLLFHKVQPRNKAKPIYGGWLLLAPDGTDFDNPVHRSRKWQRRFFILYEHGLLRYALDEMPTTLPQGTI
NMNQCTDVVDGEGRTGQKFSLCILTPEKEHFIRAETKEIVSGWLEMLMVYPRTNKQNQKKKRKVEPPTPQEPGPAKVAVTSSSSSSSSSS
SIPSAEKVPTTKSTLWQEEMRTKDQPDGSSLSPAQSPSQSQPPAASSLREPGLESKEEESAMSSDRMDCGRKVRVESGYFSLEKTKQDLK
AEEQQLPPPLSPPSPSTPNHRRSQVIEKFEALDIEKAEHMETNAVGPSPSSDTRQGRSEKRAFPRKRDFTNEAPPAPLPDASASPLSPHR
RAKSLDRRSTEPSVTPDLLNFKKGWLTKQYEDGQWKKHWFVLADQSLRYYRDSVAEEAADLDGEIDLSACYDVTEYPVQRNYGFQIHTKE
GEFTLSAMTSGIRRNWIQTIMKHVHPTTAPDVTSSLPEEKNKSSCSFETCPRPTEKQEAELGEPDPEQKRSRARERRREGRSKTFDWAEF
RPIQQALAQERVGGVGPADTHEPLRPEAEPGELERERARRREERRKRFGMLDATDGPGTEDAALRMEVDRSPGLPMSDLKTHNVHVEIEQ
RWHQVETTPLREEKQVPIAPVHLSSEDGGDRLSTHELTSLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQATCERGF
AAMEETHQKKIEDLQRQHQRELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSVNSDVEALRRQYLEELQSVQR
ELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLTGDGGGEATGSPLAQGKDAYELEVLLRV
KESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGEKSPDSATVSGYDIMKSKSNPDFL

--------------------------------------------------------------
>3980_3980_8_ALG10B-MPRIP_ALG10B_chr12_38712260_ENST00000551464_MPRIP_chr17_16979024_ENST00000444976_length(transcript)=3542nt_BP=501nt
GCCTTCCGGTATGTGGCCCCGTCTGGCTAGTCCTGTCTAGCGCGCCCATTTCGAGCCCAAGTTTCCAGCTCGGGTTTCCGGGCTCAGAAT
TTTCCAGGAGTGGGTTCTTGGGCAGTGGCTGTGGGAGCAGGAATGGCGCAGCTAGAGGGTTACTGTTTCTCGGCCGCCTTGAGCTGTACC
TTTTTAGTGTCCTGCCTCCTCTTCTCCGCCTTCAGCCGGGCGCTGCGAGAGCCCTACATGGACGAGATCTTCCACCTGCCTCAGGCGCAG
CGCTACTGTGAGGGCCATTTCTCCCTTTCCCAGTGGGATCCCATGATTACTACATTACCTGGCTTGTACCTGGTGTCAGTTGGAGTGGTC
AAACCTGCCATTTGGATCTTTGCATGGTCTGAACATGTTGTCTGCTCCATTGGGATGCTCAGATTTGTTAATCTTCTCTTCAGTGTTGGC
AACTTCTATTTACTATATTTGCTTTTCCACAAGGTACAACCCAGAAACAAGGCAAAACCCATTTATGGCGGTTGGCTGCTCCTGGCTCCA
GATGGGACCGACTTTGACAACCCAGTGCACCGGTCTCGGAAATGGCAGCGACGGTTCTTCATCCTTTACGAGCACGGCCTCTTGCGCTAC
GCCCTGGATGAGATGCCCACGACCCTTCCTCAGGGCACCATCAACATGAACCAGTGCACAGATGTGGTGGATGGGGAGGGCCGCACGGGC
CAGAAGTTCTCCCTGTGTATTCTGACGCCTGAGAAGGAGCATTTCATCCGGGCGGAGACCAAGGAGATCGTCAGTGGGTGGCTGGAGATG
CTCATGGTCTATCCCCGGACCAACAAGCAGAATCAGAAGAAGAAACGGAAAGTGGAGCCCCCCACACCACAGGAGCCTGGGCCTGCCAAG
GTGGCTGTTACCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCATCCCCAGTGCTGAGAAAGTCCCCACCACCAAGTCCACACTCTGG
CAGGAAGAAATGAGGACCAAGGACCAGCCAGATGGCAGCAGCCTGAGTCCAGCTCAGAGTCCCAGCCAGAGCCAGCCTCCTGCTGCCAGC
TCCCTGCGGGAACCTGGGCTAGAGAGCAAAGAAGAGGAGAGCGCCATGAGTAGCGACCGCATGGACTGTGGCCGCAAAGTCCGGGTGGAG
AGCGGCTACTTCTCTCTGGAGAAGACCAAACAGGACTTGAAGGCTGAAGAACAGCAGCTGCCCCCGCCGCTCTCCCCTCCCAGCCCCAGC
ACCCCCAACCACAGGAGGTCCCAGGTGATTGAAAAGTTTGAGGCCTTGGACATTGAGAAGGCAGAGCACATGGAGACCAATGCAGTGGGG
CCCTCACCATCCAGCGACACACGCCAGGGCCGCAGCGAGAAGAGGGCGTTCCCTAGGAAGCGGCCCGACCTGCTGAATTTCAAGAAAGGC
TGGCTGACTAAGCAGTATGAGGACGGCCAGTGGAAGAAACACTGGTTTGTCCTCGCCGATCAAAGCCTGAGATACTACAGGGATTCAGTG
GCTGAGGAGGCAGCCGACTTGGATGGAGAAATTGACTTGTCCGCATGTTACGATGTCACAGAGTATCCAGTTCAGAGAAACTATGGCTTC
CAGATACATACAAAGGAGGGCGAGTTTACCCTGTCGGCCATGACATCTGGGATTCGGCGGAACTGGATCCAGACCATCATGAAGCACGTG
CACCCGACCACTGCCCCGGATGTGACCAGCTCGTTGCCAGAGGAAAAAAACAAGAGCAGCTGCTCTTTTGAGACCTGCCCGAGGCCTACT
GAGAAGCAAGAGGCAGAGCTGGGGGAGCCGGACCCTGAGCAGAAGAGGAGCCGCGCACGGGAGCGGAGGCGAGAGGGCCGCTCCAAGACC
TTTGACTGGGCTGAGTTCCGTCCCATCCAGCAGGCCCTGGCTCAGGAGCGGGTGGGCGGCGTGGGGCCTGCTGACACCCACGAGCCCCTG
CGCCCTGAGGCGGAGCCTGGGGAGCTGGAGCGGGAGCGTGCACGGAGGCGGGAGGAGCGCCGCAAGCGCTTCGGGATGCTCGACGCCACA
GACGGGCCAGGCACTGAGGATGCAGCCCTGCGCATGGAGGTGGACCGGAGCCCAGGGCTGCCTATGAGCGACCTCAAAACGCATAACGTC
CACGTGGAGATTGAGCAGCGGTGGCATCAGGTGGAGACCACACCTCTCCGGGAAGAGAAGCAGGTGCCCATCGCCCCCGTCCACCTGTCT
TCTGAAGATGGGGGTGACCGGCTCTCCACACACGAGCTGACCTCTCTGCTCGAGAAGGAGCTGGAGCAGAGCCAGAAGGAGGCCTCAGAC
CTTCTGGAGCAGAACCGGCTCCTGCAGGACCAGCTGAGGGTGGCCCTGGGCCGGGAGCAGAGCGCCCGTGAGGGCTACGTGCTGCAGGCC
ACGTGCGAGCGAGGGTTTGCAGCAATGGAAGAAACGCACCAGAAGAAGATTGAAGATCTCCAGAGGCAGCACCAGCGGGAGCTAGAGAAA
CTTCGAGAAGAGAAAGACCGCCTCCTAGCCGAGGAGACAGCGGCCACCATCTCAGCCATCGAAGCCATGAAGAACGCCCACCGGGAGGAA
ATGGAGCGGGAGCTGGAGAAGAGCCAGCGGTCCCAGATCAGCAGCGTCAACTCGGATGTTGAGGCCCTGCGGCGCCAGTACCTGGAGGAG
CTGCAGTCGGTGCAGCGGGAACTGGAGGTCCTCTCGGAGCAGTACTCGCAGAAGTGCCTGGAGAATGCCCATCTGGCCCAGGCGCTGGAG
GCCGAGCGGCAGGCCCTGCGGCAGTGCCAGCGTGAGAACCAGGAGCTCAATGCCCACAACCAGGAGCTGAACAACCGCCTGGCTGCAGAG
ATCACACGGTTGCGGACGCTGCTGACTGGGGACGGCGGTGGGGAGGCCACTGGGTCACCCCTTGCACAGGGCAAGGATGCCTATGAACTA
GAGGTCTTATTGCGGGTAAAGGAATCGGAAATACAGTACCTGAAACAGGAGATTAGCTCCCTCAAGGATGAGCTGCAGACGGCACTGCGG
GACAAGAAGTACGCAAGTGACAAGTACAAAGACATCTACACAGAGCTCAGCATCGCGAAGGCTAAGGCTGACTGTGACATCAGCAGGTTG
AAGGAGCAGCTCAAGGCTGCAACGGAAGCACTGGGGGAGAAGTCCCCTGACAGTGCCACGGTGTCCGGATATGATATAATGAAATCTAAA
AGCAACCCTGACTTCTTGAAGAAAGACAGATCCTGTGTCACCCGGCAACTCAGAAACATCAGGTCCAAGAGTCTGAAGGAAGGCCTGACG
GTGCAAGAACGGTTGAAGCTCTTTGAATCCAGGGACTTGAAGAAAGACTAGGTGTGTCCCATCCAAGTTGAGCACGCGCCTTCCCCAGCT
TGCAGCAGCACACCCCAAGCGCTGCTTTTCACCTGTACCTTTGTTTTATTATTATTATTATTATTGCTGTTGTTGTCATCGTTAACTGTG

>3980_3980_8_ALG10B-MPRIP_ALG10B_chr12_38712260_ENST00000551464_MPRIP_chr17_16979024_ENST00000444976_length(amino acids)=1082AA_BP=123
MAQLEGYCFSAALSCTFLVSCLLFSAFSRALREPYMDEIFHLPQAQRYCEGHFSLSQWDPMITTLPGLYLVSVGVVKPAIWIFAWSEHVV
CSIGMLRFVNLLFSVGNFYLLYLLFHKVQPRNKAKPIYGGWLLLAPDGTDFDNPVHRSRKWQRRFFILYEHGLLRYALDEMPTTLPQGTI
NMNQCTDVVDGEGRTGQKFSLCILTPEKEHFIRAETKEIVSGWLEMLMVYPRTNKQNQKKKRKVEPPTPQEPGPAKVAVTSSSSSSSSSS
SIPSAEKVPTTKSTLWQEEMRTKDQPDGSSLSPAQSPSQSQPPAASSLREPGLESKEEESAMSSDRMDCGRKVRVESGYFSLEKTKQDLK
AEEQQLPPPLSPPSPSTPNHRRSQVIEKFEALDIEKAEHMETNAVGPSPSSDTRQGRSEKRAFPRKRPDLLNFKKGWLTKQYEDGQWKKH
WFVLADQSLRYYRDSVAEEAADLDGEIDLSACYDVTEYPVQRNYGFQIHTKEGEFTLSAMTSGIRRNWIQTIMKHVHPTTAPDVTSSLPE
EKNKSSCSFETCPRPTEKQEAELGEPDPEQKRSRARERRREGRSKTFDWAEFRPIQQALAQERVGGVGPADTHEPLRPEAEPGELERERA
RRREERRKRFGMLDATDGPGTEDAALRMEVDRSPGLPMSDLKTHNVHVEIEQRWHQVETTPLREEKQVPIAPVHLSSEDGGDRLSTHELT
SLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQATCERGFAAMEETHQKKIEDLQRQHQRELEKLREEKDRLLAEETA
ATISAIEAMKNAHREEMERELEKSQRSQISSVNSDVEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQ
ELNAHNQELNNRLAAEITRLRTLLTGDGGGEATGSPLAQGKDAYELEVLLRVKESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYT
ELSIAKAKADCDISRLKEQLKAATEALGEKSPDSATVSGYDIMKSKSNPDFLKKDRSCVTRQLRNIRSKSLKEGLTVQERLKLFESRDLK

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for ALG10B-MPRIP


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with
TgeneMPRIPchr12:38712260chr17:16979024ENST00000341712024824_87941.01231.0PPP1R12A
TgeneMPRIPchr12:38712260chr17:16979024ENST00000395804023824_87941.01026.0PPP1R12A
TgeneMPRIPchr12:38712260chr17:16979024ENST00000395811023824_87941.01039.0PPP1R12A
TgeneMPRIPchr12:38712260chr17:16979024ENST00000444976022824_87941.01001.0PPP1R12A
TgeneMPRIPchr12:38712260chr17:16979024ENST00000341712024546_82441.01231.0RHOA
TgeneMPRIPchr12:38712260chr17:16979024ENST00000395804023546_82441.01026.0RHOA
TgeneMPRIPchr12:38712260chr17:16979024ENST00000395811023546_82441.01039.0RHOA
TgeneMPRIPchr12:38712260chr17:16979024ENST00000444976022546_82441.01001.0RHOA


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with
TgeneMPRIPchr12:38712260chr17:16979024ENST000003417120242_38341.01231.0F-actin
TgeneMPRIPchr12:38712260chr17:16979024ENST000003958040232_38341.01026.0F-actin
TgeneMPRIPchr12:38712260chr17:16979024ENST000003958110232_38341.01039.0F-actin
TgeneMPRIPchr12:38712260chr17:16979024ENST000004449760222_38341.01001.0F-actin


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for ALG10B-MPRIP


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for ALG10B-MPRIP


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
TgeneC0043094Weight Gain1CTD_human
TgeneC3495676Anorectal Malformations1GENOMICS_ENGLAND