Fusion Gene Studies
in Kim Lab

FusionBase FusionGDB FusionGDB2 FusionPDB FusionNeoAntigen FusionAI FusionNW FGviewer Publication Contact
FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:AP3D1-REXO1 (FusionGDB2 ID:HG8943TG57455)

Fusion Gene Summary for AP3D1-REXO1

check button Fusion gene summary
Fusion gene informationFusion gene name: AP3D1-REXO1
Fusion gene ID: hg8943tg57455
HgeneTgene
Gene symbol

AP3D1

REXO1

Gene ID

8943

57455

Gene nameadaptor related protein complex 3 subunit delta 1RNA exonuclease 1 homolog
SynonymsADTD|HPS10|hBLVRELOABP1|EloA-BP1|REX1|TCEB3BP1
Cytomap('AP3D1')('REXO1')

19p13.3

19p13.3

Type of geneprotein-codingprotein-coding
DescriptionAP-3 complex subunit delta-1AP-3 complex delta subunit, partial CDSadapter-related protein complex 3 subunit delta-1adaptor related protein complex 3 delta 1 subunitdelta adaptinsubunit of putative vesicle coat adaptor complex AP-3RNA exonuclease 1 homologREX1, RNA exonuclease 1 homologelongin-A-binding protein 1transcription elongation factor B polypeptide 3-binding protein 1
Modification date2020031320200313
UniProtAcc..
Ensembl transtripts involved in fusion geneENST00000345016, ENST00000350812, 
ENST00000355272, ENST00000356926, 
ENST00000590683, 
Fusion gene scores* DoF score16 X 16 X 10=25603 X 4 X 3=36
# samples 194
** MAII scorelog2(19/2560*10)=-3.75207248655641
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(4/36*10)=0.15200309344505
effective Gene in Pan-Cancer Fusion Genes (eGinPCFGs).
DoF>8 and MAII>0
Context

PubMed: AP3D1 [Title/Abstract] AND REXO1 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointAP3D1(2108685)-REXO1(1816812), # samples:1
AP3D1(2108686)-REXO1(1816812), # samples:1
Anticipated loss of major functional domain due to fusion event.AP3D1-REXO1 seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
AP3D1-REXO1 seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
AP3D1-REXO1 seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
AP3D1-REXO1 seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID

check buttonFusion gene breakpoints across AP3D1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure
check buttonFusion gene breakpoints across REXO1 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4LUADTCGA-49-4507-01AAP3D1chr19

2108686

-REXO1chr19

1816812

-
ChimerDB4LUADTCGA-49-4507AP3D1chr19

2108685

-REXO1chr19

1816812

-


Top

Fusion Gene ORF analysis for AP3D1-REXO1

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-intronENST00000345016ENST00000587524AP3D1chr19

2108686

-REXO1chr19

1816812

-
5CDS-intronENST00000345016ENST00000587524AP3D1chr19

2108685

-REXO1chr19

1816812

-
5CDS-intronENST00000350812ENST00000587524AP3D1chr19

2108686

-REXO1chr19

1816812

-
5CDS-intronENST00000350812ENST00000587524AP3D1chr19

2108685

-REXO1chr19

1816812

-
5CDS-intronENST00000355272ENST00000587524AP3D1chr19

2108686

-REXO1chr19

1816812

-
5CDS-intronENST00000355272ENST00000587524AP3D1chr19

2108685

-REXO1chr19

1816812

-
5CDS-intronENST00000356926ENST00000587524AP3D1chr19

2108686

-REXO1chr19

1816812

-
5CDS-intronENST00000356926ENST00000587524AP3D1chr19

2108685

-REXO1chr19

1816812

-
In-frameENST00000345016ENST00000170168AP3D1chr19

2108686

-REXO1chr19

1816812

-
In-frameENST00000345016ENST00000170168AP3D1chr19

2108685

-REXO1chr19

1816812

-
In-frameENST00000350812ENST00000170168AP3D1chr19

2108686

-REXO1chr19

1816812

-
In-frameENST00000350812ENST00000170168AP3D1chr19

2108685

-REXO1chr19

1816812

-
In-frameENST00000355272ENST00000170168AP3D1chr19

2108686

-REXO1chr19

1816812

-
In-frameENST00000355272ENST00000170168AP3D1chr19

2108685

-REXO1chr19

1816812

-
In-frameENST00000356926ENST00000170168AP3D1chr19

2108686

-REXO1chr19

1816812

-
In-frameENST00000356926ENST00000170168AP3D1chr19

2108685

-REXO1chr19

1816812

-
intron-3CDSENST00000590683ENST00000170168AP3D1chr19

2108686

-REXO1chr19

1816812

-
intron-3CDSENST00000590683ENST00000170168AP3D1chr19

2108685

-REXO1chr19

1816812

-
intron-intronENST00000590683ENST00000587524AP3D1chr19

2108686

-REXO1chr19

1816812

-
intron-intronENST00000590683ENST00000587524AP3D1chr19

2108685

-REXO1chr19

1816812

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000355272AP3D1chr192108685-ENST00000170168REXO1chr191816812-504137599042231377
ENST00000356926AP3D1chr192108685-ENST00000170168REXO1chr191816812-4748346610639301274
ENST00000350812AP3D1chr192108685-ENST00000170168REXO1chr191816812-434730658935291146
ENST00000345016AP3D1chr192108685-ENST00000170168REXO1chr191816812-4880359811540621315
ENST00000355272AP3D1chr192108686-ENST00000170168REXO1chr191816812-504137599042231377
ENST00000356926AP3D1chr192108686-ENST00000170168REXO1chr191816812-4748346610639301274
ENST00000350812AP3D1chr192108686-ENST00000170168REXO1chr191816812-434730658935291146
ENST00000345016AP3D1chr192108686-ENST00000170168REXO1chr191816812-4880359811540621315

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000355272ENST00000170168AP3D1chr192108685-REXO1chr191816812-0.0045981520.99540186
ENST00000356926ENST00000170168AP3D1chr192108685-REXO1chr191816812-0.0094485910.99055135
ENST00000350812ENST00000170168AP3D1chr192108685-REXO1chr191816812-0.0067970870.9932029
ENST00000345016ENST00000170168AP3D1chr192108685-REXO1chr191816812-0.007187160.9928128
ENST00000355272ENST00000170168AP3D1chr192108686-REXO1chr191816812-0.0045981520.99540186
ENST00000356926ENST00000170168AP3D1chr192108686-REXO1chr191816812-0.0094485910.99055135
ENST00000350812ENST00000170168AP3D1chr192108686-REXO1chr191816812-0.0067970870.9932029
ENST00000345016ENST00000170168AP3D1chr192108686-REXO1chr191816812-0.007187160.9928128

Top

Fusion Genomic Features for AP3D1-REXO1


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

Top

Fusion Protein Features for AP3D1-REXO1


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr19:2108685/chr19:1816812)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
..
FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneAP3D1chr19:2108685chr19:1816812ENST00000345016-2930659_67911221154.0Coiled coilOntology_term=ECO:0000255
HgeneAP3D1chr19:2108685chr19:1816812ENST00000345016-2930725_75611221154.0Coiled coilOntology_term=ECO:0000255
HgeneAP3D1chr19:2108685chr19:1816812ENST00000345016-2930845_86911221154.0Coiled coilOntology_term=ECO:0000255
HgeneAP3D1chr19:2108685chr19:1816812ENST00000355272-3132659_67911841216.0Coiled coilOntology_term=ECO:0000255
HgeneAP3D1chr19:2108685chr19:1816812ENST00000355272-3132725_75611841216.0Coiled coilOntology_term=ECO:0000255
HgeneAP3D1chr19:2108685chr19:1816812ENST00000355272-3132845_86911841216.0Coiled coilOntology_term=ECO:0000255
HgeneAP3D1chr19:2108686chr19:1816812ENST00000345016-2930659_67911221154.0Coiled coilOntology_term=ECO:0000255
HgeneAP3D1chr19:2108686chr19:1816812ENST00000345016-2930725_75611221154.0Coiled coilOntology_term=ECO:0000255
HgeneAP3D1chr19:2108686chr19:1816812ENST00000345016-2930845_86911221154.0Coiled coilOntology_term=ECO:0000255
HgeneAP3D1chr19:2108686chr19:1816812ENST00000355272-3132659_67911841216.0Coiled coilOntology_term=ECO:0000255
HgeneAP3D1chr19:2108686chr19:1816812ENST00000355272-3132725_75611841216.0Coiled coilOntology_term=ECO:0000255
HgeneAP3D1chr19:2108686chr19:1816812ENST00000355272-3132845_86911841216.0Coiled coilOntology_term=ECO:0000255
HgeneAP3D1chr19:2108685chr19:1816812ENST00000345016-2930828_89311221154.0Compositional biasNote=Lys-rich
HgeneAP3D1chr19:2108685chr19:1816812ENST00000355272-3132828_89311841216.0Compositional biasNote=Lys-rich
HgeneAP3D1chr19:2108686chr19:1816812ENST00000345016-2930828_89311221154.0Compositional biasNote=Lys-rich
HgeneAP3D1chr19:2108686chr19:1816812ENST00000355272-3132828_89311841216.0Compositional biasNote=Lys-rich
HgeneAP3D1chr19:2108685chr19:1816812ENST00000345016-2930142_17911221154.0RepeatNote=HEAT 3
HgeneAP3D1chr19:2108685chr19:1816812ENST00000345016-2930180_21611221154.0RepeatNote=HEAT 4
HgeneAP3D1chr19:2108685chr19:1816812ENST00000345016-2930254_29211221154.0RepeatNote=HEAT 5
HgeneAP3D1chr19:2108685chr19:1816812ENST00000345016-2930299_33611221154.0RepeatNote=HEAT 6
HgeneAP3D1chr19:2108685chr19:1816812ENST00000345016-2930338_37311221154.0RepeatNote=HEAT 7
HgeneAP3D1chr19:2108685chr19:1816812ENST00000345016-293034_7111221154.0RepeatNote=HEAT 1
HgeneAP3D1chr19:2108685chr19:1816812ENST00000345016-2930375_40911221154.0RepeatNote=HEAT 8
HgeneAP3D1chr19:2108685chr19:1816812ENST00000345016-2930431_46811221154.0RepeatNote=HEAT 9
HgeneAP3D1chr19:2108685chr19:1816812ENST00000345016-2930497_53511221154.0RepeatNote=HEAT 10
HgeneAP3D1chr19:2108685chr19:1816812ENST00000345016-2930548_58511221154.0RepeatNote=HEAT 11
HgeneAP3D1chr19:2108685chr19:1816812ENST00000345016-293077_11411221154.0RepeatNote=HEAT 2
HgeneAP3D1chr19:2108685chr19:1816812ENST00000355272-3132142_17911841216.0RepeatNote=HEAT 3
HgeneAP3D1chr19:2108685chr19:1816812ENST00000355272-3132180_21611841216.0RepeatNote=HEAT 4
HgeneAP3D1chr19:2108685chr19:1816812ENST00000355272-3132254_29211841216.0RepeatNote=HEAT 5
HgeneAP3D1chr19:2108685chr19:1816812ENST00000355272-3132299_33611841216.0RepeatNote=HEAT 6
HgeneAP3D1chr19:2108685chr19:1816812ENST00000355272-3132338_37311841216.0RepeatNote=HEAT 7
HgeneAP3D1chr19:2108685chr19:1816812ENST00000355272-313234_7111841216.0RepeatNote=HEAT 1
HgeneAP3D1chr19:2108685chr19:1816812ENST00000355272-3132375_40911841216.0RepeatNote=HEAT 8
HgeneAP3D1chr19:2108685chr19:1816812ENST00000355272-3132431_46811841216.0RepeatNote=HEAT 9
HgeneAP3D1chr19:2108685chr19:1816812ENST00000355272-3132497_53511841216.0RepeatNote=HEAT 10
HgeneAP3D1chr19:2108685chr19:1816812ENST00000355272-3132548_58511841216.0RepeatNote=HEAT 11
HgeneAP3D1chr19:2108685chr19:1816812ENST00000355272-313277_11411841216.0RepeatNote=HEAT 2
HgeneAP3D1chr19:2108686chr19:1816812ENST00000345016-2930142_17911221154.0RepeatNote=HEAT 3
HgeneAP3D1chr19:2108686chr19:1816812ENST00000345016-2930180_21611221154.0RepeatNote=HEAT 4
HgeneAP3D1chr19:2108686chr19:1816812ENST00000345016-2930254_29211221154.0RepeatNote=HEAT 5
HgeneAP3D1chr19:2108686chr19:1816812ENST00000345016-2930299_33611221154.0RepeatNote=HEAT 6
HgeneAP3D1chr19:2108686chr19:1816812ENST00000345016-2930338_37311221154.0RepeatNote=HEAT 7
HgeneAP3D1chr19:2108686chr19:1816812ENST00000345016-293034_7111221154.0RepeatNote=HEAT 1
HgeneAP3D1chr19:2108686chr19:1816812ENST00000345016-2930375_40911221154.0RepeatNote=HEAT 8
HgeneAP3D1chr19:2108686chr19:1816812ENST00000345016-2930431_46811221154.0RepeatNote=HEAT 9
HgeneAP3D1chr19:2108686chr19:1816812ENST00000345016-2930497_53511221154.0RepeatNote=HEAT 10
HgeneAP3D1chr19:2108686chr19:1816812ENST00000345016-2930548_58511221154.0RepeatNote=HEAT 11
HgeneAP3D1chr19:2108686chr19:1816812ENST00000345016-293077_11411221154.0RepeatNote=HEAT 2
HgeneAP3D1chr19:2108686chr19:1816812ENST00000355272-3132142_17911841216.0RepeatNote=HEAT 3
HgeneAP3D1chr19:2108686chr19:1816812ENST00000355272-3132180_21611841216.0RepeatNote=HEAT 4
HgeneAP3D1chr19:2108686chr19:1816812ENST00000355272-3132254_29211841216.0RepeatNote=HEAT 5
HgeneAP3D1chr19:2108686chr19:1816812ENST00000355272-3132299_33611841216.0RepeatNote=HEAT 6
HgeneAP3D1chr19:2108686chr19:1816812ENST00000355272-3132338_37311841216.0RepeatNote=HEAT 7
HgeneAP3D1chr19:2108686chr19:1816812ENST00000355272-313234_7111841216.0RepeatNote=HEAT 1
HgeneAP3D1chr19:2108686chr19:1816812ENST00000355272-3132375_40911841216.0RepeatNote=HEAT 8
HgeneAP3D1chr19:2108686chr19:1816812ENST00000355272-3132431_46811841216.0RepeatNote=HEAT 9
HgeneAP3D1chr19:2108686chr19:1816812ENST00000355272-3132497_53511841216.0RepeatNote=HEAT 10
HgeneAP3D1chr19:2108686chr19:1816812ENST00000355272-3132548_58511841216.0RepeatNote=HEAT 11
HgeneAP3D1chr19:2108686chr19:1816812ENST00000355272-313277_11411841216.0RepeatNote=HEAT 2

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgeneREXO1chr19:2108685chr19:1816812ENST00000170168111686_11510671222.0Coiled coilOntology_term=ECO:0000255
TgeneREXO1chr19:2108686chr19:1816812ENST00000170168111686_11510671222.0Coiled coilOntology_term=ECO:0000255
TgeneREXO1chr19:2108685chr19:1816812ENST000001701681116188_19710671222.0Compositional biasNote=Poly-Gly
TgeneREXO1chr19:2108685chr19:1816812ENST000001701681116375_38010671222.0Compositional biasNote=Poly-Lys
TgeneREXO1chr19:2108685chr19:1816812ENST000001701681116537_59210671222.0Compositional biasNote=Ser-rich
TgeneREXO1chr19:2108686chr19:1816812ENST000001701681116188_19710671222.0Compositional biasNote=Poly-Gly
TgeneREXO1chr19:2108686chr19:1816812ENST000001701681116375_38010671222.0Compositional biasNote=Poly-Lys
TgeneREXO1chr19:2108686chr19:1816812ENST000001701681116537_59210671222.0Compositional biasNote=Ser-rich
TgeneREXO1chr19:2108685chr19:1816812ENST0000017016811161060_120910671222.0DomainNote=Exonuclease
TgeneREXO1chr19:2108686chr19:1816812ENST0000017016811161060_120910671222.0DomainNote=Exonuclease


Top

Fusion Gene Sequence for AP3D1-REXO1


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>5283_5283_1_AP3D1-REXO1_AP3D1_chr19_2108685_ENST00000345016_REXO1_chr19_1816812_ENST00000170168_length(transcript)=4880nt_BP=3598nt
GCCGCCATCTTGTCCGCCATTTGCAAGGGGCCCGGAGCGGGATCGCGGCACCTGCCGAGCGGGTCGCCGCCTCTGCCGCGGTCCTTGGAC
CCCGCCGCCGCCCTGGCCTGGGAGCTTGCCCCGCCGCAGCGGCCGGCAGCGCGGCGCTCCGCGGGCGGCAGGCACGGGCCCCGGGCCCCC
TCACGGCGCCCAGCCGCGGGCCTCCCGAGGCAAAAGCCCGTGGGCCGCCGCGATGGCCCTCAAGATGGTGAAGGGCAGCATCGACCGCAT
GTTCGACAAGAATCTGCAGGACTTGGTCCGCGGCATCCGTAACCACAAGGAGGACGAGGCAAAATACATATCTCAGTGCATTGATGAGAT
CAAGCAGGAGCTGAAGCAGGACAACATAGCGGTGAAGGCGAACGCGGTCTGCAAGCTGACGTATTTACAGATGTTGGGATACGACATCAG
CTGGGCCGCCTTCAACATCATAGAAGTGATGAGTGCCTCCAAGTTCACCTTCAAGCGAATTGGCTACCTCGCTGCTTCCCAGAGCTTTCA
CGAAGGCACCGACGTCATCATGCTGACCACCAATCAGATCCGTAAGGACTTGAGCAGCCCCAGCCAGTACGACACAGGTGTTGCACTGAC
GGGTCTGTCCTGCTTCGTCACCCCAGACCTTGCCAGAGACCTGGCAAATGACATCATGACACTGATGTCACACACCAAGCCCTACATCAG
GAAGAAGGCTGTGCTGATCATGTACAAGGTGTTCCTGAAGTACCCCGAGTCGCTGCGCCCTGCCTTTCCCCGGCTGAAGGAGAAGCTGGA
GGACCCCGACCCCGGGGTTCAGTCGGCTGCCGTCAATGTCATCTGCGAGCTGGCCAGACGCAACCCTAAGAACTACCTGTCCCTGGCCCC
GCTCTTTTTCAAGCTGATGACGTCCTCCACCAACAACTGGGTCCTCATCAAGATCATCAAGCTGTTCGGTGCTCTTACTCCTTTGGAACC
GCGGCTGGGCAAGAAGCTGATCGAGCCCCTCACCAATCTCATCCACAGCACGTCTGCCATGTCTCTCCTCTATGAATGTGTGAACACCGT
GATTGCAGTGCTCATCTCGCTGTCCTCCGGCATGCCCAACCACAGCGCCAGCATCCAGCTTTGTGTTCAGAAATTAAGGATATTGATCGA
GGACTCCGATCAGAACTTGAAGTACCTGGGGCTGCTGGCAATGTCCAAGATCCTGAAGACCCACCCCAAGTCCGTGCAGTCCCACAAGGA
CCTCATCCTGCAGTGCCTGGACGACAAGGACGAGTCCATCCGGCTGCGGGCCCTGGACCTGCTCTATGGGATGGTGTCCAAGAAGAACCT
GATGGAGATCGTGAAGAAGCTGATGACCCACGTAGACAAGGCAGAGGGTACCACCTACCGTGACGAGCTGCTCACCAAGATCATTGACAT
CTGCAGCCAGTCCAACTACCAGTACATCACCAACTTCGAGTGGTACATCAGCATCCTGGTGGAGCTGACCCGGCTGGAGGGCACACGGCA
CGGCCACCTCATCGCCGCCCAAATGCTGGACGTGGCCATCCGCGTGAAGGCCATCCGCAAGTTCGCCGTGTCCCAGATGTCTGCGCTGCT
TGACAGTGCACACCTGCTGGCCAGCAGCACCCAGCGGAACGGGATCTGTGAGGTGCTGTACGCTGCCGCCTGGATCTGCGGGGAGTTCTC
AGAGCATCTGCAGGAACCACACCACACTTTGGAGGCCATGCTGCGGCCCAGAGTCACCACGCTGCCAGGCCACATCCAGGCCGTGTATGT
GCAGAACGTGGTCAAGCTCTACGCCTCCATCCTGCAGCAGAAGGAGCAGGCCGGGGAGGCAGAGGGCGCTCAGGCCGTCACCCAGCTCAT
GGTGGACCGGCTGCCCCAGTTTGTGCAGAGCGCAGACCTGGAGGTGCAGGAGCGGGCGTCCTGCATCCTGCAGCTGGTCAAGCACATCCA
GAAGCTTCAGGCCAAGGACGTGCCTGTGGCAGAGGAGGTCAGCGCTCTCTTTGCTGGGGAGCTGAACCCAGTGGCCCCCAAGGCCCAGAA
GAAGGTTCCAGTCCCCGAAGGCCTGGACCTGGACGCCTGGATCAATGAGCCACTCTCGGACAGCGAGTCAGAGGACGAGAGGCCCAGGGC
CGTCTTCCACGAGGAGGAGCAGCGGCGTCCCAAGCACCGGCCGTCGGAGGCGGACGAGGAAGAGCTGGCTCGGCGCCGAGAGGCCCGGAA
GCAGGAGCAGGCCAACAACCCCTTCTACATCAAGAGCTCGCCATCGCCACAGAAGCGGTACCAGGACACCCCGGGCGTGGAGCACATTCC
CGTGGTGCAGATTGACCTCTCCGTCCCCTTGAAGGTTCCAGGGCTGCCTATGTCAGATCAGTATGTGAAGCTGGAGGAGGAGCGGCGGCA
CCGGCAGAAGCTGGAGAAGGACAAGAGGAGGAAAAAGAGGAAGGAGAAGGAGAAGAAGGGCAAGCGCCGCCACAGCTCGCTGCCCACGGA
GAGCGACGAGGACATCGCCCCTGCCCAGCAGGTGGACATCGTCACAGAGGAGATGCCTGAGAATGCTCTGCCCAGCGACGAGGATGACAA
AGACCCCAACGACCCCTACAGGGCTCTGGATATTGACCTGGATAAGCCCTTAGCCGACAGCGAGAAACTGCCTATTCAGAAACACAGAAA
CACCGAGACCTCAAAATCCCCTGAGAAGGACGTTCCCATGGTAGAAAAGAAGAGCAAGAAACCCAAGAAGAAAGAGAAAAAACACAAAGA
GAAAGAGAGAGACAAGGAGAAGAAGAAGGAGAAGGAGAAGAAGAAATCTCCCAAGCCTAAGAAGAAGAAGCACAGGAAGGAGAAGGAGGA
GCGGACCAAAGGCAAGAAGAAGTCCAAGAAGCAGCCTCCAGGCAGCGAGGAGGCAGCGGGGGAGCCGGTGCAGAATGGCGCGCCAGAGGA
GGAGCAGCTCCCGCCTGAGTCCAGCTACTCCCTCCTCGCTGAAAATTCCTATGTTAAAATGACCTGTGACATCCGGGGCAGTCTGCAGGA
GGACAGCCAGGTCACTGTGGCCATCGTGCTGGAGAACAGGAGCAGCAGCATCCTCAAGGGCATGGAGCTCAGCGTGCTGGACTCACTCAA
TGCCAGGATGGCCCGGCCGCAGGGCTCCTCCGTCCACGATGGCGTCCCCGTGCCTTTCCAGCTGCCCCCAGGCGTCTCCAACGAAGCCCA
GTATGTGTTCACCATCCAGAGCATCGTCATGGCGCAGAAGCTCAAGGGGACCCTGTCCTTCATTGCCAAGAATGACGAGGGTGCGACCCA
CGAGAAGCTGGACTTCAGGCTGCACTTCAGCTGCAGCTCCTACTTGATCACCACTCCCTGCTACAGTGACGCCTTTGCTAAGTTGCTGGA
GTCTGGGGACTTGAGCATGAGCTCAATCAAAGTCGATGGCATTCGGATGTCCTTCCAGAATCTTCTGGCGAAGATCTGTTTTCACCACCA
TTTTTCCGTTGTGGAGCGAGTGGACTCCTGCGCCTCCATGTACAGCCGCTCCATCCAGGGCCACCATGTCTGCCTCCTGGTGAAAAAGTC
CTACACCACATATGGCCTGGAGCTGACGCGCGTCACGGTGGTCGACACGGACGTGCACGTGGTTTATGACACCTTCGTGAAGCCTGACAA
CGAGATCGTGGACTACAACACCAGGTTTTCGGGGGTGACGGAGGCTGACCTTGCCGACACAAGTGTCACGCTGCGTGACGTCCAGGCCGT
TCTGCTGAGCATGTTCAGCGCTGACACCATCCTCATCGGACACAGCCTGGAGAGCGACCTCCTGGCCCTGAAGGTCATCCACAGCACCGT
GGTGGACACGTCTGTGCTCTTCCCCCACCGCCTGGGCCTCCCCTACAAGCGGTCCCTGCGGAACCTCATGGCCGACTACCTCAGACAGAT
CATCCAGGACAATGTGGATGGGCACAGCTCCAGCGAGGACGCCGGCGCCTGCATGCACCTGGTGATCTGGAAGGTTCGAGAAGACGCCAA
GACCAAGCGATGACGCCTGCCCGCCTCCCACCCGCCTCTCCTGCCGTCCCGCTGGTCCTTAGCCCCATGCCTCTTCCAAAACAGTGCAAT
AAATCTCCGGTAACCCGTCCACCTGGCCGAGGCAGCCCAGAGCAGCGGGATGAGCTGGCGGCCAGAGAACGCCCAGCCCAGCCCACCCCC
GCTCACGTCCTGCCGCACCCCTCCGCCGGTCCCCACCCTCTGCCCCCCAGCCCTCTGGCGTCTCTAGAAACTGCTGCTGATGGAGACCAG
GACAGAGCCCGCCCCCACCCGGCCCGCAGCCCCTCCTGCCCCTCCTGCCAGGCCCTCCCTCCCGGGTGGTGGCCGGCACCTCTGCTCCTC
ACGTGGGTCGCGGGGCGGGCTTGTCCCAGGTTTGTTTGAGACAGTCTTTTTTTATTTTGTATTGTTATTTTTATTATTTTTAATTTAAAC
CTGATGACTTGCACAGCATCTTTCCCACCGGGAAGAGCGGGCCTGCTGCCTTCCCTCCTGCGGGGTGGGGGTGGGACAGACCCCAGTGGG
GGCCAGGGCCACACTCAGCGCAGTGTGGGCTGGACCCGCCTCTGTGCTCCAGGGGCTGGCCGGTCTTCGGGGGCGCCGGCTTCCACCACT
CGCCGCCCCACCCCTGCACTCCGTGACCTCTGAGAACCCAGCTGGCCCCTCGTTAGCCCCAGGGTCTGCCGTCTAGAGGGAGCCCACCGG
CCTCCGGACACTGTCTTCCCGCTAGAGCCCCGTCCTCCTCTGCGGGGCTCTCCGGACCCCCTTCCCCCTCACCTCCAGGCAGGGACAGAA

>5283_5283_1_AP3D1-REXO1_AP3D1_chr19_2108685_ENST00000345016_REXO1_chr19_1816812_ENST00000170168_length(amino acids)=1315AA_BP=1
MPRRSGRQRGAPRAAGTGPGPPHGAQPRASRGKSPWAAAMALKMVKGSIDRMFDKNLQDLVRGIRNHKEDEAKYISQCIDEIKQELKQDN
IAVKANAVCKLTYLQMLGYDISWAAFNIIEVMSASKFTFKRIGYLAASQSFHEGTDVIMLTTNQIRKDLSSPSQYDTGVALTGLSCFVTP
DLARDLANDIMTLMSHTKPYIRKKAVLIMYKVFLKYPESLRPAFPRLKEKLEDPDPGVQSAAVNVICELARRNPKNYLSLAPLFFKLMTS
STNNWVLIKIIKLFGALTPLEPRLGKKLIEPLTNLIHSTSAMSLLYECVNTVIAVLISLSSGMPNHSASIQLCVQKLRILIEDSDQNLKY
LGLLAMSKILKTHPKSVQSHKDLILQCLDDKDESIRLRALDLLYGMVSKKNLMEIVKKLMTHVDKAEGTTYRDELLTKIIDICSQSNYQY
ITNFEWYISILVELTRLEGTRHGHLIAAQMLDVAIRVKAIRKFAVSQMSALLDSAHLLASSTQRNGICEVLYAAAWICGEFSEHLQEPHH
TLEAMLRPRVTTLPGHIQAVYVQNVVKLYASILQQKEQAGEAEGAQAVTQLMVDRLPQFVQSADLEVQERASCILQLVKHIQKLQAKDVP
VAEEVSALFAGELNPVAPKAQKKVPVPEGLDLDAWINEPLSDSESEDERPRAVFHEEEQRRPKHRPSEADEEELARRREARKQEQANNPF
YIKSSPSPQKRYQDTPGVEHIPVVQIDLSVPLKVPGLPMSDQYVKLEEERRHRQKLEKDKRRKKRKEKEKKGKRRHSSLPTESDEDIAPA
QQVDIVTEEMPENALPSDEDDKDPNDPYRALDIDLDKPLADSEKLPIQKHRNTETSKSPEKDVPMVEKKSKKPKKKEKKHKEKERDKEKK
KEKEKKKSPKPKKKKHRKEKEERTKGKKKSKKQPPGSEEAAGEPVQNGAPEEEQLPPESSYSLLAENSYVKMTCDIRGSLQEDSQVTVAI
VLENRSSSILKGMELSVLDSLNARMARPQGSSVHDGVPVPFQLPPGVSNEAQYVFTIQSIVMAQKLKGTLSFIAKNDEGATHEKLDFRLH
FSCSSYLITTPCYSDAFAKLLESGDLSMSSIKVDGIRMSFQNLLAKICFHHHFSVVERVDSCASMYSRSIQGHHVCLLVKKSYTTYGLEL
TRVTVVDTDVHVVYDTFVKPDNEIVDYNTRFSGVTEADLADTSVTLRDVQAVLLSMFSADTILIGHSLESDLLALKVIHSTVVDTSVLFP

--------------------------------------------------------------
>5283_5283_2_AP3D1-REXO1_AP3D1_chr19_2108685_ENST00000350812_REXO1_chr19_1816812_ENST00000170168_length(transcript)=4347nt_BP=3065nt
GGGGCCCGGAGCGGGATCGCGGCACCTGCCGAGCGGGTCGCCGCCTCTGCCGCGGTCCTTGGACCCCGCCGCCGCCCTGGCCTGGGAGCT
TGCCCCGCCGCAGCGGCCGGCAGCGCGGCGCTCCGCGGGCGGCAGGCACGGGCCCCGGGCCCCCTCACGGCGCCCAGCCGCGGGCCTCCC
GAGGCAAAAGCCCGTGGGCCGCCGCGATGGCCCTCAAGATGGTGAAGGGCAGCATCGACCGCATGTTCGACAAGAATCTGCAGGACTTGG
TCCGCGGCATCCGTAACCACAAGGAGGACGAGGCAAAATACATATCTCAGTGCATTGATGAGATCAAGCAGGAGCTGAAGCAGGACAACA
TAGCGGTGAAGGCGAACGCGGTCTGCAAGCTGACGTATTTACAGATGTTGGGATACGACATCAGCTGGGCCGCCTTCAACATCATAGAAG
TGATGAGTGCCTCCAAGTTCACCTTCAAGCGAATTGGCTACCTCGCTGCTTCCCAGAGCTTTCACGAAGGCACCGACGTCATCATGCTGA
CCACCAATCAGATCCTGCTCATCTCGCTGTCCTCCGGCATGCCCAACCACAGCGCCAGCATCCAGCTTTGTGTTCAGAAATTAAGGATAT
TGATCGAGGACTCCGATCAGAACTTGAAGTACCTGGGGCTGCTGGCAATGTCCAAGATCCTGAAGACCCACCCCAAGTCCGTGCAGTCCC
ACAAGGACCTCATCCTGCAGTGCCTGGACGACAAGGACGAGTCCATCCGGCTGCGGGCCCTGGACCTGCTCTATGGGATGGTGTCCAAGA
AGAACCTGATGGAGATCGTGAAGAAGCTGATGACCCACGTAGACAAGGCAGAGGGTACCACCTACCGTGACGAGCTGCTCACCAAGATCA
TTGACATCTGCAGCCAGTCCAACTACCAGTACATCACCAACTTCGAGTGGTACATCAGCATCCTGGTGGAGCTGACCCGGCTGGAGGGCA
CACGGCACGGCCACCTCATCGCCGCCCAAATGCTGGACGTGGCCATCCGCGTGAAGGCCATCCGCAAGTTCGCCGTGTCCCAGATGTCTG
CGCTGCTTGACAGTGCACACCTGCTGGCCAGCAGCACCCAGCGGAACGGGATCTGTGAGGTGCTGTACGCTGCCGCCTGGATCTGCGGGG
AGTTCTCAGAGCATCTGCAGGAACCACACCACACTTTGGAGGCCATGCTGCGGCCCAGAGTCACCACGCTGCCAGGCCACATCCAGGCCG
TGTATGTGCAGAACGTGGTCAAGCTCTACGCCTCCATCCTGCAGCAGAAGGAGCAGGCCGGGGAGGCAGAGGGCGCTCAGGCCGTCACCC
AGCTCATGGTGGACCGGCTGCCCCAGTTTGTGCAGAGCGCAGACCTGGAGGTGCAGGAGCGGGCGTCCTGCATCCTGCAGCTGGTCAAGC
ACATCCAGAAGCTTCAGGCCAAGGACGTGCCTGTGGCAGAGGAGGTCAGCGCTCTCTTTGCTGGGGAGCTGAACCCAGTGGCCCCCAAGG
CCCAGAAGAAGGTTCCAGTCCCCGAAGGCCTGGACCTGGACGCCTGGATCAATGAGCCACTCTCGGACAGCGAGTCAGAGGACGAGAGGC
CCAGGGCCGTCTTCCACGAGGAGGAGCAGCGGCGTCCCAAGCACCGGCCGTCGGAGGCGGACGAGGAAGAGCTGGCTCGGCGCCGAGAGG
CCCGGAAGCAGGAGCAGGCCAACAACCCCTTCTACATCAAGAGCTCGCCATCGCCACAGAAGCGGTACCAGGACACCCCGGGCGTGGAGC
ACATTCCCGTGGTGCAGATTGACCTCTCCGTCCCCTTGAAGGTTCCAGGGCTGCCTATGTCAGATCAGTATGTGAAGCTGGAGGAGGAGC
GGCGGCACCGGCAGAAGCTGGAGAAGGACAAGAGGAGGAAAAAGAGGAAGGAGAAGGAGAAGAAGGGCAAGCGCCGCCACAGCTCGCTGC
CCACGGAGAGCGACGAGGACATCGCCCCTGCCCAGCAGGTGGACATCGTCACAGAGGAGATGCCTGAGAATGCTCTGCCCAGCGACGAGG
ATGACAAAGACCCCAACGACCCCTACAGGGCTCTGGATATTGACCTGGATAAGCCCTTAGCCGACAGCGAGAAACTGCCTATTCAGAAAC
ACAGAAACACCGAGACCTCAAAATCCCCTGAGAAGGACGTTCCCATGGTAGAAAAGAAGAGCAAGAAACCCAAGAAGAAAGAGAAAAAAC
ACAAAGAGAAAGAGAGAGACAAGGAGAAGAAGAAGGAGAAGGAGAAGAAGAAATCTCCCAAGCCTAAGAAGAAGAAGCACAGGAAGGAGA
AGGAGGAGCGGACCAAAGGCAAGAAGAAGTCCAAGAAGCAGCCTCCAGGCAGCGAGGAGGCAGCGGGGGAGCCGGTGCAGAATGGCGCGC
CAGAGGAGGAGCAGCTCCCGCCTGAGTCCAGCTACTCCCTCCTCGCTGAAAATTCCTATGTTAAAATGACCTGTGACATCCGGGGCAGTC
TGCAGGAGGACAGCCAGGTCACTGTGGCCATCGTGCTGGAGAACAGGAGCAGCAGCATCCTCAAGGGCATGGAGCTCAGCGTGCTGGACT
CACTCAATGCCAGGATGGCCCGGCCGCAGGGCTCCTCCGTCCACGATGGCGTCCCCGTGCCTTTCCAGCTGCCCCCAGGCGTCTCCAACG
AAGCCCAGTATGTGTTCACCATCCAGAGCATCGTCATGGCGCAGAAGCTCAAGGGGACCCTGTCCTTCATTGCCAAGAATGACGAGGGTG
CGACCCACGAGAAGCTGGACTTCAGGCTGCACTTCAGCTGCAGCTCCTACTTGATCACCACTCCCTGCTACAGTGACGCCTTTGCTAAGT
TGCTGGAGTCTGGGGACTTGAGCATGAGCTCAATCAAAGTCGATGGCATTCGGATGTCCTTCCAGAATCTTCTGGCGAAGATCTGTTTTC
ACCACCATTTTTCCGTTGTGGAGCGAGTGGACTCCTGCGCCTCCATGTACAGCCGCTCCATCCAGGGCCACCATGTCTGCCTCCTGGTGA
AAAAGTCCTACACCACATATGGCCTGGAGCTGACGCGCGTCACGGTGGTCGACACGGACGTGCACGTGGTTTATGACACCTTCGTGAAGC
CTGACAACGAGATCGTGGACTACAACACCAGGTTTTCGGGGGTGACGGAGGCTGACCTTGCCGACACAAGTGTCACGCTGCGTGACGTCC
AGGCCGTTCTGCTGAGCATGTTCAGCGCTGACACCATCCTCATCGGACACAGCCTGGAGAGCGACCTCCTGGCCCTGAAGGTCATCCACA
GCACCGTGGTGGACACGTCTGTGCTCTTCCCCCACCGCCTGGGCCTCCCCTACAAGCGGTCCCTGCGGAACCTCATGGCCGACTACCTCA
GACAGATCATCCAGGACAATGTGGATGGGCACAGCTCCAGCGAGGACGCCGGCGCCTGCATGCACCTGGTGATCTGGAAGGTTCGAGAAG
ACGCCAAGACCAAGCGATGACGCCTGCCCGCCTCCCACCCGCCTCTCCTGCCGTCCCGCTGGTCCTTAGCCCCATGCCTCTTCCAAAACA
GTGCAATAAATCTCCGGTAACCCGTCCACCTGGCCGAGGCAGCCCAGAGCAGCGGGATGAGCTGGCGGCCAGAGAACGCCCAGCCCAGCC
CACCCCCGCTCACGTCCTGCCGCACCCCTCCGCCGGTCCCCACCCTCTGCCCCCCAGCCCTCTGGCGTCTCTAGAAACTGCTGCTGATGG
AGACCAGGACAGAGCCCGCCCCCACCCGGCCCGCAGCCCCTCCTGCCCCTCCTGCCAGGCCCTCCCTCCCGGGTGGTGGCCGGCACCTCT
GCTCCTCACGTGGGTCGCGGGGCGGGCTTGTCCCAGGTTTGTTTGAGACAGTCTTTTTTTATTTTGTATTGTTATTTTTATTATTTTTAA
TTTAAACCTGATGACTTGCACAGCATCTTTCCCACCGGGAAGAGCGGGCCTGCTGCCTTCCCTCCTGCGGGGTGGGGGTGGGACAGACCC
CAGTGGGGGCCAGGGCCACACTCAGCGCAGTGTGGGCTGGACCCGCCTCTGTGCTCCAGGGGCTGGCCGGTCTTCGGGGGCGCCGGCTTC
CACCACTCGCCGCCCCACCCCTGCACTCCGTGACCTCTGAGAACCCAGCTGGCCCCTCGTTAGCCCCAGGGTCTGCCGTCTAGAGGGAGC
CCACCGGCCTCCGGACACTGTCTTCCCGCTAGAGCCCCGTCCTCCTCTGCGGGGCTCTCCGGACCCCCTTCCCCCTCACCTCCAGGCAGG

>5283_5283_2_AP3D1-REXO1_AP3D1_chr19_2108685_ENST00000350812_REXO1_chr19_1816812_ENST00000170168_length(amino acids)=1146AA_BP=1
MPRRSGRQRGAPRAAGTGPGPPHGAQPRASRGKSPWAAAMALKMVKGSIDRMFDKNLQDLVRGIRNHKEDEAKYISQCIDEIKQELKQDN
IAVKANAVCKLTYLQMLGYDISWAAFNIIEVMSASKFTFKRIGYLAASQSFHEGTDVIMLTTNQILLISLSSGMPNHSASIQLCVQKLRI
LIEDSDQNLKYLGLLAMSKILKTHPKSVQSHKDLILQCLDDKDESIRLRALDLLYGMVSKKNLMEIVKKLMTHVDKAEGTTYRDELLTKI
IDICSQSNYQYITNFEWYISILVELTRLEGTRHGHLIAAQMLDVAIRVKAIRKFAVSQMSALLDSAHLLASSTQRNGICEVLYAAAWICG
EFSEHLQEPHHTLEAMLRPRVTTLPGHIQAVYVQNVVKLYASILQQKEQAGEAEGAQAVTQLMVDRLPQFVQSADLEVQERASCILQLVK
HIQKLQAKDVPVAEEVSALFAGELNPVAPKAQKKVPVPEGLDLDAWINEPLSDSESEDERPRAVFHEEEQRRPKHRPSEADEEELARRRE
ARKQEQANNPFYIKSSPSPQKRYQDTPGVEHIPVVQIDLSVPLKVPGLPMSDQYVKLEEERRHRQKLEKDKRRKKRKEKEKKGKRRHSSL
PTESDEDIAPAQQVDIVTEEMPENALPSDEDDKDPNDPYRALDIDLDKPLADSEKLPIQKHRNTETSKSPEKDVPMVEKKSKKPKKKEKK
HKEKERDKEKKKEKEKKKSPKPKKKKHRKEKEERTKGKKKSKKQPPGSEEAAGEPVQNGAPEEEQLPPESSYSLLAENSYVKMTCDIRGS
LQEDSQVTVAIVLENRSSSILKGMELSVLDSLNARMARPQGSSVHDGVPVPFQLPPGVSNEAQYVFTIQSIVMAQKLKGTLSFIAKNDEG
ATHEKLDFRLHFSCSSYLITTPCYSDAFAKLLESGDLSMSSIKVDGIRMSFQNLLAKICFHHHFSVVERVDSCASMYSRSIQGHHVCLLV
KKSYTTYGLELTRVTVVDTDVHVVYDTFVKPDNEIVDYNTRFSGVTEADLADTSVTLRDVQAVLLSMFSADTILIGHSLESDLLALKVIH

--------------------------------------------------------------
>5283_5283_3_AP3D1-REXO1_AP3D1_chr19_2108685_ENST00000355272_REXO1_chr19_1816812_ENST00000170168_length(transcript)=5041nt_BP=3759nt
AGGGGCCCGGAGCGGGATCGCGGCACCTGCCGAGCGGGTCGCCGCCTCTGCCGCGGTCCTTGGACCCCGCCGCCGCCCTGGCCTGGGAGC
TTGCCCCGCCGCAGCGGCCGGCAGCGCGGCGCTCCGCGGGCGGCAGGCACGGGCCCCGGGCCCCCTCACGGCGCCCAGCCGCGGGCCTCC
CGAGGCAAAAGCCCGTGGGCCGCCGCGATGGCCCTCAAGATGGTGAAGGGCAGCATCGACCGCATGTTCGACAAGAATCTGCAGGACTTG
GTCCGCGGCATCCGTAACCACAAGGAGGACGAGGCAAAATACATATCTCAGTGCATTGATGAGATCAAGCAGGAGCTGAAGCAGGACAAC
ATAGCGGTGAAGGCGAACGCGGTCTGCAAGCTGACGTATTTACAGATGTTGGGATACGACATCAGCTGGGCCGCCTTCAACATCATAGAA
GTGATGAGTGCCTCCAAGTTCACCTTCAAGCGAATTGGCTACCTCGCTGCTTCCCAGAGCTTTCACGAAGGCACCGACGTCATCATGCTG
ACCACCAATCAGATCCGTAAGGACTTGAGCAGCCCCAGCCAGTACGACACAGGTGTTGCACTGACGGGTCTGTCCTGCTTCGTCACCCCA
GACCTTGCCAGAGACCTGGCAAATGACATCATGACACTGATGTCACACACCAAGCCCTACATCAGGAAGAAGGCTGTGCTGATCATGTAC
AAGGTGTTCCTGAAGTACCCCGAGTCGCTGCGCCCTGCCTTTCCCCGGCTGAAGGAGAAGCTGGAGGACCCCGACCCCGGGGTTCAGTCG
GCTGCCGTCAATGTCATCTGCGAGCTGGCCAGACGCAACCCTAAGAACTACCTGTCCCTGGCCCCGCTCTTTTTCAAGCTGATGACGTCC
TCCACCAACAACTGGGTCCTCATCAAGATCATCAAGCTGTTCGGTGCTCTTACTCCTTTGGAACCGCGGCTGGGCAAGAAGCTGATCGAG
CCCCTCACCAATCTCATCCACAGCACGTCTGCCATGTCTCTCCTCTATGAATGTGTGAACACCGTGATTGCAGTGCTCATCTCGCTGTCC
TCCGGCATGCCCAACCACAGCGCCAGCATCCAGCTTTGTGTTCAGAAATTAAGGATATTGATCGAGGACTCCGATCAGAACTTGAAGTAC
CTGGGGCTGCTGGCAATGTCCAAGATCCTGAAGACCCACCCCAAGTCCGTGCAGTCCCACAAGGACCTCATCCTGCAGTGCCTGGACGAC
AAGGACGAGTCCATCCGGCTGCGGGCCCTGGACCTGCTCTATGGGATGGTGTCCAAGAAGAACCTGATGGAGATCGTGAAGAAGCTGATG
ACCCACGTAGACAAGGCAGAGGGTACCACCTACCGTGACGAGCTGCTCACCAAGATCATTGACATCTGCAGCCAGTCCAACTACCAGTAC
ATCACCAACTTCGAGTGGTACATCAGCATCCTGGTGGAGCTGACCCGGCTGGAGGGCACACGGCACGGCCACCTCATCGCCGCCCAAATG
CTGGACGTGGCCATCCGCGTGAAGGCCATCCGCAAGTTCGCCGTGTCCCAGATGTCTGCGCTGCTTGACAGTGCACACCTGCTGGCCAGC
AGCACCCAGCGGAACGGGATCTGTGAGGTGCTGTACGCTGCCGCCTGGATCTGCGGGGAGTTCTCAGAGCATCTGCAGGAACCACACCAC
ACTTTGGAGGCCATGCTGCGGCCCAGAGTCACCACGCTGCCAGGCCACATCCAGGCCGTGTATGTGCAGAACGTGGTCAAGCTCTACGCC
TCCATCCTGCAGCAGAAGGAGCAGGCCGGGGAGGCAGAGGGCGCTCAGGCCGTCACCCAGCTCATGGTGGACCGGCTGCCCCAGTTTGTG
CAGAGCGCAGACCTGGAGGTGCAGGAGCGGGCGTCCTGCATCCTGCAGCTGGTCAAGCACATCCAGAAGCTTCAGGCCAAGGACGTGCCT
GTGGCAGAGGAGGTCAGCGCTCTCTTTGCTGGGGAGCTGAACCCAGTGGCCCCCAAGGCCCAGAAGAAGGTTCCAGTCCCCGAAGGCCTG
GACCTGGACGCCTGGATCAATGAGCCACTCTCGGACAGCGAGTCAGAGGACGAGAGGCCCAGGGCCGTCTTCCACGAGGAGGAGCAGCGG
CGTCCCAAGCACCGGCCGTCGGAGGCGGACGAGGAAGAGCTGGCTCGGCGCCGAGAGGCCCGGAAGCAGGAGCAGGCCAACAACCCCTTC
TACATCAAGAGCTCGCCATCGCCACAGAAGCGGTACCAGGACACCCCGGGCGTGGAGCACATTCCCGTGGTGCAGATTGACCTCTCCGTC
CCCTTGAAGGTTCCAGGGCTGCCTATGTCAGATCAGTATGTGAAGCTGGAGGAGGAGCGGCGGCACCGGCAGAAGCTGGAGAAGGACAAG
AGGAGGAAAAAGAGGAAGGAGAAGGAGAAGAAGGGCAAGCGCCGCCACAGCTCGCTGCCCACGGAGAGCGACGAGGACATCGCCCCTGCC
CAGCAGGTGGACATCGTCACAGAGGAGATGCCTGAGAATGCTCTGCCCAGCGACGAGGATGACAAAGACCCCAACGACCCCTACAGGGCT
CTGGATATTGACCTGGATAAGCCCTTAGCCGACAGCGAGAAACTGCCTATTCAGAAACACAGAAACACCGAGACCTCAAAATCCCCTGAG
AAGGACGTTCCCATGGTAGAAAAGAAGAGCAAGAAACCCAAGAAGAAAGAGAAAAAACACAAAGAGAAAGAGAGAGACAAGGAGAAGAAG
AAGGAGAAGGAGAAGAAGGCTGAGGACCTGGACTTCTGGCTGTCTACCACCCCACCGCCTGCCCCCGCCCCCGCCCCCGCCCCCGTTCCA
TCCACGGGGGAGCTCAGTGTGAACACTGTCACTACCCCGAAGGACGAGTGTGAGGACGCCAAGACGGAGGCGCAGGGCGAGGAGGACGAT
GCCGAGGGGCAAGACCAGGACAAGAAATCTCCCAAGCCTAAGAAGAAGAAGCACAGGAAGGAGAAGGAGGAGCGGACCAAAGGCAAGAAG
AAGTCCAAGAAGCAGCCTCCAGGCAGCGAGGAGGCAGCGGGGGAGCCGGTGCAGAATGGCGCGCCAGAGGAGGAGCAGCTCCCGCCTGAG
TCCAGCTACTCCCTCCTCGCTGAAAATTCCTATGTTAAAATGACCTGTGACATCCGGGGCAGTCTGCAGGAGGACAGCCAGGTCACTGTG
GCCATCGTGCTGGAGAACAGGAGCAGCAGCATCCTCAAGGGCATGGAGCTCAGCGTGCTGGACTCACTCAATGCCAGGATGGCCCGGCCG
CAGGGCTCCTCCGTCCACGATGGCGTCCCCGTGCCTTTCCAGCTGCCCCCAGGCGTCTCCAACGAAGCCCAGTATGTGTTCACCATCCAG
AGCATCGTCATGGCGCAGAAGCTCAAGGGGACCCTGTCCTTCATTGCCAAGAATGACGAGGGTGCGACCCACGAGAAGCTGGACTTCAGG
CTGCACTTCAGCTGCAGCTCCTACTTGATCACCACTCCCTGCTACAGTGACGCCTTTGCTAAGTTGCTGGAGTCTGGGGACTTGAGCATG
AGCTCAATCAAAGTCGATGGCATTCGGATGTCCTTCCAGAATCTTCTGGCGAAGATCTGTTTTCACCACCATTTTTCCGTTGTGGAGCGA
GTGGACTCCTGCGCCTCCATGTACAGCCGCTCCATCCAGGGCCACCATGTCTGCCTCCTGGTGAAAAAGTCCTACACCACATATGGCCTG
GAGCTGACGCGCGTCACGGTGGTCGACACGGACGTGCACGTGGTTTATGACACCTTCGTGAAGCCTGACAACGAGATCGTGGACTACAAC
ACCAGGTTTTCGGGGGTGACGGAGGCTGACCTTGCCGACACAAGTGTCACGCTGCGTGACGTCCAGGCCGTTCTGCTGAGCATGTTCAGC
GCTGACACCATCCTCATCGGACACAGCCTGGAGAGCGACCTCCTGGCCCTGAAGGTCATCCACAGCACCGTGGTGGACACGTCTGTGCTC
TTCCCCCACCGCCTGGGCCTCCCCTACAAGCGGTCCCTGCGGAACCTCATGGCCGACTACCTCAGACAGATCATCCAGGACAATGTGGAT
GGGCACAGCTCCAGCGAGGACGCCGGCGCCTGCATGCACCTGGTGATCTGGAAGGTTCGAGAAGACGCCAAGACCAAGCGATGACGCCTG
CCCGCCTCCCACCCGCCTCTCCTGCCGTCCCGCTGGTCCTTAGCCCCATGCCTCTTCCAAAACAGTGCAATAAATCTCCGGTAACCCGTC
CACCTGGCCGAGGCAGCCCAGAGCAGCGGGATGAGCTGGCGGCCAGAGAACGCCCAGCCCAGCCCACCCCCGCTCACGTCCTGCCGCACC
CCTCCGCCGGTCCCCACCCTCTGCCCCCCAGCCCTCTGGCGTCTCTAGAAACTGCTGCTGATGGAGACCAGGACAGAGCCCGCCCCCACC
CGGCCCGCAGCCCCTCCTGCCCCTCCTGCCAGGCCCTCCCTCCCGGGTGGTGGCCGGCACCTCTGCTCCTCACGTGGGTCGCGGGGCGGG
CTTGTCCCAGGTTTGTTTGAGACAGTCTTTTTTTATTTTGTATTGTTATTTTTATTATTTTTAATTTAAACCTGATGACTTGCACAGCAT
CTTTCCCACCGGGAAGAGCGGGCCTGCTGCCTTCCCTCCTGCGGGGTGGGGGTGGGACAGACCCCAGTGGGGGCCAGGGCCACACTCAGC
GCAGTGTGGGCTGGACCCGCCTCTGTGCTCCAGGGGCTGGCCGGTCTTCGGGGGCGCCGGCTTCCACCACTCGCCGCCCCACCCCTGCAC
TCCGTGACCTCTGAGAACCCAGCTGGCCCCTCGTTAGCCCCAGGGTCTGCCGTCTAGAGGGAGCCCACCGGCCTCCGGACACTGTCTTCC
CGCTAGAGCCCCGTCCTCCTCTGCGGGGCTCTCCGGACCCCCTTCCCCCTCACCTCCAGGCAGGGACAGAATAAATGTTTGTATGGATTT

>5283_5283_3_AP3D1-REXO1_AP3D1_chr19_2108685_ENST00000355272_REXO1_chr19_1816812_ENST00000170168_length(amino acids)=1377AA_BP=1
MPRRSGRQRGAPRAAGTGPGPPHGAQPRASRGKSPWAAAMALKMVKGSIDRMFDKNLQDLVRGIRNHKEDEAKYISQCIDEIKQELKQDN
IAVKANAVCKLTYLQMLGYDISWAAFNIIEVMSASKFTFKRIGYLAASQSFHEGTDVIMLTTNQIRKDLSSPSQYDTGVALTGLSCFVTP
DLARDLANDIMTLMSHTKPYIRKKAVLIMYKVFLKYPESLRPAFPRLKEKLEDPDPGVQSAAVNVICELARRNPKNYLSLAPLFFKLMTS
STNNWVLIKIIKLFGALTPLEPRLGKKLIEPLTNLIHSTSAMSLLYECVNTVIAVLISLSSGMPNHSASIQLCVQKLRILIEDSDQNLKY
LGLLAMSKILKTHPKSVQSHKDLILQCLDDKDESIRLRALDLLYGMVSKKNLMEIVKKLMTHVDKAEGTTYRDELLTKIIDICSQSNYQY
ITNFEWYISILVELTRLEGTRHGHLIAAQMLDVAIRVKAIRKFAVSQMSALLDSAHLLASSTQRNGICEVLYAAAWICGEFSEHLQEPHH
TLEAMLRPRVTTLPGHIQAVYVQNVVKLYASILQQKEQAGEAEGAQAVTQLMVDRLPQFVQSADLEVQERASCILQLVKHIQKLQAKDVP
VAEEVSALFAGELNPVAPKAQKKVPVPEGLDLDAWINEPLSDSESEDERPRAVFHEEEQRRPKHRPSEADEEELARRREARKQEQANNPF
YIKSSPSPQKRYQDTPGVEHIPVVQIDLSVPLKVPGLPMSDQYVKLEEERRHRQKLEKDKRRKKRKEKEKKGKRRHSSLPTESDEDIAPA
QQVDIVTEEMPENALPSDEDDKDPNDPYRALDIDLDKPLADSEKLPIQKHRNTETSKSPEKDVPMVEKKSKKPKKKEKKHKEKERDKEKK
KEKEKKAEDLDFWLSTTPPPAPAPAPAPVPSTGELSVNTVTTPKDECEDAKTEAQGEEDDAEGQDQDKKSPKPKKKKHRKEKEERTKGKK
KSKKQPPGSEEAAGEPVQNGAPEEEQLPPESSYSLLAENSYVKMTCDIRGSLQEDSQVTVAIVLENRSSSILKGMELSVLDSLNARMARP
QGSSVHDGVPVPFQLPPGVSNEAQYVFTIQSIVMAQKLKGTLSFIAKNDEGATHEKLDFRLHFSCSSYLITTPCYSDAFAKLLESGDLSM
SSIKVDGIRMSFQNLLAKICFHHHFSVVERVDSCASMYSRSIQGHHVCLLVKKSYTTYGLELTRVTVVDTDVHVVYDTFVKPDNEIVDYN
TRFSGVTEADLADTSVTLRDVQAVLLSMFSADTILIGHSLESDLLALKVIHSTVVDTSVLFPHRLGLPYKRSLRNLMADYLRQIIQDNVD

--------------------------------------------------------------
>5283_5283_4_AP3D1-REXO1_AP3D1_chr19_2108685_ENST00000356926_REXO1_chr19_1816812_ENST00000170168_length(transcript)=4748nt_BP=3466nt
TTGTCCGCCATTTGCAAGGGGCCCGGAGCGGGATCGCGGCACCTGCCGAGCGGGTCGCCGCCTCTGCCGCGGTCCTTGGACCCCGCCGCC
GCCCTGGCCTGGGAGCTTGCCCCGCCGCAGCGGCCGGCAGCGCGGCGCTCCGCGGGCGGCAGGCACGGGCCCCGGGCCCCCTCACGGCGC
CCAGCCGCGGGCCTCCCGAGGCAAAAGCCCGTGGGCCGCCGCGATGGCCCTCAAGATGGTGAAGGGCAGCATCGACCGCATGTTCGACAA
GAATCTGCAGGACTTGGTCCGCGGCATCCGTAACCACAAGGAGGACGAGGCAAAATACATATCTCAGTGCATTGATGAGATCAAGCAGGA
GCTGAAGCAGGACAACATAGCGGTGAAGGCGAACGCGGTCTGCAAGCTGACGTATTTACAGATGTTGGGATACGACATCAGCTGGGCCGC
CTTCAACATCATAGAAGTGATGAGTGCCTCCAAGTTCACCTTCAAGCGAATTGGCTACCTCGCTGCTTCCCAGAGCTTTCACGAAGGCAC
CGACGTCATCATGCTGACCACCAATCAGATCCGTAAGGACTTGAGCAGCCCCAGCCAGTACGACACAGGTGTTGCACTGACGGGTCTGTC
CTGCTTCGTCACCCCAGACCTTGCCAGAGACCTGGCAAATGACATCATGACACTGATGTCACACACCAAGCCCTACATCAGGAAGAAGGC
TAAGCTGATCGAGCCCCTCACCAATCTCATCCACAGCACGTCTGCCATGTCTCTCCTCTATGAATGTGTGAACACCGTGATTGCAGTGCT
CATCTCGCTGTCCTCCGGCATGCCCAACCACAGCGCCAGCATCCAGCTTTGTGTTCAGAAATTAAGGATATTGATCGAGGACTCCGATCA
GAACTTGAAGTACCTGGGGCTGCTGGCAATGTCCAAGATCCTGAAGACCCACCCCAAGTCCGTGCAGTCCCACAAGGACCTCATCCTGCA
GTGCCTGGACGACAAGGACGAGTCCATCCGGCTGCGGGCCCTGGACCTGCTCTATGGGATGGTGTCCAAGAAGAACCTGATGGAGATCGT
GAAGAAGCTGATGACCCACGTAGACAAGGCAGAGGGTACCACCTACCGTGACGAGCTGCTCACCAAGATCATTGACATCTGCAGCCAGTC
CAACTACCAGTACATCACCAACTTCGAGTGGTACATCAGCATCCTGGTGGAGCTGACCCGGCTGGAGGGCACACGGCACGGCCACCTCAT
CGCCGCCCAAATGCTGGACGTGGCCATCCGCGTGAAGGCCATCCGCAAGTTCGCCGTGTCCCAGATGTCTGCGCTGCTTGACAGTGCACA
CCTGCTGGCCAGCAGCACCCAGCGGAACGGGATCTGTGAGGTGCTGTACGCTGCCGCCTGGATCTGCGGGGAGTTCTCAGAGCATCTGCA
GGAACCACACCACACTTTGGAGGCCATGCTGCGGCCCAGAGTCACCACGCTGCCAGGCCACATCCAGGCCGTGTATGTGCAGAACGTGGT
CAAGCTCTACGCCTCCATCCTGCAGCAGAAGGAGCAGGCCGGGGAGGCAGAGGGCGCTCAGGCCGTCACCCAGCTCATGGTGGACCGGCT
GCCCCAGTTTGTGCAGAGCGCAGACCTGGAGGTGCAGGAGCGGGCGTCCTGCATCCTGCAGCTGGTCAAGCACATCCAGAAGCTTCAGGC
CAAGGACGTGCCTGTGGCAGAGGAGGTCAGCGCTCTCTTTGCTGGGGAGCTGAACCCAGTGGCCCCCAAGGCCCAGAAGAAGGTTCCAGT
CCCCGAAGGCCTGGACCTGGACGCCTGGATCAATGAGCCACTCTCGGACAGCGAGTCAGAGGACGAGAGGCCCAGGGCCGTCTTCCACGA
GGAGGAGCAGCGGCGTCCCAAGCACCGGCCGTCGGAGGCGGACGAGGAAGAGCTGGCTCGGCGCCGAGAGGCCCGGAAGCAGGAGCAGGC
CAACAACCCCTTCTACATCAAGAGCTCGCCATCGCCACAGAAGCGGTACCAGGACACCCCGGGCGTGGAGCACATTCCCGTGGTGCAGAT
TGACCTCTCCGTCCCCTTGAAGGTTCCAGGGCTGCCTATGTCAGATCAGTATGTGAAGCTGGAGGAGGAGCGGCGGCACCGGCAGAAGCT
GGAGAAGGACAAGAGGAGGAAAAAGAGGAAGGAGAAGGAGAAGAAGGGCAAGCGCCGCCACAGCTCGCTGCCCACGGAGAGCGACGAGGA
CATCGCCCCTGCCCAGCAGGTGGACATCGTCACAGAGGAGATGCCTGAGAATGCTCTGCCCAGCGACGAGGATGACAAAGACCCCAACGA
CCCCTACAGGGCTCTGGATATTGACCTGGATAAGCCCTTAGCCGACAGCGAGAAACTGCCTATTCAGAAACACAGAAACACCGAGACCTC
AAAATCCCCTGAGAAGGACGTTCCCATGGTAGAAAAGAAGAGCAAGAAACCCAAGAAGAAAGAGAAAAAACACAAAGAGAAAGAGAGAGA
CAAGGAGAAGAAGAAGGAGAAGGAGAAGAAGGCTGAGGACCTGGACTTCTGGCTGTCTACCACCCCACCGCCTGCCCCCGCCCCCGCCCC
CGCCCCCGTTCCATCCACGGACGAGTGTGAGGACGCCAAGACGGAGGCGCAGGGCGAGGAGGACGATGCCGAGGGGCAAGACCAGGACAA
GAAATCTCCCAAGCCTAAGAAGAAGAAGCACAGGAAGGAGAAGGAGGAGCGGACCAAAGGCAAGAAGAAGTCCAAGAAGCAGCCTCCAGG
CAGCGAGGAGGCAGCGGGGGAGCCGGTGCAGAATGGCGCGCCAGAGGAGGAGCAGCTCCCGCCTGAGTCCAGCTACTCCCTCCTCGCTGA
AAATTCCTATGTTAAAATGACCTGTGACATCCGGGGCAGTCTGCAGGAGGACAGCCAGGTCACTGTGGCCATCGTGCTGGAGAACAGGAG
CAGCAGCATCCTCAAGGGCATGGAGCTCAGCGTGCTGGACTCACTCAATGCCAGGATGGCCCGGCCGCAGGGCTCCTCCGTCCACGATGG
CGTCCCCGTGCCTTTCCAGCTGCCCCCAGGCGTCTCCAACGAAGCCCAGTATGTGTTCACCATCCAGAGCATCGTCATGGCGCAGAAGCT
CAAGGGGACCCTGTCCTTCATTGCCAAGAATGACGAGGGTGCGACCCACGAGAAGCTGGACTTCAGGCTGCACTTCAGCTGCAGCTCCTA
CTTGATCACCACTCCCTGCTACAGTGACGCCTTTGCTAAGTTGCTGGAGTCTGGGGACTTGAGCATGAGCTCAATCAAAGTCGATGGCAT
TCGGATGTCCTTCCAGAATCTTCTGGCGAAGATCTGTTTTCACCACCATTTTTCCGTTGTGGAGCGAGTGGACTCCTGCGCCTCCATGTA
CAGCCGCTCCATCCAGGGCCACCATGTCTGCCTCCTGGTGAAAAAGTCCTACACCACATATGGCCTGGAGCTGACGCGCGTCACGGTGGT
CGACACGGACGTGCACGTGGTTTATGACACCTTCGTGAAGCCTGACAACGAGATCGTGGACTACAACACCAGGTTTTCGGGGGTGACGGA
GGCTGACCTTGCCGACACAAGTGTCACGCTGCGTGACGTCCAGGCCGTTCTGCTGAGCATGTTCAGCGCTGACACCATCCTCATCGGACA
CAGCCTGGAGAGCGACCTCCTGGCCCTGAAGGTCATCCACAGCACCGTGGTGGACACGTCTGTGCTCTTCCCCCACCGCCTGGGCCTCCC
CTACAAGCGGTCCCTGCGGAACCTCATGGCCGACTACCTCAGACAGATCATCCAGGACAATGTGGATGGGCACAGCTCCAGCGAGGACGC
CGGCGCCTGCATGCACCTGGTGATCTGGAAGGTTCGAGAAGACGCCAAGACCAAGCGATGACGCCTGCCCGCCTCCCACCCGCCTCTCCT
GCCGTCCCGCTGGTCCTTAGCCCCATGCCTCTTCCAAAACAGTGCAATAAATCTCCGGTAACCCGTCCACCTGGCCGAGGCAGCCCAGAG
CAGCGGGATGAGCTGGCGGCCAGAGAACGCCCAGCCCAGCCCACCCCCGCTCACGTCCTGCCGCACCCCTCCGCCGGTCCCCACCCTCTG
CCCCCCAGCCCTCTGGCGTCTCTAGAAACTGCTGCTGATGGAGACCAGGACAGAGCCCGCCCCCACCCGGCCCGCAGCCCCTCCTGCCCC
TCCTGCCAGGCCCTCCCTCCCGGGTGGTGGCCGGCACCTCTGCTCCTCACGTGGGTCGCGGGGCGGGCTTGTCCCAGGTTTGTTTGAGAC
AGTCTTTTTTTATTTTGTATTGTTATTTTTATTATTTTTAATTTAAACCTGATGACTTGCACAGCATCTTTCCCACCGGGAAGAGCGGGC
CTGCTGCCTTCCCTCCTGCGGGGTGGGGGTGGGACAGACCCCAGTGGGGGCCAGGGCCACACTCAGCGCAGTGTGGGCTGGACCCGCCTC
TGTGCTCCAGGGGCTGGCCGGTCTTCGGGGGCGCCGGCTTCCACCACTCGCCGCCCCACCCCTGCACTCCGTGACCTCTGAGAACCCAGC
TGGCCCCTCGTTAGCCCCAGGGTCTGCCGTCTAGAGGGAGCCCACCGGCCTCCGGACACTGTCTTCCCGCTAGAGCCCCGTCCTCCTCTG

>5283_5283_4_AP3D1-REXO1_AP3D1_chr19_2108685_ENST00000356926_REXO1_chr19_1816812_ENST00000170168_length(amino acids)=1274AA_BP=1
MPRRSGRQRGAPRAAGTGPGPPHGAQPRASRGKSPWAAAMALKMVKGSIDRMFDKNLQDLVRGIRNHKEDEAKYISQCIDEIKQELKQDN
IAVKANAVCKLTYLQMLGYDISWAAFNIIEVMSASKFTFKRIGYLAASQSFHEGTDVIMLTTNQIRKDLSSPSQYDTGVALTGLSCFVTP
DLARDLANDIMTLMSHTKPYIRKKAKLIEPLTNLIHSTSAMSLLYECVNTVIAVLISLSSGMPNHSASIQLCVQKLRILIEDSDQNLKYL
GLLAMSKILKTHPKSVQSHKDLILQCLDDKDESIRLRALDLLYGMVSKKNLMEIVKKLMTHVDKAEGTTYRDELLTKIIDICSQSNYQYI
TNFEWYISILVELTRLEGTRHGHLIAAQMLDVAIRVKAIRKFAVSQMSALLDSAHLLASSTQRNGICEVLYAAAWICGEFSEHLQEPHHT
LEAMLRPRVTTLPGHIQAVYVQNVVKLYASILQQKEQAGEAEGAQAVTQLMVDRLPQFVQSADLEVQERASCILQLVKHIQKLQAKDVPV
AEEVSALFAGELNPVAPKAQKKVPVPEGLDLDAWINEPLSDSESEDERPRAVFHEEEQRRPKHRPSEADEEELARRREARKQEQANNPFY
IKSSPSPQKRYQDTPGVEHIPVVQIDLSVPLKVPGLPMSDQYVKLEEERRHRQKLEKDKRRKKRKEKEKKGKRRHSSLPTESDEDIAPAQ
QVDIVTEEMPENALPSDEDDKDPNDPYRALDIDLDKPLADSEKLPIQKHRNTETSKSPEKDVPMVEKKSKKPKKKEKKHKEKERDKEKKK
EKEKKAEDLDFWLSTTPPPAPAPAPAPVPSTDECEDAKTEAQGEEDDAEGQDQDKKSPKPKKKKHRKEKEERTKGKKKSKKQPPGSEEAA
GEPVQNGAPEEEQLPPESSYSLLAENSYVKMTCDIRGSLQEDSQVTVAIVLENRSSSILKGMELSVLDSLNARMARPQGSSVHDGVPVPF
QLPPGVSNEAQYVFTIQSIVMAQKLKGTLSFIAKNDEGATHEKLDFRLHFSCSSYLITTPCYSDAFAKLLESGDLSMSSIKVDGIRMSFQ
NLLAKICFHHHFSVVERVDSCASMYSRSIQGHHVCLLVKKSYTTYGLELTRVTVVDTDVHVVYDTFVKPDNEIVDYNTRFSGVTEADLAD
TSVTLRDVQAVLLSMFSADTILIGHSLESDLLALKVIHSTVVDTSVLFPHRLGLPYKRSLRNLMADYLRQIIQDNVDGHSSSEDAGACMH

--------------------------------------------------------------
>5283_5283_5_AP3D1-REXO1_AP3D1_chr19_2108686_ENST00000345016_REXO1_chr19_1816812_ENST00000170168_length(transcript)=4880nt_BP=3598nt
GCCGCCATCTTGTCCGCCATTTGCAAGGGGCCCGGAGCGGGATCGCGGCACCTGCCGAGCGGGTCGCCGCCTCTGCCGCGGTCCTTGGAC
CCCGCCGCCGCCCTGGCCTGGGAGCTTGCCCCGCCGCAGCGGCCGGCAGCGCGGCGCTCCGCGGGCGGCAGGCACGGGCCCCGGGCCCCC
TCACGGCGCCCAGCCGCGGGCCTCCCGAGGCAAAAGCCCGTGGGCCGCCGCGATGGCCCTCAAGATGGTGAAGGGCAGCATCGACCGCAT
GTTCGACAAGAATCTGCAGGACTTGGTCCGCGGCATCCGTAACCACAAGGAGGACGAGGCAAAATACATATCTCAGTGCATTGATGAGAT
CAAGCAGGAGCTGAAGCAGGACAACATAGCGGTGAAGGCGAACGCGGTCTGCAAGCTGACGTATTTACAGATGTTGGGATACGACATCAG
CTGGGCCGCCTTCAACATCATAGAAGTGATGAGTGCCTCCAAGTTCACCTTCAAGCGAATTGGCTACCTCGCTGCTTCCCAGAGCTTTCA
CGAAGGCACCGACGTCATCATGCTGACCACCAATCAGATCCGTAAGGACTTGAGCAGCCCCAGCCAGTACGACACAGGTGTTGCACTGAC
GGGTCTGTCCTGCTTCGTCACCCCAGACCTTGCCAGAGACCTGGCAAATGACATCATGACACTGATGTCACACACCAAGCCCTACATCAG
GAAGAAGGCTGTGCTGATCATGTACAAGGTGTTCCTGAAGTACCCCGAGTCGCTGCGCCCTGCCTTTCCCCGGCTGAAGGAGAAGCTGGA
GGACCCCGACCCCGGGGTTCAGTCGGCTGCCGTCAATGTCATCTGCGAGCTGGCCAGACGCAACCCTAAGAACTACCTGTCCCTGGCCCC
GCTCTTTTTCAAGCTGATGACGTCCTCCACCAACAACTGGGTCCTCATCAAGATCATCAAGCTGTTCGGTGCTCTTACTCCTTTGGAACC
GCGGCTGGGCAAGAAGCTGATCGAGCCCCTCACCAATCTCATCCACAGCACGTCTGCCATGTCTCTCCTCTATGAATGTGTGAACACCGT
GATTGCAGTGCTCATCTCGCTGTCCTCCGGCATGCCCAACCACAGCGCCAGCATCCAGCTTTGTGTTCAGAAATTAAGGATATTGATCGA
GGACTCCGATCAGAACTTGAAGTACCTGGGGCTGCTGGCAATGTCCAAGATCCTGAAGACCCACCCCAAGTCCGTGCAGTCCCACAAGGA
CCTCATCCTGCAGTGCCTGGACGACAAGGACGAGTCCATCCGGCTGCGGGCCCTGGACCTGCTCTATGGGATGGTGTCCAAGAAGAACCT
GATGGAGATCGTGAAGAAGCTGATGACCCACGTAGACAAGGCAGAGGGTACCACCTACCGTGACGAGCTGCTCACCAAGATCATTGACAT
CTGCAGCCAGTCCAACTACCAGTACATCACCAACTTCGAGTGGTACATCAGCATCCTGGTGGAGCTGACCCGGCTGGAGGGCACACGGCA
CGGCCACCTCATCGCCGCCCAAATGCTGGACGTGGCCATCCGCGTGAAGGCCATCCGCAAGTTCGCCGTGTCCCAGATGTCTGCGCTGCT
TGACAGTGCACACCTGCTGGCCAGCAGCACCCAGCGGAACGGGATCTGTGAGGTGCTGTACGCTGCCGCCTGGATCTGCGGGGAGTTCTC
AGAGCATCTGCAGGAACCACACCACACTTTGGAGGCCATGCTGCGGCCCAGAGTCACCACGCTGCCAGGCCACATCCAGGCCGTGTATGT
GCAGAACGTGGTCAAGCTCTACGCCTCCATCCTGCAGCAGAAGGAGCAGGCCGGGGAGGCAGAGGGCGCTCAGGCCGTCACCCAGCTCAT
GGTGGACCGGCTGCCCCAGTTTGTGCAGAGCGCAGACCTGGAGGTGCAGGAGCGGGCGTCCTGCATCCTGCAGCTGGTCAAGCACATCCA
GAAGCTTCAGGCCAAGGACGTGCCTGTGGCAGAGGAGGTCAGCGCTCTCTTTGCTGGGGAGCTGAACCCAGTGGCCCCCAAGGCCCAGAA
GAAGGTTCCAGTCCCCGAAGGCCTGGACCTGGACGCCTGGATCAATGAGCCACTCTCGGACAGCGAGTCAGAGGACGAGAGGCCCAGGGC
CGTCTTCCACGAGGAGGAGCAGCGGCGTCCCAAGCACCGGCCGTCGGAGGCGGACGAGGAAGAGCTGGCTCGGCGCCGAGAGGCCCGGAA
GCAGGAGCAGGCCAACAACCCCTTCTACATCAAGAGCTCGCCATCGCCACAGAAGCGGTACCAGGACACCCCGGGCGTGGAGCACATTCC
CGTGGTGCAGATTGACCTCTCCGTCCCCTTGAAGGTTCCAGGGCTGCCTATGTCAGATCAGTATGTGAAGCTGGAGGAGGAGCGGCGGCA
CCGGCAGAAGCTGGAGAAGGACAAGAGGAGGAAAAAGAGGAAGGAGAAGGAGAAGAAGGGCAAGCGCCGCCACAGCTCGCTGCCCACGGA
GAGCGACGAGGACATCGCCCCTGCCCAGCAGGTGGACATCGTCACAGAGGAGATGCCTGAGAATGCTCTGCCCAGCGACGAGGATGACAA
AGACCCCAACGACCCCTACAGGGCTCTGGATATTGACCTGGATAAGCCCTTAGCCGACAGCGAGAAACTGCCTATTCAGAAACACAGAAA
CACCGAGACCTCAAAATCCCCTGAGAAGGACGTTCCCATGGTAGAAAAGAAGAGCAAGAAACCCAAGAAGAAAGAGAAAAAACACAAAGA
GAAAGAGAGAGACAAGGAGAAGAAGAAGGAGAAGGAGAAGAAGAAATCTCCCAAGCCTAAGAAGAAGAAGCACAGGAAGGAGAAGGAGGA
GCGGACCAAAGGCAAGAAGAAGTCCAAGAAGCAGCCTCCAGGCAGCGAGGAGGCAGCGGGGGAGCCGGTGCAGAATGGCGCGCCAGAGGA
GGAGCAGCTCCCGCCTGAGTCCAGCTACTCCCTCCTCGCTGAAAATTCCTATGTTAAAATGACCTGTGACATCCGGGGCAGTCTGCAGGA
GGACAGCCAGGTCACTGTGGCCATCGTGCTGGAGAACAGGAGCAGCAGCATCCTCAAGGGCATGGAGCTCAGCGTGCTGGACTCACTCAA
TGCCAGGATGGCCCGGCCGCAGGGCTCCTCCGTCCACGATGGCGTCCCCGTGCCTTTCCAGCTGCCCCCAGGCGTCTCCAACGAAGCCCA
GTATGTGTTCACCATCCAGAGCATCGTCATGGCGCAGAAGCTCAAGGGGACCCTGTCCTTCATTGCCAAGAATGACGAGGGTGCGACCCA
CGAGAAGCTGGACTTCAGGCTGCACTTCAGCTGCAGCTCCTACTTGATCACCACTCCCTGCTACAGTGACGCCTTTGCTAAGTTGCTGGA
GTCTGGGGACTTGAGCATGAGCTCAATCAAAGTCGATGGCATTCGGATGTCCTTCCAGAATCTTCTGGCGAAGATCTGTTTTCACCACCA
TTTTTCCGTTGTGGAGCGAGTGGACTCCTGCGCCTCCATGTACAGCCGCTCCATCCAGGGCCACCATGTCTGCCTCCTGGTGAAAAAGTC
CTACACCACATATGGCCTGGAGCTGACGCGCGTCACGGTGGTCGACACGGACGTGCACGTGGTTTATGACACCTTCGTGAAGCCTGACAA
CGAGATCGTGGACTACAACACCAGGTTTTCGGGGGTGACGGAGGCTGACCTTGCCGACACAAGTGTCACGCTGCGTGACGTCCAGGCCGT
TCTGCTGAGCATGTTCAGCGCTGACACCATCCTCATCGGACACAGCCTGGAGAGCGACCTCCTGGCCCTGAAGGTCATCCACAGCACCGT
GGTGGACACGTCTGTGCTCTTCCCCCACCGCCTGGGCCTCCCCTACAAGCGGTCCCTGCGGAACCTCATGGCCGACTACCTCAGACAGAT
CATCCAGGACAATGTGGATGGGCACAGCTCCAGCGAGGACGCCGGCGCCTGCATGCACCTGGTGATCTGGAAGGTTCGAGAAGACGCCAA
GACCAAGCGATGACGCCTGCCCGCCTCCCACCCGCCTCTCCTGCCGTCCCGCTGGTCCTTAGCCCCATGCCTCTTCCAAAACAGTGCAAT
AAATCTCCGGTAACCCGTCCACCTGGCCGAGGCAGCCCAGAGCAGCGGGATGAGCTGGCGGCCAGAGAACGCCCAGCCCAGCCCACCCCC
GCTCACGTCCTGCCGCACCCCTCCGCCGGTCCCCACCCTCTGCCCCCCAGCCCTCTGGCGTCTCTAGAAACTGCTGCTGATGGAGACCAG
GACAGAGCCCGCCCCCACCCGGCCCGCAGCCCCTCCTGCCCCTCCTGCCAGGCCCTCCCTCCCGGGTGGTGGCCGGCACCTCTGCTCCTC
ACGTGGGTCGCGGGGCGGGCTTGTCCCAGGTTTGTTTGAGACAGTCTTTTTTTATTTTGTATTGTTATTTTTATTATTTTTAATTTAAAC
CTGATGACTTGCACAGCATCTTTCCCACCGGGAAGAGCGGGCCTGCTGCCTTCCCTCCTGCGGGGTGGGGGTGGGACAGACCCCAGTGGG
GGCCAGGGCCACACTCAGCGCAGTGTGGGCTGGACCCGCCTCTGTGCTCCAGGGGCTGGCCGGTCTTCGGGGGCGCCGGCTTCCACCACT
CGCCGCCCCACCCCTGCACTCCGTGACCTCTGAGAACCCAGCTGGCCCCTCGTTAGCCCCAGGGTCTGCCGTCTAGAGGGAGCCCACCGG
CCTCCGGACACTGTCTTCCCGCTAGAGCCCCGTCCTCCTCTGCGGGGCTCTCCGGACCCCCTTCCCCCTCACCTCCAGGCAGGGACAGAA

>5283_5283_5_AP3D1-REXO1_AP3D1_chr19_2108686_ENST00000345016_REXO1_chr19_1816812_ENST00000170168_length(amino acids)=1315AA_BP=1
MPRRSGRQRGAPRAAGTGPGPPHGAQPRASRGKSPWAAAMALKMVKGSIDRMFDKNLQDLVRGIRNHKEDEAKYISQCIDEIKQELKQDN
IAVKANAVCKLTYLQMLGYDISWAAFNIIEVMSASKFTFKRIGYLAASQSFHEGTDVIMLTTNQIRKDLSSPSQYDTGVALTGLSCFVTP
DLARDLANDIMTLMSHTKPYIRKKAVLIMYKVFLKYPESLRPAFPRLKEKLEDPDPGVQSAAVNVICELARRNPKNYLSLAPLFFKLMTS
STNNWVLIKIIKLFGALTPLEPRLGKKLIEPLTNLIHSTSAMSLLYECVNTVIAVLISLSSGMPNHSASIQLCVQKLRILIEDSDQNLKY
LGLLAMSKILKTHPKSVQSHKDLILQCLDDKDESIRLRALDLLYGMVSKKNLMEIVKKLMTHVDKAEGTTYRDELLTKIIDICSQSNYQY
ITNFEWYISILVELTRLEGTRHGHLIAAQMLDVAIRVKAIRKFAVSQMSALLDSAHLLASSTQRNGICEVLYAAAWICGEFSEHLQEPHH
TLEAMLRPRVTTLPGHIQAVYVQNVVKLYASILQQKEQAGEAEGAQAVTQLMVDRLPQFVQSADLEVQERASCILQLVKHIQKLQAKDVP
VAEEVSALFAGELNPVAPKAQKKVPVPEGLDLDAWINEPLSDSESEDERPRAVFHEEEQRRPKHRPSEADEEELARRREARKQEQANNPF
YIKSSPSPQKRYQDTPGVEHIPVVQIDLSVPLKVPGLPMSDQYVKLEEERRHRQKLEKDKRRKKRKEKEKKGKRRHSSLPTESDEDIAPA
QQVDIVTEEMPENALPSDEDDKDPNDPYRALDIDLDKPLADSEKLPIQKHRNTETSKSPEKDVPMVEKKSKKPKKKEKKHKEKERDKEKK
KEKEKKKSPKPKKKKHRKEKEERTKGKKKSKKQPPGSEEAAGEPVQNGAPEEEQLPPESSYSLLAENSYVKMTCDIRGSLQEDSQVTVAI
VLENRSSSILKGMELSVLDSLNARMARPQGSSVHDGVPVPFQLPPGVSNEAQYVFTIQSIVMAQKLKGTLSFIAKNDEGATHEKLDFRLH
FSCSSYLITTPCYSDAFAKLLESGDLSMSSIKVDGIRMSFQNLLAKICFHHHFSVVERVDSCASMYSRSIQGHHVCLLVKKSYTTYGLEL
TRVTVVDTDVHVVYDTFVKPDNEIVDYNTRFSGVTEADLADTSVTLRDVQAVLLSMFSADTILIGHSLESDLLALKVIHSTVVDTSVLFP

--------------------------------------------------------------
>5283_5283_6_AP3D1-REXO1_AP3D1_chr19_2108686_ENST00000350812_REXO1_chr19_1816812_ENST00000170168_length(transcript)=4347nt_BP=3065nt
GGGGCCCGGAGCGGGATCGCGGCACCTGCCGAGCGGGTCGCCGCCTCTGCCGCGGTCCTTGGACCCCGCCGCCGCCCTGGCCTGGGAGCT
TGCCCCGCCGCAGCGGCCGGCAGCGCGGCGCTCCGCGGGCGGCAGGCACGGGCCCCGGGCCCCCTCACGGCGCCCAGCCGCGGGCCTCCC
GAGGCAAAAGCCCGTGGGCCGCCGCGATGGCCCTCAAGATGGTGAAGGGCAGCATCGACCGCATGTTCGACAAGAATCTGCAGGACTTGG
TCCGCGGCATCCGTAACCACAAGGAGGACGAGGCAAAATACATATCTCAGTGCATTGATGAGATCAAGCAGGAGCTGAAGCAGGACAACA
TAGCGGTGAAGGCGAACGCGGTCTGCAAGCTGACGTATTTACAGATGTTGGGATACGACATCAGCTGGGCCGCCTTCAACATCATAGAAG
TGATGAGTGCCTCCAAGTTCACCTTCAAGCGAATTGGCTACCTCGCTGCTTCCCAGAGCTTTCACGAAGGCACCGACGTCATCATGCTGA
CCACCAATCAGATCCTGCTCATCTCGCTGTCCTCCGGCATGCCCAACCACAGCGCCAGCATCCAGCTTTGTGTTCAGAAATTAAGGATAT
TGATCGAGGACTCCGATCAGAACTTGAAGTACCTGGGGCTGCTGGCAATGTCCAAGATCCTGAAGACCCACCCCAAGTCCGTGCAGTCCC
ACAAGGACCTCATCCTGCAGTGCCTGGACGACAAGGACGAGTCCATCCGGCTGCGGGCCCTGGACCTGCTCTATGGGATGGTGTCCAAGA
AGAACCTGATGGAGATCGTGAAGAAGCTGATGACCCACGTAGACAAGGCAGAGGGTACCACCTACCGTGACGAGCTGCTCACCAAGATCA
TTGACATCTGCAGCCAGTCCAACTACCAGTACATCACCAACTTCGAGTGGTACATCAGCATCCTGGTGGAGCTGACCCGGCTGGAGGGCA
CACGGCACGGCCACCTCATCGCCGCCCAAATGCTGGACGTGGCCATCCGCGTGAAGGCCATCCGCAAGTTCGCCGTGTCCCAGATGTCTG
CGCTGCTTGACAGTGCACACCTGCTGGCCAGCAGCACCCAGCGGAACGGGATCTGTGAGGTGCTGTACGCTGCCGCCTGGATCTGCGGGG
AGTTCTCAGAGCATCTGCAGGAACCACACCACACTTTGGAGGCCATGCTGCGGCCCAGAGTCACCACGCTGCCAGGCCACATCCAGGCCG
TGTATGTGCAGAACGTGGTCAAGCTCTACGCCTCCATCCTGCAGCAGAAGGAGCAGGCCGGGGAGGCAGAGGGCGCTCAGGCCGTCACCC
AGCTCATGGTGGACCGGCTGCCCCAGTTTGTGCAGAGCGCAGACCTGGAGGTGCAGGAGCGGGCGTCCTGCATCCTGCAGCTGGTCAAGC
ACATCCAGAAGCTTCAGGCCAAGGACGTGCCTGTGGCAGAGGAGGTCAGCGCTCTCTTTGCTGGGGAGCTGAACCCAGTGGCCCCCAAGG
CCCAGAAGAAGGTTCCAGTCCCCGAAGGCCTGGACCTGGACGCCTGGATCAATGAGCCACTCTCGGACAGCGAGTCAGAGGACGAGAGGC
CCAGGGCCGTCTTCCACGAGGAGGAGCAGCGGCGTCCCAAGCACCGGCCGTCGGAGGCGGACGAGGAAGAGCTGGCTCGGCGCCGAGAGG
CCCGGAAGCAGGAGCAGGCCAACAACCCCTTCTACATCAAGAGCTCGCCATCGCCACAGAAGCGGTACCAGGACACCCCGGGCGTGGAGC
ACATTCCCGTGGTGCAGATTGACCTCTCCGTCCCCTTGAAGGTTCCAGGGCTGCCTATGTCAGATCAGTATGTGAAGCTGGAGGAGGAGC
GGCGGCACCGGCAGAAGCTGGAGAAGGACAAGAGGAGGAAAAAGAGGAAGGAGAAGGAGAAGAAGGGCAAGCGCCGCCACAGCTCGCTGC
CCACGGAGAGCGACGAGGACATCGCCCCTGCCCAGCAGGTGGACATCGTCACAGAGGAGATGCCTGAGAATGCTCTGCCCAGCGACGAGG
ATGACAAAGACCCCAACGACCCCTACAGGGCTCTGGATATTGACCTGGATAAGCCCTTAGCCGACAGCGAGAAACTGCCTATTCAGAAAC
ACAGAAACACCGAGACCTCAAAATCCCCTGAGAAGGACGTTCCCATGGTAGAAAAGAAGAGCAAGAAACCCAAGAAGAAAGAGAAAAAAC
ACAAAGAGAAAGAGAGAGACAAGGAGAAGAAGAAGGAGAAGGAGAAGAAGAAATCTCCCAAGCCTAAGAAGAAGAAGCACAGGAAGGAGA
AGGAGGAGCGGACCAAAGGCAAGAAGAAGTCCAAGAAGCAGCCTCCAGGCAGCGAGGAGGCAGCGGGGGAGCCGGTGCAGAATGGCGCGC
CAGAGGAGGAGCAGCTCCCGCCTGAGTCCAGCTACTCCCTCCTCGCTGAAAATTCCTATGTTAAAATGACCTGTGACATCCGGGGCAGTC
TGCAGGAGGACAGCCAGGTCACTGTGGCCATCGTGCTGGAGAACAGGAGCAGCAGCATCCTCAAGGGCATGGAGCTCAGCGTGCTGGACT
CACTCAATGCCAGGATGGCCCGGCCGCAGGGCTCCTCCGTCCACGATGGCGTCCCCGTGCCTTTCCAGCTGCCCCCAGGCGTCTCCAACG
AAGCCCAGTATGTGTTCACCATCCAGAGCATCGTCATGGCGCAGAAGCTCAAGGGGACCCTGTCCTTCATTGCCAAGAATGACGAGGGTG
CGACCCACGAGAAGCTGGACTTCAGGCTGCACTTCAGCTGCAGCTCCTACTTGATCACCACTCCCTGCTACAGTGACGCCTTTGCTAAGT
TGCTGGAGTCTGGGGACTTGAGCATGAGCTCAATCAAAGTCGATGGCATTCGGATGTCCTTCCAGAATCTTCTGGCGAAGATCTGTTTTC
ACCACCATTTTTCCGTTGTGGAGCGAGTGGACTCCTGCGCCTCCATGTACAGCCGCTCCATCCAGGGCCACCATGTCTGCCTCCTGGTGA
AAAAGTCCTACACCACATATGGCCTGGAGCTGACGCGCGTCACGGTGGTCGACACGGACGTGCACGTGGTTTATGACACCTTCGTGAAGC
CTGACAACGAGATCGTGGACTACAACACCAGGTTTTCGGGGGTGACGGAGGCTGACCTTGCCGACACAAGTGTCACGCTGCGTGACGTCC
AGGCCGTTCTGCTGAGCATGTTCAGCGCTGACACCATCCTCATCGGACACAGCCTGGAGAGCGACCTCCTGGCCCTGAAGGTCATCCACA
GCACCGTGGTGGACACGTCTGTGCTCTTCCCCCACCGCCTGGGCCTCCCCTACAAGCGGTCCCTGCGGAACCTCATGGCCGACTACCTCA
GACAGATCATCCAGGACAATGTGGATGGGCACAGCTCCAGCGAGGACGCCGGCGCCTGCATGCACCTGGTGATCTGGAAGGTTCGAGAAG
ACGCCAAGACCAAGCGATGACGCCTGCCCGCCTCCCACCCGCCTCTCCTGCCGTCCCGCTGGTCCTTAGCCCCATGCCTCTTCCAAAACA
GTGCAATAAATCTCCGGTAACCCGTCCACCTGGCCGAGGCAGCCCAGAGCAGCGGGATGAGCTGGCGGCCAGAGAACGCCCAGCCCAGCC
CACCCCCGCTCACGTCCTGCCGCACCCCTCCGCCGGTCCCCACCCTCTGCCCCCCAGCCCTCTGGCGTCTCTAGAAACTGCTGCTGATGG
AGACCAGGACAGAGCCCGCCCCCACCCGGCCCGCAGCCCCTCCTGCCCCTCCTGCCAGGCCCTCCCTCCCGGGTGGTGGCCGGCACCTCT
GCTCCTCACGTGGGTCGCGGGGCGGGCTTGTCCCAGGTTTGTTTGAGACAGTCTTTTTTTATTTTGTATTGTTATTTTTATTATTTTTAA
TTTAAACCTGATGACTTGCACAGCATCTTTCCCACCGGGAAGAGCGGGCCTGCTGCCTTCCCTCCTGCGGGGTGGGGGTGGGACAGACCC
CAGTGGGGGCCAGGGCCACACTCAGCGCAGTGTGGGCTGGACCCGCCTCTGTGCTCCAGGGGCTGGCCGGTCTTCGGGGGCGCCGGCTTC
CACCACTCGCCGCCCCACCCCTGCACTCCGTGACCTCTGAGAACCCAGCTGGCCCCTCGTTAGCCCCAGGGTCTGCCGTCTAGAGGGAGC
CCACCGGCCTCCGGACACTGTCTTCCCGCTAGAGCCCCGTCCTCCTCTGCGGGGCTCTCCGGACCCCCTTCCCCCTCACCTCCAGGCAGG

>5283_5283_6_AP3D1-REXO1_AP3D1_chr19_2108686_ENST00000350812_REXO1_chr19_1816812_ENST00000170168_length(amino acids)=1146AA_BP=1
MPRRSGRQRGAPRAAGTGPGPPHGAQPRASRGKSPWAAAMALKMVKGSIDRMFDKNLQDLVRGIRNHKEDEAKYISQCIDEIKQELKQDN
IAVKANAVCKLTYLQMLGYDISWAAFNIIEVMSASKFTFKRIGYLAASQSFHEGTDVIMLTTNQILLISLSSGMPNHSASIQLCVQKLRI
LIEDSDQNLKYLGLLAMSKILKTHPKSVQSHKDLILQCLDDKDESIRLRALDLLYGMVSKKNLMEIVKKLMTHVDKAEGTTYRDELLTKI
IDICSQSNYQYITNFEWYISILVELTRLEGTRHGHLIAAQMLDVAIRVKAIRKFAVSQMSALLDSAHLLASSTQRNGICEVLYAAAWICG
EFSEHLQEPHHTLEAMLRPRVTTLPGHIQAVYVQNVVKLYASILQQKEQAGEAEGAQAVTQLMVDRLPQFVQSADLEVQERASCILQLVK
HIQKLQAKDVPVAEEVSALFAGELNPVAPKAQKKVPVPEGLDLDAWINEPLSDSESEDERPRAVFHEEEQRRPKHRPSEADEEELARRRE
ARKQEQANNPFYIKSSPSPQKRYQDTPGVEHIPVVQIDLSVPLKVPGLPMSDQYVKLEEERRHRQKLEKDKRRKKRKEKEKKGKRRHSSL
PTESDEDIAPAQQVDIVTEEMPENALPSDEDDKDPNDPYRALDIDLDKPLADSEKLPIQKHRNTETSKSPEKDVPMVEKKSKKPKKKEKK
HKEKERDKEKKKEKEKKKSPKPKKKKHRKEKEERTKGKKKSKKQPPGSEEAAGEPVQNGAPEEEQLPPESSYSLLAENSYVKMTCDIRGS
LQEDSQVTVAIVLENRSSSILKGMELSVLDSLNARMARPQGSSVHDGVPVPFQLPPGVSNEAQYVFTIQSIVMAQKLKGTLSFIAKNDEG
ATHEKLDFRLHFSCSSYLITTPCYSDAFAKLLESGDLSMSSIKVDGIRMSFQNLLAKICFHHHFSVVERVDSCASMYSRSIQGHHVCLLV
KKSYTTYGLELTRVTVVDTDVHVVYDTFVKPDNEIVDYNTRFSGVTEADLADTSVTLRDVQAVLLSMFSADTILIGHSLESDLLALKVIH

--------------------------------------------------------------
>5283_5283_7_AP3D1-REXO1_AP3D1_chr19_2108686_ENST00000355272_REXO1_chr19_1816812_ENST00000170168_length(transcript)=5041nt_BP=3759nt
AGGGGCCCGGAGCGGGATCGCGGCACCTGCCGAGCGGGTCGCCGCCTCTGCCGCGGTCCTTGGACCCCGCCGCCGCCCTGGCCTGGGAGC
TTGCCCCGCCGCAGCGGCCGGCAGCGCGGCGCTCCGCGGGCGGCAGGCACGGGCCCCGGGCCCCCTCACGGCGCCCAGCCGCGGGCCTCC
CGAGGCAAAAGCCCGTGGGCCGCCGCGATGGCCCTCAAGATGGTGAAGGGCAGCATCGACCGCATGTTCGACAAGAATCTGCAGGACTTG
GTCCGCGGCATCCGTAACCACAAGGAGGACGAGGCAAAATACATATCTCAGTGCATTGATGAGATCAAGCAGGAGCTGAAGCAGGACAAC
ATAGCGGTGAAGGCGAACGCGGTCTGCAAGCTGACGTATTTACAGATGTTGGGATACGACATCAGCTGGGCCGCCTTCAACATCATAGAA
GTGATGAGTGCCTCCAAGTTCACCTTCAAGCGAATTGGCTACCTCGCTGCTTCCCAGAGCTTTCACGAAGGCACCGACGTCATCATGCTG
ACCACCAATCAGATCCGTAAGGACTTGAGCAGCCCCAGCCAGTACGACACAGGTGTTGCACTGACGGGTCTGTCCTGCTTCGTCACCCCA
GACCTTGCCAGAGACCTGGCAAATGACATCATGACACTGATGTCACACACCAAGCCCTACATCAGGAAGAAGGCTGTGCTGATCATGTAC
AAGGTGTTCCTGAAGTACCCCGAGTCGCTGCGCCCTGCCTTTCCCCGGCTGAAGGAGAAGCTGGAGGACCCCGACCCCGGGGTTCAGTCG
GCTGCCGTCAATGTCATCTGCGAGCTGGCCAGACGCAACCCTAAGAACTACCTGTCCCTGGCCCCGCTCTTTTTCAAGCTGATGACGTCC
TCCACCAACAACTGGGTCCTCATCAAGATCATCAAGCTGTTCGGTGCTCTTACTCCTTTGGAACCGCGGCTGGGCAAGAAGCTGATCGAG
CCCCTCACCAATCTCATCCACAGCACGTCTGCCATGTCTCTCCTCTATGAATGTGTGAACACCGTGATTGCAGTGCTCATCTCGCTGTCC
TCCGGCATGCCCAACCACAGCGCCAGCATCCAGCTTTGTGTTCAGAAATTAAGGATATTGATCGAGGACTCCGATCAGAACTTGAAGTAC
CTGGGGCTGCTGGCAATGTCCAAGATCCTGAAGACCCACCCCAAGTCCGTGCAGTCCCACAAGGACCTCATCCTGCAGTGCCTGGACGAC
AAGGACGAGTCCATCCGGCTGCGGGCCCTGGACCTGCTCTATGGGATGGTGTCCAAGAAGAACCTGATGGAGATCGTGAAGAAGCTGATG
ACCCACGTAGACAAGGCAGAGGGTACCACCTACCGTGACGAGCTGCTCACCAAGATCATTGACATCTGCAGCCAGTCCAACTACCAGTAC
ATCACCAACTTCGAGTGGTACATCAGCATCCTGGTGGAGCTGACCCGGCTGGAGGGCACACGGCACGGCCACCTCATCGCCGCCCAAATG
CTGGACGTGGCCATCCGCGTGAAGGCCATCCGCAAGTTCGCCGTGTCCCAGATGTCTGCGCTGCTTGACAGTGCACACCTGCTGGCCAGC
AGCACCCAGCGGAACGGGATCTGTGAGGTGCTGTACGCTGCCGCCTGGATCTGCGGGGAGTTCTCAGAGCATCTGCAGGAACCACACCAC
ACTTTGGAGGCCATGCTGCGGCCCAGAGTCACCACGCTGCCAGGCCACATCCAGGCCGTGTATGTGCAGAACGTGGTCAAGCTCTACGCC
TCCATCCTGCAGCAGAAGGAGCAGGCCGGGGAGGCAGAGGGCGCTCAGGCCGTCACCCAGCTCATGGTGGACCGGCTGCCCCAGTTTGTG
CAGAGCGCAGACCTGGAGGTGCAGGAGCGGGCGTCCTGCATCCTGCAGCTGGTCAAGCACATCCAGAAGCTTCAGGCCAAGGACGTGCCT
GTGGCAGAGGAGGTCAGCGCTCTCTTTGCTGGGGAGCTGAACCCAGTGGCCCCCAAGGCCCAGAAGAAGGTTCCAGTCCCCGAAGGCCTG
GACCTGGACGCCTGGATCAATGAGCCACTCTCGGACAGCGAGTCAGAGGACGAGAGGCCCAGGGCCGTCTTCCACGAGGAGGAGCAGCGG
CGTCCCAAGCACCGGCCGTCGGAGGCGGACGAGGAAGAGCTGGCTCGGCGCCGAGAGGCCCGGAAGCAGGAGCAGGCCAACAACCCCTTC
TACATCAAGAGCTCGCCATCGCCACAGAAGCGGTACCAGGACACCCCGGGCGTGGAGCACATTCCCGTGGTGCAGATTGACCTCTCCGTC
CCCTTGAAGGTTCCAGGGCTGCCTATGTCAGATCAGTATGTGAAGCTGGAGGAGGAGCGGCGGCACCGGCAGAAGCTGGAGAAGGACAAG
AGGAGGAAAAAGAGGAAGGAGAAGGAGAAGAAGGGCAAGCGCCGCCACAGCTCGCTGCCCACGGAGAGCGACGAGGACATCGCCCCTGCC
CAGCAGGTGGACATCGTCACAGAGGAGATGCCTGAGAATGCTCTGCCCAGCGACGAGGATGACAAAGACCCCAACGACCCCTACAGGGCT
CTGGATATTGACCTGGATAAGCCCTTAGCCGACAGCGAGAAACTGCCTATTCAGAAACACAGAAACACCGAGACCTCAAAATCCCCTGAG
AAGGACGTTCCCATGGTAGAAAAGAAGAGCAAGAAACCCAAGAAGAAAGAGAAAAAACACAAAGAGAAAGAGAGAGACAAGGAGAAGAAG
AAGGAGAAGGAGAAGAAGGCTGAGGACCTGGACTTCTGGCTGTCTACCACCCCACCGCCTGCCCCCGCCCCCGCCCCCGCCCCCGTTCCA
TCCACGGGGGAGCTCAGTGTGAACACTGTCACTACCCCGAAGGACGAGTGTGAGGACGCCAAGACGGAGGCGCAGGGCGAGGAGGACGAT
GCCGAGGGGCAAGACCAGGACAAGAAATCTCCCAAGCCTAAGAAGAAGAAGCACAGGAAGGAGAAGGAGGAGCGGACCAAAGGCAAGAAG
AAGTCCAAGAAGCAGCCTCCAGGCAGCGAGGAGGCAGCGGGGGAGCCGGTGCAGAATGGCGCGCCAGAGGAGGAGCAGCTCCCGCCTGAG
TCCAGCTACTCCCTCCTCGCTGAAAATTCCTATGTTAAAATGACCTGTGACATCCGGGGCAGTCTGCAGGAGGACAGCCAGGTCACTGTG
GCCATCGTGCTGGAGAACAGGAGCAGCAGCATCCTCAAGGGCATGGAGCTCAGCGTGCTGGACTCACTCAATGCCAGGATGGCCCGGCCG
CAGGGCTCCTCCGTCCACGATGGCGTCCCCGTGCCTTTCCAGCTGCCCCCAGGCGTCTCCAACGAAGCCCAGTATGTGTTCACCATCCAG
AGCATCGTCATGGCGCAGAAGCTCAAGGGGACCCTGTCCTTCATTGCCAAGAATGACGAGGGTGCGACCCACGAGAAGCTGGACTTCAGG
CTGCACTTCAGCTGCAGCTCCTACTTGATCACCACTCCCTGCTACAGTGACGCCTTTGCTAAGTTGCTGGAGTCTGGGGACTTGAGCATG
AGCTCAATCAAAGTCGATGGCATTCGGATGTCCTTCCAGAATCTTCTGGCGAAGATCTGTTTTCACCACCATTTTTCCGTTGTGGAGCGA
GTGGACTCCTGCGCCTCCATGTACAGCCGCTCCATCCAGGGCCACCATGTCTGCCTCCTGGTGAAAAAGTCCTACACCACATATGGCCTG
GAGCTGACGCGCGTCACGGTGGTCGACACGGACGTGCACGTGGTTTATGACACCTTCGTGAAGCCTGACAACGAGATCGTGGACTACAAC
ACCAGGTTTTCGGGGGTGACGGAGGCTGACCTTGCCGACACAAGTGTCACGCTGCGTGACGTCCAGGCCGTTCTGCTGAGCATGTTCAGC
GCTGACACCATCCTCATCGGACACAGCCTGGAGAGCGACCTCCTGGCCCTGAAGGTCATCCACAGCACCGTGGTGGACACGTCTGTGCTC
TTCCCCCACCGCCTGGGCCTCCCCTACAAGCGGTCCCTGCGGAACCTCATGGCCGACTACCTCAGACAGATCATCCAGGACAATGTGGAT
GGGCACAGCTCCAGCGAGGACGCCGGCGCCTGCATGCACCTGGTGATCTGGAAGGTTCGAGAAGACGCCAAGACCAAGCGATGACGCCTG
CCCGCCTCCCACCCGCCTCTCCTGCCGTCCCGCTGGTCCTTAGCCCCATGCCTCTTCCAAAACAGTGCAATAAATCTCCGGTAACCCGTC
CACCTGGCCGAGGCAGCCCAGAGCAGCGGGATGAGCTGGCGGCCAGAGAACGCCCAGCCCAGCCCACCCCCGCTCACGTCCTGCCGCACC
CCTCCGCCGGTCCCCACCCTCTGCCCCCCAGCCCTCTGGCGTCTCTAGAAACTGCTGCTGATGGAGACCAGGACAGAGCCCGCCCCCACC
CGGCCCGCAGCCCCTCCTGCCCCTCCTGCCAGGCCCTCCCTCCCGGGTGGTGGCCGGCACCTCTGCTCCTCACGTGGGTCGCGGGGCGGG
CTTGTCCCAGGTTTGTTTGAGACAGTCTTTTTTTATTTTGTATTGTTATTTTTATTATTTTTAATTTAAACCTGATGACTTGCACAGCAT
CTTTCCCACCGGGAAGAGCGGGCCTGCTGCCTTCCCTCCTGCGGGGTGGGGGTGGGACAGACCCCAGTGGGGGCCAGGGCCACACTCAGC
GCAGTGTGGGCTGGACCCGCCTCTGTGCTCCAGGGGCTGGCCGGTCTTCGGGGGCGCCGGCTTCCACCACTCGCCGCCCCACCCCTGCAC
TCCGTGACCTCTGAGAACCCAGCTGGCCCCTCGTTAGCCCCAGGGTCTGCCGTCTAGAGGGAGCCCACCGGCCTCCGGACACTGTCTTCC
CGCTAGAGCCCCGTCCTCCTCTGCGGGGCTCTCCGGACCCCCTTCCCCCTCACCTCCAGGCAGGGACAGAATAAATGTTTGTATGGATTT

>5283_5283_7_AP3D1-REXO1_AP3D1_chr19_2108686_ENST00000355272_REXO1_chr19_1816812_ENST00000170168_length(amino acids)=1377AA_BP=1
MPRRSGRQRGAPRAAGTGPGPPHGAQPRASRGKSPWAAAMALKMVKGSIDRMFDKNLQDLVRGIRNHKEDEAKYISQCIDEIKQELKQDN
IAVKANAVCKLTYLQMLGYDISWAAFNIIEVMSASKFTFKRIGYLAASQSFHEGTDVIMLTTNQIRKDLSSPSQYDTGVALTGLSCFVTP
DLARDLANDIMTLMSHTKPYIRKKAVLIMYKVFLKYPESLRPAFPRLKEKLEDPDPGVQSAAVNVICELARRNPKNYLSLAPLFFKLMTS
STNNWVLIKIIKLFGALTPLEPRLGKKLIEPLTNLIHSTSAMSLLYECVNTVIAVLISLSSGMPNHSASIQLCVQKLRILIEDSDQNLKY
LGLLAMSKILKTHPKSVQSHKDLILQCLDDKDESIRLRALDLLYGMVSKKNLMEIVKKLMTHVDKAEGTTYRDELLTKIIDICSQSNYQY
ITNFEWYISILVELTRLEGTRHGHLIAAQMLDVAIRVKAIRKFAVSQMSALLDSAHLLASSTQRNGICEVLYAAAWICGEFSEHLQEPHH
TLEAMLRPRVTTLPGHIQAVYVQNVVKLYASILQQKEQAGEAEGAQAVTQLMVDRLPQFVQSADLEVQERASCILQLVKHIQKLQAKDVP
VAEEVSALFAGELNPVAPKAQKKVPVPEGLDLDAWINEPLSDSESEDERPRAVFHEEEQRRPKHRPSEADEEELARRREARKQEQANNPF
YIKSSPSPQKRYQDTPGVEHIPVVQIDLSVPLKVPGLPMSDQYVKLEEERRHRQKLEKDKRRKKRKEKEKKGKRRHSSLPTESDEDIAPA
QQVDIVTEEMPENALPSDEDDKDPNDPYRALDIDLDKPLADSEKLPIQKHRNTETSKSPEKDVPMVEKKSKKPKKKEKKHKEKERDKEKK
KEKEKKAEDLDFWLSTTPPPAPAPAPAPVPSTGELSVNTVTTPKDECEDAKTEAQGEEDDAEGQDQDKKSPKPKKKKHRKEKEERTKGKK
KSKKQPPGSEEAAGEPVQNGAPEEEQLPPESSYSLLAENSYVKMTCDIRGSLQEDSQVTVAIVLENRSSSILKGMELSVLDSLNARMARP
QGSSVHDGVPVPFQLPPGVSNEAQYVFTIQSIVMAQKLKGTLSFIAKNDEGATHEKLDFRLHFSCSSYLITTPCYSDAFAKLLESGDLSM
SSIKVDGIRMSFQNLLAKICFHHHFSVVERVDSCASMYSRSIQGHHVCLLVKKSYTTYGLELTRVTVVDTDVHVVYDTFVKPDNEIVDYN
TRFSGVTEADLADTSVTLRDVQAVLLSMFSADTILIGHSLESDLLALKVIHSTVVDTSVLFPHRLGLPYKRSLRNLMADYLRQIIQDNVD

--------------------------------------------------------------
>5283_5283_8_AP3D1-REXO1_AP3D1_chr19_2108686_ENST00000356926_REXO1_chr19_1816812_ENST00000170168_length(transcript)=4748nt_BP=3466nt
TTGTCCGCCATTTGCAAGGGGCCCGGAGCGGGATCGCGGCACCTGCCGAGCGGGTCGCCGCCTCTGCCGCGGTCCTTGGACCCCGCCGCC
GCCCTGGCCTGGGAGCTTGCCCCGCCGCAGCGGCCGGCAGCGCGGCGCTCCGCGGGCGGCAGGCACGGGCCCCGGGCCCCCTCACGGCGC
CCAGCCGCGGGCCTCCCGAGGCAAAAGCCCGTGGGCCGCCGCGATGGCCCTCAAGATGGTGAAGGGCAGCATCGACCGCATGTTCGACAA
GAATCTGCAGGACTTGGTCCGCGGCATCCGTAACCACAAGGAGGACGAGGCAAAATACATATCTCAGTGCATTGATGAGATCAAGCAGGA
GCTGAAGCAGGACAACATAGCGGTGAAGGCGAACGCGGTCTGCAAGCTGACGTATTTACAGATGTTGGGATACGACATCAGCTGGGCCGC
CTTCAACATCATAGAAGTGATGAGTGCCTCCAAGTTCACCTTCAAGCGAATTGGCTACCTCGCTGCTTCCCAGAGCTTTCACGAAGGCAC
CGACGTCATCATGCTGACCACCAATCAGATCCGTAAGGACTTGAGCAGCCCCAGCCAGTACGACACAGGTGTTGCACTGACGGGTCTGTC
CTGCTTCGTCACCCCAGACCTTGCCAGAGACCTGGCAAATGACATCATGACACTGATGTCACACACCAAGCCCTACATCAGGAAGAAGGC
TAAGCTGATCGAGCCCCTCACCAATCTCATCCACAGCACGTCTGCCATGTCTCTCCTCTATGAATGTGTGAACACCGTGATTGCAGTGCT
CATCTCGCTGTCCTCCGGCATGCCCAACCACAGCGCCAGCATCCAGCTTTGTGTTCAGAAATTAAGGATATTGATCGAGGACTCCGATCA
GAACTTGAAGTACCTGGGGCTGCTGGCAATGTCCAAGATCCTGAAGACCCACCCCAAGTCCGTGCAGTCCCACAAGGACCTCATCCTGCA
GTGCCTGGACGACAAGGACGAGTCCATCCGGCTGCGGGCCCTGGACCTGCTCTATGGGATGGTGTCCAAGAAGAACCTGATGGAGATCGT
GAAGAAGCTGATGACCCACGTAGACAAGGCAGAGGGTACCACCTACCGTGACGAGCTGCTCACCAAGATCATTGACATCTGCAGCCAGTC
CAACTACCAGTACATCACCAACTTCGAGTGGTACATCAGCATCCTGGTGGAGCTGACCCGGCTGGAGGGCACACGGCACGGCCACCTCAT
CGCCGCCCAAATGCTGGACGTGGCCATCCGCGTGAAGGCCATCCGCAAGTTCGCCGTGTCCCAGATGTCTGCGCTGCTTGACAGTGCACA
CCTGCTGGCCAGCAGCACCCAGCGGAACGGGATCTGTGAGGTGCTGTACGCTGCCGCCTGGATCTGCGGGGAGTTCTCAGAGCATCTGCA
GGAACCACACCACACTTTGGAGGCCATGCTGCGGCCCAGAGTCACCACGCTGCCAGGCCACATCCAGGCCGTGTATGTGCAGAACGTGGT
CAAGCTCTACGCCTCCATCCTGCAGCAGAAGGAGCAGGCCGGGGAGGCAGAGGGCGCTCAGGCCGTCACCCAGCTCATGGTGGACCGGCT
GCCCCAGTTTGTGCAGAGCGCAGACCTGGAGGTGCAGGAGCGGGCGTCCTGCATCCTGCAGCTGGTCAAGCACATCCAGAAGCTTCAGGC
CAAGGACGTGCCTGTGGCAGAGGAGGTCAGCGCTCTCTTTGCTGGGGAGCTGAACCCAGTGGCCCCCAAGGCCCAGAAGAAGGTTCCAGT
CCCCGAAGGCCTGGACCTGGACGCCTGGATCAATGAGCCACTCTCGGACAGCGAGTCAGAGGACGAGAGGCCCAGGGCCGTCTTCCACGA
GGAGGAGCAGCGGCGTCCCAAGCACCGGCCGTCGGAGGCGGACGAGGAAGAGCTGGCTCGGCGCCGAGAGGCCCGGAAGCAGGAGCAGGC
CAACAACCCCTTCTACATCAAGAGCTCGCCATCGCCACAGAAGCGGTACCAGGACACCCCGGGCGTGGAGCACATTCCCGTGGTGCAGAT
TGACCTCTCCGTCCCCTTGAAGGTTCCAGGGCTGCCTATGTCAGATCAGTATGTGAAGCTGGAGGAGGAGCGGCGGCACCGGCAGAAGCT
GGAGAAGGACAAGAGGAGGAAAAAGAGGAAGGAGAAGGAGAAGAAGGGCAAGCGCCGCCACAGCTCGCTGCCCACGGAGAGCGACGAGGA
CATCGCCCCTGCCCAGCAGGTGGACATCGTCACAGAGGAGATGCCTGAGAATGCTCTGCCCAGCGACGAGGATGACAAAGACCCCAACGA
CCCCTACAGGGCTCTGGATATTGACCTGGATAAGCCCTTAGCCGACAGCGAGAAACTGCCTATTCAGAAACACAGAAACACCGAGACCTC
AAAATCCCCTGAGAAGGACGTTCCCATGGTAGAAAAGAAGAGCAAGAAACCCAAGAAGAAAGAGAAAAAACACAAAGAGAAAGAGAGAGA
CAAGGAGAAGAAGAAGGAGAAGGAGAAGAAGGCTGAGGACCTGGACTTCTGGCTGTCTACCACCCCACCGCCTGCCCCCGCCCCCGCCCC
CGCCCCCGTTCCATCCACGGACGAGTGTGAGGACGCCAAGACGGAGGCGCAGGGCGAGGAGGACGATGCCGAGGGGCAAGACCAGGACAA
GAAATCTCCCAAGCCTAAGAAGAAGAAGCACAGGAAGGAGAAGGAGGAGCGGACCAAAGGCAAGAAGAAGTCCAAGAAGCAGCCTCCAGG
CAGCGAGGAGGCAGCGGGGGAGCCGGTGCAGAATGGCGCGCCAGAGGAGGAGCAGCTCCCGCCTGAGTCCAGCTACTCCCTCCTCGCTGA
AAATTCCTATGTTAAAATGACCTGTGACATCCGGGGCAGTCTGCAGGAGGACAGCCAGGTCACTGTGGCCATCGTGCTGGAGAACAGGAG
CAGCAGCATCCTCAAGGGCATGGAGCTCAGCGTGCTGGACTCACTCAATGCCAGGATGGCCCGGCCGCAGGGCTCCTCCGTCCACGATGG
CGTCCCCGTGCCTTTCCAGCTGCCCCCAGGCGTCTCCAACGAAGCCCAGTATGTGTTCACCATCCAGAGCATCGTCATGGCGCAGAAGCT
CAAGGGGACCCTGTCCTTCATTGCCAAGAATGACGAGGGTGCGACCCACGAGAAGCTGGACTTCAGGCTGCACTTCAGCTGCAGCTCCTA
CTTGATCACCACTCCCTGCTACAGTGACGCCTTTGCTAAGTTGCTGGAGTCTGGGGACTTGAGCATGAGCTCAATCAAAGTCGATGGCAT
TCGGATGTCCTTCCAGAATCTTCTGGCGAAGATCTGTTTTCACCACCATTTTTCCGTTGTGGAGCGAGTGGACTCCTGCGCCTCCATGTA
CAGCCGCTCCATCCAGGGCCACCATGTCTGCCTCCTGGTGAAAAAGTCCTACACCACATATGGCCTGGAGCTGACGCGCGTCACGGTGGT
CGACACGGACGTGCACGTGGTTTATGACACCTTCGTGAAGCCTGACAACGAGATCGTGGACTACAACACCAGGTTTTCGGGGGTGACGGA
GGCTGACCTTGCCGACACAAGTGTCACGCTGCGTGACGTCCAGGCCGTTCTGCTGAGCATGTTCAGCGCTGACACCATCCTCATCGGACA
CAGCCTGGAGAGCGACCTCCTGGCCCTGAAGGTCATCCACAGCACCGTGGTGGACACGTCTGTGCTCTTCCCCCACCGCCTGGGCCTCCC
CTACAAGCGGTCCCTGCGGAACCTCATGGCCGACTACCTCAGACAGATCATCCAGGACAATGTGGATGGGCACAGCTCCAGCGAGGACGC
CGGCGCCTGCATGCACCTGGTGATCTGGAAGGTTCGAGAAGACGCCAAGACCAAGCGATGACGCCTGCCCGCCTCCCACCCGCCTCTCCT
GCCGTCCCGCTGGTCCTTAGCCCCATGCCTCTTCCAAAACAGTGCAATAAATCTCCGGTAACCCGTCCACCTGGCCGAGGCAGCCCAGAG
CAGCGGGATGAGCTGGCGGCCAGAGAACGCCCAGCCCAGCCCACCCCCGCTCACGTCCTGCCGCACCCCTCCGCCGGTCCCCACCCTCTG
CCCCCCAGCCCTCTGGCGTCTCTAGAAACTGCTGCTGATGGAGACCAGGACAGAGCCCGCCCCCACCCGGCCCGCAGCCCCTCCTGCCCC
TCCTGCCAGGCCCTCCCTCCCGGGTGGTGGCCGGCACCTCTGCTCCTCACGTGGGTCGCGGGGCGGGCTTGTCCCAGGTTTGTTTGAGAC
AGTCTTTTTTTATTTTGTATTGTTATTTTTATTATTTTTAATTTAAACCTGATGACTTGCACAGCATCTTTCCCACCGGGAAGAGCGGGC
CTGCTGCCTTCCCTCCTGCGGGGTGGGGGTGGGACAGACCCCAGTGGGGGCCAGGGCCACACTCAGCGCAGTGTGGGCTGGACCCGCCTC
TGTGCTCCAGGGGCTGGCCGGTCTTCGGGGGCGCCGGCTTCCACCACTCGCCGCCCCACCCCTGCACTCCGTGACCTCTGAGAACCCAGC
TGGCCCCTCGTTAGCCCCAGGGTCTGCCGTCTAGAGGGAGCCCACCGGCCTCCGGACACTGTCTTCCCGCTAGAGCCCCGTCCTCCTCTG

>5283_5283_8_AP3D1-REXO1_AP3D1_chr19_2108686_ENST00000356926_REXO1_chr19_1816812_ENST00000170168_length(amino acids)=1274AA_BP=1
MPRRSGRQRGAPRAAGTGPGPPHGAQPRASRGKSPWAAAMALKMVKGSIDRMFDKNLQDLVRGIRNHKEDEAKYISQCIDEIKQELKQDN
IAVKANAVCKLTYLQMLGYDISWAAFNIIEVMSASKFTFKRIGYLAASQSFHEGTDVIMLTTNQIRKDLSSPSQYDTGVALTGLSCFVTP
DLARDLANDIMTLMSHTKPYIRKKAKLIEPLTNLIHSTSAMSLLYECVNTVIAVLISLSSGMPNHSASIQLCVQKLRILIEDSDQNLKYL
GLLAMSKILKTHPKSVQSHKDLILQCLDDKDESIRLRALDLLYGMVSKKNLMEIVKKLMTHVDKAEGTTYRDELLTKIIDICSQSNYQYI
TNFEWYISILVELTRLEGTRHGHLIAAQMLDVAIRVKAIRKFAVSQMSALLDSAHLLASSTQRNGICEVLYAAAWICGEFSEHLQEPHHT
LEAMLRPRVTTLPGHIQAVYVQNVVKLYASILQQKEQAGEAEGAQAVTQLMVDRLPQFVQSADLEVQERASCILQLVKHIQKLQAKDVPV
AEEVSALFAGELNPVAPKAQKKVPVPEGLDLDAWINEPLSDSESEDERPRAVFHEEEQRRPKHRPSEADEEELARRREARKQEQANNPFY
IKSSPSPQKRYQDTPGVEHIPVVQIDLSVPLKVPGLPMSDQYVKLEEERRHRQKLEKDKRRKKRKEKEKKGKRRHSSLPTESDEDIAPAQ
QVDIVTEEMPENALPSDEDDKDPNDPYRALDIDLDKPLADSEKLPIQKHRNTETSKSPEKDVPMVEKKSKKPKKKEKKHKEKERDKEKKK
EKEKKAEDLDFWLSTTPPPAPAPAPAPVPSTDECEDAKTEAQGEEDDAEGQDQDKKSPKPKKKKHRKEKEERTKGKKKSKKQPPGSEEAA
GEPVQNGAPEEEQLPPESSYSLLAENSYVKMTCDIRGSLQEDSQVTVAIVLENRSSSILKGMELSVLDSLNARMARPQGSSVHDGVPVPF
QLPPGVSNEAQYVFTIQSIVMAQKLKGTLSFIAKNDEGATHEKLDFRLHFSCSSYLITTPCYSDAFAKLLESGDLSMSSIKVDGIRMSFQ
NLLAKICFHHHFSVVERVDSCASMYSRSIQGHHVCLLVKKSYTTYGLELTRVTVVDTDVHVVYDTFVKPDNEIVDYNTRFSGVTEADLAD
TSVTLRDVQAVLLSMFSADTILIGHSLESDLLALKVIHSTVVDTSVLFPHRLGLPYKRSLRNLMADYLRQIIQDNVDGHSSSEDAGACMH

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for AP3D1-REXO1


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with
TgeneREXO1chr19:2108685chr19:1816812ENST000001701681116498_5771067.01222.0ELOA
TgeneREXO1chr19:2108686chr19:1816812ENST000001701681116498_5771067.01222.0ELOA


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for AP3D1-REXO1


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for AP3D1-REXO1


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
HgeneAP3D1C0001916Albinism1GENOMICS_ENGLAND
HgeneAP3D1C0005818Blood Platelet Disorders1GENOMICS_ENGLAND
HgeneAP3D1C0027947Neutropenia1GENOMICS_ENGLAND
HgeneAP3D1C0036572Seizures1GENOMICS_ENGLAND
HgeneAP3D1C0078917Albinism, Ocular1ORPHANET
HgeneAP3D1C0079504Hermanski-Pudlak Syndrome1CTD_human;GENOMICS_ENGLAND
HgeneAP3D1C4310746HERMANSKY-PUDLAK SYNDROME 101GENOMICS_ENGLAND
TgeneC0032460Polycystic Ovary Syndrome1CTD_human
TgeneC1136382Sclerocystic Ovaries1CTD_human