FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:DNMT1-EMR1 (FusionGDB2 ID:23694)

Fusion Gene Summary for DNMT1-EMR1

check button Fusion gene summary
Fusion gene informationFusion gene name: DNMT1-EMR1
Fusion gene ID: 23694
HgeneTgene
Gene symbol

DNMT1

EMR1

Gene ID

1786

2015

Gene nameDNA methyltransferase 1adhesion G protein-coupled receptor E1
SynonymsADCADN|AIM|CXXC9|DNMT|HSN1E|MCMT|m.HsaIEMR1|TM7LN3
Cytomap

19p13.2

19p13.3-p13.2

Type of geneprotein-codingprotein-coding
DescriptionDNA (cytosine-5)-methyltransferase 1CXXC-type zinc finger protein 9DNA (cytosine-5-)-methyltransferase 1DNA MTase HsaIDNA methyltransferase HsaIadhesion G protein-coupled receptor E1EGF-like module receptor 1EGF-like module-containing mucin-like hormone receptor-like 1EMR1 hormone receptoregf-like module containing, mucin-like, hormone receptor-like 1egf-like module containing, mucin-like, h
Modification date2020031320200313
UniProtAcc

P26358

.
Ensembl transtripts involved in fusion geneENST00000340748, ENST00000359526, 
ENST00000540357, ENST00000589538, 
ENST00000601198, ENST00000250572, 
ENST00000312053, ENST00000381404, 
ENST00000381407, ENST00000450315, 
Fusion gene scores* DoF score14 X 12 X 11=18482 X 2 X 2=8
# samples 172
** MAII scorelog2(17/1848*10)=-3.44235810527836
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(2/8*10)=1.32192809488736
Context

PubMed: DNMT1 [Title/Abstract] AND EMR1 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointDNMT1(10266529)-EMR1(6890492), # samples:1
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneDNMT1

GO:0010216

maintenance of DNA methylation

18754681|21745816


check buttonFusion gene breakpoints across DNMT1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across EMR1 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4BLCATCGA-FJ-A3ZE-01ADNMT1chr19

10266529

-EMR1chr19

6890492

+


Top

Fusion Gene ORF analysis for DNMT1-EMR1

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-intronENST00000340748ENST00000601198DNMT1chr19

10266529

-EMR1chr19

6890492

+
5CDS-intronENST00000359526ENST00000601198DNMT1chr19

10266529

-EMR1chr19

6890492

+
5CDS-intronENST00000540357ENST00000601198DNMT1chr19

10266529

-EMR1chr19

6890492

+
In-frameENST00000340748ENST00000250572DNMT1chr19

10266529

-EMR1chr19

6890492

+
In-frameENST00000340748ENST00000312053DNMT1chr19

10266529

-EMR1chr19

6890492

+
In-frameENST00000340748ENST00000381404DNMT1chr19

10266529

-EMR1chr19

6890492

+
In-frameENST00000340748ENST00000381407DNMT1chr19

10266529

-EMR1chr19

6890492

+
In-frameENST00000340748ENST00000450315DNMT1chr19

10266529

-EMR1chr19

6890492

+
In-frameENST00000359526ENST00000250572DNMT1chr19

10266529

-EMR1chr19

6890492

+
In-frameENST00000359526ENST00000312053DNMT1chr19

10266529

-EMR1chr19

6890492

+
In-frameENST00000359526ENST00000381404DNMT1chr19

10266529

-EMR1chr19

6890492

+
In-frameENST00000359526ENST00000381407DNMT1chr19

10266529

-EMR1chr19

6890492

+
In-frameENST00000359526ENST00000450315DNMT1chr19

10266529

-EMR1chr19

6890492

+
In-frameENST00000540357ENST00000250572DNMT1chr19

10266529

-EMR1chr19

6890492

+
In-frameENST00000540357ENST00000312053DNMT1chr19

10266529

-EMR1chr19

6890492

+
In-frameENST00000540357ENST00000381404DNMT1chr19

10266529

-EMR1chr19

6890492

+
In-frameENST00000540357ENST00000381407DNMT1chr19

10266529

-EMR1chr19

6890492

+
In-frameENST00000540357ENST00000450315DNMT1chr19

10266529

-EMR1chr19

6890492

+
intron-3CDSENST00000589538ENST00000250572DNMT1chr19

10266529

-EMR1chr19

6890492

+
intron-3CDSENST00000589538ENST00000312053DNMT1chr19

10266529

-EMR1chr19

6890492

+
intron-3CDSENST00000589538ENST00000381404DNMT1chr19

10266529

-EMR1chr19

6890492

+
intron-3CDSENST00000589538ENST00000381407DNMT1chr19

10266529

-EMR1chr19

6890492

+
intron-3CDSENST00000589538ENST00000450315DNMT1chr19

10266529

-EMR1chr19

6890492

+
intron-intronENST00000589538ENST00000601198DNMT1chr19

10266529

-EMR1chr19

6890492

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000340748DNMT1chr1910266529-ENST00000250572EMR1chr196890492+4545168016914114807
ENST00000340748DNMT1chr1910266529-ENST00000381407EMR1chr196890492+4298168016913886731
ENST00000340748DNMT1chr1910266529-ENST00000312053EMR1chr196890492+4740168016914309872
ENST00000340748DNMT1chr1910266529-ENST00000450315EMR1chr196890492+4184168016913778695
ENST00000340748DNMT1chr1910266529-ENST00000381404EMR1chr196890492+4675168016914252853
ENST00000540357DNMT1chr1910266529-ENST00000250572EMR1chr196890492+4489162416354058807
ENST00000540357DNMT1chr1910266529-ENST00000381407EMR1chr196890492+4242162416353830731
ENST00000540357DNMT1chr1910266529-ENST00000312053EMR1chr196890492+4684162416354253872
ENST00000540357DNMT1chr1910266529-ENST00000450315EMR1chr196890492+4128162416353722695
ENST00000540357DNMT1chr1910266529-ENST00000381404EMR1chr196890492+4619162416354196853
ENST00000359526DNMT1chr1910266529-ENST00000250572EMR1chr196890492+4537167216834106807
ENST00000359526DNMT1chr1910266529-ENST00000381407EMR1chr196890492+4290167216833878731
ENST00000359526DNMT1chr1910266529-ENST00000312053EMR1chr196890492+4732167216834301872
ENST00000359526DNMT1chr1910266529-ENST00000450315EMR1chr196890492+4176167216833770695
ENST00000359526DNMT1chr1910266529-ENST00000381404EMR1chr196890492+4667167216834244853

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000340748ENST00000250572DNMT1chr1910266529-EMR1chr196890492+0.0030035850.9969964
ENST00000340748ENST00000381407DNMT1chr1910266529-EMR1chr196890492+0.0047923320.9952076
ENST00000340748ENST00000312053DNMT1chr1910266529-EMR1chr196890492+0.0042401070.9957599
ENST00000340748ENST00000450315DNMT1chr1910266529-EMR1chr196890492+0.0041155990.99588436
ENST00000340748ENST00000381404DNMT1chr1910266529-EMR1chr196890492+0.0022909940.99770904
ENST00000540357ENST00000250572DNMT1chr1910266529-EMR1chr196890492+0.0030860110.996914
ENST00000540357ENST00000381407DNMT1chr1910266529-EMR1chr196890492+0.0050122340.99498785
ENST00000540357ENST00000312053DNMT1chr1910266529-EMR1chr196890492+0.0041040550.995896
ENST00000540357ENST00000450315DNMT1chr1910266529-EMR1chr196890492+0.0042603570.9957397
ENST00000540357ENST00000381404DNMT1chr1910266529-EMR1chr196890492+0.0022647920.99773514
ENST00000359526ENST00000250572DNMT1chr1910266529-EMR1chr196890492+0.0031441380.99685585
ENST00000359526ENST00000381407DNMT1chr1910266529-EMR1chr196890492+0.005139340.9948606
ENST00000359526ENST00000312053DNMT1chr1910266529-EMR1chr196890492+0.0043795830.9956204
ENST00000359526ENST00000450315DNMT1chr1910266529-EMR1chr196890492+0.0043764930.9956235
ENST00000359526ENST00000381404DNMT1chr1910266529-EMR1chr196890492+0.0024116050.9975884

Top

Fusion Genomic Features for DNMT1-EMR1


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)
DNMT1chr1910266528-EMR1chr196890491+8.83E-050.99991167
DNMT1chr1910266528-EMR1chr196890491+8.83E-050.99991167

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.
genomic feature of top 1%

Top

Fusion Protein Features for DNMT1-EMR1


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr19:10266529/chr19:6890492)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
DNMT1

P26358

.
FUNCTION: Methylates CpG residues. Preferentially methylates hemimethylated DNA. Associates with DNA replication sites in S phase maintaining the methylation pattern in the newly synthesized strand, that is essential for epigenetic inheritance. Associates with chromatin during G2 and M phases to maintain DNA methylation independently of replication. It is responsible for maintaining methylation patterns established in development. DNA methylation is coordinated with methylation of histones. Mediates transcriptional repression by direct binding to HDAC2. In association with DNMT3B and via the recruitment of CTCFL/BORIS, involved in activation of BAG1 gene expression by modulating dimethylation of promoter histone H3 at H3K4 and H3K9. Probably forms a corepressor complex required for activated KRAS-mediated promoter hypermethylation and transcriptional silencing of tumor suppressor genes (TSGs) or other tumor-related genes in colorectal cancer (CRC) cells (PubMed:24623306). Also required to maintain a transcriptionally repressive state of genes in undifferentiated embryonic stem cells (ESCs) (PubMed:24623306). Associates at promoter regions of tumor suppressor genes (TSGs) leading to their gene silencing (PubMed:24623306). Promotes tumor growth (PubMed:24623306). {ECO:0000269|PubMed:16357870, ECO:0000269|PubMed:18413740, ECO:0000269|PubMed:18754681, ECO:0000269|PubMed:24623306}.FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-184016_1094811617.0DomainDMAP1-binding
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-194116_1094971633.0DomainDMAP1-binding
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-1840177_2054811617.0MotifNuclear localization signal
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-1941177_2054971633.0MotifNuclear localization signal
TgeneEMR1chr19:10266529chr19:6890492ENST00000250572020317_59910822.0Compositional biasNote=Ser/Thr-rich
TgeneEMR1chr19:10266529chr19:6890492ENST00000312053021317_59910887.0Compositional biasNote=Ser/Thr-rich
TgeneEMR1chr19:10266529chr19:6890492ENST00000381404020317_59910868.0Compositional biasNote=Ser/Thr-rich
TgeneEMR1chr19:10266529chr19:6890492ENST00000381407018317_59910746.0Compositional biasNote=Ser/Thr-rich
TgeneEMR1chr19:10266529chr19:6890492ENST00000450315018317_59910710.0Compositional biasNote=Ser/Thr-rich
TgeneEMR1chr19:10266529chr19:6890492ENST00000250572020132_17110822.0DomainEGF-like 3%3B calcium-binding
TgeneEMR1chr19:10266529chr19:6890492ENST00000250572020172_22010822.0DomainEGF-like 4%3B calcium-binding
TgeneEMR1chr19:10266529chr19:6890492ENST00000250572020221_26710822.0DomainEGF-like 5%3B calcium-binding
TgeneEMR1chr19:10266529chr19:6890492ENST00000250572020268_31610822.0DomainEGF-like 6%3B calcium-binding
TgeneEMR1chr19:10266529chr19:6890492ENST0000025057202031_7910822.0DomainEGF-like 1
TgeneEMR1chr19:10266529chr19:6890492ENST00000250572020547_59610822.0DomainGPS
TgeneEMR1chr19:10266529chr19:6890492ENST00000312053021132_17110887.0DomainEGF-like 3%3B calcium-binding
TgeneEMR1chr19:10266529chr19:6890492ENST00000312053021172_22010887.0DomainEGF-like 4%3B calcium-binding
TgeneEMR1chr19:10266529chr19:6890492ENST00000312053021221_26710887.0DomainEGF-like 5%3B calcium-binding
TgeneEMR1chr19:10266529chr19:6890492ENST00000312053021268_31610887.0DomainEGF-like 6%3B calcium-binding
TgeneEMR1chr19:10266529chr19:6890492ENST0000031205302131_7910887.0DomainEGF-like 1
TgeneEMR1chr19:10266529chr19:6890492ENST00000312053021547_59610887.0DomainGPS
TgeneEMR1chr19:10266529chr19:6890492ENST00000381404020132_17110868.0DomainEGF-like 3%3B calcium-binding
TgeneEMR1chr19:10266529chr19:6890492ENST00000381404020172_22010868.0DomainEGF-like 4%3B calcium-binding
TgeneEMR1chr19:10266529chr19:6890492ENST00000381404020221_26710868.0DomainEGF-like 5%3B calcium-binding
TgeneEMR1chr19:10266529chr19:6890492ENST00000381404020268_31610868.0DomainEGF-like 6%3B calcium-binding
TgeneEMR1chr19:10266529chr19:6890492ENST0000038140402031_7910868.0DomainEGF-like 1
TgeneEMR1chr19:10266529chr19:6890492ENST00000381404020547_59610868.0DomainGPS
TgeneEMR1chr19:10266529chr19:6890492ENST00000381407018132_17110746.0DomainEGF-like 3%3B calcium-binding
TgeneEMR1chr19:10266529chr19:6890492ENST00000381407018172_22010746.0DomainEGF-like 4%3B calcium-binding
TgeneEMR1chr19:10266529chr19:6890492ENST00000381407018221_26710746.0DomainEGF-like 5%3B calcium-binding
TgeneEMR1chr19:10266529chr19:6890492ENST00000381407018268_31610746.0DomainEGF-like 6%3B calcium-binding
TgeneEMR1chr19:10266529chr19:6890492ENST0000038140701831_7910746.0DomainEGF-like 1
TgeneEMR1chr19:10266529chr19:6890492ENST00000381407018547_59610746.0DomainGPS
TgeneEMR1chr19:10266529chr19:6890492ENST00000450315018132_17110710.0DomainEGF-like 3%3B calcium-binding
TgeneEMR1chr19:10266529chr19:6890492ENST00000450315018172_22010710.0DomainEGF-like 4%3B calcium-binding
TgeneEMR1chr19:10266529chr19:6890492ENST00000450315018221_26710710.0DomainEGF-like 5%3B calcium-binding
TgeneEMR1chr19:10266529chr19:6890492ENST00000450315018268_31610710.0DomainEGF-like 6%3B calcium-binding
TgeneEMR1chr19:10266529chr19:6890492ENST0000045031501831_7910710.0DomainEGF-like 1
TgeneEMR1chr19:10266529chr19:6890492ENST00000450315018547_59610710.0DomainGPS
TgeneEMR1chr19:10266529chr19:6890492ENST0000025057202021_59910822.0Topological domainExtracellular
TgeneEMR1chr19:10266529chr19:6890492ENST00000250572020628_63410822.0Topological domainCytoplasmic
TgeneEMR1chr19:10266529chr19:6890492ENST00000250572020657_66610822.0Topological domainExtracellular
TgeneEMR1chr19:10266529chr19:6890492ENST00000250572020691_70910822.0Topological domainCytoplasmic
TgeneEMR1chr19:10266529chr19:6890492ENST00000250572020732_74710822.0Topological domainExtracellular
TgeneEMR1chr19:10266529chr19:6890492ENST00000250572020777_79410822.0Topological domainCytoplasmic
TgeneEMR1chr19:10266529chr19:6890492ENST00000250572020815_82910822.0Topological domainExtracellular
TgeneEMR1chr19:10266529chr19:6890492ENST00000250572020853_88610822.0Topological domainCytoplasmic
TgeneEMR1chr19:10266529chr19:6890492ENST0000031205302121_59910887.0Topological domainExtracellular
TgeneEMR1chr19:10266529chr19:6890492ENST00000312053021628_63410887.0Topological domainCytoplasmic
TgeneEMR1chr19:10266529chr19:6890492ENST00000312053021657_66610887.0Topological domainExtracellular
TgeneEMR1chr19:10266529chr19:6890492ENST00000312053021691_70910887.0Topological domainCytoplasmic
TgeneEMR1chr19:10266529chr19:6890492ENST00000312053021732_74710887.0Topological domainExtracellular
TgeneEMR1chr19:10266529chr19:6890492ENST00000312053021777_79410887.0Topological domainCytoplasmic
TgeneEMR1chr19:10266529chr19:6890492ENST00000312053021815_82910887.0Topological domainExtracellular
TgeneEMR1chr19:10266529chr19:6890492ENST00000312053021853_88610887.0Topological domainCytoplasmic
TgeneEMR1chr19:10266529chr19:6890492ENST0000038140402021_59910868.0Topological domainExtracellular
TgeneEMR1chr19:10266529chr19:6890492ENST00000381404020628_63410868.0Topological domainCytoplasmic
TgeneEMR1chr19:10266529chr19:6890492ENST00000381404020657_66610868.0Topological domainExtracellular
TgeneEMR1chr19:10266529chr19:6890492ENST00000381404020691_70910868.0Topological domainCytoplasmic
TgeneEMR1chr19:10266529chr19:6890492ENST00000381404020732_74710868.0Topological domainExtracellular
TgeneEMR1chr19:10266529chr19:6890492ENST00000381404020777_79410868.0Topological domainCytoplasmic
TgeneEMR1chr19:10266529chr19:6890492ENST00000381404020815_82910868.0Topological domainExtracellular
TgeneEMR1chr19:10266529chr19:6890492ENST00000381404020853_88610868.0Topological domainCytoplasmic
TgeneEMR1chr19:10266529chr19:6890492ENST0000038140701821_59910746.0Topological domainExtracellular
TgeneEMR1chr19:10266529chr19:6890492ENST00000381407018628_63410746.0Topological domainCytoplasmic
TgeneEMR1chr19:10266529chr19:6890492ENST00000381407018657_66610746.0Topological domainExtracellular
TgeneEMR1chr19:10266529chr19:6890492ENST00000381407018691_70910746.0Topological domainCytoplasmic
TgeneEMR1chr19:10266529chr19:6890492ENST00000381407018732_74710746.0Topological domainExtracellular
TgeneEMR1chr19:10266529chr19:6890492ENST00000381407018777_79410746.0Topological domainCytoplasmic
TgeneEMR1chr19:10266529chr19:6890492ENST00000381407018815_82910746.0Topological domainExtracellular
TgeneEMR1chr19:10266529chr19:6890492ENST00000381407018853_88610746.0Topological domainCytoplasmic
TgeneEMR1chr19:10266529chr19:6890492ENST0000045031501821_59910710.0Topological domainExtracellular
TgeneEMR1chr19:10266529chr19:6890492ENST00000450315018628_63410710.0Topological domainCytoplasmic
TgeneEMR1chr19:10266529chr19:6890492ENST00000450315018657_66610710.0Topological domainExtracellular
TgeneEMR1chr19:10266529chr19:6890492ENST00000450315018691_70910710.0Topological domainCytoplasmic
TgeneEMR1chr19:10266529chr19:6890492ENST00000450315018732_74710710.0Topological domainExtracellular
TgeneEMR1chr19:10266529chr19:6890492ENST00000450315018777_79410710.0Topological domainCytoplasmic
TgeneEMR1chr19:10266529chr19:6890492ENST00000450315018815_82910710.0Topological domainExtracellular
TgeneEMR1chr19:10266529chr19:6890492ENST00000450315018853_88610710.0Topological domainCytoplasmic
TgeneEMR1chr19:10266529chr19:6890492ENST00000250572020600_62710822.0TransmembraneHelical%3B Name%3D1
TgeneEMR1chr19:10266529chr19:6890492ENST00000250572020635_65610822.0TransmembraneHelical%3B Name%3D2
TgeneEMR1chr19:10266529chr19:6890492ENST00000250572020667_69010822.0TransmembraneHelical%3B Name%3D3
TgeneEMR1chr19:10266529chr19:6890492ENST00000250572020710_73110822.0TransmembraneHelical%3B Name%3D4
TgeneEMR1chr19:10266529chr19:6890492ENST00000250572020748_77610822.0TransmembraneHelical%3B Name%3D5
TgeneEMR1chr19:10266529chr19:6890492ENST00000250572020795_81410822.0TransmembraneHelical%3B Name%3D6
TgeneEMR1chr19:10266529chr19:6890492ENST00000250572020830_85210822.0TransmembraneHelical%3B Name%3D7
TgeneEMR1chr19:10266529chr19:6890492ENST00000312053021600_62710887.0TransmembraneHelical%3B Name%3D1
TgeneEMR1chr19:10266529chr19:6890492ENST00000312053021635_65610887.0TransmembraneHelical%3B Name%3D2
TgeneEMR1chr19:10266529chr19:6890492ENST00000312053021667_69010887.0TransmembraneHelical%3B Name%3D3
TgeneEMR1chr19:10266529chr19:6890492ENST00000312053021710_73110887.0TransmembraneHelical%3B Name%3D4
TgeneEMR1chr19:10266529chr19:6890492ENST00000312053021748_77610887.0TransmembraneHelical%3B Name%3D5
TgeneEMR1chr19:10266529chr19:6890492ENST00000312053021795_81410887.0TransmembraneHelical%3B Name%3D6
TgeneEMR1chr19:10266529chr19:6890492ENST00000312053021830_85210887.0TransmembraneHelical%3B Name%3D7
TgeneEMR1chr19:10266529chr19:6890492ENST00000381404020600_62710868.0TransmembraneHelical%3B Name%3D1
TgeneEMR1chr19:10266529chr19:6890492ENST00000381404020635_65610868.0TransmembraneHelical%3B Name%3D2
TgeneEMR1chr19:10266529chr19:6890492ENST00000381404020667_69010868.0TransmembraneHelical%3B Name%3D3
TgeneEMR1chr19:10266529chr19:6890492ENST00000381404020710_73110868.0TransmembraneHelical%3B Name%3D4
TgeneEMR1chr19:10266529chr19:6890492ENST00000381404020748_77610868.0TransmembraneHelical%3B Name%3D5
TgeneEMR1chr19:10266529chr19:6890492ENST00000381404020795_81410868.0TransmembraneHelical%3B Name%3D6
TgeneEMR1chr19:10266529chr19:6890492ENST00000381404020830_85210868.0TransmembraneHelical%3B Name%3D7
TgeneEMR1chr19:10266529chr19:6890492ENST00000381407018600_62710746.0TransmembraneHelical%3B Name%3D1
TgeneEMR1chr19:10266529chr19:6890492ENST00000381407018635_65610746.0TransmembraneHelical%3B Name%3D2
TgeneEMR1chr19:10266529chr19:6890492ENST00000381407018667_69010746.0TransmembraneHelical%3B Name%3D3
TgeneEMR1chr19:10266529chr19:6890492ENST00000381407018710_73110746.0TransmembraneHelical%3B Name%3D4
TgeneEMR1chr19:10266529chr19:6890492ENST00000381407018748_77610746.0TransmembraneHelical%3B Name%3D5
TgeneEMR1chr19:10266529chr19:6890492ENST00000381407018795_81410746.0TransmembraneHelical%3B Name%3D6
TgeneEMR1chr19:10266529chr19:6890492ENST00000381407018830_85210746.0TransmembraneHelical%3B Name%3D7
TgeneEMR1chr19:10266529chr19:6890492ENST00000450315018600_62710710.0TransmembraneHelical%3B Name%3D1
TgeneEMR1chr19:10266529chr19:6890492ENST00000450315018635_65610710.0TransmembraneHelical%3B Name%3D2
TgeneEMR1chr19:10266529chr19:6890492ENST00000450315018667_69010710.0TransmembraneHelical%3B Name%3D3
TgeneEMR1chr19:10266529chr19:6890492ENST00000450315018710_73110710.0TransmembraneHelical%3B Name%3D4
TgeneEMR1chr19:10266529chr19:6890492ENST00000450315018748_77610710.0TransmembraneHelical%3B Name%3D5
TgeneEMR1chr19:10266529chr19:6890492ENST00000450315018795_81410710.0TransmembraneHelical%3B Name%3D6
TgeneEMR1chr19:10266529chr19:6890492ENST00000450315018830_85210710.0TransmembraneHelical%3B Name%3D7

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-18401139_15994811617.0DomainSAM-dependent MTase C5-type
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-1840755_8804811617.0DomainBAH 1
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-1840972_11004811617.0DomainBAH 2
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-19411139_15994971633.0DomainSAM-dependent MTase C5-type
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-1941755_8804971633.0DomainBAH 1
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-1941972_11004971633.0DomainBAH 2
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-18401109_11204811617.0RegionNote=6 X 2 AA tandem repeats of K-G
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-18401139_16164811617.0RegionNote=Catalytic
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-18401150_11514811617.0RegionS-adenosyl-L-methionine binding
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-18401168_11694811617.0RegionS-adenosyl-L-methionine binding
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-18401190_11914811617.0RegionS-adenosyl-L-methionine binding
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-1840310_5024811617.0RegionNote=Homodimerization
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-1840331_5504811617.0RegionDNA replication foci-targeting sequence
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-1840651_6974811617.0RegionNote=Required for activity
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-1840693_7544811617.0RegionNote=Autoinhibitory linker
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-19411109_11204971633.0RegionNote=6 X 2 AA tandem repeats of K-G
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-19411139_16164971633.0RegionNote=Catalytic
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-19411150_11514971633.0RegionS-adenosyl-L-methionine binding
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-19411168_11694971633.0RegionS-adenosyl-L-methionine binding
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-19411190_11914971633.0RegionS-adenosyl-L-methionine binding
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-1941310_5024971633.0RegionNote=Homodimerization
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-1941331_5504971633.0RegionDNA replication foci-targeting sequence
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-1941651_6974971633.0RegionNote=Required for activity
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-1941693_7544971633.0RegionNote=Autoinhibitory linker
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-18401109_11104811617.0RepeatNote=1
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-18401111_11124811617.0RepeatNote=2
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-18401113_11144811617.0RepeatNote=3
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-18401115_11164811617.0RepeatNote=4
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-18401117_11184811617.0RepeatNote=5
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-18401119_11204811617.0RepeatNote=6%3B approximate
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-19411109_11104971633.0RepeatNote=1
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-19411111_11124971633.0RepeatNote=2
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-19411113_11144971633.0RepeatNote=3
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-19411115_11164971633.0RepeatNote=4
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-19411117_11184971633.0RepeatNote=5
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-19411119_11204971633.0RepeatNote=6%3B approximate
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-1840646_6924811617.0Zinc fingerCXXC-type
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-1941646_6924971633.0Zinc fingerCXXC-type


Top

Fusion Gene Sequence for DNMT1-EMR1


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>23694_23694_1_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000340748_EMR1_chr19_6890492_ENST00000250572_length(transcript)=4545nt_BP=1680nt
TCCGCGTGGGGGGGGTGTGTGCCCGCCTTGCGCATGCGTGTTCCCTGGGCATGGCCGGCTCCGTTCCATCCTTCTGCACAGGGTATCGCC
TCTCTCCGTTTGGTACATCCCCTCCTCCCCCACGCCCGGACTGGGGTGGTAGACGCCGCCTCCGCTCATCGCCCCTCCCCATCGGTTTCC
GCGCGAAAAGCCGGGGCGCCTGCGCTGCCGCCGCCGCGTCTGCTGAAGCCTCCGAGATGCCGGCGCGTACCGCCCCAGCCCGGGTGCCCA
CACTGGCCGTCCCGGCCATCTCGCTGCCCGACGATGTCCGCAGGCGGCTCAAAGATTTGGAAAGAGACAGCTTAACAGAAAAGGAATGTG
TGAAGGAGAAATTGAATCTCTTGCACGAATTTCTGCAAACAGAAATAAAGAATCAGTTATGTGACTTGGAAACCAAATTACGTAAAGAAG
AATTATCCGAGGAGGGCTACCTGGCTAAAGTCAAATCCCTTTTAAATAAAGATTTGTCCTTGGAGAACGGTGCTCATGCTTACAACCGGG
AAGTGAATGGACGTCTAGAAAACGGGAACCAAGCAAGAAGTGAAGCCCGTAGAGTGGGAATGGCAGATGCCAACAGCCCCCCCAAACCCC
TTTCCAAACCTCGCACGCCCAGGAGGAGCAAGTCCGATGGAGAGGCTAAGCCTGAACCTTCACCTAGCCCCAGGATTACAAGGAAAAGCA
CCAGGCAAACCACCATCACATCTCATTTTGCAAAGGGCCCTGCCAAACGGAAACCTCAGGAAGAGTCTGAAAGAGCCAAATCGGATGAGT
CCATCAAGGAAGAAGACAAAGACCAGGATGAGAAGAGACGTAGAGTTACATCCAGAGAACGAGTTGCTAGACCGCTTCCTGCAGAAGAAC
CTGAAAGAGCAAAATCAGGAACGCGCACTGAAAAGGAAGAAGAAAGAGATGAAAAAGAAGAAAAGAGACTCCGAAGTCAAACCAAAGAAC
CAACACCCAAACAGAAACTGAAGGAGGAGCCGGACAGAGAAGCCAGGGCAGGCGTGCAGGCTGACGAGGACGAAGATGGAGACGAGAAAG
ATGAGAAGAAGCACAGAAGTCAACCCAAAGATCTAGCTGCCAAACGGAGGCCCGAAGAAAAAGAACCTGAAAAAGTAAATCCACAGATTT
CTGATGAAAAAGACGAGGATGAAAAGGAGGAGAAGAGACGCAAAACGACCCCCAAAGAACCAACGGAGAAAAAAATGGCTCGCGCCAAAA
CAGTCATGAACTCCAAGACCCACCCTCCCAAGTGCATTCAGTGCGGGCAGTACCTGGACGACCCTGACCTCAAATATGGGCAGCACCCAC
CAGACGCGGTGGATGAGCCACAGATGCTGACAAATGAGAAGCTGTCCATCTTTGATGCCAACGAGTCTGGCTTTGAGAGTTATGAGGCGC
TTCCCCAGCACAAACTGACCTGCTTCAGTGTGTACTGTAAGCACGGTCACCTGTGTCCCATCGACACCGGCCTCATCGAGAAGAATATCG
AACTCTTCTTTTCTGGTTCAGCAAAACCAATCTATGATGATGACCCATCTCTTGAAGGTGGTGTTAATGGCAAAAATCTTGGCCCCATAA
ATGAATGGTGGATCACTGGCTTTGATGGAGGTGAAAAGGCCCTCATCGGCTTCAGCACCTGATGTTGTGTTATGCACAGCTGGGAAGGGC
ACATAAGACCCACACGGAAACCAAACACAAAGGGTAATAACTGTAGAGACAGTACCTTGTGCCCAGCTTATGCCACCTGCACCAATACAG
TGGACAGTTACTATTGCGCTTGCAAACAAGGCTTCCTGTCCAGCAATGGGCAAAATCACTTCAAGGATCCAGGAGTGCGATGCAAAGATA
TTGATGAATGTTCTCAAAGCCCCCAGCCCTGTGGTCCTAACTCATCCTGCAAAAACCTGTCAGGGAGGTACAAGTGCAGCTGTTTAGATG
GTTTCTCTTCTCCCACTGGAAATGACTGGGTCCCAGGAAAGCCGGGCAATTTCTCCTGTACTGATATCAATGAGTGCCTCACCAGCAGCG
TCTGCCCTGAGCATTCTGACTGTGTCAACTCCATGGGAAGCTACAGTTGCAGCTGTCAAGTTGGATTCATCTCTAGAAACTCCACCTGTG
AAGACGTGGATGAATGTGCAGATCCAAGAGCTTGCCCAGAGCATGCAACTTGTAATAACACTGTTGGAAACTACTCTTGTTTCTGCAACC
CAGGATTTGAATCCAGCAGTGGCCACTTGAGTTTCCAGGGTCTCAAAGCATCGTGTGAAGATATTGATGAATGCACTGAAATGTGCCCCA
TCAATTCAACATGCACCAACACTCCTGGGAGCTACTTTTGCACCTGCCACCCTGGCTTTGCACCAAGCAATGGACAGTTGAATTTCACAG
ACCAAGGAGTGGAATGTAGAGATATTGATGAGTGCCGCCAAGATCCATCAACCTGTGGTCCTAATTCTATCTGCACCAATGCCCTGGGCT
CCTACAGCTGTGGCTGCATTGCAGGCTTTCATCCCAATCCAGAAGGCTCCCAGAAAGATGGCAACTTCAGCTGCCAAAGGGTTCTCTTCA
AATGTAAGGAAGATGTGATACCCGATAATAAGCAGATCCAGCAATGCCAAGAGGGAACCGCAGTGAAACCTGCATATGTCTCCTTTTGTG
CACAAATAAATAACATCTTCAGCGTTCTGGACAAAGTGTGTGAAAATAAAACGACCGTAGTTTCTCTGAAGAATACAACTGAGAGCTTTG
TCCCTGTGCTTAAACAAATATCCACGTGGACTAAATTCACCAAGGAAGAGACGTCCTCCCTGGCCACAGTCTTCCTGGAGAGTGTGGAAA
GCATGACACTGGCATCTTTTTGGAAACCCTCAGCAAATATCACTCCGGCTGTTCGGACGGAATACTTAGACATTGAGAGCAAAGTTATCA
ACAAAGAATGCAGTGAAGAGAATGTGACGTTGGACTTGGTAGCCAAGGGGGATAAGATGAAGATCGGGTGTTCCACAATTGAGGAATCTG
AATCCACAGAGACCACTGGTGTGGCTTTTGTCTCCTTTGTGGGCATGGAATCGGTTTTAAATGAGCGCTTCTTCAAAGACCACCAGGCTC
CCTTGACCACCTCTGAGATCAAGCTGAAGATGAATTCTCGAGTCGTTGGGGGCATAATGACTGGAGAGAAGAAAGACGGCTTCTCAGATC
CAATCATCTACACTCTGGAGAACATTCAGCCAAAGCAGAAGTTTGAGAGGCCCATCTGTGTTTCCTGGAGCACTGATGTGAAGGGTGGAA
GATGGACATCCTTTGGCTGTGTGATCCTGGAAGCTTCTGAGACATATACCATCTGCAGCTGTAATCAGATGGCAAATCTTGCCGTTATCA
TGGCGTCTGGGGAGCTCACGATGGGCTGCGCCATCATCGCGGGCTTCCTGCACTACCTTTTCCTTGCCTGCTTCTTCTGGATGCTGGTGG
AGGCTGTGATACTGTTCTTGATGGTCAGAAACCTGAAGGTGGTGAATTACTTCAGCTCTCGCAACATCAAGATGCTGCACATCTGTGCCT
TTGGTTATGGGCTGCCGATGCTGGTGGTGGTGATCTCTGCCAGTGTGCAGCCACAGGGCTATGGAATGCATAATCGCTGCTGGCTGAATA
CAGAGACAGGGTTCATCTGGAGTTTCTTGGGGCCAGTTTGCACAGTTATAGTGATCAACTCCCTTCTCCTGACCTGGACCTTGTGGATCC
TGAGGCAGAGGCTTTCCAGTGTTAATGCCGAAGTCTCAACGCTAAAAGACACCAGGTTACTGACCTTCAAGGCCTTTGCCCAGCTCTTCA
TCCTGGGCTGCTCCTGGGTGCTGGGCATTTTTCAGATTGGACCTGTGGCAGGTGTCATGGCTTACCTGTTCACCATCATCAACAGCCTGC
AGGGGGCCTTCATCTTCCTCATCCACTGTCTGCTCAACGGCCAGGTACGAGAAGAATACAAGAGGTGGATCACTGGGAAGACGAAGCCCA
GCTCCCAGTCCCAGACCTCAAGGATCTTGCTGTCCTCCATGCCATCCGCTTCCAAGACGGGTTAAAGTCCTTTCTTGCTTTCAAATATGC
TATGGAGCCACAGTTGAGGACAGTAGTTTCCTGCAGGAGCCTACCCTGAAATCTCTTCTCAGCTTAACATGGAAATGAGGATCCCACCAG
CCCCAGAACCCTCTGGGGAAGAATGTTGGGGGCGGTCTTCCTGTGGTTGTATGCACTGATGAGAAATCAGGCGTTTCTGCTCCAAACGAC
CATTTTATCTTCGTGCTCTGCAACTTCTTCAATTCCAGAGTTTCTGAGAACAGACCCAAATTCAATGGCATGACCAAGAACACCTGGCTA
CCATTTTGTTTTCTCCTGCCCTTGTTGGTGCATGGTTCTAAGCATGCCCCTCCAGAGCCTATCATACGCCTGATACAGAGAACCTCTCAA

>23694_23694_1_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000340748_EMR1_chr19_6890492_ENST00000250572_length(amino acids)=807AA_BP=
MHSWEGHIRPTRKPNTKGNNCRDSTLCPAYATCTNTVDSYYCACKQGFLSSNGQNHFKDPGVRCKDIDECSQSPQPCGPNSSCKNLSGRY
KCSCLDGFSSPTGNDWVPGKPGNFSCTDINECLTSSVCPEHSDCVNSMGSYSCSCQVGFISRNSTCEDVDECADPRACPEHATCNNTVGN
YSCFCNPGFESSSGHLSFQGLKASCEDIDECTEMCPINSTCTNTPGSYFCTCHPGFAPSNGQLNFTDQGVECRDIDECRQDPSTCGPNSI
CTNALGSYSCGCIAGFHPNPEGSQKDGNFSCQRVLFKCKEDVIPDNKQIQQCQEGTAVKPAYVSFCAQINNIFSVLDKVCENKTTVVSLK
NTTESFVPVLKQISTWTKFTKEETSSLATVFLESVESMTLASFWKPSANITPAVRTEYLDIESKVINKECSEENVTLDLVAKGDKMKIGC
STIEESESTETTGVAFVSFVGMESVLNERFFKDHQAPLTTSEIKLKMNSRVVGGIMTGEKKDGFSDPIIYTLENIQPKQKFERPICVSWS
TDVKGGRWTSFGCVILEASETYTICSCNQMANLAVIMASGELTMGCAIIAGFLHYLFLACFFWMLVEAVILFLMVRNLKVVNYFSSRNIK
MLHICAFGYGLPMLVVVISASVQPQGYGMHNRCWLNTETGFIWSFLGPVCTVIVINSLLLTWTLWILRQRLSSVNAEVSTLKDTRLLTFK

--------------------------------------------------------------
>23694_23694_2_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000340748_EMR1_chr19_6890492_ENST00000312053_length(transcript)=4740nt_BP=1680nt
TCCGCGTGGGGGGGGTGTGTGCCCGCCTTGCGCATGCGTGTTCCCTGGGCATGGCCGGCTCCGTTCCATCCTTCTGCACAGGGTATCGCC
TCTCTCCGTTTGGTACATCCCCTCCTCCCCCACGCCCGGACTGGGGTGGTAGACGCCGCCTCCGCTCATCGCCCCTCCCCATCGGTTTCC
GCGCGAAAAGCCGGGGCGCCTGCGCTGCCGCCGCCGCGTCTGCTGAAGCCTCCGAGATGCCGGCGCGTACCGCCCCAGCCCGGGTGCCCA
CACTGGCCGTCCCGGCCATCTCGCTGCCCGACGATGTCCGCAGGCGGCTCAAAGATTTGGAAAGAGACAGCTTAACAGAAAAGGAATGTG
TGAAGGAGAAATTGAATCTCTTGCACGAATTTCTGCAAACAGAAATAAAGAATCAGTTATGTGACTTGGAAACCAAATTACGTAAAGAAG
AATTATCCGAGGAGGGCTACCTGGCTAAAGTCAAATCCCTTTTAAATAAAGATTTGTCCTTGGAGAACGGTGCTCATGCTTACAACCGGG
AAGTGAATGGACGTCTAGAAAACGGGAACCAAGCAAGAAGTGAAGCCCGTAGAGTGGGAATGGCAGATGCCAACAGCCCCCCCAAACCCC
TTTCCAAACCTCGCACGCCCAGGAGGAGCAAGTCCGATGGAGAGGCTAAGCCTGAACCTTCACCTAGCCCCAGGATTACAAGGAAAAGCA
CCAGGCAAACCACCATCACATCTCATTTTGCAAAGGGCCCTGCCAAACGGAAACCTCAGGAAGAGTCTGAAAGAGCCAAATCGGATGAGT
CCATCAAGGAAGAAGACAAAGACCAGGATGAGAAGAGACGTAGAGTTACATCCAGAGAACGAGTTGCTAGACCGCTTCCTGCAGAAGAAC
CTGAAAGAGCAAAATCAGGAACGCGCACTGAAAAGGAAGAAGAAAGAGATGAAAAAGAAGAAAAGAGACTCCGAAGTCAAACCAAAGAAC
CAACACCCAAACAGAAACTGAAGGAGGAGCCGGACAGAGAAGCCAGGGCAGGCGTGCAGGCTGACGAGGACGAAGATGGAGACGAGAAAG
ATGAGAAGAAGCACAGAAGTCAACCCAAAGATCTAGCTGCCAAACGGAGGCCCGAAGAAAAAGAACCTGAAAAAGTAAATCCACAGATTT
CTGATGAAAAAGACGAGGATGAAAAGGAGGAGAAGAGACGCAAAACGACCCCCAAAGAACCAACGGAGAAAAAAATGGCTCGCGCCAAAA
CAGTCATGAACTCCAAGACCCACCCTCCCAAGTGCATTCAGTGCGGGCAGTACCTGGACGACCCTGACCTCAAATATGGGCAGCACCCAC
CAGACGCGGTGGATGAGCCACAGATGCTGACAAATGAGAAGCTGTCCATCTTTGATGCCAACGAGTCTGGCTTTGAGAGTTATGAGGCGC
TTCCCCAGCACAAACTGACCTGCTTCAGTGTGTACTGTAAGCACGGTCACCTGTGTCCCATCGACACCGGCCTCATCGAGAAGAATATCG
AACTCTTCTTTTCTGGTTCAGCAAAACCAATCTATGATGATGACCCATCTCTTGAAGGTGGTGTTAATGGCAAAAATCTTGGCCCCATAA
ATGAATGGTGGATCACTGGCTTTGATGGAGGTGAAAAGGCCCTCATCGGCTTCAGCACCTGATGTTGTGTTATGCACAGCTGGGAAGGGC
ACATAAGACCCACACGGAAACCAAACACAAAGGGTAATAACTGTAGAGACAGTACCTTGTGCCCAGCTTATGCCACCTGCACCAATACAG
TGGACAGTTACTATTGCGCTTGCAAACAAGGCTTCCTGTCCAGCAATGGGCAAAATCACTTCAAGGATCCAGGAGTGCGATGCAAAGATA
TTGATGAATGTTCTCAAAGCCCCCAGCCCTGTGGTCCTAACTCATCCTGCAAAAACCTGTCAGGGAGGTACAAGTGCAGCTGTTTAGATG
GTTTCTCTTCTCCCACTGGAAATGACTGGGTCCCAGGAAAGCCGGGCAATTTCTCCTGTACTGATATCAATGAGTGCCTCACCAGCAGCG
TCTGCCCTGAGCATTCTGACTGTGTCAACTCCATGGGAAGCTACAGTTGCAGCTGTCAAGTTGGATTCATCTCTAGAAACTCCACCTGTG
AAGACGTGGATGAATGTGCAGATCCAAGAGCTTGCCCAGAGCATGCAACTTGTAATAACACTGTTGGAAACTACTCTTGTTTCTGCAACC
CAGGATTTGAATCCAGCAGTGGCCACTTGAGTTTCCAGGGTCTCAAAGCATCGTGTGAAGATATTGATGAATGCACTGAAATGTGCCCCA
TCAATTCAACATGCACCAACACTCCTGGGAGCTACTTTTGCACCTGCCACCCTGGCTTTGCACCAAGCAATGGACAGTTGAATTTCACAG
ACCAAGGAGTGGAATGTAGAGATATTGATGAGTGCCGCCAAGATCCATCAACCTGTGGTCCTAATTCTATCTGCACCAATGCCCTGGGCT
CCTACAGCTGTGGCTGCATTGCAGGCTTTCATCCCAATCCAGAAGGCTCCCAGAAAGATGGCAACTTCAGCTGCCAAAGGGTTCTCTTCA
AATGTAAGGAAGATGTGATACCCGATAATAAGCAGATCCAGCAATGCCAAGAGGGAACCGCAGTGAAACCTGCATATGTCTCCTTTTGTG
CACAAATAAATAACATCTTCAGCGTTCTGGACAAAGTGTGTGAAAATAAAACGACCGTAGTTTCTCTGAAGAATACAACTGAGAGCTTTG
TCCCTGTGCTTAAACAAATATCCACGTGGACTAAATTCACCAAGGAAGAGACGTCCTCCCTGGCCACAGTCTTCCTGGAGAGTGTGGAAA
GCATGACACTGGCATCTTTTTGGAAACCCTCAGCAAATATCACTCCGGCTGTTCGGACGGAATACTTAGACATTGAGAGCAAAGTTATCA
ACAAAGAATGCAGTGAAGAGAATGTGACGTTGGACTTGGTAGCCAAGGGGGATAAGATGAAGATCGGGTGTTCCACAATTGAGGAATCTG
AATCCACAGAGACCACTGGTGTGGCTTTTGTCTCCTTTGTGGGCATGGAATCGGTTTTAAATGAGCGCTTCTTCAAAGACCACCAGGCTC
CCTTGACCACCTCTGAGATCAAGCTGAAGATGAATTCTCGAGTCGTTGGGGGCATAATGACTGGAGAGAAGAAAGACGGCTTCTCAGATC
CAATCATCTACACTCTGGAGAACATTCAGCCAAAGCAGAAGTTTGAGAGGCCCATCTGTGTTTCCTGGAGCACTGATGTGAAGGGTGGAA
GATGGACATCCTTTGGCTGTGTGATCCTGGAAGCTTCTGAGACATATACCATCTGCAGCTGTAATCAGATGGCAAATCTTGCCGTTATCA
TGGCGTCTGGGGAGCTCACGATGGACTTTTCCTTGTACATCATTAGCCATGTAGGCATTATCATCTCCTTGGTGTGCCTCGTCTTGGCCA
TCGCCACCTTTCTGCTGTGTCGCTCCATCCGAAATCACAACACCTACCTCCACCTGCACCTCTGCGTGTGTCTCCTCTTGGCGAAGACTC
TCTTCCTCGCCGGTATACACAAGACTGACAACAAGATGGGCTGCGCCATCATCGCGGGCTTCCTGCACTACCTTTTCCTTGCCTGCTTCT
TCTGGATGCTGGTGGAGGCTGTGATACTGTTCTTGATGGTCAGAAACCTGAAGGTGGTGAATTACTTCAGCTCTCGCAACATCAAGATGC
TGCACATCTGTGCCTTTGGTTATGGGCTGCCGATGCTGGTGGTGGTGATCTCTGCCAGTGTGCAGCCACAGGGCTATGGAATGCATAATC
GCTGCTGGCTGAATACAGAGACAGGGTTCATCTGGAGTTTCTTGGGGCCAGTTTGCACAGTTATAGTGATCAACTCCCTTCTCCTGACCT
GGACCTTGTGGATCCTGAGGCAGAGGCTTTCCAGTGTTAATGCCGAAGTCTCAACGCTAAAAGACACCAGGTTACTGACCTTCAAGGCCT
TTGCCCAGCTCTTCATCCTGGGCTGCTCCTGGGTGCTGGGCATTTTTCAGATTGGACCTGTGGCAGGTGTCATGGCTTACCTGTTCACCA
TCATCAACAGCCTGCAGGGGGCCTTCATCTTCCTCATCCACTGTCTGCTCAACGGCCAGGTACGAGAAGAATACAAGAGGTGGATCACTG
GGAAGACGAAGCCCAGCTCCCAGTCCCAGACCTCAAGGATCTTGCTGTCCTCCATGCCATCCGCTTCCAAGACGGGTTAAAGTCCTTTCT
TGCTTTCAAATATGCTATGGAGCCACAGTTGAGGACAGTAGTTTCCTGCAGGAGCCTACCCTGAAATCTCTTCTCAGCTTAACATGGAAA
TGAGGATCCCACCAGCCCCAGAACCCTCTGGGGAAGAATGTTGGGGGCGGTCTTCCTGTGGTTGTATGCACTGATGAGAAATCAGGCGTT
TCTGCTCCAAACGACCATTTTATCTTCGTGCTCTGCAACTTCTTCAATTCCAGAGTTTCTGAGAACAGACCCAAATTCAATGGCATGACC
AAGAACACCTGGCTACCATTTTGTTTTCTCCTGCCCTTGTTGGTGCATGGTTCTAAGCATGCCCCTCCAGAGCCTATCATACGCCTGATA

>23694_23694_2_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000340748_EMR1_chr19_6890492_ENST00000312053_length(amino acids)=872AA_BP=
MHSWEGHIRPTRKPNTKGNNCRDSTLCPAYATCTNTVDSYYCACKQGFLSSNGQNHFKDPGVRCKDIDECSQSPQPCGPNSSCKNLSGRY
KCSCLDGFSSPTGNDWVPGKPGNFSCTDINECLTSSVCPEHSDCVNSMGSYSCSCQVGFISRNSTCEDVDECADPRACPEHATCNNTVGN
YSCFCNPGFESSSGHLSFQGLKASCEDIDECTEMCPINSTCTNTPGSYFCTCHPGFAPSNGQLNFTDQGVECRDIDECRQDPSTCGPNSI
CTNALGSYSCGCIAGFHPNPEGSQKDGNFSCQRVLFKCKEDVIPDNKQIQQCQEGTAVKPAYVSFCAQINNIFSVLDKVCENKTTVVSLK
NTTESFVPVLKQISTWTKFTKEETSSLATVFLESVESMTLASFWKPSANITPAVRTEYLDIESKVINKECSEENVTLDLVAKGDKMKIGC
STIEESESTETTGVAFVSFVGMESVLNERFFKDHQAPLTTSEIKLKMNSRVVGGIMTGEKKDGFSDPIIYTLENIQPKQKFERPICVSWS
TDVKGGRWTSFGCVILEASETYTICSCNQMANLAVIMASGELTMDFSLYIISHVGIIISLVCLVLAIATFLLCRSIRNHNTYLHLHLCVC
LLLAKTLFLAGIHKTDNKMGCAIIAGFLHYLFLACFFWMLVEAVILFLMVRNLKVVNYFSSRNIKMLHICAFGYGLPMLVVVISASVQPQ
GYGMHNRCWLNTETGFIWSFLGPVCTVIVINSLLLTWTLWILRQRLSSVNAEVSTLKDTRLLTFKAFAQLFILGCSWVLGIFQIGPVAGV

--------------------------------------------------------------
>23694_23694_3_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000340748_EMR1_chr19_6890492_ENST00000381404_length(transcript)=4675nt_BP=1680nt
TCCGCGTGGGGGGGGTGTGTGCCCGCCTTGCGCATGCGTGTTCCCTGGGCATGGCCGGCTCCGTTCCATCCTTCTGCACAGGGTATCGCC
TCTCTCCGTTTGGTACATCCCCTCCTCCCCCACGCCCGGACTGGGGTGGTAGACGCCGCCTCCGCTCATCGCCCCTCCCCATCGGTTTCC
GCGCGAAAAGCCGGGGCGCCTGCGCTGCCGCCGCCGCGTCTGCTGAAGCCTCCGAGATGCCGGCGCGTACCGCCCCAGCCCGGGTGCCCA
CACTGGCCGTCCCGGCCATCTCGCTGCCCGACGATGTCCGCAGGCGGCTCAAAGATTTGGAAAGAGACAGCTTAACAGAAAAGGAATGTG
TGAAGGAGAAATTGAATCTCTTGCACGAATTTCTGCAAACAGAAATAAAGAATCAGTTATGTGACTTGGAAACCAAATTACGTAAAGAAG
AATTATCCGAGGAGGGCTACCTGGCTAAAGTCAAATCCCTTTTAAATAAAGATTTGTCCTTGGAGAACGGTGCTCATGCTTACAACCGGG
AAGTGAATGGACGTCTAGAAAACGGGAACCAAGCAAGAAGTGAAGCCCGTAGAGTGGGAATGGCAGATGCCAACAGCCCCCCCAAACCCC
TTTCCAAACCTCGCACGCCCAGGAGGAGCAAGTCCGATGGAGAGGCTAAGCCTGAACCTTCACCTAGCCCCAGGATTACAAGGAAAAGCA
CCAGGCAAACCACCATCACATCTCATTTTGCAAAGGGCCCTGCCAAACGGAAACCTCAGGAAGAGTCTGAAAGAGCCAAATCGGATGAGT
CCATCAAGGAAGAAGACAAAGACCAGGATGAGAAGAGACGTAGAGTTACATCCAGAGAACGAGTTGCTAGACCGCTTCCTGCAGAAGAAC
CTGAAAGAGCAAAATCAGGAACGCGCACTGAAAAGGAAGAAGAAAGAGATGAAAAAGAAGAAAAGAGACTCCGAAGTCAAACCAAAGAAC
CAACACCCAAACAGAAACTGAAGGAGGAGCCGGACAGAGAAGCCAGGGCAGGCGTGCAGGCTGACGAGGACGAAGATGGAGACGAGAAAG
ATGAGAAGAAGCACAGAAGTCAACCCAAAGATCTAGCTGCCAAACGGAGGCCCGAAGAAAAAGAACCTGAAAAAGTAAATCCACAGATTT
CTGATGAAAAAGACGAGGATGAAAAGGAGGAGAAGAGACGCAAAACGACCCCCAAAGAACCAACGGAGAAAAAAATGGCTCGCGCCAAAA
CAGTCATGAACTCCAAGACCCACCCTCCCAAGTGCATTCAGTGCGGGCAGTACCTGGACGACCCTGACCTCAAATATGGGCAGCACCCAC
CAGACGCGGTGGATGAGCCACAGATGCTGACAAATGAGAAGCTGTCCATCTTTGATGCCAACGAGTCTGGCTTTGAGAGTTATGAGGCGC
TTCCCCAGCACAAACTGACCTGCTTCAGTGTGTACTGTAAGCACGGTCACCTGTGTCCCATCGACACCGGCCTCATCGAGAAGAATATCG
AACTCTTCTTTTCTGGTTCAGCAAAACCAATCTATGATGATGACCCATCTCTTGAAGGTGGTGTTAATGGCAAAAATCTTGGCCCCATAA
ATGAATGGTGGATCACTGGCTTTGATGGAGGTGAAAAGGCCCTCATCGGCTTCAGCACCTGATGTTGTGTTATGCACAGCTGGGAAGGGC
ACATAAGACCCACACGGAAACCAAACACAAAGGGTAATAACTGTAGAGACAGTACCTTGTGCCCAGCTTATGCCACCTGCACCAATACAG
TGGACAGTTACTATTGCGCTTGCAAACAAGGCTTCCTGTCCAGCAATGGGCAAAATCACTTCAAGGATCCAGGAGTGCGATGCAAAGATA
TCAATGAGTGCCTCACCAGCAGCGTCTGCCCTGAGCATTCTGACTGTGTCAACTCCATGGGAAGCTACAGTTGCAGCTGTCAAGTTGGAT
TCATCTCTAGAAACTCCACCTGTGAAGACGTGGATGAATGTGCAGATCCAAGAGCTTGCCCAGAGCATGCAACTTGTAATAACACTGTTG
GAAACTACTCTTGTTTCTGCAACCCAGGATTTGAATCCAGCAGTGGCCACTTGAGTTTCCAGGGTCTCAAAGCATCGTGTGAAGATATTG
ATGAATGCACTGAAATGTGCCCCATCAATTCAACATGCACCAACACTCCTGGGAGCTACTTTTGCACCTGCCACCCTGGCTTTGCACCAA
GCAATGGACAGTTGAATTTCACAGACCAAGGAGTGGAATGTAGAGATATTGATGAGTGCCGCCAAGATCCATCAACCTGTGGTCCTAATT
CTATCTGCACCAATGCCCTGGGCTCCTACAGCTGTGGCTGCATTGCAGGCTTTCATCCCAATCCAGAAGGCTCCCAGAAAGATGGCAACT
TCAGCTGCCAAAGGGTTCTCTTCAAATGTAAGGAAGATGTGATACCCGATAATAAGCAGATCCAGCAATGCCAAGAGGGAACCGCAGTGA
AACCTGCATATGTCTCCTTTTGTGCACAAATAAATAACATCTTCAGCGTTCTGGACAAAGTGTGTGAAAATAAAACGACCGTAGTTTCTC
TGAAGAATACAACTGAGAGCTTTGTCCCTGTGCTTAAACAAATATCCACGTGGACTAAATTCACCAAGGAAGAGACGTCCTCCCTGGCCA
CAGTCTTCCTGGAGAGTGTGGAAAGCATGACACTGGCATCTTTTTGGAAACCCTCAGCAAATATCACTCCGGCTGTTCGGACGGAATACT
TAGACATTGAGAGCAAAGTTATCAACAAAGAATGCAGTGAAGAGAATGTGACGTTGGACTTGGTAGCCAAGGGGGATAAGATGAAGATCG
GGTGTTCCACAATTGAGGAATCTGAATCCACAGAGACCACTGGTGTGGCTTTTGTCTCCTTTGTGGGCATGGAATCGGTTTTAAATGAGC
GCTTCTTCAAAGACCACCAGGCTCCCTTGACCACCTCTGAGATCAAGCTGAAGATGAATTCTCGAGTCGTTGGGGGCATAATGACTGGAG
AGAAGAAAGACGGCTTCTCAGATCCAATCATCTACACTCTGGAGAACATTCAGCCAAAGCAGAAGTTTGAGAGGCCCATCTGTGTTTCCT
GGAGCACTGATGTGAAGGGTGGAAGATGGACATCCTTTGGCTGTGTGATCCTGGAAGCTTCTGAGACATATACCATCTGCAGCTGTAATC
AGATGGCAAATCTTGCCGTTATCATGGCGTCTGGGGAGCTCACGATGGACTTTTCCTTGTACATCATTAGCCATGTAGGCATTATCATCT
CCTTGGTGTGCCTCGTCTTGGCCATCGCCACCTTTCTGCTGTGTCGCTCCATCCGAAATCACAACACCTACCTCCACCTGCACCTCTGCG
TGTGTCTCCTCTTGGCGAAGACTCTCTTCCTCGCCGGTATACACAAGACTGACAACAAGATGGGCTGCGCCATCATCGCGGGCTTCCTGC
ACTACCTTTTCCTTGCCTGCTTCTTCTGGATGCTGGTGGAGGCTGTGATACTGTTCTTGATGGTCAGAAACCTGAAGGTGGTGAATTACT
TCAGCTCTCGCAACATCAAGATGCTGCACATCTGTGCCTTTGGTTATGGGCTGCCGATGCTGGTGGTGGTGATCTCTGCCAGTGTGCAGC
CACAGGGCTATGGAATGCATAATCGCTGCTGGCTGAATACAGAGACAGGGTTCATCTGGAGTTTCTTGGGGCCAGTTTGCACAGTTATAG
TGGTAAGCAAATACTACAACAGCCTGGCGAAGTGTGTTCTGAAGGAGGAGCAAGGAGACCTGCGAGATCTGGAATTTCCAGGGACGTGTG
CAGCTGAGAGGATCAACTCCCTTCTCCTGACCTGGACCTTGTGGATCCTGAGGCAGAGGCTTTCCAGTGTTAATGCCGAAGTCTCAACGC
TAAAAGACACCAGGTTACTGACCTTCAAGGCCTTTGCCCAGCTCTTCATCCTGGGCTGCTCCTGGGTGCTGGGCATTTTTCAGATTGGAC
CTGTGGCAGGTGTCATGGCTTACCTGTTCACCATCATCAACAGCCTGCAGGGGGCCTTCATCTTCCTCATCCACTGTCTGCTCAACGGCC
AGGTACGAGAAGAATACAAGAGGTGGATCACTGGGAAGACGAAGCCCAGCTCCCAGTCCCAGACCTCAAGGATCTTGCTGTCCTCCATGC
CATCCGCTTCCAAGACGGGTTAAAGTCCTTTCTTGCTTTCAAATATGCTATGGAGCCACAGTTGAGGACAGTAGTTTCCTGCAGGAGCCT
ACCCTGAAATCTCTTCTCAGCTTAACATGGAAATGAGGATCCCACCAGCCCCAGAACCCTCTGGGGAAGAATGTTGGGGGCGGTCTTCCT
GTGGTTGTATGCACTGATGAGAAATCAGGCGTTTCTGCTCCAAACGACCATTTTATCTTCGTGCTCTGCAACTTCTTCAATTCCAGAGTT
TCTGAGAACAGACCCAAATTCAATGGCATGACCAAGAACACCTGGCTACCATTTTGTTTTCTCCTGCCCTTGTTGGTGCATGGTTCTAAG

>23694_23694_3_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000340748_EMR1_chr19_6890492_ENST00000381404_length(amino acids)=853AA_BP=59
MHSWEGHIRPTRKPNTKGNNCRDSTLCPAYATCTNTVDSYYCACKQGFLSSNGQNHFKDPGVRCKDINECLTSSVCPEHSDCVNSMGSYS
CSCQVGFISRNSTCEDVDECADPRACPEHATCNNTVGNYSCFCNPGFESSSGHLSFQGLKASCEDIDECTEMCPINSTCTNTPGSYFCTC
HPGFAPSNGQLNFTDQGVECRDIDECRQDPSTCGPNSICTNALGSYSCGCIAGFHPNPEGSQKDGNFSCQRVLFKCKEDVIPDNKQIQQC
QEGTAVKPAYVSFCAQINNIFSVLDKVCENKTTVVSLKNTTESFVPVLKQISTWTKFTKEETSSLATVFLESVESMTLASFWKPSANITP
AVRTEYLDIESKVINKECSEENVTLDLVAKGDKMKIGCSTIEESESTETTGVAFVSFVGMESVLNERFFKDHQAPLTTSEIKLKMNSRVV
GGIMTGEKKDGFSDPIIYTLENIQPKQKFERPICVSWSTDVKGGRWTSFGCVILEASETYTICSCNQMANLAVIMASGELTMDFSLYIIS
HVGIIISLVCLVLAIATFLLCRSIRNHNTYLHLHLCVCLLLAKTLFLAGIHKTDNKMGCAIIAGFLHYLFLACFFWMLVEAVILFLMVRN
LKVVNYFSSRNIKMLHICAFGYGLPMLVVVISASVQPQGYGMHNRCWLNTETGFIWSFLGPVCTVIVVSKYYNSLAKCVLKEEQGDLRDL
EFPGTCAAERINSLLLTWTLWILRQRLSSVNAEVSTLKDTRLLTFKAFAQLFILGCSWVLGIFQIGPVAGVMAYLFTIINSLQGAFIFLI

--------------------------------------------------------------
>23694_23694_4_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000340748_EMR1_chr19_6890492_ENST00000381407_length(transcript)=4298nt_BP=1680nt
TCCGCGTGGGGGGGGTGTGTGCCCGCCTTGCGCATGCGTGTTCCCTGGGCATGGCCGGCTCCGTTCCATCCTTCTGCACAGGGTATCGCC
TCTCTCCGTTTGGTACATCCCCTCCTCCCCCACGCCCGGACTGGGGTGGTAGACGCCGCCTCCGCTCATCGCCCCTCCCCATCGGTTTCC
GCGCGAAAAGCCGGGGCGCCTGCGCTGCCGCCGCCGCGTCTGCTGAAGCCTCCGAGATGCCGGCGCGTACCGCCCCAGCCCGGGTGCCCA
CACTGGCCGTCCCGGCCATCTCGCTGCCCGACGATGTCCGCAGGCGGCTCAAAGATTTGGAAAGAGACAGCTTAACAGAAAAGGAATGTG
TGAAGGAGAAATTGAATCTCTTGCACGAATTTCTGCAAACAGAAATAAAGAATCAGTTATGTGACTTGGAAACCAAATTACGTAAAGAAG
AATTATCCGAGGAGGGCTACCTGGCTAAAGTCAAATCCCTTTTAAATAAAGATTTGTCCTTGGAGAACGGTGCTCATGCTTACAACCGGG
AAGTGAATGGACGTCTAGAAAACGGGAACCAAGCAAGAAGTGAAGCCCGTAGAGTGGGAATGGCAGATGCCAACAGCCCCCCCAAACCCC
TTTCCAAACCTCGCACGCCCAGGAGGAGCAAGTCCGATGGAGAGGCTAAGCCTGAACCTTCACCTAGCCCCAGGATTACAAGGAAAAGCA
CCAGGCAAACCACCATCACATCTCATTTTGCAAAGGGCCCTGCCAAACGGAAACCTCAGGAAGAGTCTGAAAGAGCCAAATCGGATGAGT
CCATCAAGGAAGAAGACAAAGACCAGGATGAGAAGAGACGTAGAGTTACATCCAGAGAACGAGTTGCTAGACCGCTTCCTGCAGAAGAAC
CTGAAAGAGCAAAATCAGGAACGCGCACTGAAAAGGAAGAAGAAAGAGATGAAAAAGAAGAAAAGAGACTCCGAAGTCAAACCAAAGAAC
CAACACCCAAACAGAAACTGAAGGAGGAGCCGGACAGAGAAGCCAGGGCAGGCGTGCAGGCTGACGAGGACGAAGATGGAGACGAGAAAG
ATGAGAAGAAGCACAGAAGTCAACCCAAAGATCTAGCTGCCAAACGGAGGCCCGAAGAAAAAGAACCTGAAAAAGTAAATCCACAGATTT
CTGATGAAAAAGACGAGGATGAAAAGGAGGAGAAGAGACGCAAAACGACCCCCAAAGAACCAACGGAGAAAAAAATGGCTCGCGCCAAAA
CAGTCATGAACTCCAAGACCCACCCTCCCAAGTGCATTCAGTGCGGGCAGTACCTGGACGACCCTGACCTCAAATATGGGCAGCACCCAC
CAGACGCGGTGGATGAGCCACAGATGCTGACAAATGAGAAGCTGTCCATCTTTGATGCCAACGAGTCTGGCTTTGAGAGTTATGAGGCGC
TTCCCCAGCACAAACTGACCTGCTTCAGTGTGTACTGTAAGCACGGTCACCTGTGTCCCATCGACACCGGCCTCATCGAGAAGAATATCG
AACTCTTCTTTTCTGGTTCAGCAAAACCAATCTATGATGATGACCCATCTCTTGAAGGTGGTGTTAATGGCAAAAATCTTGGCCCCATAA
ATGAATGGTGGATCACTGGCTTTGATGGAGGTGAAAAGGCCCTCATCGGCTTCAGCACCTGATGTTGTGTTATGCACAGCTGGGAAGGGC
ACATAAGACCCACACGGAAACCAAACACAAAGGGTAATAACTGTAGAGACAGTACCTTGTGCCCAGCTTATGCCACCTGCACCAATACAG
TGGACAGTTACTATTGCGCTTGCAAACAAGGCTTCCTGTCCAGCAATGGGCAAAATCACTTCAAGGATCCAGGAGTGCGATGCAAAGATA
TTGATGAATGCACTGAAATGTGCCCCATCAATTCAACATGCACCAACACTCCTGGGAGCTACTTTTGCACCTGCCACCCTGGCTTTGCAC
CAAGCAATGGACAGTTGAATTTCACAGACCAAGGAGTGGAATGTAGAGATATTGATGAGTGCCGCCAAGATCCATCAACCTGTGGTCCTA
ATTCTATCTGCACCAATGCCCTGGGCTCCTACAGCTGTGGCTGCATTGCAGGCTTTCATCCCAATCCAGAAGGCTCCCAGAAAGATGGCA
ACTTCAGCTGCCAAAGGGTTCTCTTCAAATGTAAGGAAGATGTGATACCCGATAATAAGCAGATCCAGCAATGCCAAGAGGGAACCGCAG
TGAAACCTGCATATGTCTCCTTTTGTGCACAAATAAATAACATCTTCAGCGTTCTGGACAAAGTGTGTGAAAATAAAACGACCGTAGTTT
CTCTGAAGAATACAACTGAGAGCTTTGTCCCTGTGCTTAAACAAATATCCACGTGGACTAAATTCACCAAGGAAGAGACGTCCTCCCTGG
CCACAGTCTTCCTGGAGAGTGTGGAAAGCATGACACTGGCATCTTTTTGGAAACCCTCAGCAAATATCACTCCGGCTGTTCGGACGGAAT
ACTTAGACATTGAGAGCAAAGTTATCAACAAAGAATGCAGTGAAGAGAATGTGACGTTGGACTTGGTAGCCAAGGGGGATAAGATGAAGA
TCGGGTGTTCCACAATTGAGGAATCTGAATCCACAGAGACCACTGGTGTGGCTTTTGTCTCCTTTGTGGGCATGGAATCGGTTTTAAATG
AGCGCTTCTTCAAAGACCACCAGGCTCCCTTGACCACCTCTGAGATCAAGCTGAAGATGAATTCTCGAGTCGTTGGGGGCATAATGACTG
GAGAGAAGAAAGACGGCTTCTCAGATCCAATCATCTACACTCTGGAGAACATTCAGCCAAAGCAGAAGTTTGAGAGGCCCATCTGTGTTT
CCTGGAGCACTGATGTGAAGGGTGGAAGATGGACATCCTTTGGCTGTGTGATCCTGGAAGCTTCTGAGACATATACCATCTGCAGCTGTA
ATCAGATGGCAAATCTTGCCGTTATCATGGCGTCTGGGGAGCTCACGATGGACTTTTCCTTGTACATCATTAGCCATGTAGGCATTATCA
TCTCCTTGGTGTGCCTCGTCTTGGCCATCGCCACCTTTCTGCTGTGTCGCTCCATCCGAAATCACAACACCTACCTCCACCTGCACCTCT
GCGTGTGTCTCCTCTTGGCGAAGACTCTCTTCCTCGCCGGTATACACAAGACTGACAACAAGATGGGCTGCGCCATCATCGCGGGCTTCC
TGCACTACCTTTTCCTTGCCTGCTTCTTCTGGATGCTGGTGGAGGCTGTGATACTGTTCTTGATGGTCAGAAACCTGAAGGTGGTGAATT
ACTTCAGCTCTCGCAACATCAAGATGCTGCACATCTGTGCCTTTGGTTATGGGCTGCCGATGCTGGTGGTGGTGATCTCTGCCAGTGTGC
AGCCACAGGGCTATGGAATGCATAATCGCTGCTGGCTGAATACAGAGACAGGGTTCATCTGGAGTTTCTTGGGGCCAGTTTGCACAGTTA
TAGTGATCAACTCCCTTCTCCTGACCTGGACCTTGTGGATCCTGAGGCAGAGGCTTTCCAGTGTTAATGCCGAAGTCTCAACGCTAAAAG
ACACCAGGTTACTGACCTTCAAGGCCTTTGCCCAGCTCTTCATCCTGGGCTGCTCCTGGGTGCTGGGCATTTTTCAGATTGGACCTGTGG
CAGGTGTCATGGCTTACCTGTTCACCATCATCAACAGCCTGCAGGGGGCCTTCATCTTCCTCATCCACTGTCTGCTCAACGGCCAGGTAC
GAGAAGAATACAAGAGGTGGATCACTGGGAAGACGAAGCCCAGCTCCCAGTCCCAGACCTCAAGGATCTTGCTGTCCTCCATGCCATCCG
CTTCCAAGACGGGTTAAAGTCCTTTCTTGCTTTCAAATATGCTATGGAGCCACAGTTGAGGACAGTAGTTTCCTGCAGGAGCCTACCCTG
AAATCTCTTCTCAGCTTAACATGGAAATGAGGATCCCACCAGCCCCAGAACCCTCTGGGGAAGAATGTTGGGGGCGGTCTTCCTGTGGTT
GTATGCACTGATGAGAAATCAGGCGTTTCTGCTCCAAACGACCATTTTATCTTCGTGCTCTGCAACTTCTTCAATTCCAGAGTTTCTGAG
AACAGACCCAAATTCAATGGCATGACCAAGAACACCTGGCTACCATTTTGTTTTCTCCTGCCCTTGTTGGTGCATGGTTCTAAGCATGCC

>23694_23694_4_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000340748_EMR1_chr19_6890492_ENST00000381407_length(amino acids)=731AA_BP=
MHSWEGHIRPTRKPNTKGNNCRDSTLCPAYATCTNTVDSYYCACKQGFLSSNGQNHFKDPGVRCKDIDECTEMCPINSTCTNTPGSYFCT
CHPGFAPSNGQLNFTDQGVECRDIDECRQDPSTCGPNSICTNALGSYSCGCIAGFHPNPEGSQKDGNFSCQRVLFKCKEDVIPDNKQIQQ
CQEGTAVKPAYVSFCAQINNIFSVLDKVCENKTTVVSLKNTTESFVPVLKQISTWTKFTKEETSSLATVFLESVESMTLASFWKPSANIT
PAVRTEYLDIESKVINKECSEENVTLDLVAKGDKMKIGCSTIEESESTETTGVAFVSFVGMESVLNERFFKDHQAPLTTSEIKLKMNSRV
VGGIMTGEKKDGFSDPIIYTLENIQPKQKFERPICVSWSTDVKGGRWTSFGCVILEASETYTICSCNQMANLAVIMASGELTMDFSLYII
SHVGIIISLVCLVLAIATFLLCRSIRNHNTYLHLHLCVCLLLAKTLFLAGIHKTDNKMGCAIIAGFLHYLFLACFFWMLVEAVILFLMVR
NLKVVNYFSSRNIKMLHICAFGYGLPMLVVVISASVQPQGYGMHNRCWLNTETGFIWSFLGPVCTVIVINSLLLTWTLWILRQRLSSVNA
EVSTLKDTRLLTFKAFAQLFILGCSWVLGIFQIGPVAGVMAYLFTIINSLQGAFIFLIHCLLNGQVREEYKRWITGKTKPSSQSQTSRIL

--------------------------------------------------------------
>23694_23694_5_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000340748_EMR1_chr19_6890492_ENST00000450315_length(transcript)=4184nt_BP=1680nt
TCCGCGTGGGGGGGGTGTGTGCCCGCCTTGCGCATGCGTGTTCCCTGGGCATGGCCGGCTCCGTTCCATCCTTCTGCACAGGGTATCGCC
TCTCTCCGTTTGGTACATCCCCTCCTCCCCCACGCCCGGACTGGGGTGGTAGACGCCGCCTCCGCTCATCGCCCCTCCCCATCGGTTTCC
GCGCGAAAAGCCGGGGCGCCTGCGCTGCCGCCGCCGCGTCTGCTGAAGCCTCCGAGATGCCGGCGCGTACCGCCCCAGCCCGGGTGCCCA
CACTGGCCGTCCCGGCCATCTCGCTGCCCGACGATGTCCGCAGGCGGCTCAAAGATTTGGAAAGAGACAGCTTAACAGAAAAGGAATGTG
TGAAGGAGAAATTGAATCTCTTGCACGAATTTCTGCAAACAGAAATAAAGAATCAGTTATGTGACTTGGAAACCAAATTACGTAAAGAAG
AATTATCCGAGGAGGGCTACCTGGCTAAAGTCAAATCCCTTTTAAATAAAGATTTGTCCTTGGAGAACGGTGCTCATGCTTACAACCGGG
AAGTGAATGGACGTCTAGAAAACGGGAACCAAGCAAGAAGTGAAGCCCGTAGAGTGGGAATGGCAGATGCCAACAGCCCCCCCAAACCCC
TTTCCAAACCTCGCACGCCCAGGAGGAGCAAGTCCGATGGAGAGGCTAAGCCTGAACCTTCACCTAGCCCCAGGATTACAAGGAAAAGCA
CCAGGCAAACCACCATCACATCTCATTTTGCAAAGGGCCCTGCCAAACGGAAACCTCAGGAAGAGTCTGAAAGAGCCAAATCGGATGAGT
CCATCAAGGAAGAAGACAAAGACCAGGATGAGAAGAGACGTAGAGTTACATCCAGAGAACGAGTTGCTAGACCGCTTCCTGCAGAAGAAC
CTGAAAGAGCAAAATCAGGAACGCGCACTGAAAAGGAAGAAGAAAGAGATGAAAAAGAAGAAAAGAGACTCCGAAGTCAAACCAAAGAAC
CAACACCCAAACAGAAACTGAAGGAGGAGCCGGACAGAGAAGCCAGGGCAGGCGTGCAGGCTGACGAGGACGAAGATGGAGACGAGAAAG
ATGAGAAGAAGCACAGAAGTCAACCCAAAGATCTAGCTGCCAAACGGAGGCCCGAAGAAAAAGAACCTGAAAAAGTAAATCCACAGATTT
CTGATGAAAAAGACGAGGATGAAAAGGAGGAGAAGAGACGCAAAACGACCCCCAAAGAACCAACGGAGAAAAAAATGGCTCGCGCCAAAA
CAGTCATGAACTCCAAGACCCACCCTCCCAAGTGCATTCAGTGCGGGCAGTACCTGGACGACCCTGACCTCAAATATGGGCAGCACCCAC
CAGACGCGGTGGATGAGCCACAGATGCTGACAAATGAGAAGCTGTCCATCTTTGATGCCAACGAGTCTGGCTTTGAGAGTTATGAGGCGC
TTCCCCAGCACAAACTGACCTGCTTCAGTGTGTACTGTAAGCACGGTCACCTGTGTCCCATCGACACCGGCCTCATCGAGAAGAATATCG
AACTCTTCTTTTCTGGTTCAGCAAAACCAATCTATGATGATGACCCATCTCTTGAAGGTGGTGTTAATGGCAAAAATCTTGGCCCCATAA
ATGAATGGTGGATCACTGGCTTTGATGGAGGTGAAAAGGCCCTCATCGGCTTCAGCACCTGATGTTGTGTTATGCACAGCTGGGAAGGGC
ACATAAGACCCACACGGAAACCAAACACAAAGGGTAATAACTGTAGAGACAGTACCTTGTGCCCAGCTTATGCCACCTGCACCAATACAG
TGGACAGTTACTATTGCGCTTGCAAACAAGGCTTCCTGTCCAGCAATGGGCAAAATCACTTCAAGGATCCAGGAGTGCGATGCAAAGATA
TTGATGAATGTTCTCAAAGCCCCCAGCCCTGTGGTCCTAACTCATCCTGCAAAAACCTGTCAGGGAGGTACAAGTGCAGCTGTTTAGATG
GTTTCTCTTCTCCCACTGGAAATGACTGGGTCCCAGGAAAGCCGGGCAATTTCTCCTGTACTGATATCAATGAGTGCCTCACCAGCAGGG
TTCTCTTCAAATGTAAGGAAGATGTGATACCCGATAATAAGCAGATCCAGCAATGCCAAGAGGGAACCGCAGTGAAACCTGCATATGTCT
CCTTTTGTGCACAAATAAATAACATCTTCAGCGTTCTGGACAAAGTGTGTGAAAATAAAACGACCGTAGTTTCTCTGAAGAATACAACTG
AGAGCTTTGTCCCTGTGCTTAAACAAATATCCACGTGGACTAAATTCACCAAGGAAGAGACGTCCTCCCTGGCCACAGTCTTCCTGGAGA
GTGTGGAAAGCATGACACTGGCATCTTTTTGGAAACCCTCAGCAAATATCACTCCGGCTGTTCGGACGGAATACTTAGACATTGAGAGCA
AAGTTATCAACAAAGAATGCAGTGAAGAGAATGTGACGTTGGACTTGGTAGCCAAGGGGGATAAGATGAAGATCGGGTGTTCCACAATTG
AGGAATCTGAATCCACAGAGACCACTGGTGTGGCTTTTGTCTCCTTTGTGGGCATGGAATCGGTTTTAAATGAGCGCTTCTTCAAAGACC
ACCAGGCTCCCTTGACCACCTCTGAGATCAAGCTGAAGATGAATTCTCGAGTCGTTGGGGGCATAATGACTGGAGAGAAGAAAGACGGCT
TCTCAGATCCAATCATCTACACTCTGGAGAACATTCAGCCAAAGCAGAAGTTTGAGAGGCCCATCTGTGTTTCCTGGAGCACTGATGTGA
AGGGTGGAAGATGGACATCCTTTGGCTGTGTGATCCTGGAAGCTTCTGAGACATATACCATCTGCAGCTGTAATCAGATGGCAAATCTTG
CCGTTATCATGGCGTCTGGGGAGCTCACGATGGACTTTTCCTTGTACATCATTAGCCATGTAGGCATTATCATCTCCTTGGTGTGCCTCG
TCTTGGCCATCGCCACCTTTCTGCTGTGTCGCTCCATCCGAAATCACAACACCTACCTCCACCTGCACCTCTGCGTGTGTCTCCTCTTGG
CGAAGACTCTCTTCCTCGCCGGTATACACAAGACTGACAACAAGATGGGCTGCGCCATCATCGCGGGCTTCCTGCACTACCTTTTCCTTG
CCTGCTTCTTCTGGATGCTGGTGGAGGCTGTGATACTGTTCTTGATGGTCAGAAACCTGAAGGTGGTGAATTACTTCAGCTCTCGCAACA
TCAAGATGCTGCACATCTGTGCCTTTGGTTATGGGCTGCCGATGCTGGTGGTGGTGATCTCTGCCAGTGTGCAGCCACAGGGCTATGGAA
TGCATAATCGCTGCTGGCTGAATACAGAGACAGGGTTCATCTGGAGTTTCTTGGGGCCAGTTTGCACAGTTATAGTGATCAACTCCCTTC
TCCTGACCTGGACCTTGTGGATCCTGAGGCAGAGGCTTTCCAGTGTTAATGCCGAAGTCTCAACGCTAAAAGACACCAGGTTACTGACCT
TCAAGGCCTTTGCCCAGCTCTTCATCCTGGGCTGCTCCTGGGTGCTGGGCATTTTTCAGATTGGACCTGTGGCAGGTGTCATGGCTTACC
TGTTCACCATCATCAACAGCCTGCAGGGGGCCTTCATCTTCCTCATCCACTGTCTGCTCAACGGCCAGGTACGAGAAGAATACAAGAGGT
GGATCACTGGGAAGACGAAGCCCAGCTCCCAGTCCCAGACCTCAAGGATCTTGCTGTCCTCCATGCCATCCGCTTCCAAGACGGGTTAAA
GTCCTTTCTTGCTTTCAAATATGCTATGGAGCCACAGTTGAGGACAGTAGTTTCCTGCAGGAGCCTACCCTGAAATCTCTTCTCAGCTTA
ACATGGAAATGAGGATCCCACCAGCCCCAGAACCCTCTGGGGAAGAATGTTGGGGGCGGTCTTCCTGTGGTTGTATGCACTGATGAGAAA
TCAGGCGTTTCTGCTCCAAACGACCATTTTATCTTCGTGCTCTGCAACTTCTTCAATTCCAGAGTTTCTGAGAACAGACCCAAATTCAAT
GGCATGACCAAGAACACCTGGCTACCATTTTGTTTTCTCCTGCCCTTGTTGGTGCATGGTTCTAAGCATGCCCCTCCAGAGCCTATCATA

>23694_23694_5_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000340748_EMR1_chr19_6890492_ENST00000450315_length(amino acids)=695AA_BP=
MHSWEGHIRPTRKPNTKGNNCRDSTLCPAYATCTNTVDSYYCACKQGFLSSNGQNHFKDPGVRCKDIDECSQSPQPCGPNSSCKNLSGRY
KCSCLDGFSSPTGNDWVPGKPGNFSCTDINECLTSRVLFKCKEDVIPDNKQIQQCQEGTAVKPAYVSFCAQINNIFSVLDKVCENKTTVV
SLKNTTESFVPVLKQISTWTKFTKEETSSLATVFLESVESMTLASFWKPSANITPAVRTEYLDIESKVINKECSEENVTLDLVAKGDKMK
IGCSTIEESESTETTGVAFVSFVGMESVLNERFFKDHQAPLTTSEIKLKMNSRVVGGIMTGEKKDGFSDPIIYTLENIQPKQKFERPICV
SWSTDVKGGRWTSFGCVILEASETYTICSCNQMANLAVIMASGELTMDFSLYIISHVGIIISLVCLVLAIATFLLCRSIRNHNTYLHLHL
CVCLLLAKTLFLAGIHKTDNKMGCAIIAGFLHYLFLACFFWMLVEAVILFLMVRNLKVVNYFSSRNIKMLHICAFGYGLPMLVVVISASV
QPQGYGMHNRCWLNTETGFIWSFLGPVCTVIVINSLLLTWTLWILRQRLSSVNAEVSTLKDTRLLTFKAFAQLFILGCSWVLGIFQIGPV

--------------------------------------------------------------
>23694_23694_6_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000359526_EMR1_chr19_6890492_ENST00000250572_length(transcript)=4537nt_BP=1672nt
GGCTCCGTTCCATCCTTCTGCACAGGGTATCGCCTCTCTCCGTTTGGTACATCCCCTCCTCCCCCACGCCCGGACTGGGGTGGTAGACGC
CGCCTCCGCTCATCGCCCCTCCCCATCGGTTTCCGCGCGAAAAGCCGGGGCGCCTGCGCTGCCGCCGCCGCGTCTGCTGAAGCCTCCGAG
ATGCCGGCGCGTACCGCCCCAGCCCGGGTGCCCACACTGGCCGTCCCGGCCATCTCGCTGCCCGACGATGTCCGCAGGCGGCTCAAAGAT
TTGGAAAGAGACAGCTTAACAGAAAAGGAATGTGTGAAGGAGAAATTGAATCTCTTGCACGAATTTCTGCAAACAGAAATAAAGAATCAG
TTATGTGACTTGGAAACCAAATTACGTAAAGAAGAATTATCCGAGGAGGGCTACCTGGCTAAAGTCAAATCCCTTTTAAATAAAGATTTG
TCCTTGGAGAACGGTGCTCATGCTTACAACCGGGAAGTGAATGGACGTCTAGAAAACGGGAACCAAGCAAGAAGTGAAGCCCGTAGAGTG
GGAATGGCAGATGCCAACAGCCCCCCCAAACCCCTTTCCAAACCTCGCACGCCCAGGAGGAGCAAGTCCGATGGAGAGGCTAAGCGTTCA
AGAGACCCTCCTGCCTCAGCCTCCCAAGTAACTGGGATTAGAGCTGAACCTTCACCTAGCCCCAGGATTACAAGGAAAAGCACCAGGCAA
ACCACCATCACATCTCATTTTGCAAAGGGCCCTGCCAAACGGAAACCTCAGGAAGAGTCTGAAAGAGCCAAATCGGATGAGTCCATCAAG
GAAGAAGACAAAGACCAGGATGAGAAGAGACGTAGAGTTACATCCAGAGAACGAGTTGCTAGACCGCTTCCTGCAGAAGAACCTGAAAGA
GCAAAATCAGGAACGCGCACTGAAAAGGAAGAAGAAAGAGATGAAAAAGAAGAAAAGAGACTCCGAAGTCAAACCAAAGAACCAACACCC
AAACAGAAACTGAAGGAGGAGCCGGACAGAGAAGCCAGGGCAGGCGTGCAGGCTGACGAGGACGAAGATGGAGACGAGAAAGATGAGAAG
AAGCACAGAAGTCAACCCAAAGATCTAGCTGCCAAACGGAGGCCCGAAGAAAAAGAACCTGAAAAAGTAAATCCACAGATTTCTGATGAA
AAAGACGAGGATGAAAAGGAGGAGAAGAGACGCAAAACGACCCCCAAAGAACCAACGGAGAAAAAAATGGCTCGCGCCAAAACAGTCATG
AACTCCAAGACCCACCCTCCCAAGTGCATTCAGTGCGGGCAGTACCTGGACGACCCTGACCTCAAATATGGGCAGCACCCACCAGACGCG
GTGGATGAGCCACAGATGCTGACAAATGAGAAGCTGTCCATCTTTGATGCCAACGAGTCTGGCTTTGAGAGTTATGAGGCGCTTCCCCAG
CACAAACTGACCTGCTTCAGTGTGTACTGTAAGCACGGTCACCTGTGTCCCATCGACACCGGCCTCATCGAGAAGAATATCGAACTCTTC
TTTTCTGGTTCAGCAAAACCAATCTATGATGATGACCCATCTCTTGAAGGTGGTGTTAATGGCAAAAATCTTGGCCCCATAAATGAATGG
TGGATCACTGGCTTTGATGGAGGTGAAAAGGCCCTCATCGGCTTCAGCACCTGATGTTGTGTTATGCACAGCTGGGAAGGGCACATAAGA
CCCACACGGAAACCAAACACAAAGGGTAATAACTGTAGAGACAGTACCTTGTGCCCAGCTTATGCCACCTGCACCAATACAGTGGACAGT
TACTATTGCGCTTGCAAACAAGGCTTCCTGTCCAGCAATGGGCAAAATCACTTCAAGGATCCAGGAGTGCGATGCAAAGATATTGATGAA
TGTTCTCAAAGCCCCCAGCCCTGTGGTCCTAACTCATCCTGCAAAAACCTGTCAGGGAGGTACAAGTGCAGCTGTTTAGATGGTTTCTCT
TCTCCCACTGGAAATGACTGGGTCCCAGGAAAGCCGGGCAATTTCTCCTGTACTGATATCAATGAGTGCCTCACCAGCAGCGTCTGCCCT
GAGCATTCTGACTGTGTCAACTCCATGGGAAGCTACAGTTGCAGCTGTCAAGTTGGATTCATCTCTAGAAACTCCACCTGTGAAGACGTG
GATGAATGTGCAGATCCAAGAGCTTGCCCAGAGCATGCAACTTGTAATAACACTGTTGGAAACTACTCTTGTTTCTGCAACCCAGGATTT
GAATCCAGCAGTGGCCACTTGAGTTTCCAGGGTCTCAAAGCATCGTGTGAAGATATTGATGAATGCACTGAAATGTGCCCCATCAATTCA
ACATGCACCAACACTCCTGGGAGCTACTTTTGCACCTGCCACCCTGGCTTTGCACCAAGCAATGGACAGTTGAATTTCACAGACCAAGGA
GTGGAATGTAGAGATATTGATGAGTGCCGCCAAGATCCATCAACCTGTGGTCCTAATTCTATCTGCACCAATGCCCTGGGCTCCTACAGC
TGTGGCTGCATTGCAGGCTTTCATCCCAATCCAGAAGGCTCCCAGAAAGATGGCAACTTCAGCTGCCAAAGGGTTCTCTTCAAATGTAAG
GAAGATGTGATACCCGATAATAAGCAGATCCAGCAATGCCAAGAGGGAACCGCAGTGAAACCTGCATATGTCTCCTTTTGTGCACAAATA
AATAACATCTTCAGCGTTCTGGACAAAGTGTGTGAAAATAAAACGACCGTAGTTTCTCTGAAGAATACAACTGAGAGCTTTGTCCCTGTG
CTTAAACAAATATCCACGTGGACTAAATTCACCAAGGAAGAGACGTCCTCCCTGGCCACAGTCTTCCTGGAGAGTGTGGAAAGCATGACA
CTGGCATCTTTTTGGAAACCCTCAGCAAATATCACTCCGGCTGTTCGGACGGAATACTTAGACATTGAGAGCAAAGTTATCAACAAAGAA
TGCAGTGAAGAGAATGTGACGTTGGACTTGGTAGCCAAGGGGGATAAGATGAAGATCGGGTGTTCCACAATTGAGGAATCTGAATCCACA
GAGACCACTGGTGTGGCTTTTGTCTCCTTTGTGGGCATGGAATCGGTTTTAAATGAGCGCTTCTTCAAAGACCACCAGGCTCCCTTGACC
ACCTCTGAGATCAAGCTGAAGATGAATTCTCGAGTCGTTGGGGGCATAATGACTGGAGAGAAGAAAGACGGCTTCTCAGATCCAATCATC
TACACTCTGGAGAACATTCAGCCAAAGCAGAAGTTTGAGAGGCCCATCTGTGTTTCCTGGAGCACTGATGTGAAGGGTGGAAGATGGACA
TCCTTTGGCTGTGTGATCCTGGAAGCTTCTGAGACATATACCATCTGCAGCTGTAATCAGATGGCAAATCTTGCCGTTATCATGGCGTCT
GGGGAGCTCACGATGGGCTGCGCCATCATCGCGGGCTTCCTGCACTACCTTTTCCTTGCCTGCTTCTTCTGGATGCTGGTGGAGGCTGTG
ATACTGTTCTTGATGGTCAGAAACCTGAAGGTGGTGAATTACTTCAGCTCTCGCAACATCAAGATGCTGCACATCTGTGCCTTTGGTTAT
GGGCTGCCGATGCTGGTGGTGGTGATCTCTGCCAGTGTGCAGCCACAGGGCTATGGAATGCATAATCGCTGCTGGCTGAATACAGAGACA
GGGTTCATCTGGAGTTTCTTGGGGCCAGTTTGCACAGTTATAGTGATCAACTCCCTTCTCCTGACCTGGACCTTGTGGATCCTGAGGCAG
AGGCTTTCCAGTGTTAATGCCGAAGTCTCAACGCTAAAAGACACCAGGTTACTGACCTTCAAGGCCTTTGCCCAGCTCTTCATCCTGGGC
TGCTCCTGGGTGCTGGGCATTTTTCAGATTGGACCTGTGGCAGGTGTCATGGCTTACCTGTTCACCATCATCAACAGCCTGCAGGGGGCC
TTCATCTTCCTCATCCACTGTCTGCTCAACGGCCAGGTACGAGAAGAATACAAGAGGTGGATCACTGGGAAGACGAAGCCCAGCTCCCAG
TCCCAGACCTCAAGGATCTTGCTGTCCTCCATGCCATCCGCTTCCAAGACGGGTTAAAGTCCTTTCTTGCTTTCAAATATGCTATGGAGC
CACAGTTGAGGACAGTAGTTTCCTGCAGGAGCCTACCCTGAAATCTCTTCTCAGCTTAACATGGAAATGAGGATCCCACCAGCCCCAGAA
CCCTCTGGGGAAGAATGTTGGGGGCGGTCTTCCTGTGGTTGTATGCACTGATGAGAAATCAGGCGTTTCTGCTCCAAACGACCATTTTAT
CTTCGTGCTCTGCAACTTCTTCAATTCCAGAGTTTCTGAGAACAGACCCAAATTCAATGGCATGACCAAGAACACCTGGCTACCATTTTG
TTTTCTCCTGCCCTTGTTGGTGCATGGTTCTAAGCATGCCCCTCCAGAGCCTATCATACGCCTGATACAGAGAACCTCTCAATAAATGAT

>23694_23694_6_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000359526_EMR1_chr19_6890492_ENST00000250572_length(amino acids)=807AA_BP=
MHSWEGHIRPTRKPNTKGNNCRDSTLCPAYATCTNTVDSYYCACKQGFLSSNGQNHFKDPGVRCKDIDECSQSPQPCGPNSSCKNLSGRY
KCSCLDGFSSPTGNDWVPGKPGNFSCTDINECLTSSVCPEHSDCVNSMGSYSCSCQVGFISRNSTCEDVDECADPRACPEHATCNNTVGN
YSCFCNPGFESSSGHLSFQGLKASCEDIDECTEMCPINSTCTNTPGSYFCTCHPGFAPSNGQLNFTDQGVECRDIDECRQDPSTCGPNSI
CTNALGSYSCGCIAGFHPNPEGSQKDGNFSCQRVLFKCKEDVIPDNKQIQQCQEGTAVKPAYVSFCAQINNIFSVLDKVCENKTTVVSLK
NTTESFVPVLKQISTWTKFTKEETSSLATVFLESVESMTLASFWKPSANITPAVRTEYLDIESKVINKECSEENVTLDLVAKGDKMKIGC
STIEESESTETTGVAFVSFVGMESVLNERFFKDHQAPLTTSEIKLKMNSRVVGGIMTGEKKDGFSDPIIYTLENIQPKQKFERPICVSWS
TDVKGGRWTSFGCVILEASETYTICSCNQMANLAVIMASGELTMGCAIIAGFLHYLFLACFFWMLVEAVILFLMVRNLKVVNYFSSRNIK
MLHICAFGYGLPMLVVVISASVQPQGYGMHNRCWLNTETGFIWSFLGPVCTVIVINSLLLTWTLWILRQRLSSVNAEVSTLKDTRLLTFK

--------------------------------------------------------------
>23694_23694_7_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000359526_EMR1_chr19_6890492_ENST00000312053_length(transcript)=4732nt_BP=1672nt
GGCTCCGTTCCATCCTTCTGCACAGGGTATCGCCTCTCTCCGTTTGGTACATCCCCTCCTCCCCCACGCCCGGACTGGGGTGGTAGACGC
CGCCTCCGCTCATCGCCCCTCCCCATCGGTTTCCGCGCGAAAAGCCGGGGCGCCTGCGCTGCCGCCGCCGCGTCTGCTGAAGCCTCCGAG
ATGCCGGCGCGTACCGCCCCAGCCCGGGTGCCCACACTGGCCGTCCCGGCCATCTCGCTGCCCGACGATGTCCGCAGGCGGCTCAAAGAT
TTGGAAAGAGACAGCTTAACAGAAAAGGAATGTGTGAAGGAGAAATTGAATCTCTTGCACGAATTTCTGCAAACAGAAATAAAGAATCAG
TTATGTGACTTGGAAACCAAATTACGTAAAGAAGAATTATCCGAGGAGGGCTACCTGGCTAAAGTCAAATCCCTTTTAAATAAAGATTTG
TCCTTGGAGAACGGTGCTCATGCTTACAACCGGGAAGTGAATGGACGTCTAGAAAACGGGAACCAAGCAAGAAGTGAAGCCCGTAGAGTG
GGAATGGCAGATGCCAACAGCCCCCCCAAACCCCTTTCCAAACCTCGCACGCCCAGGAGGAGCAAGTCCGATGGAGAGGCTAAGCGTTCA
AGAGACCCTCCTGCCTCAGCCTCCCAAGTAACTGGGATTAGAGCTGAACCTTCACCTAGCCCCAGGATTACAAGGAAAAGCACCAGGCAA
ACCACCATCACATCTCATTTTGCAAAGGGCCCTGCCAAACGGAAACCTCAGGAAGAGTCTGAAAGAGCCAAATCGGATGAGTCCATCAAG
GAAGAAGACAAAGACCAGGATGAGAAGAGACGTAGAGTTACATCCAGAGAACGAGTTGCTAGACCGCTTCCTGCAGAAGAACCTGAAAGA
GCAAAATCAGGAACGCGCACTGAAAAGGAAGAAGAAAGAGATGAAAAAGAAGAAAAGAGACTCCGAAGTCAAACCAAAGAACCAACACCC
AAACAGAAACTGAAGGAGGAGCCGGACAGAGAAGCCAGGGCAGGCGTGCAGGCTGACGAGGACGAAGATGGAGACGAGAAAGATGAGAAG
AAGCACAGAAGTCAACCCAAAGATCTAGCTGCCAAACGGAGGCCCGAAGAAAAAGAACCTGAAAAAGTAAATCCACAGATTTCTGATGAA
AAAGACGAGGATGAAAAGGAGGAGAAGAGACGCAAAACGACCCCCAAAGAACCAACGGAGAAAAAAATGGCTCGCGCCAAAACAGTCATG
AACTCCAAGACCCACCCTCCCAAGTGCATTCAGTGCGGGCAGTACCTGGACGACCCTGACCTCAAATATGGGCAGCACCCACCAGACGCG
GTGGATGAGCCACAGATGCTGACAAATGAGAAGCTGTCCATCTTTGATGCCAACGAGTCTGGCTTTGAGAGTTATGAGGCGCTTCCCCAG
CACAAACTGACCTGCTTCAGTGTGTACTGTAAGCACGGTCACCTGTGTCCCATCGACACCGGCCTCATCGAGAAGAATATCGAACTCTTC
TTTTCTGGTTCAGCAAAACCAATCTATGATGATGACCCATCTCTTGAAGGTGGTGTTAATGGCAAAAATCTTGGCCCCATAAATGAATGG
TGGATCACTGGCTTTGATGGAGGTGAAAAGGCCCTCATCGGCTTCAGCACCTGATGTTGTGTTATGCACAGCTGGGAAGGGCACATAAGA
CCCACACGGAAACCAAACACAAAGGGTAATAACTGTAGAGACAGTACCTTGTGCCCAGCTTATGCCACCTGCACCAATACAGTGGACAGT
TACTATTGCGCTTGCAAACAAGGCTTCCTGTCCAGCAATGGGCAAAATCACTTCAAGGATCCAGGAGTGCGATGCAAAGATATTGATGAA
TGTTCTCAAAGCCCCCAGCCCTGTGGTCCTAACTCATCCTGCAAAAACCTGTCAGGGAGGTACAAGTGCAGCTGTTTAGATGGTTTCTCT
TCTCCCACTGGAAATGACTGGGTCCCAGGAAAGCCGGGCAATTTCTCCTGTACTGATATCAATGAGTGCCTCACCAGCAGCGTCTGCCCT
GAGCATTCTGACTGTGTCAACTCCATGGGAAGCTACAGTTGCAGCTGTCAAGTTGGATTCATCTCTAGAAACTCCACCTGTGAAGACGTG
GATGAATGTGCAGATCCAAGAGCTTGCCCAGAGCATGCAACTTGTAATAACACTGTTGGAAACTACTCTTGTTTCTGCAACCCAGGATTT
GAATCCAGCAGTGGCCACTTGAGTTTCCAGGGTCTCAAAGCATCGTGTGAAGATATTGATGAATGCACTGAAATGTGCCCCATCAATTCA
ACATGCACCAACACTCCTGGGAGCTACTTTTGCACCTGCCACCCTGGCTTTGCACCAAGCAATGGACAGTTGAATTTCACAGACCAAGGA
GTGGAATGTAGAGATATTGATGAGTGCCGCCAAGATCCATCAACCTGTGGTCCTAATTCTATCTGCACCAATGCCCTGGGCTCCTACAGC
TGTGGCTGCATTGCAGGCTTTCATCCCAATCCAGAAGGCTCCCAGAAAGATGGCAACTTCAGCTGCCAAAGGGTTCTCTTCAAATGTAAG
GAAGATGTGATACCCGATAATAAGCAGATCCAGCAATGCCAAGAGGGAACCGCAGTGAAACCTGCATATGTCTCCTTTTGTGCACAAATA
AATAACATCTTCAGCGTTCTGGACAAAGTGTGTGAAAATAAAACGACCGTAGTTTCTCTGAAGAATACAACTGAGAGCTTTGTCCCTGTG
CTTAAACAAATATCCACGTGGACTAAATTCACCAAGGAAGAGACGTCCTCCCTGGCCACAGTCTTCCTGGAGAGTGTGGAAAGCATGACA
CTGGCATCTTTTTGGAAACCCTCAGCAAATATCACTCCGGCTGTTCGGACGGAATACTTAGACATTGAGAGCAAAGTTATCAACAAAGAA
TGCAGTGAAGAGAATGTGACGTTGGACTTGGTAGCCAAGGGGGATAAGATGAAGATCGGGTGTTCCACAATTGAGGAATCTGAATCCACA
GAGACCACTGGTGTGGCTTTTGTCTCCTTTGTGGGCATGGAATCGGTTTTAAATGAGCGCTTCTTCAAAGACCACCAGGCTCCCTTGACC
ACCTCTGAGATCAAGCTGAAGATGAATTCTCGAGTCGTTGGGGGCATAATGACTGGAGAGAAGAAAGACGGCTTCTCAGATCCAATCATC
TACACTCTGGAGAACATTCAGCCAAAGCAGAAGTTTGAGAGGCCCATCTGTGTTTCCTGGAGCACTGATGTGAAGGGTGGAAGATGGACA
TCCTTTGGCTGTGTGATCCTGGAAGCTTCTGAGACATATACCATCTGCAGCTGTAATCAGATGGCAAATCTTGCCGTTATCATGGCGTCT
GGGGAGCTCACGATGGACTTTTCCTTGTACATCATTAGCCATGTAGGCATTATCATCTCCTTGGTGTGCCTCGTCTTGGCCATCGCCACC
TTTCTGCTGTGTCGCTCCATCCGAAATCACAACACCTACCTCCACCTGCACCTCTGCGTGTGTCTCCTCTTGGCGAAGACTCTCTTCCTC
GCCGGTATACACAAGACTGACAACAAGATGGGCTGCGCCATCATCGCGGGCTTCCTGCACTACCTTTTCCTTGCCTGCTTCTTCTGGATG
CTGGTGGAGGCTGTGATACTGTTCTTGATGGTCAGAAACCTGAAGGTGGTGAATTACTTCAGCTCTCGCAACATCAAGATGCTGCACATC
TGTGCCTTTGGTTATGGGCTGCCGATGCTGGTGGTGGTGATCTCTGCCAGTGTGCAGCCACAGGGCTATGGAATGCATAATCGCTGCTGG
CTGAATACAGAGACAGGGTTCATCTGGAGTTTCTTGGGGCCAGTTTGCACAGTTATAGTGATCAACTCCCTTCTCCTGACCTGGACCTTG
TGGATCCTGAGGCAGAGGCTTTCCAGTGTTAATGCCGAAGTCTCAACGCTAAAAGACACCAGGTTACTGACCTTCAAGGCCTTTGCCCAG
CTCTTCATCCTGGGCTGCTCCTGGGTGCTGGGCATTTTTCAGATTGGACCTGTGGCAGGTGTCATGGCTTACCTGTTCACCATCATCAAC
AGCCTGCAGGGGGCCTTCATCTTCCTCATCCACTGTCTGCTCAACGGCCAGGTACGAGAAGAATACAAGAGGTGGATCACTGGGAAGACG
AAGCCCAGCTCCCAGTCCCAGACCTCAAGGATCTTGCTGTCCTCCATGCCATCCGCTTCCAAGACGGGTTAAAGTCCTTTCTTGCTTTCA
AATATGCTATGGAGCCACAGTTGAGGACAGTAGTTTCCTGCAGGAGCCTACCCTGAAATCTCTTCTCAGCTTAACATGGAAATGAGGATC
CCACCAGCCCCAGAACCCTCTGGGGAAGAATGTTGGGGGCGGTCTTCCTGTGGTTGTATGCACTGATGAGAAATCAGGCGTTTCTGCTCC
AAACGACCATTTTATCTTCGTGCTCTGCAACTTCTTCAATTCCAGAGTTTCTGAGAACAGACCCAAATTCAATGGCATGACCAAGAACAC
CTGGCTACCATTTTGTTTTCTCCTGCCCTTGTTGGTGCATGGTTCTAAGCATGCCCCTCCAGAGCCTATCATACGCCTGATACAGAGAAC

>23694_23694_7_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000359526_EMR1_chr19_6890492_ENST00000312053_length(amino acids)=872AA_BP=
MHSWEGHIRPTRKPNTKGNNCRDSTLCPAYATCTNTVDSYYCACKQGFLSSNGQNHFKDPGVRCKDIDECSQSPQPCGPNSSCKNLSGRY
KCSCLDGFSSPTGNDWVPGKPGNFSCTDINECLTSSVCPEHSDCVNSMGSYSCSCQVGFISRNSTCEDVDECADPRACPEHATCNNTVGN
YSCFCNPGFESSSGHLSFQGLKASCEDIDECTEMCPINSTCTNTPGSYFCTCHPGFAPSNGQLNFTDQGVECRDIDECRQDPSTCGPNSI
CTNALGSYSCGCIAGFHPNPEGSQKDGNFSCQRVLFKCKEDVIPDNKQIQQCQEGTAVKPAYVSFCAQINNIFSVLDKVCENKTTVVSLK
NTTESFVPVLKQISTWTKFTKEETSSLATVFLESVESMTLASFWKPSANITPAVRTEYLDIESKVINKECSEENVTLDLVAKGDKMKIGC
STIEESESTETTGVAFVSFVGMESVLNERFFKDHQAPLTTSEIKLKMNSRVVGGIMTGEKKDGFSDPIIYTLENIQPKQKFERPICVSWS
TDVKGGRWTSFGCVILEASETYTICSCNQMANLAVIMASGELTMDFSLYIISHVGIIISLVCLVLAIATFLLCRSIRNHNTYLHLHLCVC
LLLAKTLFLAGIHKTDNKMGCAIIAGFLHYLFLACFFWMLVEAVILFLMVRNLKVVNYFSSRNIKMLHICAFGYGLPMLVVVISASVQPQ
GYGMHNRCWLNTETGFIWSFLGPVCTVIVINSLLLTWTLWILRQRLSSVNAEVSTLKDTRLLTFKAFAQLFILGCSWVLGIFQIGPVAGV

--------------------------------------------------------------
>23694_23694_8_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000359526_EMR1_chr19_6890492_ENST00000381404_length(transcript)=4667nt_BP=1672nt
GGCTCCGTTCCATCCTTCTGCACAGGGTATCGCCTCTCTCCGTTTGGTACATCCCCTCCTCCCCCACGCCCGGACTGGGGTGGTAGACGC
CGCCTCCGCTCATCGCCCCTCCCCATCGGTTTCCGCGCGAAAAGCCGGGGCGCCTGCGCTGCCGCCGCCGCGTCTGCTGAAGCCTCCGAG
ATGCCGGCGCGTACCGCCCCAGCCCGGGTGCCCACACTGGCCGTCCCGGCCATCTCGCTGCCCGACGATGTCCGCAGGCGGCTCAAAGAT
TTGGAAAGAGACAGCTTAACAGAAAAGGAATGTGTGAAGGAGAAATTGAATCTCTTGCACGAATTTCTGCAAACAGAAATAAAGAATCAG
TTATGTGACTTGGAAACCAAATTACGTAAAGAAGAATTATCCGAGGAGGGCTACCTGGCTAAAGTCAAATCCCTTTTAAATAAAGATTTG
TCCTTGGAGAACGGTGCTCATGCTTACAACCGGGAAGTGAATGGACGTCTAGAAAACGGGAACCAAGCAAGAAGTGAAGCCCGTAGAGTG
GGAATGGCAGATGCCAACAGCCCCCCCAAACCCCTTTCCAAACCTCGCACGCCCAGGAGGAGCAAGTCCGATGGAGAGGCTAAGCGTTCA
AGAGACCCTCCTGCCTCAGCCTCCCAAGTAACTGGGATTAGAGCTGAACCTTCACCTAGCCCCAGGATTACAAGGAAAAGCACCAGGCAA
ACCACCATCACATCTCATTTTGCAAAGGGCCCTGCCAAACGGAAACCTCAGGAAGAGTCTGAAAGAGCCAAATCGGATGAGTCCATCAAG
GAAGAAGACAAAGACCAGGATGAGAAGAGACGTAGAGTTACATCCAGAGAACGAGTTGCTAGACCGCTTCCTGCAGAAGAACCTGAAAGA
GCAAAATCAGGAACGCGCACTGAAAAGGAAGAAGAAAGAGATGAAAAAGAAGAAAAGAGACTCCGAAGTCAAACCAAAGAACCAACACCC
AAACAGAAACTGAAGGAGGAGCCGGACAGAGAAGCCAGGGCAGGCGTGCAGGCTGACGAGGACGAAGATGGAGACGAGAAAGATGAGAAG
AAGCACAGAAGTCAACCCAAAGATCTAGCTGCCAAACGGAGGCCCGAAGAAAAAGAACCTGAAAAAGTAAATCCACAGATTTCTGATGAA
AAAGACGAGGATGAAAAGGAGGAGAAGAGACGCAAAACGACCCCCAAAGAACCAACGGAGAAAAAAATGGCTCGCGCCAAAACAGTCATG
AACTCCAAGACCCACCCTCCCAAGTGCATTCAGTGCGGGCAGTACCTGGACGACCCTGACCTCAAATATGGGCAGCACCCACCAGACGCG
GTGGATGAGCCACAGATGCTGACAAATGAGAAGCTGTCCATCTTTGATGCCAACGAGTCTGGCTTTGAGAGTTATGAGGCGCTTCCCCAG
CACAAACTGACCTGCTTCAGTGTGTACTGTAAGCACGGTCACCTGTGTCCCATCGACACCGGCCTCATCGAGAAGAATATCGAACTCTTC
TTTTCTGGTTCAGCAAAACCAATCTATGATGATGACCCATCTCTTGAAGGTGGTGTTAATGGCAAAAATCTTGGCCCCATAAATGAATGG
TGGATCACTGGCTTTGATGGAGGTGAAAAGGCCCTCATCGGCTTCAGCACCTGATGTTGTGTTATGCACAGCTGGGAAGGGCACATAAGA
CCCACACGGAAACCAAACACAAAGGGTAATAACTGTAGAGACAGTACCTTGTGCCCAGCTTATGCCACCTGCACCAATACAGTGGACAGT
TACTATTGCGCTTGCAAACAAGGCTTCCTGTCCAGCAATGGGCAAAATCACTTCAAGGATCCAGGAGTGCGATGCAAAGATATCAATGAG
TGCCTCACCAGCAGCGTCTGCCCTGAGCATTCTGACTGTGTCAACTCCATGGGAAGCTACAGTTGCAGCTGTCAAGTTGGATTCATCTCT
AGAAACTCCACCTGTGAAGACGTGGATGAATGTGCAGATCCAAGAGCTTGCCCAGAGCATGCAACTTGTAATAACACTGTTGGAAACTAC
TCTTGTTTCTGCAACCCAGGATTTGAATCCAGCAGTGGCCACTTGAGTTTCCAGGGTCTCAAAGCATCGTGTGAAGATATTGATGAATGC
ACTGAAATGTGCCCCATCAATTCAACATGCACCAACACTCCTGGGAGCTACTTTTGCACCTGCCACCCTGGCTTTGCACCAAGCAATGGA
CAGTTGAATTTCACAGACCAAGGAGTGGAATGTAGAGATATTGATGAGTGCCGCCAAGATCCATCAACCTGTGGTCCTAATTCTATCTGC
ACCAATGCCCTGGGCTCCTACAGCTGTGGCTGCATTGCAGGCTTTCATCCCAATCCAGAAGGCTCCCAGAAAGATGGCAACTTCAGCTGC
CAAAGGGTTCTCTTCAAATGTAAGGAAGATGTGATACCCGATAATAAGCAGATCCAGCAATGCCAAGAGGGAACCGCAGTGAAACCTGCA
TATGTCTCCTTTTGTGCACAAATAAATAACATCTTCAGCGTTCTGGACAAAGTGTGTGAAAATAAAACGACCGTAGTTTCTCTGAAGAAT
ACAACTGAGAGCTTTGTCCCTGTGCTTAAACAAATATCCACGTGGACTAAATTCACCAAGGAAGAGACGTCCTCCCTGGCCACAGTCTTC
CTGGAGAGTGTGGAAAGCATGACACTGGCATCTTTTTGGAAACCCTCAGCAAATATCACTCCGGCTGTTCGGACGGAATACTTAGACATT
GAGAGCAAAGTTATCAACAAAGAATGCAGTGAAGAGAATGTGACGTTGGACTTGGTAGCCAAGGGGGATAAGATGAAGATCGGGTGTTCC
ACAATTGAGGAATCTGAATCCACAGAGACCACTGGTGTGGCTTTTGTCTCCTTTGTGGGCATGGAATCGGTTTTAAATGAGCGCTTCTTC
AAAGACCACCAGGCTCCCTTGACCACCTCTGAGATCAAGCTGAAGATGAATTCTCGAGTCGTTGGGGGCATAATGACTGGAGAGAAGAAA
GACGGCTTCTCAGATCCAATCATCTACACTCTGGAGAACATTCAGCCAAAGCAGAAGTTTGAGAGGCCCATCTGTGTTTCCTGGAGCACT
GATGTGAAGGGTGGAAGATGGACATCCTTTGGCTGTGTGATCCTGGAAGCTTCTGAGACATATACCATCTGCAGCTGTAATCAGATGGCA
AATCTTGCCGTTATCATGGCGTCTGGGGAGCTCACGATGGACTTTTCCTTGTACATCATTAGCCATGTAGGCATTATCATCTCCTTGGTG
TGCCTCGTCTTGGCCATCGCCACCTTTCTGCTGTGTCGCTCCATCCGAAATCACAACACCTACCTCCACCTGCACCTCTGCGTGTGTCTC
CTCTTGGCGAAGACTCTCTTCCTCGCCGGTATACACAAGACTGACAACAAGATGGGCTGCGCCATCATCGCGGGCTTCCTGCACTACCTT
TTCCTTGCCTGCTTCTTCTGGATGCTGGTGGAGGCTGTGATACTGTTCTTGATGGTCAGAAACCTGAAGGTGGTGAATTACTTCAGCTCT
CGCAACATCAAGATGCTGCACATCTGTGCCTTTGGTTATGGGCTGCCGATGCTGGTGGTGGTGATCTCTGCCAGTGTGCAGCCACAGGGC
TATGGAATGCATAATCGCTGCTGGCTGAATACAGAGACAGGGTTCATCTGGAGTTTCTTGGGGCCAGTTTGCACAGTTATAGTGGTAAGC
AAATACTACAACAGCCTGGCGAAGTGTGTTCTGAAGGAGGAGCAAGGAGACCTGCGAGATCTGGAATTTCCAGGGACGTGTGCAGCTGAG
AGGATCAACTCCCTTCTCCTGACCTGGACCTTGTGGATCCTGAGGCAGAGGCTTTCCAGTGTTAATGCCGAAGTCTCAACGCTAAAAGAC
ACCAGGTTACTGACCTTCAAGGCCTTTGCCCAGCTCTTCATCCTGGGCTGCTCCTGGGTGCTGGGCATTTTTCAGATTGGACCTGTGGCA
GGTGTCATGGCTTACCTGTTCACCATCATCAACAGCCTGCAGGGGGCCTTCATCTTCCTCATCCACTGTCTGCTCAACGGCCAGGTACGA
GAAGAATACAAGAGGTGGATCACTGGGAAGACGAAGCCCAGCTCCCAGTCCCAGACCTCAAGGATCTTGCTGTCCTCCATGCCATCCGCT
TCCAAGACGGGTTAAAGTCCTTTCTTGCTTTCAAATATGCTATGGAGCCACAGTTGAGGACAGTAGTTTCCTGCAGGAGCCTACCCTGAA
ATCTCTTCTCAGCTTAACATGGAAATGAGGATCCCACCAGCCCCAGAACCCTCTGGGGAAGAATGTTGGGGGCGGTCTTCCTGTGGTTGT
ATGCACTGATGAGAAATCAGGCGTTTCTGCTCCAAACGACCATTTTATCTTCGTGCTCTGCAACTTCTTCAATTCCAGAGTTTCTGAGAA
CAGACCCAAATTCAATGGCATGACCAAGAACACCTGGCTACCATTTTGTTTTCTCCTGCCCTTGTTGGTGCATGGTTCTAAGCATGCCCC

>23694_23694_8_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000359526_EMR1_chr19_6890492_ENST00000381404_length(amino acids)=853AA_BP=59
MHSWEGHIRPTRKPNTKGNNCRDSTLCPAYATCTNTVDSYYCACKQGFLSSNGQNHFKDPGVRCKDINECLTSSVCPEHSDCVNSMGSYS
CSCQVGFISRNSTCEDVDECADPRACPEHATCNNTVGNYSCFCNPGFESSSGHLSFQGLKASCEDIDECTEMCPINSTCTNTPGSYFCTC
HPGFAPSNGQLNFTDQGVECRDIDECRQDPSTCGPNSICTNALGSYSCGCIAGFHPNPEGSQKDGNFSCQRVLFKCKEDVIPDNKQIQQC
QEGTAVKPAYVSFCAQINNIFSVLDKVCENKTTVVSLKNTTESFVPVLKQISTWTKFTKEETSSLATVFLESVESMTLASFWKPSANITP
AVRTEYLDIESKVINKECSEENVTLDLVAKGDKMKIGCSTIEESESTETTGVAFVSFVGMESVLNERFFKDHQAPLTTSEIKLKMNSRVV
GGIMTGEKKDGFSDPIIYTLENIQPKQKFERPICVSWSTDVKGGRWTSFGCVILEASETYTICSCNQMANLAVIMASGELTMDFSLYIIS
HVGIIISLVCLVLAIATFLLCRSIRNHNTYLHLHLCVCLLLAKTLFLAGIHKTDNKMGCAIIAGFLHYLFLACFFWMLVEAVILFLMVRN
LKVVNYFSSRNIKMLHICAFGYGLPMLVVVISASVQPQGYGMHNRCWLNTETGFIWSFLGPVCTVIVVSKYYNSLAKCVLKEEQGDLRDL
EFPGTCAAERINSLLLTWTLWILRQRLSSVNAEVSTLKDTRLLTFKAFAQLFILGCSWVLGIFQIGPVAGVMAYLFTIINSLQGAFIFLI

--------------------------------------------------------------
>23694_23694_9_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000359526_EMR1_chr19_6890492_ENST00000381407_length(transcript)=4290nt_BP=1672nt
GGCTCCGTTCCATCCTTCTGCACAGGGTATCGCCTCTCTCCGTTTGGTACATCCCCTCCTCCCCCACGCCCGGACTGGGGTGGTAGACGC
CGCCTCCGCTCATCGCCCCTCCCCATCGGTTTCCGCGCGAAAAGCCGGGGCGCCTGCGCTGCCGCCGCCGCGTCTGCTGAAGCCTCCGAG
ATGCCGGCGCGTACCGCCCCAGCCCGGGTGCCCACACTGGCCGTCCCGGCCATCTCGCTGCCCGACGATGTCCGCAGGCGGCTCAAAGAT
TTGGAAAGAGACAGCTTAACAGAAAAGGAATGTGTGAAGGAGAAATTGAATCTCTTGCACGAATTTCTGCAAACAGAAATAAAGAATCAG
TTATGTGACTTGGAAACCAAATTACGTAAAGAAGAATTATCCGAGGAGGGCTACCTGGCTAAAGTCAAATCCCTTTTAAATAAAGATTTG
TCCTTGGAGAACGGTGCTCATGCTTACAACCGGGAAGTGAATGGACGTCTAGAAAACGGGAACCAAGCAAGAAGTGAAGCCCGTAGAGTG
GGAATGGCAGATGCCAACAGCCCCCCCAAACCCCTTTCCAAACCTCGCACGCCCAGGAGGAGCAAGTCCGATGGAGAGGCTAAGCGTTCA
AGAGACCCTCCTGCCTCAGCCTCCCAAGTAACTGGGATTAGAGCTGAACCTTCACCTAGCCCCAGGATTACAAGGAAAAGCACCAGGCAA
ACCACCATCACATCTCATTTTGCAAAGGGCCCTGCCAAACGGAAACCTCAGGAAGAGTCTGAAAGAGCCAAATCGGATGAGTCCATCAAG
GAAGAAGACAAAGACCAGGATGAGAAGAGACGTAGAGTTACATCCAGAGAACGAGTTGCTAGACCGCTTCCTGCAGAAGAACCTGAAAGA
GCAAAATCAGGAACGCGCACTGAAAAGGAAGAAGAAAGAGATGAAAAAGAAGAAAAGAGACTCCGAAGTCAAACCAAAGAACCAACACCC
AAACAGAAACTGAAGGAGGAGCCGGACAGAGAAGCCAGGGCAGGCGTGCAGGCTGACGAGGACGAAGATGGAGACGAGAAAGATGAGAAG
AAGCACAGAAGTCAACCCAAAGATCTAGCTGCCAAACGGAGGCCCGAAGAAAAAGAACCTGAAAAAGTAAATCCACAGATTTCTGATGAA
AAAGACGAGGATGAAAAGGAGGAGAAGAGACGCAAAACGACCCCCAAAGAACCAACGGAGAAAAAAATGGCTCGCGCCAAAACAGTCATG
AACTCCAAGACCCACCCTCCCAAGTGCATTCAGTGCGGGCAGTACCTGGACGACCCTGACCTCAAATATGGGCAGCACCCACCAGACGCG
GTGGATGAGCCACAGATGCTGACAAATGAGAAGCTGTCCATCTTTGATGCCAACGAGTCTGGCTTTGAGAGTTATGAGGCGCTTCCCCAG
CACAAACTGACCTGCTTCAGTGTGTACTGTAAGCACGGTCACCTGTGTCCCATCGACACCGGCCTCATCGAGAAGAATATCGAACTCTTC
TTTTCTGGTTCAGCAAAACCAATCTATGATGATGACCCATCTCTTGAAGGTGGTGTTAATGGCAAAAATCTTGGCCCCATAAATGAATGG
TGGATCACTGGCTTTGATGGAGGTGAAAAGGCCCTCATCGGCTTCAGCACCTGATGTTGTGTTATGCACAGCTGGGAAGGGCACATAAGA
CCCACACGGAAACCAAACACAAAGGGTAATAACTGTAGAGACAGTACCTTGTGCCCAGCTTATGCCACCTGCACCAATACAGTGGACAGT
TACTATTGCGCTTGCAAACAAGGCTTCCTGTCCAGCAATGGGCAAAATCACTTCAAGGATCCAGGAGTGCGATGCAAAGATATTGATGAA
TGCACTGAAATGTGCCCCATCAATTCAACATGCACCAACACTCCTGGGAGCTACTTTTGCACCTGCCACCCTGGCTTTGCACCAAGCAAT
GGACAGTTGAATTTCACAGACCAAGGAGTGGAATGTAGAGATATTGATGAGTGCCGCCAAGATCCATCAACCTGTGGTCCTAATTCTATC
TGCACCAATGCCCTGGGCTCCTACAGCTGTGGCTGCATTGCAGGCTTTCATCCCAATCCAGAAGGCTCCCAGAAAGATGGCAACTTCAGC
TGCCAAAGGGTTCTCTTCAAATGTAAGGAAGATGTGATACCCGATAATAAGCAGATCCAGCAATGCCAAGAGGGAACCGCAGTGAAACCT
GCATATGTCTCCTTTTGTGCACAAATAAATAACATCTTCAGCGTTCTGGACAAAGTGTGTGAAAATAAAACGACCGTAGTTTCTCTGAAG
AATACAACTGAGAGCTTTGTCCCTGTGCTTAAACAAATATCCACGTGGACTAAATTCACCAAGGAAGAGACGTCCTCCCTGGCCACAGTC
TTCCTGGAGAGTGTGGAAAGCATGACACTGGCATCTTTTTGGAAACCCTCAGCAAATATCACTCCGGCTGTTCGGACGGAATACTTAGAC
ATTGAGAGCAAAGTTATCAACAAAGAATGCAGTGAAGAGAATGTGACGTTGGACTTGGTAGCCAAGGGGGATAAGATGAAGATCGGGTGT
TCCACAATTGAGGAATCTGAATCCACAGAGACCACTGGTGTGGCTTTTGTCTCCTTTGTGGGCATGGAATCGGTTTTAAATGAGCGCTTC
TTCAAAGACCACCAGGCTCCCTTGACCACCTCTGAGATCAAGCTGAAGATGAATTCTCGAGTCGTTGGGGGCATAATGACTGGAGAGAAG
AAAGACGGCTTCTCAGATCCAATCATCTACACTCTGGAGAACATTCAGCCAAAGCAGAAGTTTGAGAGGCCCATCTGTGTTTCCTGGAGC
ACTGATGTGAAGGGTGGAAGATGGACATCCTTTGGCTGTGTGATCCTGGAAGCTTCTGAGACATATACCATCTGCAGCTGTAATCAGATG
GCAAATCTTGCCGTTATCATGGCGTCTGGGGAGCTCACGATGGACTTTTCCTTGTACATCATTAGCCATGTAGGCATTATCATCTCCTTG
GTGTGCCTCGTCTTGGCCATCGCCACCTTTCTGCTGTGTCGCTCCATCCGAAATCACAACACCTACCTCCACCTGCACCTCTGCGTGTGT
CTCCTCTTGGCGAAGACTCTCTTCCTCGCCGGTATACACAAGACTGACAACAAGATGGGCTGCGCCATCATCGCGGGCTTCCTGCACTAC
CTTTTCCTTGCCTGCTTCTTCTGGATGCTGGTGGAGGCTGTGATACTGTTCTTGATGGTCAGAAACCTGAAGGTGGTGAATTACTTCAGC
TCTCGCAACATCAAGATGCTGCACATCTGTGCCTTTGGTTATGGGCTGCCGATGCTGGTGGTGGTGATCTCTGCCAGTGTGCAGCCACAG
GGCTATGGAATGCATAATCGCTGCTGGCTGAATACAGAGACAGGGTTCATCTGGAGTTTCTTGGGGCCAGTTTGCACAGTTATAGTGATC
AACTCCCTTCTCCTGACCTGGACCTTGTGGATCCTGAGGCAGAGGCTTTCCAGTGTTAATGCCGAAGTCTCAACGCTAAAAGACACCAGG
TTACTGACCTTCAAGGCCTTTGCCCAGCTCTTCATCCTGGGCTGCTCCTGGGTGCTGGGCATTTTTCAGATTGGACCTGTGGCAGGTGTC
ATGGCTTACCTGTTCACCATCATCAACAGCCTGCAGGGGGCCTTCATCTTCCTCATCCACTGTCTGCTCAACGGCCAGGTACGAGAAGAA
TACAAGAGGTGGATCACTGGGAAGACGAAGCCCAGCTCCCAGTCCCAGACCTCAAGGATCTTGCTGTCCTCCATGCCATCCGCTTCCAAG
ACGGGTTAAAGTCCTTTCTTGCTTTCAAATATGCTATGGAGCCACAGTTGAGGACAGTAGTTTCCTGCAGGAGCCTACCCTGAAATCTCT
TCTCAGCTTAACATGGAAATGAGGATCCCACCAGCCCCAGAACCCTCTGGGGAAGAATGTTGGGGGCGGTCTTCCTGTGGTTGTATGCAC
TGATGAGAAATCAGGCGTTTCTGCTCCAAACGACCATTTTATCTTCGTGCTCTGCAACTTCTTCAATTCCAGAGTTTCTGAGAACAGACC
CAAATTCAATGGCATGACCAAGAACACCTGGCTACCATTTTGTTTTCTCCTGCCCTTGTTGGTGCATGGTTCTAAGCATGCCCCTCCAGA

>23694_23694_9_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000359526_EMR1_chr19_6890492_ENST00000381407_length(amino acids)=731AA_BP=
MHSWEGHIRPTRKPNTKGNNCRDSTLCPAYATCTNTVDSYYCACKQGFLSSNGQNHFKDPGVRCKDIDECTEMCPINSTCTNTPGSYFCT
CHPGFAPSNGQLNFTDQGVECRDIDECRQDPSTCGPNSICTNALGSYSCGCIAGFHPNPEGSQKDGNFSCQRVLFKCKEDVIPDNKQIQQ
CQEGTAVKPAYVSFCAQINNIFSVLDKVCENKTTVVSLKNTTESFVPVLKQISTWTKFTKEETSSLATVFLESVESMTLASFWKPSANIT
PAVRTEYLDIESKVINKECSEENVTLDLVAKGDKMKIGCSTIEESESTETTGVAFVSFVGMESVLNERFFKDHQAPLTTSEIKLKMNSRV
VGGIMTGEKKDGFSDPIIYTLENIQPKQKFERPICVSWSTDVKGGRWTSFGCVILEASETYTICSCNQMANLAVIMASGELTMDFSLYII
SHVGIIISLVCLVLAIATFLLCRSIRNHNTYLHLHLCVCLLLAKTLFLAGIHKTDNKMGCAIIAGFLHYLFLACFFWMLVEAVILFLMVR
NLKVVNYFSSRNIKMLHICAFGYGLPMLVVVISASVQPQGYGMHNRCWLNTETGFIWSFLGPVCTVIVINSLLLTWTLWILRQRLSSVNA
EVSTLKDTRLLTFKAFAQLFILGCSWVLGIFQIGPVAGVMAYLFTIINSLQGAFIFLIHCLLNGQVREEYKRWITGKTKPSSQSQTSRIL

--------------------------------------------------------------
>23694_23694_10_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000359526_EMR1_chr19_6890492_ENST00000450315_length(transcript)=4176nt_BP=1672nt
GGCTCCGTTCCATCCTTCTGCACAGGGTATCGCCTCTCTCCGTTTGGTACATCCCCTCCTCCCCCACGCCCGGACTGGGGTGGTAGACGC
CGCCTCCGCTCATCGCCCCTCCCCATCGGTTTCCGCGCGAAAAGCCGGGGCGCCTGCGCTGCCGCCGCCGCGTCTGCTGAAGCCTCCGAG
ATGCCGGCGCGTACCGCCCCAGCCCGGGTGCCCACACTGGCCGTCCCGGCCATCTCGCTGCCCGACGATGTCCGCAGGCGGCTCAAAGAT
TTGGAAAGAGACAGCTTAACAGAAAAGGAATGTGTGAAGGAGAAATTGAATCTCTTGCACGAATTTCTGCAAACAGAAATAAAGAATCAG
TTATGTGACTTGGAAACCAAATTACGTAAAGAAGAATTATCCGAGGAGGGCTACCTGGCTAAAGTCAAATCCCTTTTAAATAAAGATTTG
TCCTTGGAGAACGGTGCTCATGCTTACAACCGGGAAGTGAATGGACGTCTAGAAAACGGGAACCAAGCAAGAAGTGAAGCCCGTAGAGTG
GGAATGGCAGATGCCAACAGCCCCCCCAAACCCCTTTCCAAACCTCGCACGCCCAGGAGGAGCAAGTCCGATGGAGAGGCTAAGCGTTCA
AGAGACCCTCCTGCCTCAGCCTCCCAAGTAACTGGGATTAGAGCTGAACCTTCACCTAGCCCCAGGATTACAAGGAAAAGCACCAGGCAA
ACCACCATCACATCTCATTTTGCAAAGGGCCCTGCCAAACGGAAACCTCAGGAAGAGTCTGAAAGAGCCAAATCGGATGAGTCCATCAAG
GAAGAAGACAAAGACCAGGATGAGAAGAGACGTAGAGTTACATCCAGAGAACGAGTTGCTAGACCGCTTCCTGCAGAAGAACCTGAAAGA
GCAAAATCAGGAACGCGCACTGAAAAGGAAGAAGAAAGAGATGAAAAAGAAGAAAAGAGACTCCGAAGTCAAACCAAAGAACCAACACCC
AAACAGAAACTGAAGGAGGAGCCGGACAGAGAAGCCAGGGCAGGCGTGCAGGCTGACGAGGACGAAGATGGAGACGAGAAAGATGAGAAG
AAGCACAGAAGTCAACCCAAAGATCTAGCTGCCAAACGGAGGCCCGAAGAAAAAGAACCTGAAAAAGTAAATCCACAGATTTCTGATGAA
AAAGACGAGGATGAAAAGGAGGAGAAGAGACGCAAAACGACCCCCAAAGAACCAACGGAGAAAAAAATGGCTCGCGCCAAAACAGTCATG
AACTCCAAGACCCACCCTCCCAAGTGCATTCAGTGCGGGCAGTACCTGGACGACCCTGACCTCAAATATGGGCAGCACCCACCAGACGCG
GTGGATGAGCCACAGATGCTGACAAATGAGAAGCTGTCCATCTTTGATGCCAACGAGTCTGGCTTTGAGAGTTATGAGGCGCTTCCCCAG
CACAAACTGACCTGCTTCAGTGTGTACTGTAAGCACGGTCACCTGTGTCCCATCGACACCGGCCTCATCGAGAAGAATATCGAACTCTTC
TTTTCTGGTTCAGCAAAACCAATCTATGATGATGACCCATCTCTTGAAGGTGGTGTTAATGGCAAAAATCTTGGCCCCATAAATGAATGG
TGGATCACTGGCTTTGATGGAGGTGAAAAGGCCCTCATCGGCTTCAGCACCTGATGTTGTGTTATGCACAGCTGGGAAGGGCACATAAGA
CCCACACGGAAACCAAACACAAAGGGTAATAACTGTAGAGACAGTACCTTGTGCCCAGCTTATGCCACCTGCACCAATACAGTGGACAGT
TACTATTGCGCTTGCAAACAAGGCTTCCTGTCCAGCAATGGGCAAAATCACTTCAAGGATCCAGGAGTGCGATGCAAAGATATTGATGAA
TGTTCTCAAAGCCCCCAGCCCTGTGGTCCTAACTCATCCTGCAAAAACCTGTCAGGGAGGTACAAGTGCAGCTGTTTAGATGGTTTCTCT
TCTCCCACTGGAAATGACTGGGTCCCAGGAAAGCCGGGCAATTTCTCCTGTACTGATATCAATGAGTGCCTCACCAGCAGGGTTCTCTTC
AAATGTAAGGAAGATGTGATACCCGATAATAAGCAGATCCAGCAATGCCAAGAGGGAACCGCAGTGAAACCTGCATATGTCTCCTTTTGT
GCACAAATAAATAACATCTTCAGCGTTCTGGACAAAGTGTGTGAAAATAAAACGACCGTAGTTTCTCTGAAGAATACAACTGAGAGCTTT
GTCCCTGTGCTTAAACAAATATCCACGTGGACTAAATTCACCAAGGAAGAGACGTCCTCCCTGGCCACAGTCTTCCTGGAGAGTGTGGAA
AGCATGACACTGGCATCTTTTTGGAAACCCTCAGCAAATATCACTCCGGCTGTTCGGACGGAATACTTAGACATTGAGAGCAAAGTTATC
AACAAAGAATGCAGTGAAGAGAATGTGACGTTGGACTTGGTAGCCAAGGGGGATAAGATGAAGATCGGGTGTTCCACAATTGAGGAATCT
GAATCCACAGAGACCACTGGTGTGGCTTTTGTCTCCTTTGTGGGCATGGAATCGGTTTTAAATGAGCGCTTCTTCAAAGACCACCAGGCT
CCCTTGACCACCTCTGAGATCAAGCTGAAGATGAATTCTCGAGTCGTTGGGGGCATAATGACTGGAGAGAAGAAAGACGGCTTCTCAGAT
CCAATCATCTACACTCTGGAGAACATTCAGCCAAAGCAGAAGTTTGAGAGGCCCATCTGTGTTTCCTGGAGCACTGATGTGAAGGGTGGA
AGATGGACATCCTTTGGCTGTGTGATCCTGGAAGCTTCTGAGACATATACCATCTGCAGCTGTAATCAGATGGCAAATCTTGCCGTTATC
ATGGCGTCTGGGGAGCTCACGATGGACTTTTCCTTGTACATCATTAGCCATGTAGGCATTATCATCTCCTTGGTGTGCCTCGTCTTGGCC
ATCGCCACCTTTCTGCTGTGTCGCTCCATCCGAAATCACAACACCTACCTCCACCTGCACCTCTGCGTGTGTCTCCTCTTGGCGAAGACT
CTCTTCCTCGCCGGTATACACAAGACTGACAACAAGATGGGCTGCGCCATCATCGCGGGCTTCCTGCACTACCTTTTCCTTGCCTGCTTC
TTCTGGATGCTGGTGGAGGCTGTGATACTGTTCTTGATGGTCAGAAACCTGAAGGTGGTGAATTACTTCAGCTCTCGCAACATCAAGATG
CTGCACATCTGTGCCTTTGGTTATGGGCTGCCGATGCTGGTGGTGGTGATCTCTGCCAGTGTGCAGCCACAGGGCTATGGAATGCATAAT
CGCTGCTGGCTGAATACAGAGACAGGGTTCATCTGGAGTTTCTTGGGGCCAGTTTGCACAGTTATAGTGATCAACTCCCTTCTCCTGACC
TGGACCTTGTGGATCCTGAGGCAGAGGCTTTCCAGTGTTAATGCCGAAGTCTCAACGCTAAAAGACACCAGGTTACTGACCTTCAAGGCC
TTTGCCCAGCTCTTCATCCTGGGCTGCTCCTGGGTGCTGGGCATTTTTCAGATTGGACCTGTGGCAGGTGTCATGGCTTACCTGTTCACC
ATCATCAACAGCCTGCAGGGGGCCTTCATCTTCCTCATCCACTGTCTGCTCAACGGCCAGGTACGAGAAGAATACAAGAGGTGGATCACT
GGGAAGACGAAGCCCAGCTCCCAGTCCCAGACCTCAAGGATCTTGCTGTCCTCCATGCCATCCGCTTCCAAGACGGGTTAAAGTCCTTTC
TTGCTTTCAAATATGCTATGGAGCCACAGTTGAGGACAGTAGTTTCCTGCAGGAGCCTACCCTGAAATCTCTTCTCAGCTTAACATGGAA
ATGAGGATCCCACCAGCCCCAGAACCCTCTGGGGAAGAATGTTGGGGGCGGTCTTCCTGTGGTTGTATGCACTGATGAGAAATCAGGCGT
TTCTGCTCCAAACGACCATTTTATCTTCGTGCTCTGCAACTTCTTCAATTCCAGAGTTTCTGAGAACAGACCCAAATTCAATGGCATGAC
CAAGAACACCTGGCTACCATTTTGTTTTCTCCTGCCCTTGTTGGTGCATGGTTCTAAGCATGCCCCTCCAGAGCCTATCATACGCCTGAT

>23694_23694_10_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000359526_EMR1_chr19_6890492_ENST00000450315_length(amino acids)=695AA_BP=
MHSWEGHIRPTRKPNTKGNNCRDSTLCPAYATCTNTVDSYYCACKQGFLSSNGQNHFKDPGVRCKDIDECSQSPQPCGPNSSCKNLSGRY
KCSCLDGFSSPTGNDWVPGKPGNFSCTDINECLTSRVLFKCKEDVIPDNKQIQQCQEGTAVKPAYVSFCAQINNIFSVLDKVCENKTTVV
SLKNTTESFVPVLKQISTWTKFTKEETSSLATVFLESVESMTLASFWKPSANITPAVRTEYLDIESKVINKECSEENVTLDLVAKGDKMK
IGCSTIEESESTETTGVAFVSFVGMESVLNERFFKDHQAPLTTSEIKLKMNSRVVGGIMTGEKKDGFSDPIIYTLENIQPKQKFERPICV
SWSTDVKGGRWTSFGCVILEASETYTICSCNQMANLAVIMASGELTMDFSLYIISHVGIIISLVCLVLAIATFLLCRSIRNHNTYLHLHL
CVCLLLAKTLFLAGIHKTDNKMGCAIIAGFLHYLFLACFFWMLVEAVILFLMVRNLKVVNYFSSRNIKMLHICAFGYGLPMLVVVISASV
QPQGYGMHNRCWLNTETGFIWSFLGPVCTVIVINSLLLTWTLWILRQRLSSVNAEVSTLKDTRLLTFKAFAQLFILGCSWVLGIFQIGPV

--------------------------------------------------------------
>23694_23694_11_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000540357_EMR1_chr19_6890492_ENST00000250572_length(transcript)=4489nt_BP=1624nt
GGCTCCGTTCCATCCTTCTGCACAGGGTATCGCCTCTCTCCGTTTGGTACATCCCCTCCTCCCCCACGCCCGGACTGGGGTGGTAGACGC
CGCCTCCGCTCATCGCCCCTCCCCATCGGTTTCCGCGCGAAAAGCCGGGGCGCCTGCGCTGCCGCCGCCGCGTCTGCTGAAGCCTCCGAG
ATGCCGGCGCGTACCGCCCCAGCCCGGGTGCCCACACTGGCCGTCCCGGCCATCTCGCTGCCCGACGATGTCCGCAGGCGGCTCAAAGAT
TTGGAAAGAGACAGCTTAACAGAAAAGGAATGTGTGAAGGAGAAATTGAATCTCTTGCACGAATTTCTGCAAACAGAAATAAAGAATCAG
TTATGTGACTTGGAAACCAAATTACGTAAAGAAGAATTATCCGAGGAGGGCTACCTGGCTAAAGTCAAATCCCTTTTAAATAAAGATTTG
TCCTTGGAGAACGGTGCTCATGCTTACAACCGGGAAGTGAATGGACGTCTAGAAAACGGGAACCAAGCAAGAAGTGAAGCCCGTAGAGTG
GGAATGGCAGATGCCAACAGCCCCCCCAAACCCCTTTCCAAACCTCGCACGCCCAGGAGGAGCAAGTCCGATGGAGAGGCTAAGCCTGAA
CCTTCACCTAGCCCCAGGATTACAAGGAAAAGCACCAGGCAAACCACCATCACATCTCATTTTGCAAAGGGCCCTGCCAAACGGAAACCT
CAGGAAGAGTCTGAAAGAGCCAAATCGGATGAGTCCATCAAGGAAGAAGACAAAGACCAGGATGAGAAGAGACGTAGAGTTACATCCAGA
GAACGAGTTGCTAGACCGCTTCCTGCAGAAGAACCTGAAAGAGCAAAATCAGGAACGCGCACTGAAAAGGAAGAAGAAAGAGATGAAAAA
GAAGAAAAGAGACTCCGAAGTCAAACCAAAGAACCAACACCCAAACAGAAACTGAAGGAGGAGCCGGACAGAGAAGCCAGGGCAGGCGTG
CAGGCTGACGAGGACGAAGATGGAGACGAGAAAGATGAGAAGAAGCACAGAAGTCAACCCAAAGATCTAGCTGCCAAACGGAGGCCCGAA
GAAAAAGAACCTGAAAAAGTAAATCCACAGATTTCTGATGAAAAAGACGAGGATGAAAAGGAGGAGAAGAGACGCAAAACGACCCCCAAA
GAACCAACGGAGAAAAAAATGGCTCGCGCCAAAACAGTCATGAACTCCAAGACCCACCCTCCCAAGTGCATTCAGTGCGGGCAGTACCTG
GACGACCCTGACCTCAAATATGGGCAGCACCCACCAGACGCGGTGGATGAGCCACAGATGCTGACAAATGAGAAGCTGTCCATCTTTGAT
GCCAACGAGTCTGGCTTTGAGAGTTATGAGGCGCTTCCCCAGCACAAACTGACCTGCTTCAGTGTGTACTGTAAGCACGGTCACCTGTGT
CCCATCGACACCGGCCTCATCGAGAAGAATATCGAACTCTTCTTTTCTGGTTCAGCAAAACCAATCTATGATGATGACCCATCTCTTGAA
GGTGGTGTTAATGGCAAAAATCTTGGCCCCATAAATGAATGGTGGATCACTGGCTTTGATGGAGGTGAAAAGGCCCTCATCGGCTTCAGC
ACCTGATGTTGTGTTATGCACAGCTGGGAAGGGCACATAAGACCCACACGGAAACCAAACACAAAGGGTAATAACTGTAGAGACAGTACC
TTGTGCCCAGCTTATGCCACCTGCACCAATACAGTGGACAGTTACTATTGCGCTTGCAAACAAGGCTTCCTGTCCAGCAATGGGCAAAAT
CACTTCAAGGATCCAGGAGTGCGATGCAAAGATATTGATGAATGTTCTCAAAGCCCCCAGCCCTGTGGTCCTAACTCATCCTGCAAAAAC
CTGTCAGGGAGGTACAAGTGCAGCTGTTTAGATGGTTTCTCTTCTCCCACTGGAAATGACTGGGTCCCAGGAAAGCCGGGCAATTTCTCC
TGTACTGATATCAATGAGTGCCTCACCAGCAGCGTCTGCCCTGAGCATTCTGACTGTGTCAACTCCATGGGAAGCTACAGTTGCAGCTGT
CAAGTTGGATTCATCTCTAGAAACTCCACCTGTGAAGACGTGGATGAATGTGCAGATCCAAGAGCTTGCCCAGAGCATGCAACTTGTAAT
AACACTGTTGGAAACTACTCTTGTTTCTGCAACCCAGGATTTGAATCCAGCAGTGGCCACTTGAGTTTCCAGGGTCTCAAAGCATCGTGT
GAAGATATTGATGAATGCACTGAAATGTGCCCCATCAATTCAACATGCACCAACACTCCTGGGAGCTACTTTTGCACCTGCCACCCTGGC
TTTGCACCAAGCAATGGACAGTTGAATTTCACAGACCAAGGAGTGGAATGTAGAGATATTGATGAGTGCCGCCAAGATCCATCAACCTGT
GGTCCTAATTCTATCTGCACCAATGCCCTGGGCTCCTACAGCTGTGGCTGCATTGCAGGCTTTCATCCCAATCCAGAAGGCTCCCAGAAA
GATGGCAACTTCAGCTGCCAAAGGGTTCTCTTCAAATGTAAGGAAGATGTGATACCCGATAATAAGCAGATCCAGCAATGCCAAGAGGGA
ACCGCAGTGAAACCTGCATATGTCTCCTTTTGTGCACAAATAAATAACATCTTCAGCGTTCTGGACAAAGTGTGTGAAAATAAAACGACC
GTAGTTTCTCTGAAGAATACAACTGAGAGCTTTGTCCCTGTGCTTAAACAAATATCCACGTGGACTAAATTCACCAAGGAAGAGACGTCC
TCCCTGGCCACAGTCTTCCTGGAGAGTGTGGAAAGCATGACACTGGCATCTTTTTGGAAACCCTCAGCAAATATCACTCCGGCTGTTCGG
ACGGAATACTTAGACATTGAGAGCAAAGTTATCAACAAAGAATGCAGTGAAGAGAATGTGACGTTGGACTTGGTAGCCAAGGGGGATAAG
ATGAAGATCGGGTGTTCCACAATTGAGGAATCTGAATCCACAGAGACCACTGGTGTGGCTTTTGTCTCCTTTGTGGGCATGGAATCGGTT
TTAAATGAGCGCTTCTTCAAAGACCACCAGGCTCCCTTGACCACCTCTGAGATCAAGCTGAAGATGAATTCTCGAGTCGTTGGGGGCATA
ATGACTGGAGAGAAGAAAGACGGCTTCTCAGATCCAATCATCTACACTCTGGAGAACATTCAGCCAAAGCAGAAGTTTGAGAGGCCCATC
TGTGTTTCCTGGAGCACTGATGTGAAGGGTGGAAGATGGACATCCTTTGGCTGTGTGATCCTGGAAGCTTCTGAGACATATACCATCTGC
AGCTGTAATCAGATGGCAAATCTTGCCGTTATCATGGCGTCTGGGGAGCTCACGATGGGCTGCGCCATCATCGCGGGCTTCCTGCACTAC
CTTTTCCTTGCCTGCTTCTTCTGGATGCTGGTGGAGGCTGTGATACTGTTCTTGATGGTCAGAAACCTGAAGGTGGTGAATTACTTCAGC
TCTCGCAACATCAAGATGCTGCACATCTGTGCCTTTGGTTATGGGCTGCCGATGCTGGTGGTGGTGATCTCTGCCAGTGTGCAGCCACAG
GGCTATGGAATGCATAATCGCTGCTGGCTGAATACAGAGACAGGGTTCATCTGGAGTTTCTTGGGGCCAGTTTGCACAGTTATAGTGATC
AACTCCCTTCTCCTGACCTGGACCTTGTGGATCCTGAGGCAGAGGCTTTCCAGTGTTAATGCCGAAGTCTCAACGCTAAAAGACACCAGG
TTACTGACCTTCAAGGCCTTTGCCCAGCTCTTCATCCTGGGCTGCTCCTGGGTGCTGGGCATTTTTCAGATTGGACCTGTGGCAGGTGTC
ATGGCTTACCTGTTCACCATCATCAACAGCCTGCAGGGGGCCTTCATCTTCCTCATCCACTGTCTGCTCAACGGCCAGGTACGAGAAGAA
TACAAGAGGTGGATCACTGGGAAGACGAAGCCCAGCTCCCAGTCCCAGACCTCAAGGATCTTGCTGTCCTCCATGCCATCCGCTTCCAAG
ACGGGTTAAAGTCCTTTCTTGCTTTCAAATATGCTATGGAGCCACAGTTGAGGACAGTAGTTTCCTGCAGGAGCCTACCCTGAAATCTCT
TCTCAGCTTAACATGGAAATGAGGATCCCACCAGCCCCAGAACCCTCTGGGGAAGAATGTTGGGGGCGGTCTTCCTGTGGTTGTATGCAC
TGATGAGAAATCAGGCGTTTCTGCTCCAAACGACCATTTTATCTTCGTGCTCTGCAACTTCTTCAATTCCAGAGTTTCTGAGAACAGACC
CAAATTCAATGGCATGACCAAGAACACCTGGCTACCATTTTGTTTTCTCCTGCCCTTGTTGGTGCATGGTTCTAAGCATGCCCCTCCAGA

>23694_23694_11_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000540357_EMR1_chr19_6890492_ENST00000250572_length(amino acids)=807AA_BP=
MHSWEGHIRPTRKPNTKGNNCRDSTLCPAYATCTNTVDSYYCACKQGFLSSNGQNHFKDPGVRCKDIDECSQSPQPCGPNSSCKNLSGRY
KCSCLDGFSSPTGNDWVPGKPGNFSCTDINECLTSSVCPEHSDCVNSMGSYSCSCQVGFISRNSTCEDVDECADPRACPEHATCNNTVGN
YSCFCNPGFESSSGHLSFQGLKASCEDIDECTEMCPINSTCTNTPGSYFCTCHPGFAPSNGQLNFTDQGVECRDIDECRQDPSTCGPNSI
CTNALGSYSCGCIAGFHPNPEGSQKDGNFSCQRVLFKCKEDVIPDNKQIQQCQEGTAVKPAYVSFCAQINNIFSVLDKVCENKTTVVSLK
NTTESFVPVLKQISTWTKFTKEETSSLATVFLESVESMTLASFWKPSANITPAVRTEYLDIESKVINKECSEENVTLDLVAKGDKMKIGC
STIEESESTETTGVAFVSFVGMESVLNERFFKDHQAPLTTSEIKLKMNSRVVGGIMTGEKKDGFSDPIIYTLENIQPKQKFERPICVSWS
TDVKGGRWTSFGCVILEASETYTICSCNQMANLAVIMASGELTMGCAIIAGFLHYLFLACFFWMLVEAVILFLMVRNLKVVNYFSSRNIK
MLHICAFGYGLPMLVVVISASVQPQGYGMHNRCWLNTETGFIWSFLGPVCTVIVINSLLLTWTLWILRQRLSSVNAEVSTLKDTRLLTFK

--------------------------------------------------------------
>23694_23694_12_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000540357_EMR1_chr19_6890492_ENST00000312053_length(transcript)=4684nt_BP=1624nt
GGCTCCGTTCCATCCTTCTGCACAGGGTATCGCCTCTCTCCGTTTGGTACATCCCCTCCTCCCCCACGCCCGGACTGGGGTGGTAGACGC
CGCCTCCGCTCATCGCCCCTCCCCATCGGTTTCCGCGCGAAAAGCCGGGGCGCCTGCGCTGCCGCCGCCGCGTCTGCTGAAGCCTCCGAG
ATGCCGGCGCGTACCGCCCCAGCCCGGGTGCCCACACTGGCCGTCCCGGCCATCTCGCTGCCCGACGATGTCCGCAGGCGGCTCAAAGAT
TTGGAAAGAGACAGCTTAACAGAAAAGGAATGTGTGAAGGAGAAATTGAATCTCTTGCACGAATTTCTGCAAACAGAAATAAAGAATCAG
TTATGTGACTTGGAAACCAAATTACGTAAAGAAGAATTATCCGAGGAGGGCTACCTGGCTAAAGTCAAATCCCTTTTAAATAAAGATTTG
TCCTTGGAGAACGGTGCTCATGCTTACAACCGGGAAGTGAATGGACGTCTAGAAAACGGGAACCAAGCAAGAAGTGAAGCCCGTAGAGTG
GGAATGGCAGATGCCAACAGCCCCCCCAAACCCCTTTCCAAACCTCGCACGCCCAGGAGGAGCAAGTCCGATGGAGAGGCTAAGCCTGAA
CCTTCACCTAGCCCCAGGATTACAAGGAAAAGCACCAGGCAAACCACCATCACATCTCATTTTGCAAAGGGCCCTGCCAAACGGAAACCT
CAGGAAGAGTCTGAAAGAGCCAAATCGGATGAGTCCATCAAGGAAGAAGACAAAGACCAGGATGAGAAGAGACGTAGAGTTACATCCAGA
GAACGAGTTGCTAGACCGCTTCCTGCAGAAGAACCTGAAAGAGCAAAATCAGGAACGCGCACTGAAAAGGAAGAAGAAAGAGATGAAAAA
GAAGAAAAGAGACTCCGAAGTCAAACCAAAGAACCAACACCCAAACAGAAACTGAAGGAGGAGCCGGACAGAGAAGCCAGGGCAGGCGTG
CAGGCTGACGAGGACGAAGATGGAGACGAGAAAGATGAGAAGAAGCACAGAAGTCAACCCAAAGATCTAGCTGCCAAACGGAGGCCCGAA
GAAAAAGAACCTGAAAAAGTAAATCCACAGATTTCTGATGAAAAAGACGAGGATGAAAAGGAGGAGAAGAGACGCAAAACGACCCCCAAA
GAACCAACGGAGAAAAAAATGGCTCGCGCCAAAACAGTCATGAACTCCAAGACCCACCCTCCCAAGTGCATTCAGTGCGGGCAGTACCTG
GACGACCCTGACCTCAAATATGGGCAGCACCCACCAGACGCGGTGGATGAGCCACAGATGCTGACAAATGAGAAGCTGTCCATCTTTGAT
GCCAACGAGTCTGGCTTTGAGAGTTATGAGGCGCTTCCCCAGCACAAACTGACCTGCTTCAGTGTGTACTGTAAGCACGGTCACCTGTGT
CCCATCGACACCGGCCTCATCGAGAAGAATATCGAACTCTTCTTTTCTGGTTCAGCAAAACCAATCTATGATGATGACCCATCTCTTGAA
GGTGGTGTTAATGGCAAAAATCTTGGCCCCATAAATGAATGGTGGATCACTGGCTTTGATGGAGGTGAAAAGGCCCTCATCGGCTTCAGC
ACCTGATGTTGTGTTATGCACAGCTGGGAAGGGCACATAAGACCCACACGGAAACCAAACACAAAGGGTAATAACTGTAGAGACAGTACC
TTGTGCCCAGCTTATGCCACCTGCACCAATACAGTGGACAGTTACTATTGCGCTTGCAAACAAGGCTTCCTGTCCAGCAATGGGCAAAAT
CACTTCAAGGATCCAGGAGTGCGATGCAAAGATATTGATGAATGTTCTCAAAGCCCCCAGCCCTGTGGTCCTAACTCATCCTGCAAAAAC
CTGTCAGGGAGGTACAAGTGCAGCTGTTTAGATGGTTTCTCTTCTCCCACTGGAAATGACTGGGTCCCAGGAAAGCCGGGCAATTTCTCC
TGTACTGATATCAATGAGTGCCTCACCAGCAGCGTCTGCCCTGAGCATTCTGACTGTGTCAACTCCATGGGAAGCTACAGTTGCAGCTGT
CAAGTTGGATTCATCTCTAGAAACTCCACCTGTGAAGACGTGGATGAATGTGCAGATCCAAGAGCTTGCCCAGAGCATGCAACTTGTAAT
AACACTGTTGGAAACTACTCTTGTTTCTGCAACCCAGGATTTGAATCCAGCAGTGGCCACTTGAGTTTCCAGGGTCTCAAAGCATCGTGT
GAAGATATTGATGAATGCACTGAAATGTGCCCCATCAATTCAACATGCACCAACACTCCTGGGAGCTACTTTTGCACCTGCCACCCTGGC
TTTGCACCAAGCAATGGACAGTTGAATTTCACAGACCAAGGAGTGGAATGTAGAGATATTGATGAGTGCCGCCAAGATCCATCAACCTGT
GGTCCTAATTCTATCTGCACCAATGCCCTGGGCTCCTACAGCTGTGGCTGCATTGCAGGCTTTCATCCCAATCCAGAAGGCTCCCAGAAA
GATGGCAACTTCAGCTGCCAAAGGGTTCTCTTCAAATGTAAGGAAGATGTGATACCCGATAATAAGCAGATCCAGCAATGCCAAGAGGGA
ACCGCAGTGAAACCTGCATATGTCTCCTTTTGTGCACAAATAAATAACATCTTCAGCGTTCTGGACAAAGTGTGTGAAAATAAAACGACC
GTAGTTTCTCTGAAGAATACAACTGAGAGCTTTGTCCCTGTGCTTAAACAAATATCCACGTGGACTAAATTCACCAAGGAAGAGACGTCC
TCCCTGGCCACAGTCTTCCTGGAGAGTGTGGAAAGCATGACACTGGCATCTTTTTGGAAACCCTCAGCAAATATCACTCCGGCTGTTCGG
ACGGAATACTTAGACATTGAGAGCAAAGTTATCAACAAAGAATGCAGTGAAGAGAATGTGACGTTGGACTTGGTAGCCAAGGGGGATAAG
ATGAAGATCGGGTGTTCCACAATTGAGGAATCTGAATCCACAGAGACCACTGGTGTGGCTTTTGTCTCCTTTGTGGGCATGGAATCGGTT
TTAAATGAGCGCTTCTTCAAAGACCACCAGGCTCCCTTGACCACCTCTGAGATCAAGCTGAAGATGAATTCTCGAGTCGTTGGGGGCATA
ATGACTGGAGAGAAGAAAGACGGCTTCTCAGATCCAATCATCTACACTCTGGAGAACATTCAGCCAAAGCAGAAGTTTGAGAGGCCCATC
TGTGTTTCCTGGAGCACTGATGTGAAGGGTGGAAGATGGACATCCTTTGGCTGTGTGATCCTGGAAGCTTCTGAGACATATACCATCTGC
AGCTGTAATCAGATGGCAAATCTTGCCGTTATCATGGCGTCTGGGGAGCTCACGATGGACTTTTCCTTGTACATCATTAGCCATGTAGGC
ATTATCATCTCCTTGGTGTGCCTCGTCTTGGCCATCGCCACCTTTCTGCTGTGTCGCTCCATCCGAAATCACAACACCTACCTCCACCTG
CACCTCTGCGTGTGTCTCCTCTTGGCGAAGACTCTCTTCCTCGCCGGTATACACAAGACTGACAACAAGATGGGCTGCGCCATCATCGCG
GGCTTCCTGCACTACCTTTTCCTTGCCTGCTTCTTCTGGATGCTGGTGGAGGCTGTGATACTGTTCTTGATGGTCAGAAACCTGAAGGTG
GTGAATTACTTCAGCTCTCGCAACATCAAGATGCTGCACATCTGTGCCTTTGGTTATGGGCTGCCGATGCTGGTGGTGGTGATCTCTGCC
AGTGTGCAGCCACAGGGCTATGGAATGCATAATCGCTGCTGGCTGAATACAGAGACAGGGTTCATCTGGAGTTTCTTGGGGCCAGTTTGC
ACAGTTATAGTGATCAACTCCCTTCTCCTGACCTGGACCTTGTGGATCCTGAGGCAGAGGCTTTCCAGTGTTAATGCCGAAGTCTCAACG
CTAAAAGACACCAGGTTACTGACCTTCAAGGCCTTTGCCCAGCTCTTCATCCTGGGCTGCTCCTGGGTGCTGGGCATTTTTCAGATTGGA
CCTGTGGCAGGTGTCATGGCTTACCTGTTCACCATCATCAACAGCCTGCAGGGGGCCTTCATCTTCCTCATCCACTGTCTGCTCAACGGC
CAGGTACGAGAAGAATACAAGAGGTGGATCACTGGGAAGACGAAGCCCAGCTCCCAGTCCCAGACCTCAAGGATCTTGCTGTCCTCCATG
CCATCCGCTTCCAAGACGGGTTAAAGTCCTTTCTTGCTTTCAAATATGCTATGGAGCCACAGTTGAGGACAGTAGTTTCCTGCAGGAGCC
TACCCTGAAATCTCTTCTCAGCTTAACATGGAAATGAGGATCCCACCAGCCCCAGAACCCTCTGGGGAAGAATGTTGGGGGCGGTCTTCC
TGTGGTTGTATGCACTGATGAGAAATCAGGCGTTTCTGCTCCAAACGACCATTTTATCTTCGTGCTCTGCAACTTCTTCAATTCCAGAGT
TTCTGAGAACAGACCCAAATTCAATGGCATGACCAAGAACACCTGGCTACCATTTTGTTTTCTCCTGCCCTTGTTGGTGCATGGTTCTAA
GCATGCCCCTCCAGAGCCTATCATACGCCTGATACAGAGAACCTCTCAATAAATGATTTGTCGCCTGTCTGACTGATTTACCCTAGGATA

>23694_23694_12_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000540357_EMR1_chr19_6890492_ENST00000312053_length(amino acids)=872AA_BP=
MHSWEGHIRPTRKPNTKGNNCRDSTLCPAYATCTNTVDSYYCACKQGFLSSNGQNHFKDPGVRCKDIDECSQSPQPCGPNSSCKNLSGRY
KCSCLDGFSSPTGNDWVPGKPGNFSCTDINECLTSSVCPEHSDCVNSMGSYSCSCQVGFISRNSTCEDVDECADPRACPEHATCNNTVGN
YSCFCNPGFESSSGHLSFQGLKASCEDIDECTEMCPINSTCTNTPGSYFCTCHPGFAPSNGQLNFTDQGVECRDIDECRQDPSTCGPNSI
CTNALGSYSCGCIAGFHPNPEGSQKDGNFSCQRVLFKCKEDVIPDNKQIQQCQEGTAVKPAYVSFCAQINNIFSVLDKVCENKTTVVSLK
NTTESFVPVLKQISTWTKFTKEETSSLATVFLESVESMTLASFWKPSANITPAVRTEYLDIESKVINKECSEENVTLDLVAKGDKMKIGC
STIEESESTETTGVAFVSFVGMESVLNERFFKDHQAPLTTSEIKLKMNSRVVGGIMTGEKKDGFSDPIIYTLENIQPKQKFERPICVSWS
TDVKGGRWTSFGCVILEASETYTICSCNQMANLAVIMASGELTMDFSLYIISHVGIIISLVCLVLAIATFLLCRSIRNHNTYLHLHLCVC
LLLAKTLFLAGIHKTDNKMGCAIIAGFLHYLFLACFFWMLVEAVILFLMVRNLKVVNYFSSRNIKMLHICAFGYGLPMLVVVISASVQPQ
GYGMHNRCWLNTETGFIWSFLGPVCTVIVINSLLLTWTLWILRQRLSSVNAEVSTLKDTRLLTFKAFAQLFILGCSWVLGIFQIGPVAGV

--------------------------------------------------------------
>23694_23694_13_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000540357_EMR1_chr19_6890492_ENST00000381404_length(transcript)=4619nt_BP=1624nt
GGCTCCGTTCCATCCTTCTGCACAGGGTATCGCCTCTCTCCGTTTGGTACATCCCCTCCTCCCCCACGCCCGGACTGGGGTGGTAGACGC
CGCCTCCGCTCATCGCCCCTCCCCATCGGTTTCCGCGCGAAAAGCCGGGGCGCCTGCGCTGCCGCCGCCGCGTCTGCTGAAGCCTCCGAG
ATGCCGGCGCGTACCGCCCCAGCCCGGGTGCCCACACTGGCCGTCCCGGCCATCTCGCTGCCCGACGATGTCCGCAGGCGGCTCAAAGAT
TTGGAAAGAGACAGCTTAACAGAAAAGGAATGTGTGAAGGAGAAATTGAATCTCTTGCACGAATTTCTGCAAACAGAAATAAAGAATCAG
TTATGTGACTTGGAAACCAAATTACGTAAAGAAGAATTATCCGAGGAGGGCTACCTGGCTAAAGTCAAATCCCTTTTAAATAAAGATTTG
TCCTTGGAGAACGGTGCTCATGCTTACAACCGGGAAGTGAATGGACGTCTAGAAAACGGGAACCAAGCAAGAAGTGAAGCCCGTAGAGTG
GGAATGGCAGATGCCAACAGCCCCCCCAAACCCCTTTCCAAACCTCGCACGCCCAGGAGGAGCAAGTCCGATGGAGAGGCTAAGCCTGAA
CCTTCACCTAGCCCCAGGATTACAAGGAAAAGCACCAGGCAAACCACCATCACATCTCATTTTGCAAAGGGCCCTGCCAAACGGAAACCT
CAGGAAGAGTCTGAAAGAGCCAAATCGGATGAGTCCATCAAGGAAGAAGACAAAGACCAGGATGAGAAGAGACGTAGAGTTACATCCAGA
GAACGAGTTGCTAGACCGCTTCCTGCAGAAGAACCTGAAAGAGCAAAATCAGGAACGCGCACTGAAAAGGAAGAAGAAAGAGATGAAAAA
GAAGAAAAGAGACTCCGAAGTCAAACCAAAGAACCAACACCCAAACAGAAACTGAAGGAGGAGCCGGACAGAGAAGCCAGGGCAGGCGTG
CAGGCTGACGAGGACGAAGATGGAGACGAGAAAGATGAGAAGAAGCACAGAAGTCAACCCAAAGATCTAGCTGCCAAACGGAGGCCCGAA
GAAAAAGAACCTGAAAAAGTAAATCCACAGATTTCTGATGAAAAAGACGAGGATGAAAAGGAGGAGAAGAGACGCAAAACGACCCCCAAA
GAACCAACGGAGAAAAAAATGGCTCGCGCCAAAACAGTCATGAACTCCAAGACCCACCCTCCCAAGTGCATTCAGTGCGGGCAGTACCTG
GACGACCCTGACCTCAAATATGGGCAGCACCCACCAGACGCGGTGGATGAGCCACAGATGCTGACAAATGAGAAGCTGTCCATCTTTGAT
GCCAACGAGTCTGGCTTTGAGAGTTATGAGGCGCTTCCCCAGCACAAACTGACCTGCTTCAGTGTGTACTGTAAGCACGGTCACCTGTGT
CCCATCGACACCGGCCTCATCGAGAAGAATATCGAACTCTTCTTTTCTGGTTCAGCAAAACCAATCTATGATGATGACCCATCTCTTGAA
GGTGGTGTTAATGGCAAAAATCTTGGCCCCATAAATGAATGGTGGATCACTGGCTTTGATGGAGGTGAAAAGGCCCTCATCGGCTTCAGC
ACCTGATGTTGTGTTATGCACAGCTGGGAAGGGCACATAAGACCCACACGGAAACCAAACACAAAGGGTAATAACTGTAGAGACAGTACC
TTGTGCCCAGCTTATGCCACCTGCACCAATACAGTGGACAGTTACTATTGCGCTTGCAAACAAGGCTTCCTGTCCAGCAATGGGCAAAAT
CACTTCAAGGATCCAGGAGTGCGATGCAAAGATATCAATGAGTGCCTCACCAGCAGCGTCTGCCCTGAGCATTCTGACTGTGTCAACTCC
ATGGGAAGCTACAGTTGCAGCTGTCAAGTTGGATTCATCTCTAGAAACTCCACCTGTGAAGACGTGGATGAATGTGCAGATCCAAGAGCT
TGCCCAGAGCATGCAACTTGTAATAACACTGTTGGAAACTACTCTTGTTTCTGCAACCCAGGATTTGAATCCAGCAGTGGCCACTTGAGT
TTCCAGGGTCTCAAAGCATCGTGTGAAGATATTGATGAATGCACTGAAATGTGCCCCATCAATTCAACATGCACCAACACTCCTGGGAGC
TACTTTTGCACCTGCCACCCTGGCTTTGCACCAAGCAATGGACAGTTGAATTTCACAGACCAAGGAGTGGAATGTAGAGATATTGATGAG
TGCCGCCAAGATCCATCAACCTGTGGTCCTAATTCTATCTGCACCAATGCCCTGGGCTCCTACAGCTGTGGCTGCATTGCAGGCTTTCAT
CCCAATCCAGAAGGCTCCCAGAAAGATGGCAACTTCAGCTGCCAAAGGGTTCTCTTCAAATGTAAGGAAGATGTGATACCCGATAATAAG
CAGATCCAGCAATGCCAAGAGGGAACCGCAGTGAAACCTGCATATGTCTCCTTTTGTGCACAAATAAATAACATCTTCAGCGTTCTGGAC
AAAGTGTGTGAAAATAAAACGACCGTAGTTTCTCTGAAGAATACAACTGAGAGCTTTGTCCCTGTGCTTAAACAAATATCCACGTGGACT
AAATTCACCAAGGAAGAGACGTCCTCCCTGGCCACAGTCTTCCTGGAGAGTGTGGAAAGCATGACACTGGCATCTTTTTGGAAACCCTCA
GCAAATATCACTCCGGCTGTTCGGACGGAATACTTAGACATTGAGAGCAAAGTTATCAACAAAGAATGCAGTGAAGAGAATGTGACGTTG
GACTTGGTAGCCAAGGGGGATAAGATGAAGATCGGGTGTTCCACAATTGAGGAATCTGAATCCACAGAGACCACTGGTGTGGCTTTTGTC
TCCTTTGTGGGCATGGAATCGGTTTTAAATGAGCGCTTCTTCAAAGACCACCAGGCTCCCTTGACCACCTCTGAGATCAAGCTGAAGATG
AATTCTCGAGTCGTTGGGGGCATAATGACTGGAGAGAAGAAAGACGGCTTCTCAGATCCAATCATCTACACTCTGGAGAACATTCAGCCA
AAGCAGAAGTTTGAGAGGCCCATCTGTGTTTCCTGGAGCACTGATGTGAAGGGTGGAAGATGGACATCCTTTGGCTGTGTGATCCTGGAA
GCTTCTGAGACATATACCATCTGCAGCTGTAATCAGATGGCAAATCTTGCCGTTATCATGGCGTCTGGGGAGCTCACGATGGACTTTTCC
TTGTACATCATTAGCCATGTAGGCATTATCATCTCCTTGGTGTGCCTCGTCTTGGCCATCGCCACCTTTCTGCTGTGTCGCTCCATCCGA
AATCACAACACCTACCTCCACCTGCACCTCTGCGTGTGTCTCCTCTTGGCGAAGACTCTCTTCCTCGCCGGTATACACAAGACTGACAAC
AAGATGGGCTGCGCCATCATCGCGGGCTTCCTGCACTACCTTTTCCTTGCCTGCTTCTTCTGGATGCTGGTGGAGGCTGTGATACTGTTC
TTGATGGTCAGAAACCTGAAGGTGGTGAATTACTTCAGCTCTCGCAACATCAAGATGCTGCACATCTGTGCCTTTGGTTATGGGCTGCCG
ATGCTGGTGGTGGTGATCTCTGCCAGTGTGCAGCCACAGGGCTATGGAATGCATAATCGCTGCTGGCTGAATACAGAGACAGGGTTCATC
TGGAGTTTCTTGGGGCCAGTTTGCACAGTTATAGTGGTAAGCAAATACTACAACAGCCTGGCGAAGTGTGTTCTGAAGGAGGAGCAAGGA
GACCTGCGAGATCTGGAATTTCCAGGGACGTGTGCAGCTGAGAGGATCAACTCCCTTCTCCTGACCTGGACCTTGTGGATCCTGAGGCAG
AGGCTTTCCAGTGTTAATGCCGAAGTCTCAACGCTAAAAGACACCAGGTTACTGACCTTCAAGGCCTTTGCCCAGCTCTTCATCCTGGGC
TGCTCCTGGGTGCTGGGCATTTTTCAGATTGGACCTGTGGCAGGTGTCATGGCTTACCTGTTCACCATCATCAACAGCCTGCAGGGGGCC
TTCATCTTCCTCATCCACTGTCTGCTCAACGGCCAGGTACGAGAAGAATACAAGAGGTGGATCACTGGGAAGACGAAGCCCAGCTCCCAG
TCCCAGACCTCAAGGATCTTGCTGTCCTCCATGCCATCCGCTTCCAAGACGGGTTAAAGTCCTTTCTTGCTTTCAAATATGCTATGGAGC
CACAGTTGAGGACAGTAGTTTCCTGCAGGAGCCTACCCTGAAATCTCTTCTCAGCTTAACATGGAAATGAGGATCCCACCAGCCCCAGAA
CCCTCTGGGGAAGAATGTTGGGGGCGGTCTTCCTGTGGTTGTATGCACTGATGAGAAATCAGGCGTTTCTGCTCCAAACGACCATTTTAT
CTTCGTGCTCTGCAACTTCTTCAATTCCAGAGTTTCTGAGAACAGACCCAAATTCAATGGCATGACCAAGAACACCTGGCTACCATTTTG
TTTTCTCCTGCCCTTGTTGGTGCATGGTTCTAAGCATGCCCCTCCAGAGCCTATCATACGCCTGATACAGAGAACCTCTCAATAAATGAT

>23694_23694_13_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000540357_EMR1_chr19_6890492_ENST00000381404_length(amino acids)=853AA_BP=59
MHSWEGHIRPTRKPNTKGNNCRDSTLCPAYATCTNTVDSYYCACKQGFLSSNGQNHFKDPGVRCKDINECLTSSVCPEHSDCVNSMGSYS
CSCQVGFISRNSTCEDVDECADPRACPEHATCNNTVGNYSCFCNPGFESSSGHLSFQGLKASCEDIDECTEMCPINSTCTNTPGSYFCTC
HPGFAPSNGQLNFTDQGVECRDIDECRQDPSTCGPNSICTNALGSYSCGCIAGFHPNPEGSQKDGNFSCQRVLFKCKEDVIPDNKQIQQC
QEGTAVKPAYVSFCAQINNIFSVLDKVCENKTTVVSLKNTTESFVPVLKQISTWTKFTKEETSSLATVFLESVESMTLASFWKPSANITP
AVRTEYLDIESKVINKECSEENVTLDLVAKGDKMKIGCSTIEESESTETTGVAFVSFVGMESVLNERFFKDHQAPLTTSEIKLKMNSRVV
GGIMTGEKKDGFSDPIIYTLENIQPKQKFERPICVSWSTDVKGGRWTSFGCVILEASETYTICSCNQMANLAVIMASGELTMDFSLYIIS
HVGIIISLVCLVLAIATFLLCRSIRNHNTYLHLHLCVCLLLAKTLFLAGIHKTDNKMGCAIIAGFLHYLFLACFFWMLVEAVILFLMVRN
LKVVNYFSSRNIKMLHICAFGYGLPMLVVVISASVQPQGYGMHNRCWLNTETGFIWSFLGPVCTVIVVSKYYNSLAKCVLKEEQGDLRDL
EFPGTCAAERINSLLLTWTLWILRQRLSSVNAEVSTLKDTRLLTFKAFAQLFILGCSWVLGIFQIGPVAGVMAYLFTIINSLQGAFIFLI

--------------------------------------------------------------
>23694_23694_14_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000540357_EMR1_chr19_6890492_ENST00000381407_length(transcript)=4242nt_BP=1624nt
GGCTCCGTTCCATCCTTCTGCACAGGGTATCGCCTCTCTCCGTTTGGTACATCCCCTCCTCCCCCACGCCCGGACTGGGGTGGTAGACGC
CGCCTCCGCTCATCGCCCCTCCCCATCGGTTTCCGCGCGAAAAGCCGGGGCGCCTGCGCTGCCGCCGCCGCGTCTGCTGAAGCCTCCGAG
ATGCCGGCGCGTACCGCCCCAGCCCGGGTGCCCACACTGGCCGTCCCGGCCATCTCGCTGCCCGACGATGTCCGCAGGCGGCTCAAAGAT
TTGGAAAGAGACAGCTTAACAGAAAAGGAATGTGTGAAGGAGAAATTGAATCTCTTGCACGAATTTCTGCAAACAGAAATAAAGAATCAG
TTATGTGACTTGGAAACCAAATTACGTAAAGAAGAATTATCCGAGGAGGGCTACCTGGCTAAAGTCAAATCCCTTTTAAATAAAGATTTG
TCCTTGGAGAACGGTGCTCATGCTTACAACCGGGAAGTGAATGGACGTCTAGAAAACGGGAACCAAGCAAGAAGTGAAGCCCGTAGAGTG
GGAATGGCAGATGCCAACAGCCCCCCCAAACCCCTTTCCAAACCTCGCACGCCCAGGAGGAGCAAGTCCGATGGAGAGGCTAAGCCTGAA
CCTTCACCTAGCCCCAGGATTACAAGGAAAAGCACCAGGCAAACCACCATCACATCTCATTTTGCAAAGGGCCCTGCCAAACGGAAACCT
CAGGAAGAGTCTGAAAGAGCCAAATCGGATGAGTCCATCAAGGAAGAAGACAAAGACCAGGATGAGAAGAGACGTAGAGTTACATCCAGA
GAACGAGTTGCTAGACCGCTTCCTGCAGAAGAACCTGAAAGAGCAAAATCAGGAACGCGCACTGAAAAGGAAGAAGAAAGAGATGAAAAA
GAAGAAAAGAGACTCCGAAGTCAAACCAAAGAACCAACACCCAAACAGAAACTGAAGGAGGAGCCGGACAGAGAAGCCAGGGCAGGCGTG
CAGGCTGACGAGGACGAAGATGGAGACGAGAAAGATGAGAAGAAGCACAGAAGTCAACCCAAAGATCTAGCTGCCAAACGGAGGCCCGAA
GAAAAAGAACCTGAAAAAGTAAATCCACAGATTTCTGATGAAAAAGACGAGGATGAAAAGGAGGAGAAGAGACGCAAAACGACCCCCAAA
GAACCAACGGAGAAAAAAATGGCTCGCGCCAAAACAGTCATGAACTCCAAGACCCACCCTCCCAAGTGCATTCAGTGCGGGCAGTACCTG
GACGACCCTGACCTCAAATATGGGCAGCACCCACCAGACGCGGTGGATGAGCCACAGATGCTGACAAATGAGAAGCTGTCCATCTTTGAT
GCCAACGAGTCTGGCTTTGAGAGTTATGAGGCGCTTCCCCAGCACAAACTGACCTGCTTCAGTGTGTACTGTAAGCACGGTCACCTGTGT
CCCATCGACACCGGCCTCATCGAGAAGAATATCGAACTCTTCTTTTCTGGTTCAGCAAAACCAATCTATGATGATGACCCATCTCTTGAA
GGTGGTGTTAATGGCAAAAATCTTGGCCCCATAAATGAATGGTGGATCACTGGCTTTGATGGAGGTGAAAAGGCCCTCATCGGCTTCAGC
ACCTGATGTTGTGTTATGCACAGCTGGGAAGGGCACATAAGACCCACACGGAAACCAAACACAAAGGGTAATAACTGTAGAGACAGTACC
TTGTGCCCAGCTTATGCCACCTGCACCAATACAGTGGACAGTTACTATTGCGCTTGCAAACAAGGCTTCCTGTCCAGCAATGGGCAAAAT
CACTTCAAGGATCCAGGAGTGCGATGCAAAGATATTGATGAATGCACTGAAATGTGCCCCATCAATTCAACATGCACCAACACTCCTGGG
AGCTACTTTTGCACCTGCCACCCTGGCTTTGCACCAAGCAATGGACAGTTGAATTTCACAGACCAAGGAGTGGAATGTAGAGATATTGAT
GAGTGCCGCCAAGATCCATCAACCTGTGGTCCTAATTCTATCTGCACCAATGCCCTGGGCTCCTACAGCTGTGGCTGCATTGCAGGCTTT
CATCCCAATCCAGAAGGCTCCCAGAAAGATGGCAACTTCAGCTGCCAAAGGGTTCTCTTCAAATGTAAGGAAGATGTGATACCCGATAAT
AAGCAGATCCAGCAATGCCAAGAGGGAACCGCAGTGAAACCTGCATATGTCTCCTTTTGTGCACAAATAAATAACATCTTCAGCGTTCTG
GACAAAGTGTGTGAAAATAAAACGACCGTAGTTTCTCTGAAGAATACAACTGAGAGCTTTGTCCCTGTGCTTAAACAAATATCCACGTGG
ACTAAATTCACCAAGGAAGAGACGTCCTCCCTGGCCACAGTCTTCCTGGAGAGTGTGGAAAGCATGACACTGGCATCTTTTTGGAAACCC
TCAGCAAATATCACTCCGGCTGTTCGGACGGAATACTTAGACATTGAGAGCAAAGTTATCAACAAAGAATGCAGTGAAGAGAATGTGACG
TTGGACTTGGTAGCCAAGGGGGATAAGATGAAGATCGGGTGTTCCACAATTGAGGAATCTGAATCCACAGAGACCACTGGTGTGGCTTTT
GTCTCCTTTGTGGGCATGGAATCGGTTTTAAATGAGCGCTTCTTCAAAGACCACCAGGCTCCCTTGACCACCTCTGAGATCAAGCTGAAG
ATGAATTCTCGAGTCGTTGGGGGCATAATGACTGGAGAGAAGAAAGACGGCTTCTCAGATCCAATCATCTACACTCTGGAGAACATTCAG
CCAAAGCAGAAGTTTGAGAGGCCCATCTGTGTTTCCTGGAGCACTGATGTGAAGGGTGGAAGATGGACATCCTTTGGCTGTGTGATCCTG
GAAGCTTCTGAGACATATACCATCTGCAGCTGTAATCAGATGGCAAATCTTGCCGTTATCATGGCGTCTGGGGAGCTCACGATGGACTTT
TCCTTGTACATCATTAGCCATGTAGGCATTATCATCTCCTTGGTGTGCCTCGTCTTGGCCATCGCCACCTTTCTGCTGTGTCGCTCCATC
CGAAATCACAACACCTACCTCCACCTGCACCTCTGCGTGTGTCTCCTCTTGGCGAAGACTCTCTTCCTCGCCGGTATACACAAGACTGAC
AACAAGATGGGCTGCGCCATCATCGCGGGCTTCCTGCACTACCTTTTCCTTGCCTGCTTCTTCTGGATGCTGGTGGAGGCTGTGATACTG
TTCTTGATGGTCAGAAACCTGAAGGTGGTGAATTACTTCAGCTCTCGCAACATCAAGATGCTGCACATCTGTGCCTTTGGTTATGGGCTG
CCGATGCTGGTGGTGGTGATCTCTGCCAGTGTGCAGCCACAGGGCTATGGAATGCATAATCGCTGCTGGCTGAATACAGAGACAGGGTTC
ATCTGGAGTTTCTTGGGGCCAGTTTGCACAGTTATAGTGATCAACTCCCTTCTCCTGACCTGGACCTTGTGGATCCTGAGGCAGAGGCTT
TCCAGTGTTAATGCCGAAGTCTCAACGCTAAAAGACACCAGGTTACTGACCTTCAAGGCCTTTGCCCAGCTCTTCATCCTGGGCTGCTCC
TGGGTGCTGGGCATTTTTCAGATTGGACCTGTGGCAGGTGTCATGGCTTACCTGTTCACCATCATCAACAGCCTGCAGGGGGCCTTCATC
TTCCTCATCCACTGTCTGCTCAACGGCCAGGTACGAGAAGAATACAAGAGGTGGATCACTGGGAAGACGAAGCCCAGCTCCCAGTCCCAG
ACCTCAAGGATCTTGCTGTCCTCCATGCCATCCGCTTCCAAGACGGGTTAAAGTCCTTTCTTGCTTTCAAATATGCTATGGAGCCACAGT
TGAGGACAGTAGTTTCCTGCAGGAGCCTACCCTGAAATCTCTTCTCAGCTTAACATGGAAATGAGGATCCCACCAGCCCCAGAACCCTCT
GGGGAAGAATGTTGGGGGCGGTCTTCCTGTGGTTGTATGCACTGATGAGAAATCAGGCGTTTCTGCTCCAAACGACCATTTTATCTTCGT
GCTCTGCAACTTCTTCAATTCCAGAGTTTCTGAGAACAGACCCAAATTCAATGGCATGACCAAGAACACCTGGCTACCATTTTGTTTTCT
CCTGCCCTTGTTGGTGCATGGTTCTAAGCATGCCCCTCCAGAGCCTATCATACGCCTGATACAGAGAACCTCTCAATAAATGATTTGTCG

>23694_23694_14_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000540357_EMR1_chr19_6890492_ENST00000381407_length(amino acids)=731AA_BP=
MHSWEGHIRPTRKPNTKGNNCRDSTLCPAYATCTNTVDSYYCACKQGFLSSNGQNHFKDPGVRCKDIDECTEMCPINSTCTNTPGSYFCT
CHPGFAPSNGQLNFTDQGVECRDIDECRQDPSTCGPNSICTNALGSYSCGCIAGFHPNPEGSQKDGNFSCQRVLFKCKEDVIPDNKQIQQ
CQEGTAVKPAYVSFCAQINNIFSVLDKVCENKTTVVSLKNTTESFVPVLKQISTWTKFTKEETSSLATVFLESVESMTLASFWKPSANIT
PAVRTEYLDIESKVINKECSEENVTLDLVAKGDKMKIGCSTIEESESTETTGVAFVSFVGMESVLNERFFKDHQAPLTTSEIKLKMNSRV
VGGIMTGEKKDGFSDPIIYTLENIQPKQKFERPICVSWSTDVKGGRWTSFGCVILEASETYTICSCNQMANLAVIMASGELTMDFSLYII
SHVGIIISLVCLVLAIATFLLCRSIRNHNTYLHLHLCVCLLLAKTLFLAGIHKTDNKMGCAIIAGFLHYLFLACFFWMLVEAVILFLMVR
NLKVVNYFSSRNIKMLHICAFGYGLPMLVVVISASVQPQGYGMHNRCWLNTETGFIWSFLGPVCTVIVINSLLLTWTLWILRQRLSSVNA
EVSTLKDTRLLTFKAFAQLFILGCSWVLGIFQIGPVAGVMAYLFTIINSLQGAFIFLIHCLLNGQVREEYKRWITGKTKPSSQSQTSRIL

--------------------------------------------------------------
>23694_23694_15_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000540357_EMR1_chr19_6890492_ENST00000450315_length(transcript)=4128nt_BP=1624nt
GGCTCCGTTCCATCCTTCTGCACAGGGTATCGCCTCTCTCCGTTTGGTACATCCCCTCCTCCCCCACGCCCGGACTGGGGTGGTAGACGC
CGCCTCCGCTCATCGCCCCTCCCCATCGGTTTCCGCGCGAAAAGCCGGGGCGCCTGCGCTGCCGCCGCCGCGTCTGCTGAAGCCTCCGAG
ATGCCGGCGCGTACCGCCCCAGCCCGGGTGCCCACACTGGCCGTCCCGGCCATCTCGCTGCCCGACGATGTCCGCAGGCGGCTCAAAGAT
TTGGAAAGAGACAGCTTAACAGAAAAGGAATGTGTGAAGGAGAAATTGAATCTCTTGCACGAATTTCTGCAAACAGAAATAAAGAATCAG
TTATGTGACTTGGAAACCAAATTACGTAAAGAAGAATTATCCGAGGAGGGCTACCTGGCTAAAGTCAAATCCCTTTTAAATAAAGATTTG
TCCTTGGAGAACGGTGCTCATGCTTACAACCGGGAAGTGAATGGACGTCTAGAAAACGGGAACCAAGCAAGAAGTGAAGCCCGTAGAGTG
GGAATGGCAGATGCCAACAGCCCCCCCAAACCCCTTTCCAAACCTCGCACGCCCAGGAGGAGCAAGTCCGATGGAGAGGCTAAGCCTGAA
CCTTCACCTAGCCCCAGGATTACAAGGAAAAGCACCAGGCAAACCACCATCACATCTCATTTTGCAAAGGGCCCTGCCAAACGGAAACCT
CAGGAAGAGTCTGAAAGAGCCAAATCGGATGAGTCCATCAAGGAAGAAGACAAAGACCAGGATGAGAAGAGACGTAGAGTTACATCCAGA
GAACGAGTTGCTAGACCGCTTCCTGCAGAAGAACCTGAAAGAGCAAAATCAGGAACGCGCACTGAAAAGGAAGAAGAAAGAGATGAAAAA
GAAGAAAAGAGACTCCGAAGTCAAACCAAAGAACCAACACCCAAACAGAAACTGAAGGAGGAGCCGGACAGAGAAGCCAGGGCAGGCGTG
CAGGCTGACGAGGACGAAGATGGAGACGAGAAAGATGAGAAGAAGCACAGAAGTCAACCCAAAGATCTAGCTGCCAAACGGAGGCCCGAA
GAAAAAGAACCTGAAAAAGTAAATCCACAGATTTCTGATGAAAAAGACGAGGATGAAAAGGAGGAGAAGAGACGCAAAACGACCCCCAAA
GAACCAACGGAGAAAAAAATGGCTCGCGCCAAAACAGTCATGAACTCCAAGACCCACCCTCCCAAGTGCATTCAGTGCGGGCAGTACCTG
GACGACCCTGACCTCAAATATGGGCAGCACCCACCAGACGCGGTGGATGAGCCACAGATGCTGACAAATGAGAAGCTGTCCATCTTTGAT
GCCAACGAGTCTGGCTTTGAGAGTTATGAGGCGCTTCCCCAGCACAAACTGACCTGCTTCAGTGTGTACTGTAAGCACGGTCACCTGTGT
CCCATCGACACCGGCCTCATCGAGAAGAATATCGAACTCTTCTTTTCTGGTTCAGCAAAACCAATCTATGATGATGACCCATCTCTTGAA
GGTGGTGTTAATGGCAAAAATCTTGGCCCCATAAATGAATGGTGGATCACTGGCTTTGATGGAGGTGAAAAGGCCCTCATCGGCTTCAGC
ACCTGATGTTGTGTTATGCACAGCTGGGAAGGGCACATAAGACCCACACGGAAACCAAACACAAAGGGTAATAACTGTAGAGACAGTACC
TTGTGCCCAGCTTATGCCACCTGCACCAATACAGTGGACAGTTACTATTGCGCTTGCAAACAAGGCTTCCTGTCCAGCAATGGGCAAAAT
CACTTCAAGGATCCAGGAGTGCGATGCAAAGATATTGATGAATGTTCTCAAAGCCCCCAGCCCTGTGGTCCTAACTCATCCTGCAAAAAC
CTGTCAGGGAGGTACAAGTGCAGCTGTTTAGATGGTTTCTCTTCTCCCACTGGAAATGACTGGGTCCCAGGAAAGCCGGGCAATTTCTCC
TGTACTGATATCAATGAGTGCCTCACCAGCAGGGTTCTCTTCAAATGTAAGGAAGATGTGATACCCGATAATAAGCAGATCCAGCAATGC
CAAGAGGGAACCGCAGTGAAACCTGCATATGTCTCCTTTTGTGCACAAATAAATAACATCTTCAGCGTTCTGGACAAAGTGTGTGAAAAT
AAAACGACCGTAGTTTCTCTGAAGAATACAACTGAGAGCTTTGTCCCTGTGCTTAAACAAATATCCACGTGGACTAAATTCACCAAGGAA
GAGACGTCCTCCCTGGCCACAGTCTTCCTGGAGAGTGTGGAAAGCATGACACTGGCATCTTTTTGGAAACCCTCAGCAAATATCACTCCG
GCTGTTCGGACGGAATACTTAGACATTGAGAGCAAAGTTATCAACAAAGAATGCAGTGAAGAGAATGTGACGTTGGACTTGGTAGCCAAG
GGGGATAAGATGAAGATCGGGTGTTCCACAATTGAGGAATCTGAATCCACAGAGACCACTGGTGTGGCTTTTGTCTCCTTTGTGGGCATG
GAATCGGTTTTAAATGAGCGCTTCTTCAAAGACCACCAGGCTCCCTTGACCACCTCTGAGATCAAGCTGAAGATGAATTCTCGAGTCGTT
GGGGGCATAATGACTGGAGAGAAGAAAGACGGCTTCTCAGATCCAATCATCTACACTCTGGAGAACATTCAGCCAAAGCAGAAGTTTGAG
AGGCCCATCTGTGTTTCCTGGAGCACTGATGTGAAGGGTGGAAGATGGACATCCTTTGGCTGTGTGATCCTGGAAGCTTCTGAGACATAT
ACCATCTGCAGCTGTAATCAGATGGCAAATCTTGCCGTTATCATGGCGTCTGGGGAGCTCACGATGGACTTTTCCTTGTACATCATTAGC
CATGTAGGCATTATCATCTCCTTGGTGTGCCTCGTCTTGGCCATCGCCACCTTTCTGCTGTGTCGCTCCATCCGAAATCACAACACCTAC
CTCCACCTGCACCTCTGCGTGTGTCTCCTCTTGGCGAAGACTCTCTTCCTCGCCGGTATACACAAGACTGACAACAAGATGGGCTGCGCC
ATCATCGCGGGCTTCCTGCACTACCTTTTCCTTGCCTGCTTCTTCTGGATGCTGGTGGAGGCTGTGATACTGTTCTTGATGGTCAGAAAC
CTGAAGGTGGTGAATTACTTCAGCTCTCGCAACATCAAGATGCTGCACATCTGTGCCTTTGGTTATGGGCTGCCGATGCTGGTGGTGGTG
ATCTCTGCCAGTGTGCAGCCACAGGGCTATGGAATGCATAATCGCTGCTGGCTGAATACAGAGACAGGGTTCATCTGGAGTTTCTTGGGG
CCAGTTTGCACAGTTATAGTGATCAACTCCCTTCTCCTGACCTGGACCTTGTGGATCCTGAGGCAGAGGCTTTCCAGTGTTAATGCCGAA
GTCTCAACGCTAAAAGACACCAGGTTACTGACCTTCAAGGCCTTTGCCCAGCTCTTCATCCTGGGCTGCTCCTGGGTGCTGGGCATTTTT
CAGATTGGACCTGTGGCAGGTGTCATGGCTTACCTGTTCACCATCATCAACAGCCTGCAGGGGGCCTTCATCTTCCTCATCCACTGTCTG
CTCAACGGCCAGGTACGAGAAGAATACAAGAGGTGGATCACTGGGAAGACGAAGCCCAGCTCCCAGTCCCAGACCTCAAGGATCTTGCTG
TCCTCCATGCCATCCGCTTCCAAGACGGGTTAAAGTCCTTTCTTGCTTTCAAATATGCTATGGAGCCACAGTTGAGGACAGTAGTTTCCT
GCAGGAGCCTACCCTGAAATCTCTTCTCAGCTTAACATGGAAATGAGGATCCCACCAGCCCCAGAACCCTCTGGGGAAGAATGTTGGGGG
CGGTCTTCCTGTGGTTGTATGCACTGATGAGAAATCAGGCGTTTCTGCTCCAAACGACCATTTTATCTTCGTGCTCTGCAACTTCTTCAA
TTCCAGAGTTTCTGAGAACAGACCCAAATTCAATGGCATGACCAAGAACACCTGGCTACCATTTTGTTTTCTCCTGCCCTTGTTGGTGCA

>23694_23694_15_DNMT1-EMR1_DNMT1_chr19_10266529_ENST00000540357_EMR1_chr19_6890492_ENST00000450315_length(amino acids)=695AA_BP=
MHSWEGHIRPTRKPNTKGNNCRDSTLCPAYATCTNTVDSYYCACKQGFLSSNGQNHFKDPGVRCKDIDECSQSPQPCGPNSSCKNLSGRY
KCSCLDGFSSPTGNDWVPGKPGNFSCTDINECLTSRVLFKCKEDVIPDNKQIQQCQEGTAVKPAYVSFCAQINNIFSVLDKVCENKTTVV
SLKNTTESFVPVLKQISTWTKFTKEETSSLATVFLESVESMTLASFWKPSANITPAVRTEYLDIESKVINKECSEENVTLDLVAKGDKMK
IGCSTIEESESTETTGVAFVSFVGMESVLNERFFKDHQAPLTTSEIKLKMNSRVVGGIMTGEKKDGFSDPIIYTLENIQPKQKFERPICV
SWSTDVKGGRWTSFGCVILEASETYTICSCNQMANLAVIMASGELTMDFSLYIISHVGIIISLVCLVLAIATFLLCRSIRNHNTYLHLHL
CVCLLLAKTLFLAGIHKTDNKMGCAIIAGFLHYLFLACFFWMLVEAVILFLMVRNLKVVNYFSSRNIKMLHICAFGYGLPMLVVVISASV
QPQGYGMHNRCWLNTETGFIWSFLGPVCTVIVINSLLLTWTLWILRQRLSSVNAEVSTLKDTRLLTFKAFAQLFILGCSWVLGIFQIGPV

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for DNMT1-EMR1


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-18401_120481.33333333333331617.0DMAP1
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-19411_120497.33333333333331633.0DMAP1
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-18401_148481.33333333333331617.0DNMT3A
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-19411_148497.33333333333331633.0DNMT3A
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-1840149_217481.33333333333331617.0DNMT3B
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-1941149_217497.33333333333331633.0DNMT3B
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-1840163_174481.33333333333331617.0PCNA
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-1941163_174497.33333333333331633.0PCNA


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-18401121_1616481.33333333333331617.0the PRC2/EED-EZH2 complex
HgeneDNMT1chr19:10266529chr19:6890492ENST00000340748-1840308_606481.33333333333331617.0the PRC2/EED-EZH2 complex
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-19411121_1616497.33333333333331633.0the PRC2/EED-EZH2 complex
HgeneDNMT1chr19:10266529chr19:6890492ENST00000359526-1941308_606497.33333333333331633.0the PRC2/EED-EZH2 complex


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for DNMT1-EMR1


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for DNMT1-EMR1


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource