FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:DNMT1-OLFM2 (FusionGDB2 ID:23699)

Fusion Gene Summary for DNMT1-OLFM2

check button Fusion gene summary
Fusion gene informationFusion gene name: DNMT1-OLFM2
Fusion gene ID: 23699
HgeneTgene
Gene symbol

DNMT1

OLFM2

Gene ID

1786

93145

Gene nameDNA methyltransferase 1olfactomedin 2
SynonymsADCADN|AIM|CXXC9|DNMT|HSN1E|MCMT|m.HsaINOE2|NOELIN2|NOELIN2_V1|OlfC
Cytomap

19p13.2

19p13.2

Type of geneprotein-codingprotein-coding
DescriptionDNA (cytosine-5)-methyltransferase 1CXXC-type zinc finger protein 9DNA (cytosine-5-)-methyltransferase 1DNA MTase HsaIDNA methyltransferase HsaInoelin-2neuronal olfactomedin related ER localized protein 2
Modification date2020031320200313
UniProtAcc

P26358

.
Ensembl transtripts involved in fusion geneENST00000340748, ENST00000359526, 
ENST00000540357, ENST00000589538, 
ENST00000590841, ENST00000264833, 
Fusion gene scores* DoF score14 X 12 X 11=18489 X 5 X 6=270
# samples 1710
** MAII scorelog2(17/1848*10)=-3.44235810527836
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(10/270*10)=-1.43295940727611
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: DNMT1 [Title/Abstract] AND OLFM2 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointDNMT1(10341880)-OLFM2(9971470), # samples:2
DNMT1(10246412)-OLFM2(9968537), # samples:2
Anticipated loss of major functional domain due to fusion event.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneDNMT1

GO:0010216

maintenance of DNA methylation

18754681|21745816

TgeneOLFM2

GO:0009306

protein secretion

21228389


check buttonFusion gene breakpoints across DNMT1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check buttonFusion gene breakpoints across OLFM2 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4BRCATCGA-B6-A0WY-01ADNMT1chr19

10341880

-OLFM2chr19

9971470

-
ChimerDB4ESCATCGA-L5-A43I-01ADNMT1chr19

10246412

-OLFM2chr19

9968537

-
ChimerDB4ESCATCGA-L5-A43IDNMT1chr19

10246412

-OLFM2chr19

9968537

-


Top

Fusion Gene ORF analysis for DNMT1-OLFM2

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
5CDS-5UTRENST00000340748ENST00000590841DNMT1chr19

10246412

-OLFM2chr19

9968537

-
5CDS-5UTRENST00000359526ENST00000590841DNMT1chr19

10246412

-OLFM2chr19

9968537

-
5CDS-5UTRENST00000540357ENST00000590841DNMT1chr19

10246412

-OLFM2chr19

9968537

-
In-frameENST00000340748ENST00000264833DNMT1chr19

10246412

-OLFM2chr19

9968537

-
In-frameENST00000359526ENST00000264833DNMT1chr19

10246412

-OLFM2chr19

9968537

-
In-frameENST00000540357ENST00000264833DNMT1chr19

10246412

-OLFM2chr19

9968537

-
intron-3CDSENST00000340748ENST00000264833DNMT1chr19

10341880

-OLFM2chr19

9971470

-
intron-3CDSENST00000359526ENST00000264833DNMT1chr19

10341880

-OLFM2chr19

9971470

-
intron-3CDSENST00000540357ENST00000264833DNMT1chr19

10341880

-OLFM2chr19

9971470

-
intron-3CDSENST00000589538ENST00000264833DNMT1chr19

10341880

-OLFM2chr19

9971470

-
intron-3CDSENST00000589538ENST00000264833DNMT1chr19

10246412

-OLFM2chr19

9968537

-
intron-5UTRENST00000589538ENST00000590841DNMT1chr19

10246412

-OLFM2chr19

9968537

-
intron-intronENST00000340748ENST00000590841DNMT1chr19

10341880

-OLFM2chr19

9971470

-
intron-intronENST00000359526ENST00000590841DNMT1chr19

10341880

-OLFM2chr19

9971470

-
intron-intronENST00000540357ENST00000590841DNMT1chr19

10341880

-OLFM2chr19

9971470

-
intron-intronENST00000589538ENST00000590841DNMT1chr19

10341880

-OLFM2chr19

9971470

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000340748DNMT1chr1910246412-ENST00000264833OLFM2chr199968537-658149614461122022
ENST00000540357DNMT1chr1910246412-ENST00000264833OLFM2chr199968537-6534491418060651961
ENST00000359526DNMT1chr1910246412-ENST00000264833OLFM2chr199968537-6573495318061041974

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000340748ENST00000264833DNMT1chr1910246412-OLFM2chr199968537-0.001721980.998278
ENST00000540357ENST00000264833DNMT1chr1910246412-OLFM2chr199968537-0.0010637520.99893624
ENST00000359526ENST00000264833DNMT1chr1910246412-OLFM2chr199968537-0.0019453720.9980546

Top

Fusion Genomic Features for DNMT1-OLFM2


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.

Top

Fusion Protein Features for DNMT1-OLFM2


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr19:10341880/chr19:9971470)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
DNMT1

P26358

.
FUNCTION: Methylates CpG residues. Preferentially methylates hemimethylated DNA. Associates with DNA replication sites in S phase maintaining the methylation pattern in the newly synthesized strand, that is essential for epigenetic inheritance. Associates with chromatin during G2 and M phases to maintain DNA methylation independently of replication. It is responsible for maintaining methylation patterns established in development. DNA methylation is coordinated with methylation of histones. Mediates transcriptional repression by direct binding to HDAC2. In association with DNMT3B and via the recruitment of CTCFL/BORIS, involved in activation of BAG1 gene expression by modulating dimethylation of promoter histone H3 at H3K4 and H3K9. Probably forms a corepressor complex required for activated KRAS-mediated promoter hypermethylation and transcriptional silencing of tumor suppressor genes (TSGs) or other tumor-related genes in colorectal cancer (CRC) cells (PubMed:24623306). Also required to maintain a transcriptionally repressive state of genes in undifferentiated embryonic stem cells (ESCs) (PubMed:24623306). Associates at promoter regions of tumor suppressor genes (TSGs) leading to their gene silencing (PubMed:24623306). Promotes tumor growth (PubMed:24623306). {ECO:0000269|PubMed:16357870, ECO:0000269|PubMed:18413740, ECO:0000269|PubMed:18754681, ECO:0000269|PubMed:24623306}.FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-384016_10915751617.0DomainDMAP1-binding
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-3840755_88015751617.0DomainBAH 1
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-3840972_110015751617.0DomainBAH 2
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-394116_10915911633.0DomainDMAP1-binding
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-3941755_88015911633.0DomainBAH 1
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-3941972_110015911633.0DomainBAH 2
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-3840177_20515751617.0MotifNuclear localization signal
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-3941177_20515911633.0MotifNuclear localization signal
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-38401109_112015751617.0RegionNote=6 X 2 AA tandem repeats of K-G
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-38401150_115115751617.0RegionS-adenosyl-L-methionine binding
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-38401168_116915751617.0RegionS-adenosyl-L-methionine binding
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-38401190_119115751617.0RegionS-adenosyl-L-methionine binding
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-3840310_50215751617.0RegionNote=Homodimerization
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-3840331_55015751617.0RegionDNA replication foci-targeting sequence
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-3840651_69715751617.0RegionNote=Required for activity
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-3840693_75415751617.0RegionNote=Autoinhibitory linker
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-39411109_112015911633.0RegionNote=6 X 2 AA tandem repeats of K-G
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-39411150_115115911633.0RegionS-adenosyl-L-methionine binding
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-39411168_116915911633.0RegionS-adenosyl-L-methionine binding
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-39411190_119115911633.0RegionS-adenosyl-L-methionine binding
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-3941310_50215911633.0RegionNote=Homodimerization
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-3941331_55015911633.0RegionDNA replication foci-targeting sequence
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-3941651_69715911633.0RegionNote=Required for activity
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-3941693_75415911633.0RegionNote=Autoinhibitory linker
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-38401109_111015751617.0RepeatNote=1
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-38401111_111215751617.0RepeatNote=2
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-38401113_111415751617.0RepeatNote=3
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-38401115_111615751617.0RepeatNote=4
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-38401117_111815751617.0RepeatNote=5
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-38401119_112015751617.0RepeatNote=6%3B approximate
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-39411109_111015911633.0RepeatNote=1
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-39411111_111215911633.0RepeatNote=2
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-39411113_111415911633.0RepeatNote=3
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-39411115_111615911633.0RepeatNote=4
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-39411117_111815911633.0RepeatNote=5
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-39411119_112015911633.0RepeatNote=6%3B approximate
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-3840646_69215751617.0Zinc fingerCXXC-type
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-3941646_69215911633.0Zinc fingerCXXC-type
TgeneOLFM2chr19:10246412chr19:9968537ENST0000026483316136_19371455.0Coiled coilOntology_term=ECO:0000255
TgeneOLFM2chr19:10246412chr19:9968537ENST0000026483316194_44671455.0DomainOlfactomedin-like

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-38401139_159915751617.0DomainSAM-dependent MTase C5-type
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-39411139_159915911633.0DomainSAM-dependent MTase C5-type
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-38401139_161615751617.0RegionNote=Catalytic
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-39411139_161615911633.0RegionNote=Catalytic
TgeneOLFM2chr19:10246412chr19:9968537ENST000002648331658_8571455.0Coiled coilOntology_term=ECO:0000255


Top

Fusion Gene Sequence for DNMT1-OLFM2


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>23699_23699_1_DNMT1-OLFM2_DNMT1_chr19_10246412_ENST00000340748_OLFM2_chr19_9968537_ENST00000264833_length(transcript)=6581nt_BP=4961nt
TCCGCGTGGGGGGGGTGTGTGCCCGCCTTGCGCATGCGTGTTCCCTGGGCATGGCCGGCTCCGTTCCATCCTTCTGCACAGGGTATCGCC
TCTCTCCGTTTGGTACATCCCCTCCTCCCCCACGCCCGGACTGGGGTGGTAGACGCCGCCTCCGCTCATCGCCCCTCCCCATCGGTTTCC
GCGCGAAAAGCCGGGGCGCCTGCGCTGCCGCCGCCGCGTCTGCTGAAGCCTCCGAGATGCCGGCGCGTACCGCCCCAGCCCGGGTGCCCA
CACTGGCCGTCCCGGCCATCTCGCTGCCCGACGATGTCCGCAGGCGGCTCAAAGATTTGGAAAGAGACAGCTTAACAGAAAAGGAATGTG
TGAAGGAGAAATTGAATCTCTTGCACGAATTTCTGCAAACAGAAATAAAGAATCAGTTATGTGACTTGGAAACCAAATTACGTAAAGAAG
AATTATCCGAGGAGGGCTACCTGGCTAAAGTCAAATCCCTTTTAAATAAAGATTTGTCCTTGGAGAACGGTGCTCATGCTTACAACCGGG
AAGTGAATGGACGTCTAGAAAACGGGAACCAAGCAAGAAGTGAAGCCCGTAGAGTGGGAATGGCAGATGCCAACAGCCCCCCCAAACCCC
TTTCCAAACCTCGCACGCCCAGGAGGAGCAAGTCCGATGGAGAGGCTAAGCCTGAACCTTCACCTAGCCCCAGGATTACAAGGAAAAGCA
CCAGGCAAACCACCATCACATCTCATTTTGCAAAGGGCCCTGCCAAACGGAAACCTCAGGAAGAGTCTGAAAGAGCCAAATCGGATGAGT
CCATCAAGGAAGAAGACAAAGACCAGGATGAGAAGAGACGTAGAGTTACATCCAGAGAACGAGTTGCTAGACCGCTTCCTGCAGAAGAAC
CTGAAAGAGCAAAATCAGGAACGCGCACTGAAAAGGAAGAAGAAAGAGATGAAAAAGAAGAAAAGAGACTCCGAAGTCAAACCAAAGAAC
CAACACCCAAACAGAAACTGAAGGAGGAGCCGGACAGAGAAGCCAGGGCAGGCGTGCAGGCTGACGAGGACGAAGATGGAGACGAGAAAG
ATGAGAAGAAGCACAGAAGTCAACCCAAAGATCTAGCTGCCAAACGGAGGCCCGAAGAAAAAGAACCTGAAAAAGTAAATCCACAGATTT
CTGATGAAAAAGACGAGGATGAAAAGGAGGAGAAGAGACGCAAAACGACCCCCAAAGAACCAACGGAGAAAAAAATGGCTCGCGCCAAAA
CAGTCATGAACTCCAAGACCCACCCTCCCAAGTGCATTCAGTGCGGGCAGTACCTGGACGACCCTGACCTCAAATATGGGCAGCACCCAC
CAGACGCGGTGGATGAGCCACAGATGCTGACAAATGAGAAGCTGTCCATCTTTGATGCCAACGAGTCTGGCTTTGAGAGTTATGAGGCGC
TTCCCCAGCACAAACTGACCTGCTTCAGTGTGTACTGTAAGCACGGTCACCTGTGTCCCATCGACACCGGCCTCATCGAGAAGAATATCG
AACTCTTCTTTTCTGGTTCAGCAAAACCAATCTATGATGATGACCCATCTCTTGAAGGTGGTGTTAATGGCAAAAATCTTGGCCCCATAA
ATGAATGGTGGATCACTGGCTTTGATGGAGGTGAAAAGGCCCTCATCGGCTTCAGCACCTCATTTGCCGAATACATTCTGATGGATCCCA
GTCCCGAGTATGCGCCCATATTTGGGCTGATGCAGGAGAAGATCTACATCAGCAAGATTGTGGTGGAGTTCCTGCAGAGCAATTCCGACT
CGACCTATGAGGACCTGATCAACAAGATCGAGACCACGGTTCCTCCTTCTGGCCTCAACTTGAACCGCTTCACAGAGGACTCCCTCCTGC
GACACGCGCAGTTTGTGGTGGAGCAGGTGGAGAGTTATGACGAGGCCGGGGACAGTGATGAGCAGCCCATCTTCCTGACACCCTGCATGC
GGGACCTGATCAAGCTGGCTGGGGTCACGCTGGGACAGAGGCGAGCCCAGGCGAGGCGGCAGACCATCAGGCATTCTACCAGGGAGAAGG
ACAGGGGACCCACGAAAGCCACCACCACCAAGCTGGTCTACCAGATCTTCGATACTTTCTTCGCAGAGCAAATTGAAAAGGATGACAGAG
AAGACAAGGAGAACGCCTTTAAGCGCCGGCGATGTGGCGTCTGTGAGGTGTGTCAGCAGCCTGAGTGTGGGAAATGTAAAGCCTGCAAGG
ACATGGTTAAATTTGGTGGCAGTGGACGGAGCAAGCAGGCTTGCCAAGAGCGGAGGTGTCCCAATATGGCCATGAAGGAGGCAGATGACG
ATGAGGAAGTCGATGATAACATCCCAGAGATGCCGTCACCCAAAAAAATGCACCAGGGGAAGAAGAAGAAACAGAACAAGAATCGCATCT
CTTGGGTCGGAGAAGCCGTCAAGACTGATGGGAAGAAGAGTTACTATAAGAAGGTGTGCATTGATGCGGAAACCCTGGAAGTGGGGGACT
GTGTCTCTGTTATTCCAGATGATTCCTCAAAACCGCTGTATCTAGCAAGGGTCACGGCGCTGTGGGAGGACAGCAGCAACGGGCAGATGT
TTCACGCCCACTGGTTCTGCGCTGGGACAGACACAGTCCTCGGGGCCACGTCGGACCCTCTGGAGCTGTTCTTGGTGGATGAATGTGAGG
ACATGCAGCTTTCATATATCCACAGCAAAGTGAAAGTCATCTACAAAGCCCCCTCCGAAAACTGGGCCATGGAGGGAGGCATGGATCCCG
AGTCCCTGCTGGAGGGGGACGACGGGAAGACCTACTTCTACCAGCTGTGGTATGATCAAGACTACGCGAGATTCGAGTCCCCTCCAAAAA
CCCAGCCAACAGAGGACAACAAGTTCAAATTCTGTGTGAGCTGTGCCCGTCTGGCTGAGATGAGGCAAAAAGAAATCCCCAGGGTCCTGG
AGCAGCTCGAGGACCTGGATAGCCGGGTCCTCTACTACTCAGCCACCAAGAACGGCATCCTGTACCGAGTTGGTGATGGTGTGTACCTGC
CCCCTGAGGCCTTCACGTTCAACATCAAGCTGTCCAGTCCCGTGAAACGCCCACGGAAGGAGCCCGTGGATGAGGACCTGTACCCAGAGC
ACTACCGGAAATACTCCGACTACATCAAAGGCAGCAACCTGGATGCCCCTGAGCCCTACCGAATTGGCCGGATCAAAGAGATCTTCTGTC
CCAAGAAGAGCAACGGCAGGCCCAATGAGACTGACATCAAAATCCGGGTCAACAAGTTCTACAGGCCTGAGAACACCCACAAGTCCACTC
CAGCGAGCTACCACGCAGACATCAACCTGCTCTACTGGAGCGACGAGGAGGCCGTGGTGGACTTCAAGGCTGTGCAGGGCCGCTGCACCG
TGGAGTATGGGGAGGACCTGCCCGAGTGCGTCCAGGTGTACTCCATGGGCGGCCCCAACCGCTTCTACTTCCTCGAGGCCTATAATGCAA
AGAGCAAAAGCTTTGAAGATCCTCCCAACCATGCCCGTAGCCCTGGAAACAAAGGGAAGGGCAAGGGAAAAGGGAAGGGCAAGCCCAAGT
CCCAAGCCTGTGAGCCGAGCGAGCCAGAGATAGAGATCAAGCTGCCCAAGCTGCGGACCCTGGATGTGTTTTCTGGCTGCGGGGGGTTGT
CGGAGGGATTCCACCAAGCAGGCATCTCTGACACGCTGTGGGCCATCGAGATGTGGGACCCTGCGGCCCAGGCGTTCCGGCTGAACAACC
CCGGCTCCACAGTGTTCACAGAGGACTGCAACATCCTGCTGAAGCTGGTCATGGCTGGGGAGACCACCAACTCCCGCGGCCAGCGGCTGC
CCCAGAAGGGAGACGTGGAGATGCTGTGCGGCGGGCCGCCCTGCCAGGGCTTCAGCGGCATGAACCGCTTCAATTCGCGCACCTACTCCA
AGTTCAAAAACTCTCTGGTGGTTTCCTTCCTCAGCTACTGCGACTACTACCGGCCCCGGTTCTTCCTCCTGGAGAATGTCAGGAACTTTG
TCTCCTTCAAGCGCTCCATGGTCCTGAAGCTCACCCTCCGCTGCCTGGTCCGCATGGGCTATCAGTGCACCTTCGGCGTGCTGCAGGCCG
GTCAGTACGGCGTGGCCCAGACTAGGAGGCGGGCCATCATCCTGGCCGCGGCCCCTGGAGAGAAGCTCCCTCTGTTCCCGGAGCCACTGC
ACGTGTTTGCTCCCCGGGCCTGCCAGCTGAGCGTGGTGGTGGATGACAAGAAGTTTGTGAGCAACATAACCAGGTTGAGCTCGGGTCCTT
TCCGGACCATCACGGTGCGAGACACGATGTCCGACCTGCCGGAGGTGCGGAATGGAGCCTCGGCACTGGAGATCTCCTACAACGGGGAGC
CTCAGTCCTGGTTCCAGAGGCAGCTCCGGGGCGCACAGTACCAGCCCATCCTCAGGGACCACATCTGTAAGGACATGAGTGCATTGGTGG
CTGCCCGCATGCGGCACATCCCCTTGGCCCCAGGGTCAGACTGGCGCGATCTGCCCAACATCGAGGTGCGGCTCTCAGACGGCACCATGG
CCAGGAAGCTGCGGTATACCCACCATGACAGGAAGAACGGCCGCAGCAGCTCTGGGGCCCTCCGTGGGGTCTGCTCCTGCGTGGAAGCCG
GCAAAGCCTGCGACCCCGCAGCCAGGCAGTTCAACACCCTCATCCCCTGGTGCCTGCCCCACACCGGGAACCGGCACAACCACTGGGCTG
GCCTCTATGGAAGGCTCGAGTGGGACGGCTTCTTCAGCACAACCGTCACCAACCCCGAGCCCATGGGCAAGCAGGGCCGCGTGCTCCACC
CAGAGCAGCACCGTGTGGTGAGCGTGCGGGAGTGTGCCCGCTCCCAGGGCTTCCCTGACACCTACCGGCTCTTCGGCAACATCCTGGACA
AGCACCGGCAGGTCCAGAACGTCTCCCAGTCCATGGAGGTCCTTGAGTTGCGGACGTATCGCGACCTCCAGTATGTACGCGGCATGGAGA
CCCTCATGCGGAGCCTGGATGCGCGGCTCCGGGCAGCTGATGGGTCCCTCTCGGCCAAGAGCTTCCAGGAGCTGAAGGACAGGATGACGG
AACTGTTGCCCCTGAGCTCGGTCCTGGAGCAGTACAAGGCAGACACGCGGACCATTGTACGCTTGCGGGAGGAGGTGAGGAATCTCTCCG
GCAGTCTGGCGGCCATTCAGGAGGAGATGGGTGCCTACGGGTATGAGGACCTGCAGCAACGGGTGATGGCCCTGGAGGCCCGGCTCCACG
CCTGCGCCCAGAAGCTGGGCTGTGGGAAGCTGACCGGGGTCAGTAACCCCATCACCGTTCGGGCCATGGGGTCCCGCTTCGGCTCCTGGA
TGACTGACACGATGGCCCCCAGTGCGGATAGCCGGGTCTGGTACATGGATGGCTATTACAAAGGCCGCCGGGTCCTGGAGTTCCGTACCC
TGGGAGACTTCATCAAAGGCCAGAACTTTATCCAGCACCTGCTGCCCCAGCCGTGGGCGGGCACGGGCCACGTGGTGTACAACGGCTCCC
TGTTCTATAACAAGTACCAGAGCAACGTGGTGGTCAAATACCACTTCCGCTCGCGCTCTGTGCTGGTGCAGAGGAGCCTCCCGGGCGCCG
GTTACAACAACACCTTCCCCTACTCCTGGGGCGGCTTCTCCGACATGGACTTCATGGTGGACGAGAGCGGGCTCTGGGCTGTGTACACCA
CCAACCAGAACGCGGGCAACATCGTGGTCAGCCGGCTGGACCCGCACACCCTCGAGGTCATGCGGTCCTGGGACACCGGCTACCCCAAGC
GCAGCGCTGGCGAGGCCTTCATGATCTGCGGTGTGCTCTACGTGACCAACTCCCACCTGGCTGGGGCCAAGGTCTACTTCGCCTATTTTA
CCAACACGTCCAGTTACGAGTACACGGACGTGCCCTTCCACAACCAGTATTCCCACATCTCGATGCTGGATTACAACCCCCGGGAGCGCG
CCCTCTATACCTGGAACAACGGCCACCAGGTGCTCTACAATGTCACCCTGTTTCACGTCATCAGCACCTCTGGGGACCCCTGAGCCAATG
CTGTGGCTCGGGCTGCTGCCTGGGGGGCCTCTGGGGGCTGGGGGCCCTTTTCATTCTGCCTGTGTCCCTCAAGGGTGATCTCTCTGTCTC
TGTCACGCCCTTTCTCCCCGCCTTTTTGCTGGGCTTTTGTTCTCTGCCTATGTATTTCTGTCTATTTTTTCAATTTCCCCTCTTCTCCTT
TATTGATCTCTGCTTTTAATACACCACTTCTTTCTTTCTGCCTTTTTATGGATGTCTTTTTCTTTTTATGGCTCTGGTTCTCCAGTTCTT
TCCGTCTCTGCCTCTCTCTGTCTCTCTCTCTCTGTCCTTCCACCCCTCCCTCCTTGCTTCCCACCCATTCCTCATCCCTCACTCCCACCC
CCACCCCCACCCCCAGGAGTTGAGTGCATGGATCTGTTTCTTTTTTTATTTACACTTTTTCTTTCCGGTTTGCCGGAATAAACAGGACCT

>23699_23699_1_DNMT1-OLFM2_DNMT1_chr19_10246412_ENST00000340748_OLFM2_chr19_9968537_ENST00000264833_length(amino acids)=2022AA_BP=1
MGMAGSVPSFCTGYRLSPFGTSPPPPRPDWGGRRRLRSSPLPIGFRAKSRGACAAAAASAEASEMPARTAPARVPTLAVPAISLPDDVRR
RLKDLERDSLTEKECVKEKLNLLHEFLQTEIKNQLCDLETKLRKEELSEEGYLAKVKSLLNKDLSLENGAHAYNREVNGRLENGNQARSE
ARRVGMADANSPPKPLSKPRTPRRSKSDGEAKPEPSPSPRITRKSTRQTTITSHFAKGPAKRKPQEESERAKSDESIKEEDKDQDEKRRR
VTSRERVARPLPAEEPERAKSGTRTEKEEERDEKEEKRLRSQTKEPTPKQKLKEEPDREARAGVQADEDEDGDEKDEKKHRSQPKDLAAK
RRPEEKEPEKVNPQISDEKDEDEKEEKRRKTTPKEPTEKKMARAKTVMNSKTHPPKCIQCGQYLDDPDLKYGQHPPDAVDEPQMLTNEKL
SIFDANESGFESYEALPQHKLTCFSVYCKHGHLCPIDTGLIEKNIELFFSGSAKPIYDDDPSLEGGVNGKNLGPINEWWITGFDGGEKAL
IGFSTSFAEYILMDPSPEYAPIFGLMQEKIYISKIVVEFLQSNSDSTYEDLINKIETTVPPSGLNLNRFTEDSLLRHAQFVVEQVESYDE
AGDSDEQPIFLTPCMRDLIKLAGVTLGQRRAQARRQTIRHSTREKDRGPTKATTTKLVYQIFDTFFAEQIEKDDREDKENAFKRRRCGVC
EVCQQPECGKCKACKDMVKFGGSGRSKQACQERRCPNMAMKEADDDEEVDDNIPEMPSPKKMHQGKKKKQNKNRISWVGEAVKTDGKKSY
YKKVCIDAETLEVGDCVSVIPDDSSKPLYLARVTALWEDSSNGQMFHAHWFCAGTDTVLGATSDPLELFLVDECEDMQLSYIHSKVKVIY
KAPSENWAMEGGMDPESLLEGDDGKTYFYQLWYDQDYARFESPPKTQPTEDNKFKFCVSCARLAEMRQKEIPRVLEQLEDLDSRVLYYSA
TKNGILYRVGDGVYLPPEAFTFNIKLSSPVKRPRKEPVDEDLYPEHYRKYSDYIKGSNLDAPEPYRIGRIKEIFCPKKSNGRPNETDIKI
RVNKFYRPENTHKSTPASYHADINLLYWSDEEAVVDFKAVQGRCTVEYGEDLPECVQVYSMGGPNRFYFLEAYNAKSKSFEDPPNHARSP
GNKGKGKGKGKGKPKSQACEPSEPEIEIKLPKLRTLDVFSGCGGLSEGFHQAGISDTLWAIEMWDPAAQAFRLNNPGSTVFTEDCNILLK
LVMAGETTNSRGQRLPQKGDVEMLCGGPPCQGFSGMNRFNSRTYSKFKNSLVVSFLSYCDYYRPRFFLLENVRNFVSFKRSMVLKLTLRC
LVRMGYQCTFGVLQAGQYGVAQTRRRAIILAAAPGEKLPLFPEPLHVFAPRACQLSVVVDDKKFVSNITRLSSGPFRTITVRDTMSDLPE
VRNGASALEISYNGEPQSWFQRQLRGAQYQPILRDHICKDMSALVAARMRHIPLAPGSDWRDLPNIEVRLSDGTMARKLRYTHHDRKNGR
SSSGALRGVCSCVEAGKACDPAARQFNTLIPWCLPHTGNRHNHWAGLYGRLEWDGFFSTTVTNPEPMGKQGRVLHPEQHRVVSVRECARS
QGFPDTYRLFGNILDKHRQVQNVSQSMEVLELRTYRDLQYVRGMETLMRSLDARLRAADGSLSAKSFQELKDRMTELLPLSSVLEQYKAD
TRTIVRLREEVRNLSGSLAAIQEEMGAYGYEDLQQRVMALEARLHACAQKLGCGKLTGVSNPITVRAMGSRFGSWMTDTMAPSADSRVWY
MDGYYKGRRVLEFRTLGDFIKGQNFIQHLLPQPWAGTGHVVYNGSLFYNKYQSNVVVKYHFRSRSVLVQRSLPGAGYNNTFPYSWGGFSD
MDFMVDESGLWAVYTTNQNAGNIVVSRLDPHTLEVMRSWDTGYPKRSAGEAFMICGVLYVTNSHLAGAKVYFAYFTNTSSYEYTDVPFHN

--------------------------------------------------------------
>23699_23699_2_DNMT1-OLFM2_DNMT1_chr19_10246412_ENST00000359526_OLFM2_chr19_9968537_ENST00000264833_length(transcript)=6573nt_BP=4953nt
GGCTCCGTTCCATCCTTCTGCACAGGGTATCGCCTCTCTCCGTTTGGTACATCCCCTCCTCCCCCACGCCCGGACTGGGGTGGTAGACGC
CGCCTCCGCTCATCGCCCCTCCCCATCGGTTTCCGCGCGAAAAGCCGGGGCGCCTGCGCTGCCGCCGCCGCGTCTGCTGAAGCCTCCGAG
ATGCCGGCGCGTACCGCCCCAGCCCGGGTGCCCACACTGGCCGTCCCGGCCATCTCGCTGCCCGACGATGTCCGCAGGCGGCTCAAAGAT
TTGGAAAGAGACAGCTTAACAGAAAAGGAATGTGTGAAGGAGAAATTGAATCTCTTGCACGAATTTCTGCAAACAGAAATAAAGAATCAG
TTATGTGACTTGGAAACCAAATTACGTAAAGAAGAATTATCCGAGGAGGGCTACCTGGCTAAAGTCAAATCCCTTTTAAATAAAGATTTG
TCCTTGGAGAACGGTGCTCATGCTTACAACCGGGAAGTGAATGGACGTCTAGAAAACGGGAACCAAGCAAGAAGTGAAGCCCGTAGAGTG
GGAATGGCAGATGCCAACAGCCCCCCCAAACCCCTTTCCAAACCTCGCACGCCCAGGAGGAGCAAGTCCGATGGAGAGGCTAAGCGTTCA
AGAGACCCTCCTGCCTCAGCCTCCCAAGTAACTGGGATTAGAGCTGAACCTTCACCTAGCCCCAGGATTACAAGGAAAAGCACCAGGCAA
ACCACCATCACATCTCATTTTGCAAAGGGCCCTGCCAAACGGAAACCTCAGGAAGAGTCTGAAAGAGCCAAATCGGATGAGTCCATCAAG
GAAGAAGACAAAGACCAGGATGAGAAGAGACGTAGAGTTACATCCAGAGAACGAGTTGCTAGACCGCTTCCTGCAGAAGAACCTGAAAGA
GCAAAATCAGGAACGCGCACTGAAAAGGAAGAAGAAAGAGATGAAAAAGAAGAAAAGAGACTCCGAAGTCAAACCAAAGAACCAACACCC
AAACAGAAACTGAAGGAGGAGCCGGACAGAGAAGCCAGGGCAGGCGTGCAGGCTGACGAGGACGAAGATGGAGACGAGAAAGATGAGAAG
AAGCACAGAAGTCAACCCAAAGATCTAGCTGCCAAACGGAGGCCCGAAGAAAAAGAACCTGAAAAAGTAAATCCACAGATTTCTGATGAA
AAAGACGAGGATGAAAAGGAGGAGAAGAGACGCAAAACGACCCCCAAAGAACCAACGGAGAAAAAAATGGCTCGCGCCAAAACAGTCATG
AACTCCAAGACCCACCCTCCCAAGTGCATTCAGTGCGGGCAGTACCTGGACGACCCTGACCTCAAATATGGGCAGCACCCACCAGACGCG
GTGGATGAGCCACAGATGCTGACAAATGAGAAGCTGTCCATCTTTGATGCCAACGAGTCTGGCTTTGAGAGTTATGAGGCGCTTCCCCAG
CACAAACTGACCTGCTTCAGTGTGTACTGTAAGCACGGTCACCTGTGTCCCATCGACACCGGCCTCATCGAGAAGAATATCGAACTCTTC
TTTTCTGGTTCAGCAAAACCAATCTATGATGATGACCCATCTCTTGAAGGTGGTGTTAATGGCAAAAATCTTGGCCCCATAAATGAATGG
TGGATCACTGGCTTTGATGGAGGTGAAAAGGCCCTCATCGGCTTCAGCACCTCATTTGCCGAATACATTCTGATGGATCCCAGTCCCGAG
TATGCGCCCATATTTGGGCTGATGCAGGAGAAGATCTACATCAGCAAGATTGTGGTGGAGTTCCTGCAGAGCAATTCCGACTCGACCTAT
GAGGACCTGATCAACAAGATCGAGACCACGGTTCCTCCTTCTGGCCTCAACTTGAACCGCTTCACAGAGGACTCCCTCCTGCGACACGCG
CAGTTTGTGGTGGAGCAGGTGGAGAGTTATGACGAGGCCGGGGACAGTGATGAGCAGCCCATCTTCCTGACACCCTGCATGCGGGACCTG
ATCAAGCTGGCTGGGGTCACGCTGGGACAGAGGCGAGCCCAGGCGAGGCGGCAGACCATCAGGCATTCTACCAGGGAGAAGGACAGGGGA
CCCACGAAAGCCACCACCACCAAGCTGGTCTACCAGATCTTCGATACTTTCTTCGCAGAGCAAATTGAAAAGGATGACAGAGAAGACAAG
GAGAACGCCTTTAAGCGCCGGCGATGTGGCGTCTGTGAGGTGTGTCAGCAGCCTGAGTGTGGGAAATGTAAAGCCTGCAAGGACATGGTT
AAATTTGGTGGCAGTGGACGGAGCAAGCAGGCTTGCCAAGAGCGGAGGTGTCCCAATATGGCCATGAAGGAGGCAGATGACGATGAGGAA
GTCGATGATAACATCCCAGAGATGCCGTCACCCAAAAAAATGCACCAGGGGAAGAAGAAGAAACAGAACAAGAATCGCATCTCTTGGGTC
GGAGAAGCCGTCAAGACTGATGGGAAGAAGAGTTACTATAAGAAGGTGTGCATTGATGCGGAAACCCTGGAAGTGGGGGACTGTGTCTCT
GTTATTCCAGATGATTCCTCAAAACCGCTGTATCTAGCAAGGGTCACGGCGCTGTGGGAGGACAGCAGCAACGGGCAGATGTTTCACGCC
CACTGGTTCTGCGCTGGGACAGACACAGTCCTCGGGGCCACGTCGGACCCTCTGGAGCTGTTCTTGGTGGATGAATGTGAGGACATGCAG
CTTTCATATATCCACAGCAAAGTGAAAGTCATCTACAAAGCCCCCTCCGAAAACTGGGCCATGGAGGGAGGCATGGATCCCGAGTCCCTG
CTGGAGGGGGACGACGGGAAGACCTACTTCTACCAGCTGTGGTATGATCAAGACTACGCGAGATTCGAGTCCCCTCCAAAAACCCAGCCA
ACAGAGGACAACAAGTTCAAATTCTGTGTGAGCTGTGCCCGTCTGGCTGAGATGAGGCAAAAAGAAATCCCCAGGGTCCTGGAGCAGCTC
GAGGACCTGGATAGCCGGGTCCTCTACTACTCAGCCACCAAGAACGGCATCCTGTACCGAGTTGGTGATGGTGTGTACCTGCCCCCTGAG
GCCTTCACGTTCAACATCAAGCTGTCCAGTCCCGTGAAACGCCCACGGAAGGAGCCCGTGGATGAGGACCTGTACCCAGAGCACTACCGG
AAATACTCCGACTACATCAAAGGCAGCAACCTGGATGCCCCTGAGCCCTACCGAATTGGCCGGATCAAAGAGATCTTCTGTCCCAAGAAG
AGCAACGGCAGGCCCAATGAGACTGACATCAAAATCCGGGTCAACAAGTTCTACAGGCCTGAGAACACCCACAAGTCCACTCCAGCGAGC
TACCACGCAGACATCAACCTGCTCTACTGGAGCGACGAGGAGGCCGTGGTGGACTTCAAGGCTGTGCAGGGCCGCTGCACCGTGGAGTAT
GGGGAGGACCTGCCCGAGTGCGTCCAGGTGTACTCCATGGGCGGCCCCAACCGCTTCTACTTCCTCGAGGCCTATAATGCAAAGAGCAAA
AGCTTTGAAGATCCTCCCAACCATGCCCGTAGCCCTGGAAACAAAGGGAAGGGCAAGGGAAAAGGGAAGGGCAAGCCCAAGTCCCAAGCC
TGTGAGCCGAGCGAGCCAGAGATAGAGATCAAGCTGCCCAAGCTGCGGACCCTGGATGTGTTTTCTGGCTGCGGGGGGTTGTCGGAGGGA
TTCCACCAAGCAGGCATCTCTGACACGCTGTGGGCCATCGAGATGTGGGACCCTGCGGCCCAGGCGTTCCGGCTGAACAACCCCGGCTCC
ACAGTGTTCACAGAGGACTGCAACATCCTGCTGAAGCTGGTCATGGCTGGGGAGACCACCAACTCCCGCGGCCAGCGGCTGCCCCAGAAG
GGAGACGTGGAGATGCTGTGCGGCGGGCCGCCCTGCCAGGGCTTCAGCGGCATGAACCGCTTCAATTCGCGCACCTACTCCAAGTTCAAA
AACTCTCTGGTGGTTTCCTTCCTCAGCTACTGCGACTACTACCGGCCCCGGTTCTTCCTCCTGGAGAATGTCAGGAACTTTGTCTCCTTC
AAGCGCTCCATGGTCCTGAAGCTCACCCTCCGCTGCCTGGTCCGCATGGGCTATCAGTGCACCTTCGGCGTGCTGCAGGCCGGTCAGTAC
GGCGTGGCCCAGACTAGGAGGCGGGCCATCATCCTGGCCGCGGCCCCTGGAGAGAAGCTCCCTCTGTTCCCGGAGCCACTGCACGTGTTT
GCTCCCCGGGCCTGCCAGCTGAGCGTGGTGGTGGATGACAAGAAGTTTGTGAGCAACATAACCAGGTTGAGCTCGGGTCCTTTCCGGACC
ATCACGGTGCGAGACACGATGTCCGACCTGCCGGAGGTGCGGAATGGAGCCTCGGCACTGGAGATCTCCTACAACGGGGAGCCTCAGTCC
TGGTTCCAGAGGCAGCTCCGGGGCGCACAGTACCAGCCCATCCTCAGGGACCACATCTGTAAGGACATGAGTGCATTGGTGGCTGCCCGC
ATGCGGCACATCCCCTTGGCCCCAGGGTCAGACTGGCGCGATCTGCCCAACATCGAGGTGCGGCTCTCAGACGGCACCATGGCCAGGAAG
CTGCGGTATACCCACCATGACAGGAAGAACGGCCGCAGCAGCTCTGGGGCCCTCCGTGGGGTCTGCTCCTGCGTGGAAGCCGGCAAAGCC
TGCGACCCCGCAGCCAGGCAGTTCAACACCCTCATCCCCTGGTGCCTGCCCCACACCGGGAACCGGCACAACCACTGGGCTGGCCTCTAT
GGAAGGCTCGAGTGGGACGGCTTCTTCAGCACAACCGTCACCAACCCCGAGCCCATGGGCAAGCAGGGCCGCGTGCTCCACCCAGAGCAG
CACCGTGTGGTGAGCGTGCGGGAGTGTGCCCGCTCCCAGGGCTTCCCTGACACCTACCGGCTCTTCGGCAACATCCTGGACAAGCACCGG
CAGGTCCAGAACGTCTCCCAGTCCATGGAGGTCCTTGAGTTGCGGACGTATCGCGACCTCCAGTATGTACGCGGCATGGAGACCCTCATG
CGGAGCCTGGATGCGCGGCTCCGGGCAGCTGATGGGTCCCTCTCGGCCAAGAGCTTCCAGGAGCTGAAGGACAGGATGACGGAACTGTTG
CCCCTGAGCTCGGTCCTGGAGCAGTACAAGGCAGACACGCGGACCATTGTACGCTTGCGGGAGGAGGTGAGGAATCTCTCCGGCAGTCTG
GCGGCCATTCAGGAGGAGATGGGTGCCTACGGGTATGAGGACCTGCAGCAACGGGTGATGGCCCTGGAGGCCCGGCTCCACGCCTGCGCC
CAGAAGCTGGGCTGTGGGAAGCTGACCGGGGTCAGTAACCCCATCACCGTTCGGGCCATGGGGTCCCGCTTCGGCTCCTGGATGACTGAC
ACGATGGCCCCCAGTGCGGATAGCCGGGTCTGGTACATGGATGGCTATTACAAAGGCCGCCGGGTCCTGGAGTTCCGTACCCTGGGAGAC
TTCATCAAAGGCCAGAACTTTATCCAGCACCTGCTGCCCCAGCCGTGGGCGGGCACGGGCCACGTGGTGTACAACGGCTCCCTGTTCTAT
AACAAGTACCAGAGCAACGTGGTGGTCAAATACCACTTCCGCTCGCGCTCTGTGCTGGTGCAGAGGAGCCTCCCGGGCGCCGGTTACAAC
AACACCTTCCCCTACTCCTGGGGCGGCTTCTCCGACATGGACTTCATGGTGGACGAGAGCGGGCTCTGGGCTGTGTACACCACCAACCAG
AACGCGGGCAACATCGTGGTCAGCCGGCTGGACCCGCACACCCTCGAGGTCATGCGGTCCTGGGACACCGGCTACCCCAAGCGCAGCGCT
GGCGAGGCCTTCATGATCTGCGGTGTGCTCTACGTGACCAACTCCCACCTGGCTGGGGCCAAGGTCTACTTCGCCTATTTTACCAACACG
TCCAGTTACGAGTACACGGACGTGCCCTTCCACAACCAGTATTCCCACATCTCGATGCTGGATTACAACCCCCGGGAGCGCGCCCTCTAT
ACCTGGAACAACGGCCACCAGGTGCTCTACAATGTCACCCTGTTTCACGTCATCAGCACCTCTGGGGACCCCTGAGCCAATGCTGTGGCT
CGGGCTGCTGCCTGGGGGGCCTCTGGGGGCTGGGGGCCCTTTTCATTCTGCCTGTGTCCCTCAAGGGTGATCTCTCTGTCTCTGTCACGC
CCTTTCTCCCCGCCTTTTTGCTGGGCTTTTGTTCTCTGCCTATGTATTTCTGTCTATTTTTTCAATTTCCCCTCTTCTCCTTTATTGATC
TCTGCTTTTAATACACCACTTCTTTCTTTCTGCCTTTTTATGGATGTCTTTTTCTTTTTATGGCTCTGGTTCTCCAGTTCTTTCCGTCTC
TGCCTCTCTCTGTCTCTCTCTCTCTGTCCTTCCACCCCTCCCTCCTTGCTTCCCACCCATTCCTCATCCCTCACTCCCACCCCCACCCCC
ACCCCCAGGAGTTGAGTGCATGGATCTGTTTCTTTTTTTATTTACACTTTTTCTTTCCGGTTTGCCGGAATAAACAGGACCTTTGACATT

>23699_23699_2_DNMT1-OLFM2_DNMT1_chr19_10246412_ENST00000359526_OLFM2_chr19_9968537_ENST00000264833_length(amino acids)=1974AA_BP=0
MPARTAPARVPTLAVPAISLPDDVRRRLKDLERDSLTEKECVKEKLNLLHEFLQTEIKNQLCDLETKLRKEELSEEGYLAKVKSLLNKDL
SLENGAHAYNREVNGRLENGNQARSEARRVGMADANSPPKPLSKPRTPRRSKSDGEAKRSRDPPASASQVTGIRAEPSPSPRITRKSTRQ
TTITSHFAKGPAKRKPQEESERAKSDESIKEEDKDQDEKRRRVTSRERVARPLPAEEPERAKSGTRTEKEEERDEKEEKRLRSQTKEPTP
KQKLKEEPDREARAGVQADEDEDGDEKDEKKHRSQPKDLAAKRRPEEKEPEKVNPQISDEKDEDEKEEKRRKTTPKEPTEKKMARAKTVM
NSKTHPPKCIQCGQYLDDPDLKYGQHPPDAVDEPQMLTNEKLSIFDANESGFESYEALPQHKLTCFSVYCKHGHLCPIDTGLIEKNIELF
FSGSAKPIYDDDPSLEGGVNGKNLGPINEWWITGFDGGEKALIGFSTSFAEYILMDPSPEYAPIFGLMQEKIYISKIVVEFLQSNSDSTY
EDLINKIETTVPPSGLNLNRFTEDSLLRHAQFVVEQVESYDEAGDSDEQPIFLTPCMRDLIKLAGVTLGQRRAQARRQTIRHSTREKDRG
PTKATTTKLVYQIFDTFFAEQIEKDDREDKENAFKRRRCGVCEVCQQPECGKCKACKDMVKFGGSGRSKQACQERRCPNMAMKEADDDEE
VDDNIPEMPSPKKMHQGKKKKQNKNRISWVGEAVKTDGKKSYYKKVCIDAETLEVGDCVSVIPDDSSKPLYLARVTALWEDSSNGQMFHA
HWFCAGTDTVLGATSDPLELFLVDECEDMQLSYIHSKVKVIYKAPSENWAMEGGMDPESLLEGDDGKTYFYQLWYDQDYARFESPPKTQP
TEDNKFKFCVSCARLAEMRQKEIPRVLEQLEDLDSRVLYYSATKNGILYRVGDGVYLPPEAFTFNIKLSSPVKRPRKEPVDEDLYPEHYR
KYSDYIKGSNLDAPEPYRIGRIKEIFCPKKSNGRPNETDIKIRVNKFYRPENTHKSTPASYHADINLLYWSDEEAVVDFKAVQGRCTVEY
GEDLPECVQVYSMGGPNRFYFLEAYNAKSKSFEDPPNHARSPGNKGKGKGKGKGKPKSQACEPSEPEIEIKLPKLRTLDVFSGCGGLSEG
FHQAGISDTLWAIEMWDPAAQAFRLNNPGSTVFTEDCNILLKLVMAGETTNSRGQRLPQKGDVEMLCGGPPCQGFSGMNRFNSRTYSKFK
NSLVVSFLSYCDYYRPRFFLLENVRNFVSFKRSMVLKLTLRCLVRMGYQCTFGVLQAGQYGVAQTRRRAIILAAAPGEKLPLFPEPLHVF
APRACQLSVVVDDKKFVSNITRLSSGPFRTITVRDTMSDLPEVRNGASALEISYNGEPQSWFQRQLRGAQYQPILRDHICKDMSALVAAR
MRHIPLAPGSDWRDLPNIEVRLSDGTMARKLRYTHHDRKNGRSSSGALRGVCSCVEAGKACDPAARQFNTLIPWCLPHTGNRHNHWAGLY
GRLEWDGFFSTTVTNPEPMGKQGRVLHPEQHRVVSVRECARSQGFPDTYRLFGNILDKHRQVQNVSQSMEVLELRTYRDLQYVRGMETLM
RSLDARLRAADGSLSAKSFQELKDRMTELLPLSSVLEQYKADTRTIVRLREEVRNLSGSLAAIQEEMGAYGYEDLQQRVMALEARLHACA
QKLGCGKLTGVSNPITVRAMGSRFGSWMTDTMAPSADSRVWYMDGYYKGRRVLEFRTLGDFIKGQNFIQHLLPQPWAGTGHVVYNGSLFY
NKYQSNVVVKYHFRSRSVLVQRSLPGAGYNNTFPYSWGGFSDMDFMVDESGLWAVYTTNQNAGNIVVSRLDPHTLEVMRSWDTGYPKRSA

--------------------------------------------------------------
>23699_23699_3_DNMT1-OLFM2_DNMT1_chr19_10246412_ENST00000540357_OLFM2_chr19_9968537_ENST00000264833_length(transcript)=6534nt_BP=4914nt
GGCTCCGTTCCATCCTTCTGCACAGGGTATCGCCTCTCTCCGTTTGGTACATCCCCTCCTCCCCCACGCCCGGACTGGGGTGGTAGACGC
CGCCTCCGCTCATCGCCCCTCCCCATCGGTTTCCGCGCGAAAAGCCGGGGCGCCTGCGCTGCCGCCGCCGCGTCTGCTGAAGCCTCCGAG
ATGCCGGCGCGTACCGCCCCAGCCCGGGTGCCCACACTGGCCGTCCCGGCCATCTCGCTGCCCGACGATGTCCGCAGGCGGCTCAAAGAT
TTGGAAAGAGACAGCTTAACAGAAAAGGAATGTGTGAAGGAGAAATTGAATCTCTTGCACGAATTTCTGCAAACAGAAATAAAGAATCAG
TTATGTGACTTGGAAACCAAATTACGTAAAGAAGAATTATCCGAGGAGGGCTACCTGGCTAAAGTCAAATCCCTTTTAAATAAAGATTTG
TCCTTGGAGAACGGTGCTCATGCTTACAACCGGGAAGTGAATGGACGTCTAGAAAACGGGAACCAAGCAAGAAGTGAAGCCCGTAGAGTG
GGAATGGCAGATGCCAACAGCCCCCCCAAACCCCTTTCCAAACCTCGCACGCCCAGGAGGAGCAAGTCCGATGGAGAGGCTAAGCCTGAA
CCTTCACCTAGCCCCAGGATTACAAGGAAAAGCACCAGGCAAACCACCATCACATCTCATTTTGCAAAGGGCCCTGCCAAACGGAAACCT
CAGGAAGAGTCTGAAAGAGCCAAATCGGATGAGTCCATCAAGGAAGAAGACAAAGACCAGGATGAGAAGAGACGTAGAGTTACATCCAGA
GAACGAGTTGCTAGACCGCTTCCTGCAGAAGAACCTGAAAGAGCAAAATCAGGAACGCGCACTGAAAAGGAAGAAGAAAGAGATGAAAAA
GAAGAAAAGAGACTCCGAAGTCAAACCAAAGAACCAACACCCAAACAGAAACTGAAGGAGGAGCCGGACAGAGAAGCCAGGGCAGGCGTG
CAGGCTGACGAGGACGAAGATGGAGACGAGAAAGATGAGAAGAAGCACAGAAGTCAACCCAAAGATCTAGCTGCCAAACGGAGGCCCGAA
GAAAAAGAACCTGAAAAAGTAAATCCACAGATTTCTGATGAAAAAGACGAGGATGAAAAGGAGGAGAAGAGACGCAAAACGACCCCCAAA
GAACCAACGGAGAAAAAAATGGCTCGCGCCAAAACAGTCATGAACTCCAAGACCCACCCTCCCAAGTGCATTCAGTGCGGGCAGTACCTG
GACGACCCTGACCTCAAATATGGGCAGCACCCACCAGACGCGGTGGATGAGCCACAGATGCTGACAAATGAGAAGCTGTCCATCTTTGAT
GCCAACGAGTCTGGCTTTGAGAGTTATGAGGCGCTTCCCCAGCACAAACTGACCTGCTTCAGTGTGTACTGTAAGCACGGTCACCTGTGT
CCCATCGACACCGGCCTCATCGAGAAGAATATCGAACTCTTCTTTTCTGGTTCAGCAAAACCAATCTATGATGATGACCCATCTCTTGAA
GGTGGTGTTAATGGCAAAAATCTTGGCCCCATAAATGAATGGTGGATCACTGGCTTTGATGGAGGTGAAAAGGCCCTCATCGGCTTCAGC
ACCTCATTTGCCGAATACATTCTGATGGATCCCAGTCCCGAGTATGCGCCCATATTTGGGCTGATGCAGGAGAAGATCTACATCAGCAAG
ATTGTGGTGGAGTTCCTGCAGAGCAATTCCGACTCGACCTATGAGGACCTGATCAACAAGATCGAGACCACGGTTCCTCCTTCTGGCCTC
AACTTGAACCGCTTCACAGAGGACTCCCTCCTGCGACACGCGCAGTTTGTGGTGGAGCAGGTGGAGAGTTATGACGAGGCCGGGGACAGT
GATGAGCAGCCCATCTTCCTGACACCCTGCATGCGGGACCTGATCAAGCTGGCTGGGGTCACGCTGGGACAGAGGCGAGCCCAGGCGAGG
CGGCAGACCATCAGGCATTCTACCAGGGAGAAGGACAGGGGACCCACGAAAGCCACCACCACCAAGCTGGTCTACCAGATCTTCGATACT
TTCTTCGCAGAGCAAATTGAAAAGGATGACAGAGAAGACAAGGAGAACGCCTTTAAGCGCCGGCGATGTGGCGTCTGTGAGGTGTGTCAG
CAGCCTGAGTGTGGGAAATGTAAAGCCTGCAAGGACATGGTTAAATTTGGTGGCAGTGGACGGAGCAAGCAGGCTTGCCAAGAGCGGAGG
TGTCCCAATATGGCCATGAAGGAGGCAGATGACGATGAGGAAGTCGATGATAACATCCCAGAGATGCCGTCACCCAAAAAAATGCACCAG
GGGAAGAAGAAGAAACAGAACAAGAATCGCATCTCTTGGGTCGGAGAAGCCGTCAAGACTGATGGGAAGAAGAGTTACTATAAGAAGGTG
TGCATTGATGCGGAAACCCTGGAAGTGGGGGACTGTGTCTCTGTTATTCCAGATGATTCCTCAAAACCGCTGTATCTAGCAAGGGTCACG
GCGCTGTGGGAGGACAGCAGCAACGGGCAGATGTTTCACGCCCACTGGTTCTGCGCTGGGACAGACACAGTCCTCGGGGCCACGTCGGAC
CCTCTGGAGCTGTTCTTGGTGGATGAATGTGAGGACATGCAGCTTTCATATATCCACAGCAAAGTGAAAGTCATCTACAAAGCCCCCTCC
GAAAACTGGGCCATGGAGGGAGGCATGGATCCCGAGTCCCTGCTGGAGGGGGACGACGGGAAGACCTACTTCTACCAGCTGTGGTATGAT
CAAGACTACGCGAGATTCGAGTCCCCTCCAAAAACCCAGCCAACAGAGGACAACAAGTTCAAATTCTGTGTGAGCTGTGCCCGTCTGGCT
GAGATGAGGCAAAAAGAAATCCCCAGGGTCCTGGAGCAGCTCGAGGACCTGGATAGCCGGGTCCTCTACTACTCAGCCACCAAGAACGGC
ATCCTGTACCGAGTTGGTGATGGTGTGTACCTGCCCCCTGAGGCCTTCACGTTCAACATCAAGCTGTCCAGTCCCGTGAAACGCCCACGG
AAGGAGCCCGTGGATGAGGACCTGTACCCAGAGCACTACCGGAAATACTCCGACTACATCAAAGGCAGCAACCTGGATGCCCCTGAGCCC
TACCGAATTGGCCGGATCAAAGAGATCTTCTGTCCCAAGAAGAGCAACGGCAGGCCCAATGAGACTGACATCAAAATCCGGGTCAACAAG
TTCTACAGGCCTGAGAACACCCACAAGTCCACTCCAGCGAGCTACCACGCAGACATCAACCTGCTCTACTGGAGCGACGAGGAGGCCGTG
GTGGACTTCAAGGCTGTGCAGGGCCGCTGCACCGTGGAGTATGGGGAGGACCTGCCCGAGTGCGTCCAGGTGTACTCCATGGGCGGCCCC
AACCGCTTCTACTTCCTCGAGGCCTATAATGCAAAGAGCAAAAGCTTTGAAGATCCTCCCAACCATGCCCGTAGCCCTGGAAACAAAGGG
AAGGGCAAGGGAAAAGGGAAGGGCAAGCCCAAGTCCCAAGCCTGTGAGCCGAGCGAGCCAGAGATAGAGATCAAGCTGCCCAAGCTGCGG
ACCCTGGATGTGTTTTCTGGCTGCGGGGGGTTGTCGGAGGGATTCCACCAAGCAGGCATCTCTGACACGCTGTGGGCCATCGAGATGTGG
GACCCTGCGGCCCAGGCGTTCCGGCTGAACAACCCCGGCTCCACAGTGTTCACAGAGGACTGCAACATCCTGCTGAAGCTGGTCATGGCT
GGGGAGACCACCAACTCCCGCGGCCAGCGGCTGCCCCAGAAGGGAGACGTGGAGATGCTGTGCGGCGGGCCGCCCTGCCAGGGCTTCAGC
GGCATGAACCGCTTCAATTCGCGCACCTACTCCAAGTTCAAAAACTCTCTGGTGGTTTCCTTCCTCAGCTACTGCGACTACTACCGGCCC
CGGTTCTTCCTCCTGGAGAATGTCAGGAACTTTGTCTCCTTCAAGCGCTCCATGGTCCTGAAGCTCACCCTCCGCTGCCTGGTCCGCATG
GGCTATCAGTGCACCTTCGGCGTGCTGCAGGCCGGTCAGTACGGCGTGGCCCAGACTAGGAGGCGGGCCATCATCCTGGCCGCGGCCCCT
GGAGAGAAGCTCCCTCTGTTCCCGGAGCCACTGCACGTGTTTGCTCCCCGGGCCTGCCAGCTGAGCGTGGTGGTGGATGACAAGAAGTTT
GTGAGCAACATAACCAGGTTGAGCTCGGGTCCTTTCCGGACCATCACGGTGCGAGACACGATGTCCGACCTGCCGGAGGTGCGGAATGGA
GCCTCGGCACTGGAGATCTCCTACAACGGGGAGCCTCAGTCCTGGTTCCAGAGGCAGCTCCGGGGCGCACAGTACCAGCCCATCCTCAGG
GACCACATCTGTAAGGACATGAGTGCATTGGTGGCTGCCCGCATGCGGCACATCCCCTTGGCCCCAGGGTCAGACTGGCGCGATCTGCCC
AACATCGAGGTGCGGCTCTCAGACGGCACCATGGCCAGGAAGCTGCGGTATACCCACCATGACAGGAAGAACGGCCGCAGCAGCTCTGGG
GCCCTCCGTGGGGTCTGCTCCTGCGTGGAAGGTGGGTCCTCCGGCAAAGCCTGCGACCCCGCAGCCAGGCAGTTCAACACCCTCATCCCC
TGGTGCCTGCCCCACACCGGGAACCGGCACAACCACTGGGCTGGCCTCTATGGAAGGCTCGAGTGGGACGGCTTCTTCAGCACAACCGTC
ACCAACCCCGAGCCCATGGGCAAGCAGGGCCGCGTGCTCCACCCAGAGCAGCACCGTGTGGTGAGCGTGCGGGAGTGTGCCCGCTCCCAG
GGCTTCCCTGACACCTACCGGCTCTTCGGCAACATCCTGGACAAGCACCGGCAGGTCCAGAACGTCTCCCAGTCCATGGAGGTCCTTGAG
TTGCGGACGTATCGCGACCTCCAGTATGTACGCGGCATGGAGACCCTCATGCGGAGCCTGGATGCGCGGCTCCGGGCAGCTGATGGGTCC
CTCTCGGCCAAGAGCTTCCAGGAGCTGAAGGACAGGATGACGGAACTGTTGCCCCTGAGCTCGGTCCTGGAGCAGTACAAGGCAGACACG
CGGACCATTGTACGCTTGCGGGAGGAGGTGAGGAATCTCTCCGGCAGTCTGGCGGCCATTCAGGAGGAGATGGGTGCCTACGGGTATGAG
GACCTGCAGCAACGGGTGATGGCCCTGGAGGCCCGGCTCCACGCCTGCGCCCAGAAGCTGGGCTGTGGGAAGCTGACCGGGGTCAGTAAC
CCCATCACCGTTCGGGCCATGGGGTCCCGCTTCGGCTCCTGGATGACTGACACGATGGCCCCCAGTGCGGATAGCCGGGTCTGGTACATG
GATGGCTATTACAAAGGCCGCCGGGTCCTGGAGTTCCGTACCCTGGGAGACTTCATCAAAGGCCAGAACTTTATCCAGCACCTGCTGCCC
CAGCCGTGGGCGGGCACGGGCCACGTGGTGTACAACGGCTCCCTGTTCTATAACAAGTACCAGAGCAACGTGGTGGTCAAATACCACTTC
CGCTCGCGCTCTGTGCTGGTGCAGAGGAGCCTCCCGGGCGCCGGTTACAACAACACCTTCCCCTACTCCTGGGGCGGCTTCTCCGACATG
GACTTCATGGTGGACGAGAGCGGGCTCTGGGCTGTGTACACCACCAACCAGAACGCGGGCAACATCGTGGTCAGCCGGCTGGACCCGCAC
ACCCTCGAGGTCATGCGGTCCTGGGACACCGGCTACCCCAAGCGCAGCGCTGGCGAGGCCTTCATGATCTGCGGTGTGCTCTACGTGACC
AACTCCCACCTGGCTGGGGCCAAGGTCTACTTCGCCTATTTTACCAACACGTCCAGTTACGAGTACACGGACGTGCCCTTCCACAACCAG
TATTCCCACATCTCGATGCTGGATTACAACCCCCGGGAGCGCGCCCTCTATACCTGGAACAACGGCCACCAGGTGCTCTACAATGTCACC
CTGTTTCACGTCATCAGCACCTCTGGGGACCCCTGAGCCAATGCTGTGGCTCGGGCTGCTGCCTGGGGGGCCTCTGGGGGCTGGGGGCCC
TTTTCATTCTGCCTGTGTCCCTCAAGGGTGATCTCTCTGTCTCTGTCACGCCCTTTCTCCCCGCCTTTTTGCTGGGCTTTTGTTCTCTGC
CTATGTATTTCTGTCTATTTTTTCAATTTCCCCTCTTCTCCTTTATTGATCTCTGCTTTTAATACACCACTTCTTTCTTTCTGCCTTTTT
ATGGATGTCTTTTTCTTTTTATGGCTCTGGTTCTCCAGTTCTTTCCGTCTCTGCCTCTCTCTGTCTCTCTCTCTCTGTCCTTCCACCCCT
CCCTCCTTGCTTCCCACCCATTCCTCATCCCTCACTCCCACCCCCACCCCCACCCCCAGGAGTTGAGTGCATGGATCTGTTTCTTTTTTT

>23699_23699_3_DNMT1-OLFM2_DNMT1_chr19_10246412_ENST00000540357_OLFM2_chr19_9968537_ENST00000264833_length(amino acids)=1961AA_BP=0
MPARTAPARVPTLAVPAISLPDDVRRRLKDLERDSLTEKECVKEKLNLLHEFLQTEIKNQLCDLETKLRKEELSEEGYLAKVKSLLNKDL
SLENGAHAYNREVNGRLENGNQARSEARRVGMADANSPPKPLSKPRTPRRSKSDGEAKPEPSPSPRITRKSTRQTTITSHFAKGPAKRKP
QEESERAKSDESIKEEDKDQDEKRRRVTSRERVARPLPAEEPERAKSGTRTEKEEERDEKEEKRLRSQTKEPTPKQKLKEEPDREARAGV
QADEDEDGDEKDEKKHRSQPKDLAAKRRPEEKEPEKVNPQISDEKDEDEKEEKRRKTTPKEPTEKKMARAKTVMNSKTHPPKCIQCGQYL
DDPDLKYGQHPPDAVDEPQMLTNEKLSIFDANESGFESYEALPQHKLTCFSVYCKHGHLCPIDTGLIEKNIELFFSGSAKPIYDDDPSLE
GGVNGKNLGPINEWWITGFDGGEKALIGFSTSFAEYILMDPSPEYAPIFGLMQEKIYISKIVVEFLQSNSDSTYEDLINKIETTVPPSGL
NLNRFTEDSLLRHAQFVVEQVESYDEAGDSDEQPIFLTPCMRDLIKLAGVTLGQRRAQARRQTIRHSTREKDRGPTKATTTKLVYQIFDT
FFAEQIEKDDREDKENAFKRRRCGVCEVCQQPECGKCKACKDMVKFGGSGRSKQACQERRCPNMAMKEADDDEEVDDNIPEMPSPKKMHQ
GKKKKQNKNRISWVGEAVKTDGKKSYYKKVCIDAETLEVGDCVSVIPDDSSKPLYLARVTALWEDSSNGQMFHAHWFCAGTDTVLGATSD
PLELFLVDECEDMQLSYIHSKVKVIYKAPSENWAMEGGMDPESLLEGDDGKTYFYQLWYDQDYARFESPPKTQPTEDNKFKFCVSCARLA
EMRQKEIPRVLEQLEDLDSRVLYYSATKNGILYRVGDGVYLPPEAFTFNIKLSSPVKRPRKEPVDEDLYPEHYRKYSDYIKGSNLDAPEP
YRIGRIKEIFCPKKSNGRPNETDIKIRVNKFYRPENTHKSTPASYHADINLLYWSDEEAVVDFKAVQGRCTVEYGEDLPECVQVYSMGGP
NRFYFLEAYNAKSKSFEDPPNHARSPGNKGKGKGKGKGKPKSQACEPSEPEIEIKLPKLRTLDVFSGCGGLSEGFHQAGISDTLWAIEMW
DPAAQAFRLNNPGSTVFTEDCNILLKLVMAGETTNSRGQRLPQKGDVEMLCGGPPCQGFSGMNRFNSRTYSKFKNSLVVSFLSYCDYYRP
RFFLLENVRNFVSFKRSMVLKLTLRCLVRMGYQCTFGVLQAGQYGVAQTRRRAIILAAAPGEKLPLFPEPLHVFAPRACQLSVVVDDKKF
VSNITRLSSGPFRTITVRDTMSDLPEVRNGASALEISYNGEPQSWFQRQLRGAQYQPILRDHICKDMSALVAARMRHIPLAPGSDWRDLP
NIEVRLSDGTMARKLRYTHHDRKNGRSSSGALRGVCSCVEGGSSGKACDPAARQFNTLIPWCLPHTGNRHNHWAGLYGRLEWDGFFSTTV
TNPEPMGKQGRVLHPEQHRVVSVRECARSQGFPDTYRLFGNILDKHRQVQNVSQSMEVLELRTYRDLQYVRGMETLMRSLDARLRAADGS
LSAKSFQELKDRMTELLPLSSVLEQYKADTRTIVRLREEVRNLSGSLAAIQEEMGAYGYEDLQQRVMALEARLHACAQKLGCGKLTGVSN
PITVRAMGSRFGSWMTDTMAPSADSRVWYMDGYYKGRRVLEFRTLGDFIKGQNFIQHLLPQPWAGTGHVVYNGSLFYNKYQSNVVVKYHF
RSRSVLVQRSLPGAGYNNTFPYSWGGFSDMDFMVDESGLWAVYTTNQNAGNIVVSRLDPHTLEVMRSWDTGYPKRSAGEAFMICGVLYVT

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for DNMT1-OLFM2


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-38401_1201575.01617.0DMAP1
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-39411_1201591.01633.0DMAP1
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-38401_1481575.01617.0DNMT3A
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-39411_1481591.01633.0DNMT3A
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-3840149_2171575.01617.0DNMT3B
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-3941149_2171591.01633.0DNMT3B
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-3840163_1741575.01617.0PCNA
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-3941163_1741591.01633.0PCNA
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-3840308_6061575.01617.0the PRC2/EED-EZH2 complex
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-3941308_6061591.01633.0the PRC2/EED-EZH2 complex


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with
HgeneDNMT1chr19:10246412chr19:9968537ENST00000340748-38401121_16161575.01617.0the PRC2/EED-EZH2 complex
HgeneDNMT1chr19:10246412chr19:9968537ENST00000359526-39411121_16161591.01633.0the PRC2/EED-EZH2 complex


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for DNMT1-OLFM2


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for DNMT1-OLFM2


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource