FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:RUNX1-USP42 (FusionGDB2 ID:78697)

Fusion Gene Summary for RUNX1-USP42

check button Fusion gene summary
Fusion gene informationFusion gene name: RUNX1-USP42
Fusion gene ID: 78697
HgeneTgene
Gene symbol

RUNX1

USP42

Gene ID

861

84132

Gene nameRUNX family transcription factor 1ubiquitin specific peptidase 42
SynonymsAML1|AML1-EVI-1|AMLCR1|CBF2alpha|CBFA2|EVI-1|PEBP2aB|PEBP2alpha-
Cytomap

21q22.12

7p22.1

Type of geneprotein-codingprotein-coding
Descriptionrunt-related transcription factor 1AML1-EVI-1 fusion proteinPEA2-alpha BPEBP2-alpha BSL3-3 enhancer factor 1 alpha B subunitSL3/AKV core-binding factor alpha B subunitacute myeloid leukemia 1 proteincore-binding factor, runt domain, alpha subunit 2ubiquitin carboxyl-terminal hydrolase 42deubiquitinating enzyme 42ubiquitin specific protease 42ubiquitin thioesterase 42ubiquitin thiolesterase 42ubiquitin-specific-processing protease 42
Modification date2020032220200320
UniProtAcc.

Q9H9J4

Ensembl transtripts involved in fusion geneENST00000437180, ENST00000300305, 
ENST00000325074, ENST00000344691, 
ENST00000399240, ENST00000358356, 
ENST00000486278, ENST00000494829, 
ENST00000306177, 
Fusion gene scores* DoF score46 X 68 X 13=406646 X 8 X 7=336
# samples 898
** MAII scorelog2(89/40664*10)=-5.51380298959468
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(8/336*10)=-2.0703893278914
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: RUNX1 [Title/Abstract] AND USP42 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointRUNX1(36167758)-USP42(6150914), # samples:1
RUNX1(36167489)-USP42(6150914), # samples:1
USP42(6151080)-RUNX1(36167584), # samples:1
Anticipated loss of major functional domain due to fusion event.RUNX1-USP42 seems lost the major protein functional domain in Hgene partner, which is a CGC due to the frame-shifted ORF.
RUNX1-USP42 seems lost the major protein functional domain in Hgene partner, which is a transcription factor due to the frame-shifted ORF.
RUNX1-USP42 seems lost the major protein functional domain in Hgene partner, which is a tumor suppressor due to the frame-shifted ORF.
RUNX1-USP42 seems lost the major protein functional domain in Tgene partner, which is a essential gene due to the frame-shifted ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
HgeneRUNX1

GO:0030097

hemopoiesis

21873977

HgeneRUNX1

GO:0045893

positive regulation of transcription, DNA-templated

10207087|14970218

HgeneRUNX1

GO:0045944

positive regulation of transcription by RNA polymerase II

9199349|10207087|14970218|21873977

TgeneUSP42

GO:0016579

protein deubiquitination

14715245


check buttonFusion gene breakpoints across RUNX1 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.

check buttonFusion gene breakpoints across USP42 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.

check button Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0)
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4acute myeloid leukemiaFR727326RUNX1chr21

36167758

USP42chr7

6150914

ChimerKB3..RUNX1chr21

36171597

-USP42chr7

6154953

+
ChiTaRS5.0N/AFR727326RUNX1chr21

36167489

-USP42chr7

6150914

+


Top

Fusion Gene ORF analysis for RUNX1-USP42

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
Frame-shiftENST00000437180ENST00000306177RUNX1chr21

36171597

-USP42chr7

6154953

+
In-frameENST00000300305ENST00000306177RUNX1chr21

36171597

-USP42chr7

6154953

+
In-frameENST00000325074ENST00000306177RUNX1chr21

36171597

-USP42chr7

6154953

+
In-frameENST00000344691ENST00000306177RUNX1chr21

36171597

-USP42chr7

6154953

+
In-frameENST00000399240ENST00000306177RUNX1chr21

36171597

-USP42chr7

6154953

+
intron-3CDSENST00000300305ENST00000306177RUNX1chr21

36167758

USP42chr7

6150914

intron-3CDSENST00000300305ENST00000306177RUNX1chr21

36167489

-USP42chr7

6150914

+
intron-3CDSENST00000325074ENST00000306177RUNX1chr21

36167758

USP42chr7

6150914

intron-3CDSENST00000325074ENST00000306177RUNX1chr21

36167489

-USP42chr7

6150914

+
intron-3CDSENST00000344691ENST00000306177RUNX1chr21

36167758

USP42chr7

6150914

intron-3CDSENST00000344691ENST00000306177RUNX1chr21

36167489

-USP42chr7

6150914

+
intron-3CDSENST00000358356ENST00000306177RUNX1chr21

36167758

USP42chr7

6150914

intron-3CDSENST00000358356ENST00000306177RUNX1chr21

36171597

-USP42chr7

6154953

+
intron-3CDSENST00000358356ENST00000306177RUNX1chr21

36167489

-USP42chr7

6150914

+
intron-3CDSENST00000399240ENST00000306177RUNX1chr21

36167758

USP42chr7

6150914

intron-3CDSENST00000399240ENST00000306177RUNX1chr21

36167489

-USP42chr7

6150914

+
intron-3CDSENST00000437180ENST00000306177RUNX1chr21

36167758

USP42chr7

6150914

intron-3CDSENST00000437180ENST00000306177RUNX1chr21

36167489

-USP42chr7

6150914

+
intron-3CDSENST00000486278ENST00000306177RUNX1chr21

36167758

USP42chr7

6150914

intron-3CDSENST00000486278ENST00000306177RUNX1chr21

36171597

-USP42chr7

6154953

+
intron-3CDSENST00000486278ENST00000306177RUNX1chr21

36167489

-USP42chr7

6150914

+
intron-3CDSENST00000494829ENST00000306177RUNX1chr21

36167758

USP42chr7

6150914

intron-3CDSENST00000494829ENST00000306177RUNX1chr21

36171597

-USP42chr7

6154953

+
intron-3CDSENST00000494829ENST00000306177RUNX1chr21

36167489

-USP42chr7

6150914

+

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000344691RUNX1chr2136171597-ENST00000306177USP42chr76154953+72202464157861731531
ENST00000325074RUNX1chr2136171597-ENST00000306177USP42chr76154953+5687931046401546
ENST00000300305RUNX1chr2136171597-ENST00000306177USP42chr76154953+6168141244551211558
ENST00000399240RUNX1chr2136171597-ENST00000306177USP42chr76154953+55217657144741467

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score

Top

Fusion Genomic Features for RUNX1-USP42


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page.

Top

Fusion Protein Features for RUNX1-USP42


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr21:36167758/chr7:6150914)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
.USP42

Q9H9J4

FUNCTION: Transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons. Recruits CREB-binding protein (CREBBP) to nuclear bodies. Component of the CREST-BRG1 complex, a multiprotein complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. In resting neurons, transcription of the c-FOS promoter is inhibited by BRG1-dependent recruitment of a phospho-RB1-HDAC1 repressor complex. Upon calcium influx, RB1 is dephosphorylated by calcineurin, which leads to release of the repressor complex. At the same time, there is increased recruitment of CREBBP to the promoter by a CREST-dependent mechanism, which leads to transcriptional activation. The CREST-BRG1 complex also binds to the NR2B promoter, and activity-dependent induction of NR2B expression involves a release of HDAC1 and recruitment of CREBBP (By similarity). {ECO:0000250}.FUNCTION: Deubiquitinating enzyme which may play an important role during spermatogenesis. {ECO:0000250}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note


Top

Fusion Gene Sequence for RUNX1-USP42


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>78697_78697_1_RUNX1-USP42_RUNX1_chr21_36171597_ENST00000300305_USP42_chr7_6154953_ENST00000306177_length(transcript)=6168nt_BP=1412nt
GGAAAGGGGAACAGTTAAATTTGTAATTTGGGTTGTGTGAAAACTTCTTTGGGCCTCATAAACAACCACAGAACCACAAGTTGGGTAGCC
TGGCAGTGTCAGAAGTCTGAACCCAGCATAGTGGTCAGCAGGCAGGACGAATCACACTGAATGCAAACCACAGGGTTTCGCAGCGTGGTG
AGCATCACCAACCCACAGCCAAGGCGGCGCTGGCTTTTTTTTTTTTTTTAATCTTTAACAATTTGAATATTTGTTTTTACAAAGGTGCAT
TTTTTAATAGGGCTTGGGGAGTCCCAGAGGTATCCAGCAGAGGGGAGAAGAAAGAGAGATGTAGGGCTAGAGGGGTGAGGCTGAAACAGT
GACCTGTCTTGGTTTTCGCTCCGAAGGTAAAAGAAATCATTGAGTCCCCCGCCTTCAGAAGAGGGTGCATTTTCAGGAGGAAGCGATGGC
TTCAGACAGCATATTTGAGTCATTTCCTTCGTACCCACAGTGCTTCATGAGAGAATGCATACTTGGAATGAATCCTTCTAGAGACGTCCA
CGATGCCAGCACGAGCCGCCGCTTCACGCCGCCTTCCACCGCGCTGAGCCCAGGCAAGATGAGCGAGGCGTTGCCGCTGGGCGCCCCGGA
CGCCGGCGCTGCCCTGGCCGGCAAGCTGAGGAGCGGCGACCGCAGCATGGTGGAGGTGCTGGCCGACCACCCGGGCGAGCTGGTGCGCAC
CGACAGCCCCAACTTCCTCTGCTCCGTGCTGCCTACGCACTGGCGCTGCAACAAGACCCTGCCCATCGCTTTCAAGGTGGTGGCCCTAGG
GGATGTTCCAGATGGCACTCTGGTCACTGTGATGGCTGGCAATGATGAAAACTACTCGGCTGAGCTGAGAAATGCTACCGCAGCCATGAA
GAACCAGGTTGCAAGATTTAATGACCTCAGGTTTGTCGGTCGAAGTGGAAGAGGGAAAAGCTTCACTCTGACCATCACTGTCTTCACAAA
CCCACCGCAAGTCGCCACCTACCACAGAGCCATCAAAATCACAGTGGATGGGCCCCGAGAACCTCGAAGACATCGGCAGAAACTAGATGA
TCAGACCAAGCCCGGGAGCTTGTCCTTTTCCGAGCGGCTCAGTGAACTGGAGCAGCTGCGGCGCACAGCCATGAGGGTCAGCCCACACCA
CCCAGCCCCCACGCCCAACCCTCGTGCCTCCCTGAACCACTCCACTGCCTTTAACCCTCAGCCTCAGAGTCAGATGCAGGATACAAGGCA
GATCCAACCATCCCCACCGTGGTCCTACGATCAGTCCTACCAATACCTGGGATCCATTGCCTCTCCTTCTGTGCACCCAGCAACGCCCAT
TTCACCTGGACGTGCCAGCGGCATGACAACCCTCTCTGCAGAACTTTCCAGTCGACTCTCAACCCTAGGTGATGGCATCGCTCCTCCACA
GAAAGTTCTTTTCCCATCTGAGAAGATTTGTCTTAAGTGGCAACAAACTCATAGAGTTGGAGCTGGGCTCCAGAATTTGGGCAATACCTG
TTTTGCCAATGCAGCACTGCAGTGTTTAACCTACACACCACCTCTTGCCAATTACATGCTATCACATGAACACTCCAAAACATGTCATGC
AGAAGGCTTTTGTATGATGTGTACAATGCAAGCACATATTACCCAGGCACTCAGTAATCCTGGGGACGTTATTAAACCAATGTTTGTCAT
CAATGAGATGCGGCGTATAGCTAGGCACTTCCGTTTTGGAAACCAAGAAGATGCCCATGAATTCCTTCAATACACTGTTGATGCTATGCA
GAAAGCATGCTTGAATGGCAGCAATAAATTAGACAGACACACCCAGGCCACCACTCTTGTTTGTCAGATATTTGGAGGATACCTAAGATC
TAGAGTCAAATGTTTAAATTGCAAGGGCGTTTCAGATACTTTTGATCCATATCTTGATATAACATTGGAGATAAAGGCTGCTCAGAGTGT
CAACAAGGCATTGGAGCAGTTTGTGAAGCCGGAACAGCTTGATGGAGAAAACTCGTACAAGTGCAGCAAGTGTAAAAAGATGGTTCCAGC
TTCAAAGAGGTTCACTATCCATAGATCCTCTAATGTTCTTACACTTTCTCTGAAACGTTTTGCAAATTTTACCGGTGGAAAAATTGCTAA
GGATGTGAAATACCCTGAGTATCTTGATATTCGGCCATATATGTCTCAACCCAACGGAGAGCCAATTGTCTACGTCTTGTATGCAGTGCT
GGTCCACACTGGTTTTAATTGCCATGCTGGCCATTACTTCTGCTACATAAAAGCTAGCAATGGCCTCTGGTATCAAATGAATGACTCCAT
TGTATCTACCAGTGATATTAGATCGGTACTCAGCCAACAAGCCTATGTGCTCTTTTATATCAGGTCCCATGATGTGAAAAATGGAGGTGA
ACTTACTCATCCCACCCATAGCCCCGGCCAGTCCTCTCCCCGCCCCGTCATCAGTCAGCGGGTTGTCACCAACAAACAGGCTGCGCCAGG
CTTTATCGGACCACAGCTTCCCTCTCACATGATAAAGAATCCACCTCACTTAAATGGGACTGGACCATTGAAAGACACGCCAAGCAGTTC
CATGTCGAGTCCTAACGGGAATTCCAGTGTCAACAGGGCTAGTCCTGTTAATGCTTCAGCTTCTGTCCAAAACTGGTCAGTTAATAGGTC
CTCAGTGATCCCAGAACATCCTAAGAAACAAAAAATTACAATCAGTATTCACAACAAGTTGCCTGTTCGCCAGTGTCAGTCTCAACCTAA
CCTTCATAGTAATTCTTTGGAGAACCCTACCAAGCCCGTTCCCTCTTCTACCATTACCAATTCTGCAGTACAGTCTACCTCGAACGCATC
TACGATGTCAGTTTCTAGTAAAGTAACAAAACCGATCCCCCGCAGTGAATCCTGCTCCCAGCCCGTGATGAATGGCAAATCCAAGCTGAA
CTCCAGCGTGCTGGTGCCCTATGGCGCCGAGTCCTCTGAGGACTCTGACGAGGAGTCAAAGGGGCTGGGCAAGGAGAATGGGATTGGTAC
GATTGTGAGCTCCCACTCTCCCGGCCAAGATGCCGAAGATGAGGAGGCCACTCCGCACGAGCTTCAAGAACCCATGACCCTAAACGGTGC
TAATAGTGCAGACAGCGACAGTGACCCGAAAGAAAACGGCCTAGCGCCTGATGGTGCCAGCTGCCAAGGCCAGCCTGCCCTGCACTCAGA
AAATCCCTTTGCTAAGGCAAACGGTCTTCCTGGAAAGTTGATGCCTGCTCCTTTGCTGTCTCTCCCAGAAGACAAAATCTTAGAGACCTT
CAGGCTTAGCAACAAACTGAAAGGCTCGACGGATGAAATGAGTGCACCTGGAGCAGAGAGGGGCCCTCCCGAGGACCGCGACGCCGAGCC
TCAGCCTGGCAGCCCCGCCGCCGAATCCCTGGAGGAGCCAGATGCGGCCGCCGGCCTCAGCAGCACCAAGAAGGCTCCGCCGCCCCGCGA
TCCCGGCACCCCCGCTACCAAAGAAGGCGCCTGGGAGGCCATGGCCGTCGCCCCCGAGGAGCCTCCGCCCAGCGCCGGCGAGGACATCGT
GGGGGACACAGCACCCCCTGACCTGTGTGATCCCGGGAGCTTAACAGGCGATGCGAGCCCGTTGTCCCAGGACGCAAAGGGGATGATCGC
GGAGGGCCCGCGGGACTCGGCGTTGGCGGAAGCCCCGGAAGGGTTGAGTCCGGCTCCGCCTGCGCGGTCGGAGGAGCCCTGCGAGCAGCC
ACTCCTTGTTCACCCCAGCGGGGACCACGCCCGGGACGCTCAGGACCCATCCCAGAGCTTGGGCGCACCCGAGGCCGCAGAGCGGCCGCC
AGCTCCTGTGCTGGACATGGCCCCGGCCGGTCACCCGGAAGGGGACGCTGAGCCTAGCCCCGGCGAGAGGGTCGAGGACGCCGCGGCGCC
GAAAGCCCCAGGCCCTTCCCCAGCGAAGGAGAAAATCGGCAGCCTCAGAAAGGTGGACCGAGGCCACTACCGCAGCCGGAGAGAGCGCTC
GTCCAGCGGGGAGCCCGCCAGAGAGAGCAGGAGCAAGACTGAGGGCCACCGTCACCGGCGGCGCCGCACCTGCCCCCGGGAGCGCGACCG
CCAGGACCGCCACGCCCCGGAGCACCACCCCGGCCACGGCGACAGGCTCAGCCCTGGCGAGCGCCGCTCTCTGGGCAGGTGCAGTCACCA
CCACTCCCGACACCGGAGCGGGGTGGAGCTGGACTGGGTCAGACACCACTACACCGAGGGCGAGCGTGGCTGGGGCCGGGAGAAGTTCTA
CCCCGACAGGCCGCGCTGGGACAGGTGCCGGTACTACCATGACAGGTACGCCCTGTACGCTGCCCGGGACTGGAAGCCCTTCCACGGCGG
CCGCGAGCACGAGCGGGCCGGGCTGCACGAGCGGCCGCACAAGGACCACAACCGGGGCCGTAGGGGCTGCGAGCCGGCCCGGGAGAGGGA
GCGGCACCGCCCCAGCAGCCCCCGCGCAGGCGCGCCCCACGCCCTCGCCCCGCACCCCGACCGCTTCTCCCACGACAGAACTGCACTTGT
AGCCGGAGACAACTGTAACCTCTCTGATCGGTTTCACGAACACGAAAATGGAAAGTCCCGGAAACGGAGACACGACAGTGTGGAGAACAG
TGACAGTCATGTTGAAAAGAAAGCCCGGAGGAGCGAACAGAAGGATCCTCTAGAAGAGCCTAAAGCAAAGAAGCACAAAAAATCAAAGAA
GAAAAAGAAATCCAAAGACAAACACCGAGACCGCGACTCCAGGCATCAGCAGGACTCAGACCTCTCAGCAGCGTGCTCTGACGCTGACCT
CCACAGACACAAAAAAAAGAAGAAGAAAAAGAAGAGACATTCAAGAAAATCAGAGGACTTTGTTAAAGATTCAGAACTGCACTTACCCAG
GGTCACCAGCTTGGAGACTGTCGCCCAGTTCCGGAGAGCCCAGGGTGGCTTTCCTCTCTCTGGTGGCCCGCCTCTGGAAGGCGTCGGACC
TTTCCGTGAGAAAACGAAACACTTACGGATGGAAAGCAGGGATGACAGGTGTCGTCTCTTTGAGTATGGCCAGGGTGATTGAAAACTCAG
CCTCAAAACAAAAAATTCACTAGTTATGATTCAACGCGTTCAACAGAAGCCATCCCCAGCCCAGCTTAAATTATAAAGATAGACAATAAC
TCTGTTCCAATCTGCGTGGTGCTTCTTTAGTAAATACTGTACAGATTTTACCATGGAGAACTTTTTTTTTAGTTTTTACCTTTTCTTAAT
TACCCTTATTCCGAATGGACGAACACTTTCTACCACTGCTGACCATTGTAAAATACCGTGTATATAAATCCCATTGAAATAATGCCCTGG
AATAGAACATCTCAAATGCTGCTTAATTACAGACTCAGGTCGATTACTTGTATTTCATGTAATGTTCCTCCAAGTTAGACATCTGGTGCA
AGACCAACCGGGAGACCATGGAATTGTCAAAAGTACAAACTGACAGTGTGTATATTTAATTTAAAGACTTATTTAAAAACTCACAAGCTC
TCACCTAGACTTTGGAGAGCAGTCTGTTTTCTGTAATGTCTGATACTAGAAACTAATTTGCTTATTTTAGTTGTATTCAAGATTTGAAGA
TGTATTTTATAGACAAGTTCTGTTTTTGAACTTTGTGGAACTGTTCCAATCAATCAATTTCCCAGTTATGATGAGTATTTACATTATGAA
TGTATAACCCAGACATGATTTGTAAAGCCGACAGTATGTTTCTATTACACAACACTTTTTGATACAGCGTCTCTTGTCTTCACTGATACT
GGAGTCTCCGTTGTCTGCTTGGTCCCTTCGAGTTTCTAGTTACAGACACAATCATACTGTGATTTTATTTTTAATATGGATATGCTATCA
AACTGTGATACACTTATAATTCACTGGTCCTGCATCAGGAGATGGAGTGGGGAAAACTGTATTTAATACAGTTTGTATCTGAATAATCTG
TATGGTTTATACAGTTTGTGTTGTTCAGAGATGTTTAAAGTTTGATCTTTGTTTTTCTAAAGATTAAAAAAGCACTTGCCCCACTGTAAA

>78697_78697_1_RUNX1-USP42_RUNX1_chr21_36171597_ENST00000300305_USP42_chr7_6154953_ENST00000306177_length(amino acids)=1558AA_BP=316
MASDSIFESFPSYPQCFMRECILGMNPSRDVHDASTSRRFTPPSTALSPGKMSEALPLGAPDAGAALAGKLRSGDRSMVEVLADHPGELV
RTDSPNFLCSVLPTHWRCNKTLPIAFKVVALGDVPDGTLVTVMAGNDENYSAELRNATAAMKNQVARFNDLRFVGRSGRGKSFTLTITVF
TNPPQVATYHRAIKITVDGPREPRRHRQKLDDQTKPGSLSFSERLSELEQLRRTAMRVSPHHPAPTPNPRASLNHSTAFNPQPQSQMQDT
RQIQPSPPWSYDQSYQYLGSIASPSVHPATPISPGRASGMTTLSAELSSRLSTLGDGIAPPQKVLFPSEKICLKWQQTHRVGAGLQNLGN
TCFANAALQCLTYTPPLANYMLSHEHSKTCHAEGFCMMCTMQAHITQALSNPGDVIKPMFVINEMRRIARHFRFGNQEDAHEFLQYTVDA
MQKACLNGSNKLDRHTQATTLVCQIFGGYLRSRVKCLNCKGVSDTFDPYLDITLEIKAAQSVNKALEQFVKPEQLDGENSYKCSKCKKMV
PASKRFTIHRSSNVLTLSLKRFANFTGGKIAKDVKYPEYLDIRPYMSQPNGEPIVYVLYAVLVHTGFNCHAGHYFCYIKASNGLWYQMND
SIVSTSDIRSVLSQQAYVLFYIRSHDVKNGGELTHPTHSPGQSSPRPVISQRVVTNKQAAPGFIGPQLPSHMIKNPPHLNGTGPLKDTPS
SSMSSPNGNSSVNRASPVNASASVQNWSVNRSSVIPEHPKKQKITISIHNKLPVRQCQSQPNLHSNSLENPTKPVPSSTITNSAVQSTSN
ASTMSVSSKVTKPIPRSESCSQPVMNGKSKLNSSVLVPYGAESSEDSDEESKGLGKENGIGTIVSSHSPGQDAEDEEATPHELQEPMTLN
GANSADSDSDPKENGLAPDGASCQGQPALHSENPFAKANGLPGKLMPAPLLSLPEDKILETFRLSNKLKGSTDEMSAPGAERGPPEDRDA
EPQPGSPAAESLEEPDAAAGLSSTKKAPPPRDPGTPATKEGAWEAMAVAPEEPPPSAGEDIVGDTAPPDLCDPGSLTGDASPLSQDAKGM
IAEGPRDSALAEAPEGLSPAPPARSEEPCEQPLLVHPSGDHARDAQDPSQSLGAPEAAERPPAPVLDMAPAGHPEGDAEPSPGERVEDAA
APKAPGPSPAKEKIGSLRKVDRGHYRSRRERSSSGEPARESRSKTEGHRHRRRRTCPRERDRQDRHAPEHHPGHGDRLSPGERRSLGRCS
HHHSRHRSGVELDWVRHHYTEGERGWGREKFYPDRPRWDRCRYYHDRYALYAARDWKPFHGGREHERAGLHERPHKDHNRGRRGCEPARE
RERHRPSSPRAGAPHALAPHPDRFSHDRTALVAGDNCNLSDRFHEHENGKSRKRRHDSVENSDSHVEKKARRSEQKDPLEEPKAKKHKKS
KKKKKSKDKHRDRDSRHQQDSDLSAACSDADLHRHKKKKKKKKRHSRKSEDFVKDSELHLPRVTSLETVAQFRRAQGGFPLSGGPPLEGV

--------------------------------------------------------------
>78697_78697_2_RUNX1-USP42_RUNX1_chr21_36171597_ENST00000325074_USP42_chr7_6154953_ENST00000306177_length(transcript)=5687nt_BP=931nt
ATGCCCGCAGCACCGCGGGGACCGGCGCAGGGGGAAGCGGCCGCCCGCACGCGAAGCCGGGATGCCAGCACGAGCCGCCGCTTCACGCCG
CCTTCCACCGCGCTGAGCCCAGGCAAGATGAGCGAGGCGTTGCCGCTGGGCGCCCCGGACGCCGGCGCTGCCCTGGCCGGCAAGCTGAGG
AGCGGCGACCGCAGCATGGTGGAGGTGCTGGCCGACCACCCGGGCGAGCTGGTGCGCACCGACAGCCCCAACTTCCTCTGCTCCGTGCTG
CCTACGCACTGGCGCTGCAACAAGACCCTGCCCATCGCTTTCAAGGTGGTGGCCCTAGGGGATGTTCCAGATGGCACTCTGGTCACTGTG
ATGGCTGGCAATGATGAAAACTACTCGGCTGAGCTGAGAAATGCTACCGCAGCCATGAAGAACCAGGTTGCAAGATTTAATGACCTCAGG
TTTGTCGGTCGAAGTGGAAGAGGGAAAAGCTTCACTCTGACCATCACTGTCTTCACAAACCCACCGCAAGTCGCCACCTACCACAGAGCC
ATCAAAATCACAGTGGATGGGCCCCGAGAACCTCGAAGACATCGGCAGAAACTAGATGATCAGACCAAGCCCGGGAGCTTGTCCTTTTCC
GAGCGGCTCAGTGAACTGGAGCAGCTGCGGCGCACAGCCATGAGGGTCAGCCCACACCACCCAGCCCCCACGCCCAACCCTCGTGCCTCC
CTGAACCACTCCACTGCCTTTAACCCTCAGCCTCAGAGTCAGATGCAGGATACAAGGCAGATCCAACCATCCCCACCGTGGTCCTACGAT
CAGTCCTACCAATACCTGGGATCCATTGCCTCTCCTTCTGTGCACCCAGCAACGCCCATTTCACCTGGACGTGCCAGCGGCATGACAACC
CTCTCTGCAGAACTTTCCAGTCGACTCTCAACCCTAGGTGATGGCATCGCTCCTCCACAGAAAGTTCTTTTCCCATCTGAGAAGATTTGT
CTTAAGTGGCAACAAACTCATAGAGTTGGAGCTGGGCTCCAGAATTTGGGCAATACCTGTTTTGCCAATGCAGCACTGCAGTGTTTAACC
TACACACCACCTCTTGCCAATTACATGCTATCACATGAACACTCCAAAACATGTCATGCAGAAGGCTTTTGTATGATGTGTACAATGCAA
GCACATATTACCCAGGCACTCAGTAATCCTGGGGACGTTATTAAACCAATGTTTGTCATCAATGAGATGCGGCGTATAGCTAGGCACTTC
CGTTTTGGAAACCAAGAAGATGCCCATGAATTCCTTCAATACACTGTTGATGCTATGCAGAAAGCATGCTTGAATGGCAGCAATAAATTA
GACAGACACACCCAGGCCACCACTCTTGTTTGTCAGATATTTGGAGGATACCTAAGATCTAGAGTCAAATGTTTAAATTGCAAGGGCGTT
TCAGATACTTTTGATCCATATCTTGATATAACATTGGAGATAAAGGCTGCTCAGAGTGTCAACAAGGCATTGGAGCAGTTTGTGAAGCCG
GAACAGCTTGATGGAGAAAACTCGTACAAGTGCAGCAAGTGTAAAAAGATGGTTCCAGCTTCAAAGAGGTTCACTATCCATAGATCCTCT
AATGTTCTTACACTTTCTCTGAAACGTTTTGCAAATTTTACCGGTGGAAAAATTGCTAAGGATGTGAAATACCCTGAGTATCTTGATATT
CGGCCATATATGTCTCAACCCAACGGAGAGCCAATTGTCTACGTCTTGTATGCAGTGCTGGTCCACACTGGTTTTAATTGCCATGCTGGC
CATTACTTCTGCTACATAAAAGCTAGCAATGGCCTCTGGTATCAAATGAATGACTCCATTGTATCTACCAGTGATATTAGATCGGTACTC
AGCCAACAAGCCTATGTGCTCTTTTATATCAGGTCCCATGATGTGAAAAATGGAGGTGAACTTACTCATCCCACCCATAGCCCCGGCCAG
TCCTCTCCCCGCCCCGTCATCAGTCAGCGGGTTGTCACCAACAAACAGGCTGCGCCAGGCTTTATCGGACCACAGCTTCCCTCTCACATG
ATAAAGAATCCACCTCACTTAAATGGGACTGGACCATTGAAAGACACGCCAAGCAGTTCCATGTCGAGTCCTAACGGGAATTCCAGTGTC
AACAGGGCTAGTCCTGTTAATGCTTCAGCTTCTGTCCAAAACTGGTCAGTTAATAGGTCCTCAGTGATCCCAGAACATCCTAAGAAACAA
AAAATTACAATCAGTATTCACAACAAGTTGCCTGTTCGCCAGTGTCAGTCTCAACCTAACCTTCATAGTAATTCTTTGGAGAACCCTACC
AAGCCCGTTCCCTCTTCTACCATTACCAATTCTGCAGTACAGTCTACCTCGAACGCATCTACGATGTCAGTTTCTAGTAAAGTAACAAAA
CCGATCCCCCGCAGTGAATCCTGCTCCCAGCCCGTGATGAATGGCAAATCCAAGCTGAACTCCAGCGTGCTGGTGCCCTATGGCGCCGAG
TCCTCTGAGGACTCTGACGAGGAGTCAAAGGGGCTGGGCAAGGAGAATGGGATTGGTACGATTGTGAGCTCCCACTCTCCCGGCCAAGAT
GCCGAAGATGAGGAGGCCACTCCGCACGAGCTTCAAGAACCCATGACCCTAAACGGTGCTAATAGTGCAGACAGCGACAGTGACCCGAAA
GAAAACGGCCTAGCGCCTGATGGTGCCAGCTGCCAAGGCCAGCCTGCCCTGCACTCAGAAAATCCCTTTGCTAAGGCAAACGGTCTTCCT
GGAAAGTTGATGCCTGCTCCTTTGCTGTCTCTCCCAGAAGACAAAATCTTAGAGACCTTCAGGCTTAGCAACAAACTGAAAGGCTCGACG
GATGAAATGAGTGCACCTGGAGCAGAGAGGGGCCCTCCCGAGGACCGCGACGCCGAGCCTCAGCCTGGCAGCCCCGCCGCCGAATCCCTG
GAGGAGCCAGATGCGGCCGCCGGCCTCAGCAGCACCAAGAAGGCTCCGCCGCCCCGCGATCCCGGCACCCCCGCTACCAAAGAAGGCGCC
TGGGAGGCCATGGCCGTCGCCCCCGAGGAGCCTCCGCCCAGCGCCGGCGAGGACATCGTGGGGGACACAGCACCCCCTGACCTGTGTGAT
CCCGGGAGCTTAACAGGCGATGCGAGCCCGTTGTCCCAGGACGCAAAGGGGATGATCGCGGAGGGCCCGCGGGACTCGGCGTTGGCGGAA
GCCCCGGAAGGGTTGAGTCCGGCTCCGCCTGCGCGGTCGGAGGAGCCCTGCGAGCAGCCACTCCTTGTTCACCCCAGCGGGGACCACGCC
CGGGACGCTCAGGACCCATCCCAGAGCTTGGGCGCACCCGAGGCCGCAGAGCGGCCGCCAGCTCCTGTGCTGGACATGGCCCCGGCCGGT
CACCCGGAAGGGGACGCTGAGCCTAGCCCCGGCGAGAGGGTCGAGGACGCCGCGGCGCCGAAAGCCCCAGGCCCTTCCCCAGCGAAGGAG
AAAATCGGCAGCCTCAGAAAGGTGGACCGAGGCCACTACCGCAGCCGGAGAGAGCGCTCGTCCAGCGGGGAGCCCGCCAGAGAGAGCAGG
AGCAAGACTGAGGGCCACCGTCACCGGCGGCGCCGCACCTGCCCCCGGGAGCGCGACCGCCAGGACCGCCACGCCCCGGAGCACCACCCC
GGCCACGGCGACAGGCTCAGCCCTGGCGAGCGCCGCTCTCTGGGCAGGTGCAGTCACCACCACTCCCGACACCGGAGCGGGGTGGAGCTG
GACTGGGTCAGACACCACTACACCGAGGGCGAGCGTGGCTGGGGCCGGGAGAAGTTCTACCCCGACAGGCCGCGCTGGGACAGGTGCCGG
TACTACCATGACAGGTACGCCCTGTACGCTGCCCGGGACTGGAAGCCCTTCCACGGCGGCCGCGAGCACGAGCGGGCCGGGCTGCACGAG
CGGCCGCACAAGGACCACAACCGGGGCCGTAGGGGCTGCGAGCCGGCCCGGGAGAGGGAGCGGCACCGCCCCAGCAGCCCCCGCGCAGGC
GCGCCCCACGCCCTCGCCCCGCACCCCGACCGCTTCTCCCACGACAGAACTGCACTTGTAGCCGGAGACAACTGTAACCTCTCTGATCGG
TTTCACGAACACGAAAATGGAAAGTCCCGGAAACGGAGACACGACAGTGTGGAGAACAGTGACAGTCATGTTGAAAAGAAAGCCCGGAGG
AGCGAACAGAAGGATCCTCTAGAAGAGCCTAAAGCAAAGAAGCACAAAAAATCAAAGAAGAAAAAGAAATCCAAAGACAAACACCGAGAC
CGCGACTCCAGGCATCAGCAGGACTCAGACCTCTCAGCAGCGTGCTCTGACGCTGACCTCCACAGACACAAAAAAAAGAAGAAGAAAAAG
AAGAGACATTCAAGAAAATCAGAGGACTTTGTTAAAGATTCAGAACTGCACTTACCCAGGGTCACCAGCTTGGAGACTGTCGCCCAGTTC
CGGAGAGCCCAGGGTGGCTTTCCTCTCTCTGGTGGCCCGCCTCTGGAAGGCGTCGGACCTTTCCGTGAGAAAACGAAACACTTACGGATG
GAAAGCAGGGATGACAGGTGTCGTCTCTTTGAGTATGGCCAGGGTGATTGAAAACTCAGCCTCAAAACAAAAAATTCACTAGTTATGATT
CAACGCGTTCAACAGAAGCCATCCCCAGCCCAGCTTAAATTATAAAGATAGACAATAACTCTGTTCCAATCTGCGTGGTGCTTCTTTAGT
AAATACTGTACAGATTTTACCATGGAGAACTTTTTTTTTAGTTTTTACCTTTTCTTAATTACCCTTATTCCGAATGGACGAACACTTTCT
ACCACTGCTGACCATTGTAAAATACCGTGTATATAAATCCCATTGAAATAATGCCCTGGAATAGAACATCTCAAATGCTGCTTAATTACA
GACTCAGGTCGATTACTTGTATTTCATGTAATGTTCCTCCAAGTTAGACATCTGGTGCAAGACCAACCGGGAGACCATGGAATTGTCAAA
AGTACAAACTGACAGTGTGTATATTTAATTTAAAGACTTATTTAAAAACTCACAAGCTCTCACCTAGACTTTGGAGAGCAGTCTGTTTTC
TGTAATGTCTGATACTAGAAACTAATTTGCTTATTTTAGTTGTATTCAAGATTTGAAGATGTATTTTATAGACAAGTTCTGTTTTTGAAC
TTTGTGGAACTGTTCCAATCAATCAATTTCCCAGTTATGATGAGTATTTACATTATGAATGTATAACCCAGACATGATTTGTAAAGCCGA
CAGTATGTTTCTATTACACAACACTTTTTGATACAGCGTCTCTTGTCTTCACTGATACTGGAGTCTCCGTTGTCTGCTTGGTCCCTTCGA
GTTTCTAGTTACAGACACAATCATACTGTGATTTTATTTTTAATATGGATATGCTATCAAACTGTGATACACTTATAATTCACTGGTCCT
GCATCAGGAGATGGAGTGGGGAAAACTGTATTTAATACAGTTTGTATCTGAATAATCTGTATGGTTTATACAGTTTGTGTTGTTCAGAGA
TGTTTAAAGTTTGATCTTTGTTTTTCTAAAGATTAAAAAAGCACTTGCCCCACTGTAAATATACAGCATGTAAAATTTCTATAGTATATA

>78697_78697_2_RUNX1-USP42_RUNX1_chr21_36171597_ENST00000325074_USP42_chr7_6154953_ENST00000306177_length(amino acids)=1546AA_BP=304
MPAAPRGPAQGEAAARTRSRDASTSRRFTPPSTALSPGKMSEALPLGAPDAGAALAGKLRSGDRSMVEVLADHPGELVRTDSPNFLCSVL
PTHWRCNKTLPIAFKVVALGDVPDGTLVTVMAGNDENYSAELRNATAAMKNQVARFNDLRFVGRSGRGKSFTLTITVFTNPPQVATYHRA
IKITVDGPREPRRHRQKLDDQTKPGSLSFSERLSELEQLRRTAMRVSPHHPAPTPNPRASLNHSTAFNPQPQSQMQDTRQIQPSPPWSYD
QSYQYLGSIASPSVHPATPISPGRASGMTTLSAELSSRLSTLGDGIAPPQKVLFPSEKICLKWQQTHRVGAGLQNLGNTCFANAALQCLT
YTPPLANYMLSHEHSKTCHAEGFCMMCTMQAHITQALSNPGDVIKPMFVINEMRRIARHFRFGNQEDAHEFLQYTVDAMQKACLNGSNKL
DRHTQATTLVCQIFGGYLRSRVKCLNCKGVSDTFDPYLDITLEIKAAQSVNKALEQFVKPEQLDGENSYKCSKCKKMVPASKRFTIHRSS
NVLTLSLKRFANFTGGKIAKDVKYPEYLDIRPYMSQPNGEPIVYVLYAVLVHTGFNCHAGHYFCYIKASNGLWYQMNDSIVSTSDIRSVL
SQQAYVLFYIRSHDVKNGGELTHPTHSPGQSSPRPVISQRVVTNKQAAPGFIGPQLPSHMIKNPPHLNGTGPLKDTPSSSMSSPNGNSSV
NRASPVNASASVQNWSVNRSSVIPEHPKKQKITISIHNKLPVRQCQSQPNLHSNSLENPTKPVPSSTITNSAVQSTSNASTMSVSSKVTK
PIPRSESCSQPVMNGKSKLNSSVLVPYGAESSEDSDEESKGLGKENGIGTIVSSHSPGQDAEDEEATPHELQEPMTLNGANSADSDSDPK
ENGLAPDGASCQGQPALHSENPFAKANGLPGKLMPAPLLSLPEDKILETFRLSNKLKGSTDEMSAPGAERGPPEDRDAEPQPGSPAAESL
EEPDAAAGLSSTKKAPPPRDPGTPATKEGAWEAMAVAPEEPPPSAGEDIVGDTAPPDLCDPGSLTGDASPLSQDAKGMIAEGPRDSALAE
APEGLSPAPPARSEEPCEQPLLVHPSGDHARDAQDPSQSLGAPEAAERPPAPVLDMAPAGHPEGDAEPSPGERVEDAAAPKAPGPSPAKE
KIGSLRKVDRGHYRSRRERSSSGEPARESRSKTEGHRHRRRRTCPRERDRQDRHAPEHHPGHGDRLSPGERRSLGRCSHHHSRHRSGVEL
DWVRHHYTEGERGWGREKFYPDRPRWDRCRYYHDRYALYAARDWKPFHGGREHERAGLHERPHKDHNRGRRGCEPARERERHRPSSPRAG
APHALAPHPDRFSHDRTALVAGDNCNLSDRFHEHENGKSRKRRHDSVENSDSHVEKKARRSEQKDPLEEPKAKKHKKSKKKKKSKDKHRD
RDSRHQQDSDLSAACSDADLHRHKKKKKKKKRHSRKSEDFVKDSELHLPRVTSLETVAQFRRAQGGFPLSGGPPLEGVGPFREKTKHLRM

--------------------------------------------------------------
>78697_78697_3_RUNX1-USP42_RUNX1_chr21_36171597_ENST00000344691_USP42_chr7_6154953_ENST00000306177_length(transcript)=7220nt_BP=2464nt
CATAGAGCCAGCGGGCGCGGGCGGGACGGGCGCCCCGCGGCCGGACCCAGCCAGGGCACCACGCTGCCCGGCCCTGCGCCGCCAGGCACT
TCTTTCCGGGGCTCCTAGGGACGCCAGAAGGAAGTCAACCTCTGCTGCTTCTCCTTGGCCTGCGTTGGACCTTCCTTTTTTTGTTGTTTT
TTTTTGTTTTTCCCCTTTCTTCCTTTTGAATTAACTGGCTTCTTGGCTGGATGTTTTCAACTTCTTTCCTGGCTGCGAACTTTTCCCCAA
TTGTTTTCCTTTTACAACAGGGGGAGAAAGTGCTCTGTGGTCCGAGGCGAGCCGTGAAGTTGCGTGTGCGTGGCAGTGTGCGTGGCAGGA
TGTGCGTGCGTGTGTAACCCGAGCCGCCCGATCTGTTTCGATCTGCGCCGCGGAGCCCTCCCTCAAGGCCCGCTCCACCTGCTGCGGTTA
CGCGGCGCTCGTGGGTGTTCGTGCCTCGGAGCAGCTAACCGGCGGGTGCTGGGCGACGGTGGAGGAGTATCGTCTCGCTGCTGCCCGAGT
CAGGGCTGAGTCACCCAGCTGATGTAGACAGTGGCTGCCTTCCGAAGAGTGCGTGTTTGCATGTGTGTGACTCTGCGGCTGCTCAACTCC
CAACAAACCAGAGGACCAGCCACAAACTTAACCAACATCCCCAAACCCGAGTTCACAGATGTGGGAGAGCTGTAGAACCCTGAGTGTCAT
CGACTGGGCCTTCTTATGATTGTTGTTTTAAGATTAGCTGAAGATCTCTGAAACGCTGAATTTTCTGCACTGAGCGTTTTGACAGAATTC
ATTGAGAGAACAGAGAACATGACAAGTACTTCTAGCTCAGCACTGCTCCAACTACTGAAGCTGATTTTCAAGGCTACTTAAAAAAATCTG
CAGCGTACATTAATGGATTTCTGTTGTGTTTAAATTCTCCACAGATTGTATTGTAAATATTTTATGAAGTAGAGCATATGTATATATTTA
TATATACGTGCACATACATTAGTAGCACTACCTTTGGAAGTCTCAGCTCTTGCTTTTCGGGACTGAAGCCAGTTTTGCATGATAAAAGTG
GCCTTGTTACGGGAGATAATTGTGTTCTGTTGGGACTTTAGACAAAACTCACCTGCAAAAAACTGACAGGCATTAACTACTGGAACTTCC
AAATAATGTGTTTGCTGATCGTTTTACTCTTCGCATAAATATTTTAGGAAGTGTATGAGAATTTTGCCTTCAGGAACTTTTCTAACAGCC
AAAGACAGAACTTAACCTCTGCAAGCAAGATTCGTGGAAGATAGTCTCCACTTTTTAATGCACTAAGCAATCGGTTGCTAGGAGCCCATC
CTGGGTCAGAGGCCGATCCGCAGAACCAGAACGTTTTCCCCTCCTGGACTGTTAGTAACTTAGTCTCCCTCCTCCCCTAACCACCCCCGC
CCCCCCCCACCCCCCGCAGTAATAAAGGCCCCTGAACGTGTATGTTGGTCTCCCGGGAGCTGCTTGCTGAAGATCCGCGCCCCTGTCGCC
GTCTGGTAGGAGCTGTTTGCAGGGTCCTAACTCAATCGGCTTGTTGTGATGCGTATCCCCGTAGATGCCAGCACGAGCCGCCGCTTCACG
CCGCCTTCCACCGCGCTGAGCCCAGGCAAGATGAGCGAGGCGTTGCCGCTGGGCGCCCCGGACGCCGGCGCTGCCCTGGCCGGCAAGCTG
AGGAGCGGCGACCGCAGCATGGTGGAGGTGCTGGCCGACCACCCGGGCGAGCTGGTGCGCACCGACAGCCCCAACTTCCTCTGCTCCGTG
CTGCCTACGCACTGGCGCTGCAACAAGACCCTGCCCATCGCTTTCAAGGTGGTGGCCCTAGGGGATGTTCCAGATGGCACTCTGGTCACT
GTGATGGCTGGCAATGATGAAAACTACTCGGCTGAGCTGAGAAATGCTACCGCAGCCATGAAGAACCAGGTTGCAAGATTTAATGACCTC
AGGTTTGTCGGTCGAAGTGGAAGAGGGAAAAGCTTCACTCTGACCATCACTGTCTTCACAAACCCACCGCAAGTCGCCACCTACCACAGA
GCCATCAAAATCACAGTGGATGGGCCCCGAGAACCTCGAAGACATCGGCAGAAACTAGATGATCAGACCAAGCCCGGGAGCTTGTCCTTT
TCCGAGCGGCTCAGTGAACTGGAGCAGCTGCGGCGCACAGCCATGAGGGTCAGCCCACACCACCCAGCCCCCACGCCCAACCCTCGTGCC
TCCCTGAACCACTCCACTGCCTTTAACCCTCAGCCTCAGAGTCAGATGCAGGATACAAGGCAGATCCAACCATCCCCACCGTGGTCCTAC
GATCAGTCCTACCAATACCTGGGATCCATTGCCTCTCCTTCTGTGCACCCAGCAACGCCCATTTCACCTGGACGTGCCAGCGGCATGACA
ACCCTCTCTGCAGAACTTTCCAGTCGACTCTCAACCCTAGGTGATGGCATCGCTCCTCCACAGAAAGTTCTTTTCCCATCTGAGAAGATT
TGTCTTAAGTGGCAACAAACTCATAGAGTTGGAGCTGGGCTCCAGAATTTGGGCAATACCTGTTTTGCCAATGCAGCACTGCAGTGTTTA
ACCTACACACCACCTCTTGCCAATTACATGCTATCACATGAACACTCCAAAACATGTCATGCAGAAGGCTTTTGTATGATGTGTACAATG
CAAGCACATATTACCCAGGCACTCAGTAATCCTGGGGACGTTATTAAACCAATGTTTGTCATCAATGAGATGCGGCGTATAGCTAGGCAC
TTCCGTTTTGGAAACCAAGAAGATGCCCATGAATTCCTTCAATACACTGTTGATGCTATGCAGAAAGCATGCTTGAATGGCAGCAATAAA
TTAGACAGACACACCCAGGCCACCACTCTTGTTTGTCAGATATTTGGAGGATACCTAAGATCTAGAGTCAAATGTTTAAATTGCAAGGGC
GTTTCAGATACTTTTGATCCATATCTTGATATAACATTGGAGATAAAGGCTGCTCAGAGTGTCAACAAGGCATTGGAGCAGTTTGTGAAG
CCGGAACAGCTTGATGGAGAAAACTCGTACAAGTGCAGCAAGTGTAAAAAGATGGTTCCAGCTTCAAAGAGGTTCACTATCCATAGATCC
TCTAATGTTCTTACACTTTCTCTGAAACGTTTTGCAAATTTTACCGGTGGAAAAATTGCTAAGGATGTGAAATACCCTGAGTATCTTGAT
ATTCGGCCATATATGTCTCAACCCAACGGAGAGCCAATTGTCTACGTCTTGTATGCAGTGCTGGTCCACACTGGTTTTAATTGCCATGCT
GGCCATTACTTCTGCTACATAAAAGCTAGCAATGGCCTCTGGTATCAAATGAATGACTCCATTGTATCTACCAGTGATATTAGATCGGTA
CTCAGCCAACAAGCCTATGTGCTCTTTTATATCAGGTCCCATGATGTGAAAAATGGAGGTGAACTTACTCATCCCACCCATAGCCCCGGC
CAGTCCTCTCCCCGCCCCGTCATCAGTCAGCGGGTTGTCACCAACAAACAGGCTGCGCCAGGCTTTATCGGACCACAGCTTCCCTCTCAC
ATGATAAAGAATCCACCTCACTTAAATGGGACTGGACCATTGAAAGACACGCCAAGCAGTTCCATGTCGAGTCCTAACGGGAATTCCAGT
GTCAACAGGGCTAGTCCTGTTAATGCTTCAGCTTCTGTCCAAAACTGGTCAGTTAATAGGTCCTCAGTGATCCCAGAACATCCTAAGAAA
CAAAAAATTACAATCAGTATTCACAACAAGTTGCCTGTTCGCCAGTGTCAGTCTCAACCTAACCTTCATAGTAATTCTTTGGAGAACCCT
ACCAAGCCCGTTCCCTCTTCTACCATTACCAATTCTGCAGTACAGTCTACCTCGAACGCATCTACGATGTCAGTTTCTAGTAAAGTAACA
AAACCGATCCCCCGCAGTGAATCCTGCTCCCAGCCCGTGATGAATGGCAAATCCAAGCTGAACTCCAGCGTGCTGGTGCCCTATGGCGCC
GAGTCCTCTGAGGACTCTGACGAGGAGTCAAAGGGGCTGGGCAAGGAGAATGGGATTGGTACGATTGTGAGCTCCCACTCTCCCGGCCAA
GATGCCGAAGATGAGGAGGCCACTCCGCACGAGCTTCAAGAACCCATGACCCTAAACGGTGCTAATAGTGCAGACAGCGACAGTGACCCG
AAAGAAAACGGCCTAGCGCCTGATGGTGCCAGCTGCCAAGGCCAGCCTGCCCTGCACTCAGAAAATCCCTTTGCTAAGGCAAACGGTCTT
CCTGGAAAGTTGATGCCTGCTCCTTTGCTGTCTCTCCCAGAAGACAAAATCTTAGAGACCTTCAGGCTTAGCAACAAACTGAAAGGCTCG
ACGGATGAAATGAGTGCACCTGGAGCAGAGAGGGGCCCTCCCGAGGACCGCGACGCCGAGCCTCAGCCTGGCAGCCCCGCCGCCGAATCC
CTGGAGGAGCCAGATGCGGCCGCCGGCCTCAGCAGCACCAAGAAGGCTCCGCCGCCCCGCGATCCCGGCACCCCCGCTACCAAAGAAGGC
GCCTGGGAGGCCATGGCCGTCGCCCCCGAGGAGCCTCCGCCCAGCGCCGGCGAGGACATCGTGGGGGACACAGCACCCCCTGACCTGTGT
GATCCCGGGAGCTTAACAGGCGATGCGAGCCCGTTGTCCCAGGACGCAAAGGGGATGATCGCGGAGGGCCCGCGGGACTCGGCGTTGGCG
GAAGCCCCGGAAGGGTTGAGTCCGGCTCCGCCTGCGCGGTCGGAGGAGCCCTGCGAGCAGCCACTCCTTGTTCACCCCAGCGGGGACCAC
GCCCGGGACGCTCAGGACCCATCCCAGAGCTTGGGCGCACCCGAGGCCGCAGAGCGGCCGCCAGCTCCTGTGCTGGACATGGCCCCGGCC
GGTCACCCGGAAGGGGACGCTGAGCCTAGCCCCGGCGAGAGGGTCGAGGACGCCGCGGCGCCGAAAGCCCCAGGCCCTTCCCCAGCGAAG
GAGAAAATCGGCAGCCTCAGAAAGGTGGACCGAGGCCACTACCGCAGCCGGAGAGAGCGCTCGTCCAGCGGGGAGCCCGCCAGAGAGAGC
AGGAGCAAGACTGAGGGCCACCGTCACCGGCGGCGCCGCACCTGCCCCCGGGAGCGCGACCGCCAGGACCGCCACGCCCCGGAGCACCAC
CCCGGCCACGGCGACAGGCTCAGCCCTGGCGAGCGCCGCTCTCTGGGCAGGTGCAGTCACCACCACTCCCGACACCGGAGCGGGGTGGAG
CTGGACTGGGTCAGACACCACTACACCGAGGGCGAGCGTGGCTGGGGCCGGGAGAAGTTCTACCCCGACAGGCCGCGCTGGGACAGGTGC
CGGTACTACCATGACAGGTACGCCCTGTACGCTGCCCGGGACTGGAAGCCCTTCCACGGCGGCCGCGAGCACGAGCGGGCCGGGCTGCAC
GAGCGGCCGCACAAGGACCACAACCGGGGCCGTAGGGGCTGCGAGCCGGCCCGGGAGAGGGAGCGGCACCGCCCCAGCAGCCCCCGCGCA
GGCGCGCCCCACGCCCTCGCCCCGCACCCCGACCGCTTCTCCCACGACAGAACTGCACTTGTAGCCGGAGACAACTGTAACCTCTCTGAT
CGGTTTCACGAACACGAAAATGGAAAGTCCCGGAAACGGAGACACGACAGTGTGGAGAACAGTGACAGTCATGTTGAAAAGAAAGCCCGG
AGGAGCGAACAGAAGGATCCTCTAGAAGAGCCTAAAGCAAAGAAGCACAAAAAATCAAAGAAGAAAAAGAAATCCAAAGACAAACACCGA
GACCGCGACTCCAGGCATCAGCAGGACTCAGACCTCTCAGCAGCGTGCTCTGACGCTGACCTCCACAGACACAAAAAAAAGAAGAAGAAA
AAGAAGAGACATTCAAGAAAATCAGAGGACTTTGTTAAAGATTCAGAACTGCACTTACCCAGGGTCACCAGCTTGGAGACTGTCGCCCAG
TTCCGGAGAGCCCAGGGTGGCTTTCCTCTCTCTGGTGGCCCGCCTCTGGAAGGCGTCGGACCTTTCCGTGAGAAAACGAAACACTTACGG
ATGGAAAGCAGGGATGACAGGTGTCGTCTCTTTGAGTATGGCCAGGGTGATTGAAAACTCAGCCTCAAAACAAAAAATTCACTAGTTATG
ATTCAACGCGTTCAACAGAAGCCATCCCCAGCCCAGCTTAAATTATAAAGATAGACAATAACTCTGTTCCAATCTGCGTGGTGCTTCTTT
AGTAAATACTGTACAGATTTTACCATGGAGAACTTTTTTTTTAGTTTTTACCTTTTCTTAATTACCCTTATTCCGAATGGACGAACACTT
TCTACCACTGCTGACCATTGTAAAATACCGTGTATATAAATCCCATTGAAATAATGCCCTGGAATAGAACATCTCAAATGCTGCTTAATT
ACAGACTCAGGTCGATTACTTGTATTTCATGTAATGTTCCTCCAAGTTAGACATCTGGTGCAAGACCAACCGGGAGACCATGGAATTGTC
AAAAGTACAAACTGACAGTGTGTATATTTAATTTAAAGACTTATTTAAAAACTCACAAGCTCTCACCTAGACTTTGGAGAGCAGTCTGTT
TTCTGTAATGTCTGATACTAGAAACTAATTTGCTTATTTTAGTTGTATTCAAGATTTGAAGATGTATTTTATAGACAAGTTCTGTTTTTG
AACTTTGTGGAACTGTTCCAATCAATCAATTTCCCAGTTATGATGAGTATTTACATTATGAATGTATAACCCAGACATGATTTGTAAAGC
CGACAGTATGTTTCTATTACACAACACTTTTTGATACAGCGTCTCTTGTCTTCACTGATACTGGAGTCTCCGTTGTCTGCTTGGTCCCTT
CGAGTTTCTAGTTACAGACACAATCATACTGTGATTTTATTTTTAATATGGATATGCTATCAAACTGTGATACACTTATAATTCACTGGT
CCTGCATCAGGAGATGGAGTGGGGAAAACTGTATTTAATACAGTTTGTATCTGAATAATCTGTATGGTTTATACAGTTTGTGTTGTTCAG
AGATGTTTAAAGTTTGATCTTTGTTTTTCTAAAGATTAAAAAAGCACTTGCCCCACTGTAAATATACAGCATGTAAAATTTCTATAGTAT

>78697_78697_3_RUNX1-USP42_RUNX1_chr21_36171597_ENST00000344691_USP42_chr7_6154953_ENST00000306177_length(amino acids)=1531AA_BP=289
MRIPVDASTSRRFTPPSTALSPGKMSEALPLGAPDAGAALAGKLRSGDRSMVEVLADHPGELVRTDSPNFLCSVLPTHWRCNKTLPIAFK
VVALGDVPDGTLVTVMAGNDENYSAELRNATAAMKNQVARFNDLRFVGRSGRGKSFTLTITVFTNPPQVATYHRAIKITVDGPREPRRHR
QKLDDQTKPGSLSFSERLSELEQLRRTAMRVSPHHPAPTPNPRASLNHSTAFNPQPQSQMQDTRQIQPSPPWSYDQSYQYLGSIASPSVH
PATPISPGRASGMTTLSAELSSRLSTLGDGIAPPQKVLFPSEKICLKWQQTHRVGAGLQNLGNTCFANAALQCLTYTPPLANYMLSHEHS
KTCHAEGFCMMCTMQAHITQALSNPGDVIKPMFVINEMRRIARHFRFGNQEDAHEFLQYTVDAMQKACLNGSNKLDRHTQATTLVCQIFG
GYLRSRVKCLNCKGVSDTFDPYLDITLEIKAAQSVNKALEQFVKPEQLDGENSYKCSKCKKMVPASKRFTIHRSSNVLTLSLKRFANFTG
GKIAKDVKYPEYLDIRPYMSQPNGEPIVYVLYAVLVHTGFNCHAGHYFCYIKASNGLWYQMNDSIVSTSDIRSVLSQQAYVLFYIRSHDV
KNGGELTHPTHSPGQSSPRPVISQRVVTNKQAAPGFIGPQLPSHMIKNPPHLNGTGPLKDTPSSSMSSPNGNSSVNRASPVNASASVQNW
SVNRSSVIPEHPKKQKITISIHNKLPVRQCQSQPNLHSNSLENPTKPVPSSTITNSAVQSTSNASTMSVSSKVTKPIPRSESCSQPVMNG
KSKLNSSVLVPYGAESSEDSDEESKGLGKENGIGTIVSSHSPGQDAEDEEATPHELQEPMTLNGANSADSDSDPKENGLAPDGASCQGQP
ALHSENPFAKANGLPGKLMPAPLLSLPEDKILETFRLSNKLKGSTDEMSAPGAERGPPEDRDAEPQPGSPAAESLEEPDAAAGLSSTKKA
PPPRDPGTPATKEGAWEAMAVAPEEPPPSAGEDIVGDTAPPDLCDPGSLTGDASPLSQDAKGMIAEGPRDSALAEAPEGLSPAPPARSEE
PCEQPLLVHPSGDHARDAQDPSQSLGAPEAAERPPAPVLDMAPAGHPEGDAEPSPGERVEDAAAPKAPGPSPAKEKIGSLRKVDRGHYRS
RRERSSSGEPARESRSKTEGHRHRRRRTCPRERDRQDRHAPEHHPGHGDRLSPGERRSLGRCSHHHSRHRSGVELDWVRHHYTEGERGWG
REKFYPDRPRWDRCRYYHDRYALYAARDWKPFHGGREHERAGLHERPHKDHNRGRRGCEPARERERHRPSSPRAGAPHALAPHPDRFSHD
RTALVAGDNCNLSDRFHEHENGKSRKRRHDSVENSDSHVEKKARRSEQKDPLEEPKAKKHKKSKKKKKSKDKHRDRDSRHQQDSDLSAAC
SDADLHRHKKKKKKKKRHSRKSEDFVKDSELHLPRVTSLETVAQFRRAQGGFPLSGGPPLEGVGPFREKTKHLRMESRDDRCRLFEYGQG

--------------------------------------------------------------
>78697_78697_4_RUNX1-USP42_RUNX1_chr21_36171597_ENST00000399240_USP42_chr7_6154953_ENST00000306177_length(transcript)=5521nt_BP=765nt
TGAAGATCCGCGCCCCTGTCGCCGTCTGGTAGGAGCTGTTTGCAGGGTCCTAACTCAATCGGCTTGTTGTGATGCGTATCCCCGTAGATG
CCAGCACGAGCCGCCGCTTCACGCCGCCTTCCACCGCGCTGAGCCCAGGCAAGATGAGCGAGGCGTTGCCGCTGGGCGCCCCGGACGCCG
GCGCTGCCCTGGCCGGCAAGCTGAGGAGCGGCGACCGCAGCATGGTGGAGGTGCTGGCCGACCACCCGGGCGAGCTGGTGCGCACCGACA
GCCCCAACTTCCTCTGCTCCGTGCTGCCTACGCACTGGCGCTGCAACAAGACCCTGCCCATCGCTTTCAAGGTGGTGGCCCTAGGGGATG
TTCCAGATGGCACTCTGGTCACTGTGATGGCTGGCAATGATGAAAACTACTCGGCTGAGCTGAGAAATGCTACCGCAGCCATGAAGAACC
AGGTTGCAAGATTTAATGACCTCAGGTTTGTCGGTCGAAGTGGAAGAGGGAAAAGCTTCACTCTGACCATCACTGTCTTCACAAACCCAC
CGCAAGTCGCCACCTACCACAGAGCCATCAAAATCACAGTGGATGGGCCCCGAGAACCTCGAAATACAAGGCAGATCCAACCATCCCCAC
CGTGGTCCTACGATCAGTCCTACCAATACCTGGGATCCATTGCCTCTCCTTCTGTGCACCCAGCAACGCCCATTTCACCTGGACGTGCCA
GCGGCATGACAACCCTCTCTGCAGAACTTTCCAGTCGACTCTCAACCCTAGGTGATGGCATCGCTCCTCCACAGAAAGTTCTTTTCCCAT
CTGAGAAGATTTGTCTTAAGTGGCAACAAACTCATAGAGTTGGAGCTGGGCTCCAGAATTTGGGCAATACCTGTTTTGCCAATGCAGCAC
TGCAGTGTTTAACCTACACACCACCTCTTGCCAATTACATGCTATCACATGAACACTCCAAAACATGTCATGCAGAAGGCTTTTGTATGA
TGTGTACAATGCAAGCACATATTACCCAGGCACTCAGTAATCCTGGGGACGTTATTAAACCAATGTTTGTCATCAATGAGATGCGGCGTA
TAGCTAGGCACTTCCGTTTTGGAAACCAAGAAGATGCCCATGAATTCCTTCAATACACTGTTGATGCTATGCAGAAAGCATGCTTGAATG
GCAGCAATAAATTAGACAGACACACCCAGGCCACCACTCTTGTTTGTCAGATATTTGGAGGATACCTAAGATCTAGAGTCAAATGTTTAA
ATTGCAAGGGCGTTTCAGATACTTTTGATCCATATCTTGATATAACATTGGAGATAAAGGCTGCTCAGAGTGTCAACAAGGCATTGGAGC
AGTTTGTGAAGCCGGAACAGCTTGATGGAGAAAACTCGTACAAGTGCAGCAAGTGTAAAAAGATGGTTCCAGCTTCAAAGAGGTTCACTA
TCCATAGATCCTCTAATGTTCTTACACTTTCTCTGAAACGTTTTGCAAATTTTACCGGTGGAAAAATTGCTAAGGATGTGAAATACCCTG
AGTATCTTGATATTCGGCCATATATGTCTCAACCCAACGGAGAGCCAATTGTCTACGTCTTGTATGCAGTGCTGGTCCACACTGGTTTTA
ATTGCCATGCTGGCCATTACTTCTGCTACATAAAAGCTAGCAATGGCCTCTGGTATCAAATGAATGACTCCATTGTATCTACCAGTGATA
TTAGATCGGTACTCAGCCAACAAGCCTATGTGCTCTTTTATATCAGGTCCCATGATGTGAAAAATGGAGGTGAACTTACTCATCCCACCC
ATAGCCCCGGCCAGTCCTCTCCCCGCCCCGTCATCAGTCAGCGGGTTGTCACCAACAAACAGGCTGCGCCAGGCTTTATCGGACCACAGC
TTCCCTCTCACATGATAAAGAATCCACCTCACTTAAATGGGACTGGACCATTGAAAGACACGCCAAGCAGTTCCATGTCGAGTCCTAACG
GGAATTCCAGTGTCAACAGGGCTAGTCCTGTTAATGCTTCAGCTTCTGTCCAAAACTGGTCAGTTAATAGGTCCTCAGTGATCCCAGAAC
ATCCTAAGAAACAAAAAATTACAATCAGTATTCACAACAAGTTGCCTGTTCGCCAGTGTCAGTCTCAACCTAACCTTCATAGTAATTCTT
TGGAGAACCCTACCAAGCCCGTTCCCTCTTCTACCATTACCAATTCTGCAGTACAGTCTACCTCGAACGCATCTACGATGTCAGTTTCTA
GTAAAGTAACAAAACCGATCCCCCGCAGTGAATCCTGCTCCCAGCCCGTGATGAATGGCAAATCCAAGCTGAACTCCAGCGTGCTGGTGC
CCTATGGCGCCGAGTCCTCTGAGGACTCTGACGAGGAGTCAAAGGGGCTGGGCAAGGAGAATGGGATTGGTACGATTGTGAGCTCCCACT
CTCCCGGCCAAGATGCCGAAGATGAGGAGGCCACTCCGCACGAGCTTCAAGAACCCATGACCCTAAACGGTGCTAATAGTGCAGACAGCG
ACAGTGACCCGAAAGAAAACGGCCTAGCGCCTGATGGTGCCAGCTGCCAAGGCCAGCCTGCCCTGCACTCAGAAAATCCCTTTGCTAAGG
CAAACGGTCTTCCTGGAAAGTTGATGCCTGCTCCTTTGCTGTCTCTCCCAGAAGACAAAATCTTAGAGACCTTCAGGCTTAGCAACAAAC
TGAAAGGCTCGACGGATGAAATGAGTGCACCTGGAGCAGAGAGGGGCCCTCCCGAGGACCGCGACGCCGAGCCTCAGCCTGGCAGCCCCG
CCGCCGAATCCCTGGAGGAGCCAGATGCGGCCGCCGGCCTCAGCAGCACCAAGAAGGCTCCGCCGCCCCGCGATCCCGGCACCCCCGCTA
CCAAAGAAGGCGCCTGGGAGGCCATGGCCGTCGCCCCCGAGGAGCCTCCGCCCAGCGCCGGCGAGGACATCGTGGGGGACACAGCACCCC
CTGACCTGTGTGATCCCGGGAGCTTAACAGGCGATGCGAGCCCGTTGTCCCAGGACGCAAAGGGGATGATCGCGGAGGGCCCGCGGGACT
CGGCGTTGGCGGAAGCCCCGGAAGGGTTGAGTCCGGCTCCGCCTGCGCGGTCGGAGGAGCCCTGCGAGCAGCCACTCCTTGTTCACCCCA
GCGGGGACCACGCCCGGGACGCTCAGGACCCATCCCAGAGCTTGGGCGCACCCGAGGCCGCAGAGCGGCCGCCAGCTCCTGTGCTGGACA
TGGCCCCGGCCGGTCACCCGGAAGGGGACGCTGAGCCTAGCCCCGGCGAGAGGGTCGAGGACGCCGCGGCGCCGAAAGCCCCAGGCCCTT
CCCCAGCGAAGGAGAAAATCGGCAGCCTCAGAAAGGTGGACCGAGGCCACTACCGCAGCCGGAGAGAGCGCTCGTCCAGCGGGGAGCCCG
CCAGAGAGAGCAGGAGCAAGACTGAGGGCCACCGTCACCGGCGGCGCCGCACCTGCCCCCGGGAGCGCGACCGCCAGGACCGCCACGCCC
CGGAGCACCACCCCGGCCACGGCGACAGGCTCAGCCCTGGCGAGCGCCGCTCTCTGGGCAGGTGCAGTCACCACCACTCCCGACACCGGA
GCGGGGTGGAGCTGGACTGGGTCAGACACCACTACACCGAGGGCGAGCGTGGCTGGGGCCGGGAGAAGTTCTACCCCGACAGGCCGCGCT
GGGACAGGTGCCGGTACTACCATGACAGGTACGCCCTGTACGCTGCCCGGGACTGGAAGCCCTTCCACGGCGGCCGCGAGCACGAGCGGG
CCGGGCTGCACGAGCGGCCGCACAAGGACCACAACCGGGGCCGTAGGGGCTGCGAGCCGGCCCGGGAGAGGGAGCGGCACCGCCCCAGCA
GCCCCCGCGCAGGCGCGCCCCACGCCCTCGCCCCGCACCCCGACCGCTTCTCCCACGACAGAACTGCACTTGTAGCCGGAGACAACTGTA
ACCTCTCTGATCGGTTTCACGAACACGAAAATGGAAAGTCCCGGAAACGGAGACACGACAGTGTGGAGAACAGTGACAGTCATGTTGAAA
AGAAAGCCCGGAGGAGCGAACAGAAGGATCCTCTAGAAGAGCCTAAAGCAAAGAAGCACAAAAAATCAAAGAAGAAAAAGAAATCCAAAG
ACAAACACCGAGACCGCGACTCCAGGCATCAGCAGGACTCAGACCTCTCAGCAGCGTGCTCTGACGCTGACCTCCACAGACACAAAAAAA
AGAAGAAGAAAAAGAAGAGACATTCAAGAAAATCAGAGGACTTTGTTAAAGATTCAGAACTGCACTTACCCAGGGTCACCAGCTTGGAGA
CTGTCGCCCAGTTCCGGAGAGCCCAGGGTGGCTTTCCTCTCTCTGGTGGCCCGCCTCTGGAAGGCGTCGGACCTTTCCGTGAGAAAACGA
AACACTTACGGATGGAAAGCAGGGATGACAGGTGTCGTCTCTTTGAGTATGGCCAGGGTGATTGAAAACTCAGCCTCAAAACAAAAAATT
CACTAGTTATGATTCAACGCGTTCAACAGAAGCCATCCCCAGCCCAGCTTAAATTATAAAGATAGACAATAACTCTGTTCCAATCTGCGT
GGTGCTTCTTTAGTAAATACTGTACAGATTTTACCATGGAGAACTTTTTTTTTAGTTTTTACCTTTTCTTAATTACCCTTATTCCGAATG
GACGAACACTTTCTACCACTGCTGACCATTGTAAAATACCGTGTATATAAATCCCATTGAAATAATGCCCTGGAATAGAACATCTCAAAT
GCTGCTTAATTACAGACTCAGGTCGATTACTTGTATTTCATGTAATGTTCCTCCAAGTTAGACATCTGGTGCAAGACCAACCGGGAGACC
ATGGAATTGTCAAAAGTACAAACTGACAGTGTGTATATTTAATTTAAAGACTTATTTAAAAACTCACAAGCTCTCACCTAGACTTTGGAG
AGCAGTCTGTTTTCTGTAATGTCTGATACTAGAAACTAATTTGCTTATTTTAGTTGTATTCAAGATTTGAAGATGTATTTTATAGACAAG
TTCTGTTTTTGAACTTTGTGGAACTGTTCCAATCAATCAATTTCCCAGTTATGATGAGTATTTACATTATGAATGTATAACCCAGACATG
ATTTGTAAAGCCGACAGTATGTTTCTATTACACAACACTTTTTGATACAGCGTCTCTTGTCTTCACTGATACTGGAGTCTCCGTTGTCTG
CTTGGTCCCTTCGAGTTTCTAGTTACAGACACAATCATACTGTGATTTTATTTTTAATATGGATATGCTATCAAACTGTGATACACTTAT
AATTCACTGGTCCTGCATCAGGAGATGGAGTGGGGAAAACTGTATTTAATACAGTTTGTATCTGAATAATCTGTATGGTTTATACAGTTT
GTGTTGTTCAGAGATGTTTAAAGTTTGATCTTTGTTTTTCTAAAGATTAAAAAAGCACTTGCCCCACTGTAAATATACAGCATGTAAAAT

>78697_78697_4_RUNX1-USP42_RUNX1_chr21_36171597_ENST00000399240_USP42_chr7_6154953_ENST00000306177_length(amino acids)=1467AA_BP=225
MRIPVDASTSRRFTPPSTALSPGKMSEALPLGAPDAGAALAGKLRSGDRSMVEVLADHPGELVRTDSPNFLCSVLPTHWRCNKTLPIAFK
VVALGDVPDGTLVTVMAGNDENYSAELRNATAAMKNQVARFNDLRFVGRSGRGKSFTLTITVFTNPPQVATYHRAIKITVDGPREPRNTR
QIQPSPPWSYDQSYQYLGSIASPSVHPATPISPGRASGMTTLSAELSSRLSTLGDGIAPPQKVLFPSEKICLKWQQTHRVGAGLQNLGNT
CFANAALQCLTYTPPLANYMLSHEHSKTCHAEGFCMMCTMQAHITQALSNPGDVIKPMFVINEMRRIARHFRFGNQEDAHEFLQYTVDAM
QKACLNGSNKLDRHTQATTLVCQIFGGYLRSRVKCLNCKGVSDTFDPYLDITLEIKAAQSVNKALEQFVKPEQLDGENSYKCSKCKKMVP
ASKRFTIHRSSNVLTLSLKRFANFTGGKIAKDVKYPEYLDIRPYMSQPNGEPIVYVLYAVLVHTGFNCHAGHYFCYIKASNGLWYQMNDS
IVSTSDIRSVLSQQAYVLFYIRSHDVKNGGELTHPTHSPGQSSPRPVISQRVVTNKQAAPGFIGPQLPSHMIKNPPHLNGTGPLKDTPSS
SMSSPNGNSSVNRASPVNASASVQNWSVNRSSVIPEHPKKQKITISIHNKLPVRQCQSQPNLHSNSLENPTKPVPSSTITNSAVQSTSNA
STMSVSSKVTKPIPRSESCSQPVMNGKSKLNSSVLVPYGAESSEDSDEESKGLGKENGIGTIVSSHSPGQDAEDEEATPHELQEPMTLNG
ANSADSDSDPKENGLAPDGASCQGQPALHSENPFAKANGLPGKLMPAPLLSLPEDKILETFRLSNKLKGSTDEMSAPGAERGPPEDRDAE
PQPGSPAAESLEEPDAAAGLSSTKKAPPPRDPGTPATKEGAWEAMAVAPEEPPPSAGEDIVGDTAPPDLCDPGSLTGDASPLSQDAKGMI
AEGPRDSALAEAPEGLSPAPPARSEEPCEQPLLVHPSGDHARDAQDPSQSLGAPEAAERPPAPVLDMAPAGHPEGDAEPSPGERVEDAAA
PKAPGPSPAKEKIGSLRKVDRGHYRSRRERSSSGEPARESRSKTEGHRHRRRRTCPRERDRQDRHAPEHHPGHGDRLSPGERRSLGRCSH
HHSRHRSGVELDWVRHHYTEGERGWGREKFYPDRPRWDRCRYYHDRYALYAARDWKPFHGGREHERAGLHERPHKDHNRGRRGCEPARER
ERHRPSSPRAGAPHALAPHPDRFSHDRTALVAGDNCNLSDRFHEHENGKSRKRRHDSVENSDSHVEKKARRSEQKDPLEEPKAKKHKKSK
KKKKSKDKHRDRDSRHQQDSDLSAACSDADLHRHKKKKKKKKRHSRKSEDFVKDSELHLPRVTSLETVAQFRRAQGGFPLSGGPPLEGVG

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for RUNX1-USP42


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for RUNX1-USP42


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for RUNX1-USP42


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource