Fusion Gene Studies
in Kim Lab

FusionBase FusionGDB FusionGDB2 FusionPDB FusionNeoAntigen FusionAI FusionNW FGviewer Publication Contact
FusionGDB Logo

Home

Download

Statistics

Examples

Help

Contact

Center for Computational Systems Medicine
leaf

Fusion Gene Summary

leaf

Fusion Gene ORF analysis

leaf

Fusion Genomic Features

leaf

Fusion Protein Features

leaf

Fusion Gene Sequence

leaf

Fusion Gene PPI analysis

leaf

Related Drugs

leaf

Related Diseases

Fusion gene:CHD2-MYH9 (FusionGDB2 ID:HG1106TG4627)

Fusion Gene Summary for CHD2-MYH9

check button Fusion gene summary
Fusion gene informationFusion gene name: CHD2-MYH9
Fusion gene ID: hg1106tg4627
HgeneTgene
Gene symbol

CHD2

MYH9

Gene ID

1106

4627

Gene namechromodomain helicase DNA binding protein 2myosin heavy chain 9
SynonymsEEOCBDPLT6|DFNA17|EPSTS|FTNS|MATINS|MHA|NMHC-II-A|NMMHC-IIA|NMMHCA
Cytomap('CHD2')('MYH9')

15q26.1

22q12.3

Type of geneprotein-codingprotein-coding
Descriptionchromodomain-helicase-DNA-binding protein 2ATP-dependent helicase CHD2CHD-2myosin-9cellular myosin heavy chain, type Amyosin, heavy chain 9, non-musclenon-muscle myosin heavy chain 9non-muscle myosin heavy chain Anon-muscle myosin heavy chain IIanon-muscle myosin heavy polypeptide 9nonmuscle myosin heavy chain II-A
Modification date2020031320200315
UniProtAcc

O14647

P35579

Ensembl transtripts involved in fusion geneENST00000554122, ENST00000394196, 
ENST00000420239, ENST00000557381, 
ENST00000536619, 
Fusion gene scores* DoF score22 X 13 X 11=314644 X 46 X 15=30360
# samples 2356
** MAII scorelog2(23/3146*10)=-3.77381290447131
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
log2(56/30360*10)=-5.76060115335786
possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs).
DoF>8 and MAII<0
Context

PubMed: CHD2 [Title/Abstract] AND MYH9 [Title/Abstract] AND fusion [Title/Abstract]

Most frequent breakpointCHD2(93444529)-MYH9(36701148), # samples:1
Anticipated loss of major functional domain due to fusion event.CHD2-MYH9 seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
CHD2-MYH9 seems lost the major protein functional domain in Hgene partner, which is a CGC by not retaining the major functional domain in the partially deleted in-frame ORF.
CHD2-MYH9 seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
CHD2-MYH9 seems lost the major protein functional domain in Hgene partner, which is a essential gene by not retaining the major functional domain in the partially deleted in-frame ORF.
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types
** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10)

check button Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez
PartnerGeneGO IDGO termPubMed ID
TgeneMYH9

GO:0001525

angiogenesis

16403913

TgeneMYH9

GO:0001778

plasma membrane repair

27325790

TgeneMYH9

GO:0006509

membrane protein ectodomain proteolysis

16186248

TgeneMYH9

GO:0030048

actin filament-based movement

12237319|15845534

TgeneMYH9

GO:0031032

actomyosin structure organization

24072716


check buttonFusion gene breakpoints across CHD2 (5'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure
check buttonFusion gene breakpoints across MYH9 (3'-gene)
* Click on the image to open the UCSC genome browser with custom track showing this image in a new window.
all structure

check button Fusion gene information
* All genome coordinats were lifted-over on hg19.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
SourceDiseaseSampleHgeneHchrHbpHstrandTgeneTchrTbpTstrand
ChimerDB4OVTCGA-29-1688CHD2chr15

93444529

+MYH9chr22

36701148

-


Top

Fusion Gene ORF analysis for CHD2-MYH9

check button Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure.
* Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser.
ORFHenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrand
3UTR-3CDSENST00000554122ENST00000216181CHD2chr15

93444529

+MYH9chr22

36701148

-
3UTR-intronENST00000554122ENST00000401701CHD2chr15

93444529

+MYH9chr22

36701148

-
3UTR-intronENST00000554122ENST00000475726CHD2chr15

93444529

+MYH9chr22

36701148

-
5CDS-intronENST00000394196ENST00000401701CHD2chr15

93444529

+MYH9chr22

36701148

-
5CDS-intronENST00000394196ENST00000475726CHD2chr15

93444529

+MYH9chr22

36701148

-
5CDS-intronENST00000420239ENST00000401701CHD2chr15

93444529

+MYH9chr22

36701148

-
5CDS-intronENST00000420239ENST00000475726CHD2chr15

93444529

+MYH9chr22

36701148

-
5CDS-intronENST00000557381ENST00000401701CHD2chr15

93444529

+MYH9chr22

36701148

-
5CDS-intronENST00000557381ENST00000475726CHD2chr15

93444529

+MYH9chr22

36701148

-
In-frameENST00000394196ENST00000216181CHD2chr15

93444529

+MYH9chr22

36701148

-
In-frameENST00000420239ENST00000216181CHD2chr15

93444529

+MYH9chr22

36701148

-
In-frameENST00000557381ENST00000216181CHD2chr15

93444529

+MYH9chr22

36701148

-
intron-3CDSENST00000536619ENST00000216181CHD2chr15

93444529

+MYH9chr22

36701148

-
intron-intronENST00000536619ENST00000401701CHD2chr15

93444529

+MYH9chr22

36701148

-
intron-intronENST00000536619ENST00000475726CHD2chr15

93444529

+MYH9chr22

36701148

-

check buttonORFfinder result based on the fusion transcript sequence of in-frame fusion genes.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandSeq length
(transcript)
BP loci
(transcript)
Predicted start
(transcript)
Predicted stop
(transcript)
Seq length
(amino acids)
ENST00000394196CHD2chr1593444529+ENST00000216181MYH9chr2236701148-62411130106848531261
ENST00000557381CHD2chr1593444529+ENST00000216181MYH9chr2236701148-588076970744921261
ENST00000420239CHD2chr1593444529+ENST00000216181MYH9chr2236701148-573462356143461261

check buttonDeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated.
HenstTenstHgeneHchrHbpHstrandTgeneTchrTbpTstrandNo-coding scoreCoding score
ENST00000394196ENST00000216181CHD2chr1593444529+MYH9chr2236701148-0.0193800160.98062
ENST00000557381ENST00000216181CHD2chr1593444529+MYH9chr2236701148-0.0188191350.9811809
ENST00000420239ENST00000216181CHD2chr1593444529+MYH9chr2236701148-0.0194213930.98057866

Top

Fusion Genomic Features for CHD2-MYH9


check buttonFusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints.
HgeneHchrHbpHstrandTgeneTchrTbpTstrand1-pp (fusion gene breakpoint)

check buttonDistribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page.
genomic feature

Top

Fusion Protein Features for CHD2-MYH9


check button Four levels of functional features of fusion genes
Go to FGviewer search page for the most frequent breakpoint (https://ccsmweb.uth.edu/FGviewer/chr15:93444529/chr22:36701148)
- FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels.
- How to search
1. Put your fusion gene symbol.
2. Press the tab key until there will be shown the breakpoint information filled.
4. Go down and press 'Search' tab twice.
4. Go down to have the hyperlink of the search result.
5. Click the hyperlink.
6. See the FGviewer result for your fusion gene.
FGviewer

check buttonMain function of each fusion partner protein. (from UniProt)
HgeneTgene
CHD2

O14647

MYH9

P35579

FUNCTION: DNA-binding helicase that specifically binds to the promoter of target genes, leading to chromatin remodeling, possibly by promoting deposition of histone H3.3. Involved in myogenesis via interaction with MYOD1: binds to myogenic gene regulatory sequences and mediates incorporation of histone H3.3 prior to the onset of myogenic gene expression, promoting their expression (By similarity). {ECO:0000250}.FUNCTION: Cellular myosin that appears to play a role in cytokinesis, cell shape, and specialized functions such as secretion and capping. Promotes also cell motility together with S100A4 (PubMed:16707441). During cell spreading, plays an important role in cytoskeleton reorganization, focal contacts formation (in the margins but not the central part of spreading cells), and lamellipodial retraction; this function is mechanically antagonized by MYH10 (PubMed:20052411). {ECO:0000250|UniProtKB:Q8VDD5, ECO:0000269|PubMed:16707441, ECO:0000269|PubMed:20052411}.

check buttonRetention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at

download page


* Minus value of BPloci means that the break pointn is located before the CDS.
- In-frame and retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
TgeneMYH9chr15:93444529chr22:36701148ENST000002161811641837_19267191961.0Coiled coilOntology_term=ECO:0000255
TgeneMYH9chr15:93444529chr22:36701148ENST000002161811641779_8087191961.0DomainIQ

- In-frame and not-retained protein feature among the 13 regional features.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenProtein featureProtein feature note
HgeneCHD2chr15:93444529chr22:36701148ENST00000394196+239121_139201829.0Compositional biasNote=Ser-rich
HgeneCHD2chr15:93444529chr22:36701148ENST00000394196+23913_74201829.0Compositional biasNote=Ser-rich
HgeneCHD2chr15:93444529chr22:36701148ENST00000394196+239966_1066201829.0Compositional biasNote=Glu-rich
HgeneCHD2chr15:93444529chr22:36701148ENST00000420239+213121_13920502.0Compositional biasNote=Ser-rich
HgeneCHD2chr15:93444529chr22:36701148ENST00000420239+21313_7420502.0Compositional biasNote=Ser-rich
HgeneCHD2chr15:93444529chr22:36701148ENST00000420239+213966_106620502.0Compositional biasNote=Glu-rich
HgeneCHD2chr15:93444529chr22:36701148ENST00000557381+238121_139201740.0Compositional biasNote=Ser-rich
HgeneCHD2chr15:93444529chr22:36701148ENST00000557381+23813_74201740.0Compositional biasNote=Ser-rich
HgeneCHD2chr15:93444529chr22:36701148ENST00000557381+238966_1066201740.0Compositional biasNote=Glu-rich
HgeneCHD2chr15:93444529chr22:36701148ENST00000394196+239261_353201829.0DomainChromo 1
HgeneCHD2chr15:93444529chr22:36701148ENST00000394196+239378_456201829.0DomainChromo 2
HgeneCHD2chr15:93444529chr22:36701148ENST00000394196+239496_666201829.0DomainHelicase ATP-binding
HgeneCHD2chr15:93444529chr22:36701148ENST00000394196+239795_946201829.0DomainHelicase C-terminal
HgeneCHD2chr15:93444529chr22:36701148ENST00000420239+213261_35320502.0DomainChromo 1
HgeneCHD2chr15:93444529chr22:36701148ENST00000420239+213378_45620502.0DomainChromo 2
HgeneCHD2chr15:93444529chr22:36701148ENST00000420239+213496_66620502.0DomainHelicase ATP-binding
HgeneCHD2chr15:93444529chr22:36701148ENST00000420239+213795_94620502.0DomainHelicase C-terminal
HgeneCHD2chr15:93444529chr22:36701148ENST00000557381+238261_353201740.0DomainChromo 1
HgeneCHD2chr15:93444529chr22:36701148ENST00000557381+238378_456201740.0DomainChromo 2
HgeneCHD2chr15:93444529chr22:36701148ENST00000557381+238496_666201740.0DomainHelicase ATP-binding
HgeneCHD2chr15:93444529chr22:36701148ENST00000557381+238795_946201740.0DomainHelicase C-terminal
HgeneCHD2chr15:93444529chr22:36701148ENST00000394196+239617_620201829.0MotifNote=DEAH box
HgeneCHD2chr15:93444529chr22:36701148ENST00000420239+213617_62020502.0MotifNote=DEAH box
HgeneCHD2chr15:93444529chr22:36701148ENST00000557381+238617_620201740.0MotifNote=DEAH box
HgeneCHD2chr15:93444529chr22:36701148ENST00000394196+239509_516201829.0Nucleotide bindingATP
HgeneCHD2chr15:93444529chr22:36701148ENST00000420239+213509_51620502.0Nucleotide bindingATP
HgeneCHD2chr15:93444529chr22:36701148ENST00000557381+238509_516201740.0Nucleotide bindingATP
TgeneMYH9chr15:93444529chr22:36701148ENST00000216181164127_777191961.0DomainMyosin N-terminal SH3-like
TgeneMYH9chr15:93444529chr22:36701148ENST00000216181164181_7767191961.0DomainMyosin motor
TgeneMYH9chr15:93444529chr22:36701148ENST000002161811641174_1817191961.0Nucleotide bindingATP
TgeneMYH9chr15:93444529chr22:36701148ENST000002161811641654_6767191961.0RegionNote=Actin-binding


Top

Fusion Gene Sequence for CHD2-MYH9


check button For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones.
>16303_16303_1_CHD2-MYH9_CHD2_chr15_93444529_ENST00000394196_MYH9_chr22_36701148_ENST00000216181_length(transcript)=6241nt_BP=1130nt
TAAAGAAGGAAAAACTTGAATTTATATTATGAAATTAAGGTTTTACTTTTATTTAGCAATCATTTCACCCATTGGGGGATTAATTTTTAA
AAAGATTAGTACTTAGCTGAGCTTTAGATTCTCTAAGAATTATCCCTTTTTCAAAATGGCTTGATAAAACACCACTTCCTGTCCAGGGAG
GAAGTTATTTTAGCTCAAGGACTGAAATATTTAGCAGATCTTTAGACAAAAGGAACAAATTACATTTCTCTCTGGCTTAAAAAGGGTGCT
TGAGCCAGAGGGGGTTAAGAAGCCCTCCCCCCTCCCCCTGTTGGGGTGGTCTGGAGGGGCAGGGAGGTGTGGGGGTGGGGGTCCTTAGGC
TCCCTGGGATGGAGGATCTGTCTCTCTCTCTCTCTCTCCTTTTTTTTTTTTTGGTGGAGATGAAGGGGTGGGTCTATGGTACATCACCTG
AGTTGTGGGGTAAATGTAGAGAGTGTCAATCAAAGGCAGAGCTCTCAGAGCTGGGAAGGAGGCTCTAGATGGCGGCTGTGCCTTAGAGAG
AGCGCGCTCTGCTCCCTGCCTTTGCCTCACTTTACGCAACTTTCCCTAACTTTCGGGCAGCCTCAGGGGGCCCCCGTAGCCCCCTGCCTT
TCCTAGGGACTTACTGGGGTCGATTCGAACCTTTTTTTGGGAGAAAAGCAGCTTTTAGGAGCTTTCTTTTCGTGCCTTGTTGGAAAGAAG
CAGCCGTACTGAGAGCCCAGGTCGTTGTTTTTTCCAGCTTAGAAGCCATGGCGCACCTCCATTTTTGTGCGCTCTCCTAATGAGGTTTTT
TTTCTTTCGGACCTGTTTTAGTATTAATTATTGCTTTATTTTTTTGACCAGTTAACATATTTGAGGGTTATTTTATTTATTTTTCGTTTT
TTAACGGAGGATTTTGCCTTTATTTTTAATTATTTGGGATCTGATATTTTTCTACTAGTAGATAGGACTCTTGGTTTGGACATACTACAT
GGATCAGTAAATACCTGGGCACAGGACTTCAAAGCAAACACAGATTCCCCCTCCCCCTTAATATTTAAGAATTAAAAGATGATGAGAAAT
AAGGACAAAAGCCAAGAGGAGGACAGTTCGCTACACAGCAATGCATCGAGATATGAGATCCTGACTCCAAACTCCATTCCCAAGGGTTTC
ATGGACGGGAAGCAGGCGTGCGTGCTCATGATAAAAGCCCTGGAGCTCGACAGCAATCTGTACCGCATTGGCCAGAGCAAAGTCTTCTTC
CGTGCCGGTGTGCTGGCCCACCTGGAGGAGGAGCGAGACCTGAAGATCACCGACGTCATCATAGGGTTCCAGGCCTGCTGCAGGGGCTAC
CTGGCCAGGAAAGCATTTGCCAAGCGGCAGCAGCAGCTTACCGCCATGAAGGTCCTCCAGCGGAACTGCGCTGCCTACCTGAAGCTGCGG
AACTGGCAGTGGTGGCGGCTCTTCACCAAGGTCAAGCCGCTGCTGCAGGTGAGCCGGCAGGAGGAGGAGATGATGGCCAAGGAGGAGGAG
CTGGTGAAGGTCAGAGAGAAGCAGCTGGCTGCGGAGAACAGGCTCACGGAGATGGAGACGCTGCAGTCTCAGCTCATGGCAGAGAAATTG
CAGCTGCAGGAGCAGCTCCAGGCAGAAACCGAGCTGTGTGCCGAGGCTGAGGAGCTCCGGGCCCGCCTGACCGCCAAGAAGCAGGAATTA
GAAGAGATCTGCCATGACCTAGAGGCCAGGGTGGAGGAGGAGGAGGAGCGCTGCCAGCACCTGCAGGCGGAGAAGAAGAAGATGCAGCAG
AACATCCAGGAGCTTGAGGAGCAGCTGGAGGAGGAGGAGAGCGCCCGGCAGAAGCTGCAGCTGGAGAAGGTGACCACCGAGGCGAAGCTG
AAAAAGCTGGAGGAGGAGCAGATCATCCTGGAGGACCAGAACTGCAAGCTGGCCAAGGAAAAGAAACTGCTGGAAGACAGAATAGCTGAG
TTCACCACCAACCTCACAGAAGAGGAGGAGAAATCTAAGAGCCTCGCCAAGCTCAAGAACAAGCATGAGGCAATGATCACTGACTTGGAA
GAGCGCCTCCGCAGGGAGGAGAAGCAGCGACAGGAGCTGGAGAAGACCCGCCGGAAGCTGGAGGGAGACTCCACAGACCTCAGCGACCAG
ATCGCCGAGCTCCAGGCCCAGATCGCGGAGCTCAAGATGCAGCTGGCCAAGAAAGAGGAGGAGCTCCAGGCCGCCCTGGCCAGAGTGGAA
GAGGAAGCTGCCCAGAAGAACATGGCCCTCAAGAAGATCCGGGAGCTGGAATCTCAGATCTCTGAACTCCAGGAAGACCTGGAGTCTGAG
CGTGCTTCCAGGAATAAAGCTGAGAAGCAGAAACGGGACCTTGGGGAAGAGCTAGAGGCTCTGAAAACAGAGTTGGAGGACACGCTGGAT
TCCACAGCTGCCCAGCAGGAGCTCAGGTCAAAACGTGAGCAGGAGGTGAACATCCTGAAGAAGACCCTGGAGGAGGAGGCCAAGACCCAC
GAGGCCCAGATCCAGGAGATGAGGCAGAAGCACTCACAGGCCGTGGAGGAGCTGGCGGAGCAGCTGGAGCAGACGAAGCGGGTGAAAGCA
AACCTCGAGAAGGCAAAGCAGACTCTGGAGAACGAGCGGGGGGAGCTGGCCAACGAGGTGAAGGTGCTGCTGCAGGGCAAAGGGGACTCG
GAGCACAAGCGCAAGAAAGTGGAGGCGCAGCTGCAGGAGCTGCAGGTCAAGTTCAACGAGGGAGAGCGCGTGCGCACAGAGCTGGCCGAC
AAGGTCACCAAGCTGCAGGTGGAGCTGGACAACGTGACCGGGCTTCTCAGCCAGTCCGACAGCAAGTCCAGCAAGCTCACCAAGGACTTC
TCCGCGCTGGAGTCCCAGCTGCAGGACACTCAGGAGCTGCTGCAGGAGGAGAACCGGCAGAAGCTGAGCCTGAGCACCAAGCTCAAGCAG
GTGGAGGACGAGAAGAATTCCTTCCGGGAGCAGCTGGAGGAGGAGGAGGAGGCCAAGCACAACCTGGAGAAGCAGATCGCCACCCTCCAT
GCCCAGGTGGCCGACATGAAAAAGAAGATGGAGGACAGTGTGGGGTGCCTGGAAACTGCTGAGGAGGTGAAGAGGAAGCTCCAGAAGGAC
CTGGAGGGCCTGAGCCAGCGGCACGAGGAGAAGGTGGCCGCCTACGACAAGCTGGAGAAGACCAAGACGCGGCTGCAGCAGGAGCTGGAC
GACCTGCTGGTGGACCTGGACCACCAGCGCCAGAGCGCGTGCAACCTGGAGAAGAAGCAGAAGAAGTTTGACCAGCTCCTGGCGGAGGAG
AAGACCATCTCTGCCAAGTATGCAGAGGAGCGCGACCGGGCTGAGGCGGAGGCCCGAGAGAAGGAGACCAAGGCTCTGTCGCTGGCCCGG
GCCCTGGAGGAAGCCATGGAGCAGAAGGCGGAGCTGGAGCGGCTCAACAAGCAGTTCCGCACGGAGATGGAGGACCTTATGAGCTCCAAG
GATGATGTGGGCAAGAGTGTCCACGAGCTGGAGAAGTCCAAGCGGGCCCTAGAGCAGCAGGTGGAGGAGATGAAGACGCAGCTGGAAGAG
CTGGAGGACGAGCTGCAGGCCACCGAAGATGCCAAGCTGCGGTTGGAGGTCAACCTGCAGGCCATGAAGGCCCAGTTCGAGCGGGACCTG
CAGGGCCGGGACGAGCAGAGCGAGGAGAAGAAGAAGCAGCTGGTCAGACAGGTGCGGGAGATGGAGGCAGAGCTGGAGGACGAGAGGAAG
CAGCGCTCGATGGCAGTGGCCGCCCGGAAGAAGCTGGAGATGGACCTGAAGGACCTGGAGGCGCACATCGACTCGGCCAACAAGAACCGG
GACGAAGCCATCAAACAGCTGCGGAAGCTGCAGGCCCAGATGAAGGACTGCATGCGCGAGCTGGATGACACCCGCGCCTCTCGTGAGGAG
ATCCTGGCCCAGGCCAAAGAGAACGAGAAGAAGCTGAAGAGCATGGAGGCCGAGATGATCCAGTTGCAGGAGGAACTGGCAGCCGCGGAG
CGTGCCAAGCGCCAGGCCCAGCAGGAGCGGGATGAGCTGGCTGACGAGATCGCCAACAGCAGCGGCAAAGGAGCCCTGGCGTTAGAGGAG
AAGCGGCGTCTGGAGGCCCGCATCGCCCAGCTGGAGGAGGAGCTGGAGGAGGAGCAGGGCAACACGGAGCTGATCAACGACCGGCTGAAG
AAGGCCAACCTGCAGATCGACCAGATCAACACCGACCTGAACCTGGAGCGCAGCCACGCCCAGAAGAACGAGAATGCTCGGCAGCAGCTG
GAACGCCAGAACAAGGAGCTTAAGGTCAAGCTGCAGGAGATGGAGGGCACTGTCAAGTCCAAGTACAAGGCCTCCATCACCGCCCTCGAG
GCCAAGATTGCACAGCTGGAGGAGCAGCTGGACAACGAGACCAAGGAGCGCCAGGCAGCCTGCAAACAGGTGCGTCGGACCGAGAAGAAG
CTGAAGGATGTGCTGCTGCAGGTGGATGACGAGCGGAGGAACGCCGAGCAGTACAAGGACCAGGCCGACAAGGCATCTACCCGCCTGAAG
CAGCTCAAGCGGCAGCTGGAGGAGGCCGAAGAGGAGGCCCAGCGGGCCAACGCCTCCCGCCGGAAACTGCAGCGCGAGCTGGAGGACGCC
ACTGAGACGGCCGATGCCATGAACCGCGAAGTCAGCTCCCTAAAGAACAAGCTCAGGCGCGGGGACCTGCCGTTTGTCGTGCCCCGCCGA
ATGGCCCGGAAAGGCGCCGGGGATGGCTCCGACGAAGAGGTAGATGGCAAAGCGGATGGGGCTGAGGCCAAACCTGCCGAATAAGCCTCT
TCTCCTGCAGCCTGAGATGGATGGACAGACAGACACCACAGCCTCCCCTTCCCAGACCCCGCAGCACGCCTCTCCCCACCTTCTTGGGAC
TGCTGTGAACATGCCTCCTCCTGCCCTCCGCCCCGTCCCCCCATCCCGTTTCCCTCCAGGTGTTGTTGAGGGCATTTGGCTTCCTCTGCT
GCATCCCCTTCCAGCTCCCTCCCCTGCTCAGAATCTGATACCAAAGAGACAGGGCCCGGGCCCAGGCAGAGAGCGACCAGCAGGCTCCTC
AGCCCTCTCTTGCCAAAAAGCACAAGATGTTGAGGCGAGCAGGGCAGGCCCCCGGGGAGGGGCCAGAGTTTTCTATGAATCTATTTTTCT
TCAGACTGAGGCCTTTTGGTAGTCGGAGCCCCCGCAGTCGTCAGCCTCCCTGACGTCTGCCACCAGCGCCCCCACTCCTCCTCCTTTCTT
TGCTGTTTGCAATCACACGTGGTGACCTCACACACCTCTGCCCCTTGGGCCTCCCACTCCCATGGCTCTGGGCGGTCCAGAAGGAGCAGG
CCCTGGGCCTCCACCTCTGTGCAGGGCACAGAAGGCTGGGGTGGGGGGAGGAGTGGATTCCTCCCCACCCTGTCCCAGGCAGCGCCACTG
TCCGCTGTCTCCCTCCTGATTCTAAAATGTCTCAAGTGCAATGCCCCCTCCCCTCCTTTACCGAGGACAGCCTGCCTCTGCCACAGCAAG
GCTGTCGGGGTCAAGCTGGAAAGGCCAGCAGCCTTCCAGTGGCTTCTCCCAACACTCTTGGGGACCAAATATATTTAATGGTTAAGGGAC
TTGTCCCAAGTCTGACAGCCAGAGCGTTAGAGGGGCCAGCGGCCCTCCCAGGCGATCTTGTGTCTACTCTAGGACTGGGCCCGAGGGTGG
TTTACCTGCACCGTTGACTCAGTATAGTTTAAAAATCTGCCACCTGCACAGGTATTTTTGAAAGCAAAATAAGGTTTTCTTTTTTCCCCT
TTCTTGTAATAAATGATAAAATTCCGAGTCTTTCTCACTGCCTTTGTTTAGAAGAGAGTAGCTCGTCCTCACTGGTCTACACTGGTTGCC
GAATTTACTTGTATTCCTAACTGTTTTGTATATGCTGCATTGAGACTTACGGCAAGAAGGCATTTTTTTTTTTTAAAGGAAACAAACTCT
CAAATCATGAAGTGATATAAAAGCTGCATATGCCTACAAAGCTCTGAATTCAGGTCCCAGTTGCTGTCACAAAGGAGTGAGTGAAACTCC
CACCCTACCCCCTTTTTTATATAATAAAAGTGCCTTAGCATGTGTTGCAGCTGTCACCACTACAGTAAGCTGGTTTACAGATGTTTTCCA

>16303_16303_1_CHD2-MYH9_CHD2_chr15_93444529_ENST00000394196_MYH9_chr22_36701148_ENST00000216181_length(amino acids)=1261AA_BP=20
MMRNKDKSQEEDSSLHSNASRYEILTPNSIPKGFMDGKQACVLMIKALELDSNLYRIGQSKVFFRAGVLAHLEEERDLKITDVIIGFQAC
CRGYLARKAFAKRQQQLTAMKVLQRNCAAYLKLRNWQWWRLFTKVKPLLQVSRQEEEMMAKEEELVKVREKQLAAENRLTEMETLQSQLM
AEKLQLQEQLQAETELCAEAEELRARLTAKKQELEEICHDLEARVEEEEERCQHLQAEKKKMQQNIQELEEQLEEEESARQKLQLEKVTT
EAKLKKLEEEQIILEDQNCKLAKEKKLLEDRIAEFTTNLTEEEEKSKSLAKLKNKHEAMITDLEERLRREEKQRQELEKTRRKLEGDSTD
LSDQIAELQAQIAELKMQLAKKEEELQAALARVEEEAAQKNMALKKIRELESQISELQEDLESERASRNKAEKQKRDLGEELEALKTELE
DTLDSTAAQQELRSKREQEVNILKKTLEEEAKTHEAQIQEMRQKHSQAVEELAEQLEQTKRVKANLEKAKQTLENERGELANEVKVLLQG
KGDSEHKRKKVEAQLQELQVKFNEGERVRTELADKVTKLQVELDNVTGLLSQSDSKSSKLTKDFSALESQLQDTQELLQEENRQKLSLST
KLKQVEDEKNSFREQLEEEEEAKHNLEKQIATLHAQVADMKKKMEDSVGCLETAEEVKRKLQKDLEGLSQRHEEKVAAYDKLEKTKTRLQ
QELDDLLVDLDHQRQSACNLEKKQKKFDQLLAEEKTISAKYAEERDRAEAEAREKETKALSLARALEEAMEQKAELERLNKQFRTEMEDL
MSSKDDVGKSVHELEKSKRALEQQVEEMKTQLEELEDELQATEDAKLRLEVNLQAMKAQFERDLQGRDEQSEEKKKQLVRQVREMEAELE
DERKQRSMAVAARKKLEMDLKDLEAHIDSANKNRDEAIKQLRKLQAQMKDCMRELDDTRASREEILAQAKENEKKLKSMEAEMIQLQEEL
AAAERAKRQAQQERDELADEIANSSGKGALALEEKRRLEARIAQLEEELEEEQGNTELINDRLKKANLQIDQINTDLNLERSHAQKNENA
RQQLERQNKELKVKLQEMEGTVKSKYKASITALEAKIAQLEEQLDNETKERQAACKQVRRTEKKLKDVLLQVDDERRNAEQYKDQADKAS
TRLKQLKRQLEEAEEEAQRANASRRKLQRELEDATETADAMNREVSSLKNKLRRGDLPFVVPRRMARKGAGDGSDEEVDGKADGAEAKPA

--------------------------------------------------------------
>16303_16303_2_CHD2-MYH9_CHD2_chr15_93444529_ENST00000420239_MYH9_chr22_36701148_ENST00000216181_length(transcript)=5734nt_BP=623nt
GGAGGCTCTAGATGGCGGCTGTGCCTTAGAGAGAGCGCGCTCTGCTCCCTGCCTTTGCCTCACTTTACGCAACTTTCCCTAACTTTCGGG
CAGCCTCAGGGGGCCCCCGTAGCCCCCTGCCTTTCCTAGGGACTTACTGGGGTCGATTCGAACCTTTTTTTGGGAGAAAAGCAGCTTTTA
GGAGCTTTCTTTTCGTGCCTTGTTGGAAAGAAGCAGCCGTACTGAGAGCCCAGGTCGTTGTTTTTTCCAGCTTAGAAGCCATGGCGCACC
TCCATTTTTGTGCGCTCTCCTAATGAGGTTTTTTTTCTTTCGGACCTGTTTTAGTATTAATTATTGCTTTATTTTTTTGACCAGTTAACA
TATTTGAGGGTTATTTTATTTATTTTTCGTTTTTTAACGGAGGATTTTGCCTTTATTTTTAATTATTTGGGATCTGATATTTTTCTACTA
GTAGATAGGACTCTTGGTTTGGACATACTACATGGATCAGTAAATACCTGGGCACAGGACTTCAAAGCAAACACAGATTCCCCCTCCCCC
TTAATATTTAAGAATTAAAAGATGATGAGAAATAAGGACAAAAGCCAAGAGGAGGACAGTTCGCTACACAGCAATGCATCGAGATATGAG
ATCCTGACTCCAAACTCCATTCCCAAGGGTTTCATGGACGGGAAGCAGGCGTGCGTGCTCATGATAAAAGCCCTGGAGCTCGACAGCAAT
CTGTACCGCATTGGCCAGAGCAAAGTCTTCTTCCGTGCCGGTGTGCTGGCCCACCTGGAGGAGGAGCGAGACCTGAAGATCACCGACGTC
ATCATAGGGTTCCAGGCCTGCTGCAGGGGCTACCTGGCCAGGAAAGCATTTGCCAAGCGGCAGCAGCAGCTTACCGCCATGAAGGTCCTC
CAGCGGAACTGCGCTGCCTACCTGAAGCTGCGGAACTGGCAGTGGTGGCGGCTCTTCACCAAGGTCAAGCCGCTGCTGCAGGTGAGCCGG
CAGGAGGAGGAGATGATGGCCAAGGAGGAGGAGCTGGTGAAGGTCAGAGAGAAGCAGCTGGCTGCGGAGAACAGGCTCACGGAGATGGAG
ACGCTGCAGTCTCAGCTCATGGCAGAGAAATTGCAGCTGCAGGAGCAGCTCCAGGCAGAAACCGAGCTGTGTGCCGAGGCTGAGGAGCTC
CGGGCCCGCCTGACCGCCAAGAAGCAGGAATTAGAAGAGATCTGCCATGACCTAGAGGCCAGGGTGGAGGAGGAGGAGGAGCGCTGCCAG
CACCTGCAGGCGGAGAAGAAGAAGATGCAGCAGAACATCCAGGAGCTTGAGGAGCAGCTGGAGGAGGAGGAGAGCGCCCGGCAGAAGCTG
CAGCTGGAGAAGGTGACCACCGAGGCGAAGCTGAAAAAGCTGGAGGAGGAGCAGATCATCCTGGAGGACCAGAACTGCAAGCTGGCCAAG
GAAAAGAAACTGCTGGAAGACAGAATAGCTGAGTTCACCACCAACCTCACAGAAGAGGAGGAGAAATCTAAGAGCCTCGCCAAGCTCAAG
AACAAGCATGAGGCAATGATCACTGACTTGGAAGAGCGCCTCCGCAGGGAGGAGAAGCAGCGACAGGAGCTGGAGAAGACCCGCCGGAAG
CTGGAGGGAGACTCCACAGACCTCAGCGACCAGATCGCCGAGCTCCAGGCCCAGATCGCGGAGCTCAAGATGCAGCTGGCCAAGAAAGAG
GAGGAGCTCCAGGCCGCCCTGGCCAGAGTGGAAGAGGAAGCTGCCCAGAAGAACATGGCCCTCAAGAAGATCCGGGAGCTGGAATCTCAG
ATCTCTGAACTCCAGGAAGACCTGGAGTCTGAGCGTGCTTCCAGGAATAAAGCTGAGAAGCAGAAACGGGACCTTGGGGAAGAGCTAGAG
GCTCTGAAAACAGAGTTGGAGGACACGCTGGATTCCACAGCTGCCCAGCAGGAGCTCAGGTCAAAACGTGAGCAGGAGGTGAACATCCTG
AAGAAGACCCTGGAGGAGGAGGCCAAGACCCACGAGGCCCAGATCCAGGAGATGAGGCAGAAGCACTCACAGGCCGTGGAGGAGCTGGCG
GAGCAGCTGGAGCAGACGAAGCGGGTGAAAGCAAACCTCGAGAAGGCAAAGCAGACTCTGGAGAACGAGCGGGGGGAGCTGGCCAACGAG
GTGAAGGTGCTGCTGCAGGGCAAAGGGGACTCGGAGCACAAGCGCAAGAAAGTGGAGGCGCAGCTGCAGGAGCTGCAGGTCAAGTTCAAC
GAGGGAGAGCGCGTGCGCACAGAGCTGGCCGACAAGGTCACCAAGCTGCAGGTGGAGCTGGACAACGTGACCGGGCTTCTCAGCCAGTCC
GACAGCAAGTCCAGCAAGCTCACCAAGGACTTCTCCGCGCTGGAGTCCCAGCTGCAGGACACTCAGGAGCTGCTGCAGGAGGAGAACCGG
CAGAAGCTGAGCCTGAGCACCAAGCTCAAGCAGGTGGAGGACGAGAAGAATTCCTTCCGGGAGCAGCTGGAGGAGGAGGAGGAGGCCAAG
CACAACCTGGAGAAGCAGATCGCCACCCTCCATGCCCAGGTGGCCGACATGAAAAAGAAGATGGAGGACAGTGTGGGGTGCCTGGAAACT
GCTGAGGAGGTGAAGAGGAAGCTCCAGAAGGACCTGGAGGGCCTGAGCCAGCGGCACGAGGAGAAGGTGGCCGCCTACGACAAGCTGGAG
AAGACCAAGACGCGGCTGCAGCAGGAGCTGGACGACCTGCTGGTGGACCTGGACCACCAGCGCCAGAGCGCGTGCAACCTGGAGAAGAAG
CAGAAGAAGTTTGACCAGCTCCTGGCGGAGGAGAAGACCATCTCTGCCAAGTATGCAGAGGAGCGCGACCGGGCTGAGGCGGAGGCCCGA
GAGAAGGAGACCAAGGCTCTGTCGCTGGCCCGGGCCCTGGAGGAAGCCATGGAGCAGAAGGCGGAGCTGGAGCGGCTCAACAAGCAGTTC
CGCACGGAGATGGAGGACCTTATGAGCTCCAAGGATGATGTGGGCAAGAGTGTCCACGAGCTGGAGAAGTCCAAGCGGGCCCTAGAGCAG
CAGGTGGAGGAGATGAAGACGCAGCTGGAAGAGCTGGAGGACGAGCTGCAGGCCACCGAAGATGCCAAGCTGCGGTTGGAGGTCAACCTG
CAGGCCATGAAGGCCCAGTTCGAGCGGGACCTGCAGGGCCGGGACGAGCAGAGCGAGGAGAAGAAGAAGCAGCTGGTCAGACAGGTGCGG
GAGATGGAGGCAGAGCTGGAGGACGAGAGGAAGCAGCGCTCGATGGCAGTGGCCGCCCGGAAGAAGCTGGAGATGGACCTGAAGGACCTG
GAGGCGCACATCGACTCGGCCAACAAGAACCGGGACGAAGCCATCAAACAGCTGCGGAAGCTGCAGGCCCAGATGAAGGACTGCATGCGC
GAGCTGGATGACACCCGCGCCTCTCGTGAGGAGATCCTGGCCCAGGCCAAAGAGAACGAGAAGAAGCTGAAGAGCATGGAGGCCGAGATG
ATCCAGTTGCAGGAGGAACTGGCAGCCGCGGAGCGTGCCAAGCGCCAGGCCCAGCAGGAGCGGGATGAGCTGGCTGACGAGATCGCCAAC
AGCAGCGGCAAAGGAGCCCTGGCGTTAGAGGAGAAGCGGCGTCTGGAGGCCCGCATCGCCCAGCTGGAGGAGGAGCTGGAGGAGGAGCAG
GGCAACACGGAGCTGATCAACGACCGGCTGAAGAAGGCCAACCTGCAGATCGACCAGATCAACACCGACCTGAACCTGGAGCGCAGCCAC
GCCCAGAAGAACGAGAATGCTCGGCAGCAGCTGGAACGCCAGAACAAGGAGCTTAAGGTCAAGCTGCAGGAGATGGAGGGCACTGTCAAG
TCCAAGTACAAGGCCTCCATCACCGCCCTCGAGGCCAAGATTGCACAGCTGGAGGAGCAGCTGGACAACGAGACCAAGGAGCGCCAGGCA
GCCTGCAAACAGGTGCGTCGGACCGAGAAGAAGCTGAAGGATGTGCTGCTGCAGGTGGATGACGAGCGGAGGAACGCCGAGCAGTACAAG
GACCAGGCCGACAAGGCATCTACCCGCCTGAAGCAGCTCAAGCGGCAGCTGGAGGAGGCCGAAGAGGAGGCCCAGCGGGCCAACGCCTCC
CGCCGGAAACTGCAGCGCGAGCTGGAGGACGCCACTGAGACGGCCGATGCCATGAACCGCGAAGTCAGCTCCCTAAAGAACAAGCTCAGG
CGCGGGGACCTGCCGTTTGTCGTGCCCCGCCGAATGGCCCGGAAAGGCGCCGGGGATGGCTCCGACGAAGAGGTAGATGGCAAAGCGGAT
GGGGCTGAGGCCAAACCTGCCGAATAAGCCTCTTCTCCTGCAGCCTGAGATGGATGGACAGACAGACACCACAGCCTCCCCTTCCCAGAC
CCCGCAGCACGCCTCTCCCCACCTTCTTGGGACTGCTGTGAACATGCCTCCTCCTGCCCTCCGCCCCGTCCCCCCATCCCGTTTCCCTCC
AGGTGTTGTTGAGGGCATTTGGCTTCCTCTGCTGCATCCCCTTCCAGCTCCCTCCCCTGCTCAGAATCTGATACCAAAGAGACAGGGCCC
GGGCCCAGGCAGAGAGCGACCAGCAGGCTCCTCAGCCCTCTCTTGCCAAAAAGCACAAGATGTTGAGGCGAGCAGGGCAGGCCCCCGGGG
AGGGGCCAGAGTTTTCTATGAATCTATTTTTCTTCAGACTGAGGCCTTTTGGTAGTCGGAGCCCCCGCAGTCGTCAGCCTCCCTGACGTC
TGCCACCAGCGCCCCCACTCCTCCTCCTTTCTTTGCTGTTTGCAATCACACGTGGTGACCTCACACACCTCTGCCCCTTGGGCCTCCCAC
TCCCATGGCTCTGGGCGGTCCAGAAGGAGCAGGCCCTGGGCCTCCACCTCTGTGCAGGGCACAGAAGGCTGGGGTGGGGGGAGGAGTGGA
TTCCTCCCCACCCTGTCCCAGGCAGCGCCACTGTCCGCTGTCTCCCTCCTGATTCTAAAATGTCTCAAGTGCAATGCCCCCTCCCCTCCT
TTACCGAGGACAGCCTGCCTCTGCCACAGCAAGGCTGTCGGGGTCAAGCTGGAAAGGCCAGCAGCCTTCCAGTGGCTTCTCCCAACACTC
TTGGGGACCAAATATATTTAATGGTTAAGGGACTTGTCCCAAGTCTGACAGCCAGAGCGTTAGAGGGGCCAGCGGCCCTCCCAGGCGATC
TTGTGTCTACTCTAGGACTGGGCCCGAGGGTGGTTTACCTGCACCGTTGACTCAGTATAGTTTAAAAATCTGCCACCTGCACAGGTATTT
TTGAAAGCAAAATAAGGTTTTCTTTTTTCCCCTTTCTTGTAATAAATGATAAAATTCCGAGTCTTTCTCACTGCCTTTGTTTAGAAGAGA
GTAGCTCGTCCTCACTGGTCTACACTGGTTGCCGAATTTACTTGTATTCCTAACTGTTTTGTATATGCTGCATTGAGACTTACGGCAAGA
AGGCATTTTTTTTTTTTAAAGGAAACAAACTCTCAAATCATGAAGTGATATAAAAGCTGCATATGCCTACAAAGCTCTGAATTCAGGTCC
CAGTTGCTGTCACAAAGGAGTGAGTGAAACTCCCACCCTACCCCCTTTTTTATATAATAAAAGTGCCTTAGCATGTGTTGCAGCTGTCAC

>16303_16303_2_CHD2-MYH9_CHD2_chr15_93444529_ENST00000420239_MYH9_chr22_36701148_ENST00000216181_length(amino acids)=1261AA_BP=20
MMRNKDKSQEEDSSLHSNASRYEILTPNSIPKGFMDGKQACVLMIKALELDSNLYRIGQSKVFFRAGVLAHLEEERDLKITDVIIGFQAC
CRGYLARKAFAKRQQQLTAMKVLQRNCAAYLKLRNWQWWRLFTKVKPLLQVSRQEEEMMAKEEELVKVREKQLAAENRLTEMETLQSQLM
AEKLQLQEQLQAETELCAEAEELRARLTAKKQELEEICHDLEARVEEEEERCQHLQAEKKKMQQNIQELEEQLEEEESARQKLQLEKVTT
EAKLKKLEEEQIILEDQNCKLAKEKKLLEDRIAEFTTNLTEEEEKSKSLAKLKNKHEAMITDLEERLRREEKQRQELEKTRRKLEGDSTD
LSDQIAELQAQIAELKMQLAKKEEELQAALARVEEEAAQKNMALKKIRELESQISELQEDLESERASRNKAEKQKRDLGEELEALKTELE
DTLDSTAAQQELRSKREQEVNILKKTLEEEAKTHEAQIQEMRQKHSQAVEELAEQLEQTKRVKANLEKAKQTLENERGELANEVKVLLQG
KGDSEHKRKKVEAQLQELQVKFNEGERVRTELADKVTKLQVELDNVTGLLSQSDSKSSKLTKDFSALESQLQDTQELLQEENRQKLSLST
KLKQVEDEKNSFREQLEEEEEAKHNLEKQIATLHAQVADMKKKMEDSVGCLETAEEVKRKLQKDLEGLSQRHEEKVAAYDKLEKTKTRLQ
QELDDLLVDLDHQRQSACNLEKKQKKFDQLLAEEKTISAKYAEERDRAEAEAREKETKALSLARALEEAMEQKAELERLNKQFRTEMEDL
MSSKDDVGKSVHELEKSKRALEQQVEEMKTQLEELEDELQATEDAKLRLEVNLQAMKAQFERDLQGRDEQSEEKKKQLVRQVREMEAELE
DERKQRSMAVAARKKLEMDLKDLEAHIDSANKNRDEAIKQLRKLQAQMKDCMRELDDTRASREEILAQAKENEKKLKSMEAEMIQLQEEL
AAAERAKRQAQQERDELADEIANSSGKGALALEEKRRLEARIAQLEEELEEEQGNTELINDRLKKANLQIDQINTDLNLERSHAQKNENA
RQQLERQNKELKVKLQEMEGTVKSKYKASITALEAKIAQLEEQLDNETKERQAACKQVRRTEKKLKDVLLQVDDERRNAEQYKDQADKAS
TRLKQLKRQLEEAEEEAQRANASRRKLQRELEDATETADAMNREVSSLKNKLRRGDLPFVVPRRMARKGAGDGSDEEVDGKADGAEAKPA

--------------------------------------------------------------
>16303_16303_3_CHD2-MYH9_CHD2_chr15_93444529_ENST00000557381_MYH9_chr22_36701148_ENST00000216181_length(transcript)=5880nt_BP=769nt
CCCTGGGATGGAGGATCTGTCTCTCTCTCTCTCTCTCCTTTTTTTTTTTTTGGTGGAGATGAAGGGGTGGGTCTATGGTACATCACCTGA
GTTGTGGGGTAAATGTAGAGAGTGTCAATCAAAGGCAGAGCTCTCAGAGCTGGGAAGGAGGCTCTAGATGGCGGCTGTGCCTTAGAGAGA
GCGCGCTCTGCTCCCTGCCTTTGCCTCACTTTACGCAACTTTCCCTAACTTTCGGGCAGCCTCAGGGGGCCCCCGTAGCCCCCTGCCTTT
CCTAGGGACTTACTGGGGTCGATTCGAACCTTTTTTTGGGAGAAAAGCAGCTTTTAGGAGCTTTCTTTTCGTGCCTTGTTGGAAAGAAGC
AGCCGTACTGAGAGCCCAGGTCGTTGTTTTTTCCAGCTTAGAAGCCATGGCGCACCTCCATTTTTGTGCGCTCTCCTAATGAGGTTTTTT
TTCTTTCGGACCTGTTTTAGTATTAATTATTGCTTTATTTTTTTGACCAGTTAACATATTTGAGGGTTATTTTATTTATTTTTCGTTTTT
TAACGGAGGATTTTGCCTTTATTTTTAATTATTTGGGATCTGATATTTTTCTACTAGTAGATAGGACTCTTGGTTTGGACATACTACATG
GATCAGTAAATACCTGGGCACAGGACTTCAAAGCAAACACAGATTCCCCCTCCCCCTTAATATTTAAGAATTAAAAGATGATGAGAAATA
AGGACAAAAGCCAAGAGGAGGACAGTTCGCTACACAGCAATGCATCGAGATATGAGATCCTGACTCCAAACTCCATTCCCAAGGGTTTCA
TGGACGGGAAGCAGGCGTGCGTGCTCATGATAAAAGCCCTGGAGCTCGACAGCAATCTGTACCGCATTGGCCAGAGCAAAGTCTTCTTCC
GTGCCGGTGTGCTGGCCCACCTGGAGGAGGAGCGAGACCTGAAGATCACCGACGTCATCATAGGGTTCCAGGCCTGCTGCAGGGGCTACC
TGGCCAGGAAAGCATTTGCCAAGCGGCAGCAGCAGCTTACCGCCATGAAGGTCCTCCAGCGGAACTGCGCTGCCTACCTGAAGCTGCGGA
ACTGGCAGTGGTGGCGGCTCTTCACCAAGGTCAAGCCGCTGCTGCAGGTGAGCCGGCAGGAGGAGGAGATGATGGCCAAGGAGGAGGAGC
TGGTGAAGGTCAGAGAGAAGCAGCTGGCTGCGGAGAACAGGCTCACGGAGATGGAGACGCTGCAGTCTCAGCTCATGGCAGAGAAATTGC
AGCTGCAGGAGCAGCTCCAGGCAGAAACCGAGCTGTGTGCCGAGGCTGAGGAGCTCCGGGCCCGCCTGACCGCCAAGAAGCAGGAATTAG
AAGAGATCTGCCATGACCTAGAGGCCAGGGTGGAGGAGGAGGAGGAGCGCTGCCAGCACCTGCAGGCGGAGAAGAAGAAGATGCAGCAGA
ACATCCAGGAGCTTGAGGAGCAGCTGGAGGAGGAGGAGAGCGCCCGGCAGAAGCTGCAGCTGGAGAAGGTGACCACCGAGGCGAAGCTGA
AAAAGCTGGAGGAGGAGCAGATCATCCTGGAGGACCAGAACTGCAAGCTGGCCAAGGAAAAGAAACTGCTGGAAGACAGAATAGCTGAGT
TCACCACCAACCTCACAGAAGAGGAGGAGAAATCTAAGAGCCTCGCCAAGCTCAAGAACAAGCATGAGGCAATGATCACTGACTTGGAAG
AGCGCCTCCGCAGGGAGGAGAAGCAGCGACAGGAGCTGGAGAAGACCCGCCGGAAGCTGGAGGGAGACTCCACAGACCTCAGCGACCAGA
TCGCCGAGCTCCAGGCCCAGATCGCGGAGCTCAAGATGCAGCTGGCCAAGAAAGAGGAGGAGCTCCAGGCCGCCCTGGCCAGAGTGGAAG
AGGAAGCTGCCCAGAAGAACATGGCCCTCAAGAAGATCCGGGAGCTGGAATCTCAGATCTCTGAACTCCAGGAAGACCTGGAGTCTGAGC
GTGCTTCCAGGAATAAAGCTGAGAAGCAGAAACGGGACCTTGGGGAAGAGCTAGAGGCTCTGAAAACAGAGTTGGAGGACACGCTGGATT
CCACAGCTGCCCAGCAGGAGCTCAGGTCAAAACGTGAGCAGGAGGTGAACATCCTGAAGAAGACCCTGGAGGAGGAGGCCAAGACCCACG
AGGCCCAGATCCAGGAGATGAGGCAGAAGCACTCACAGGCCGTGGAGGAGCTGGCGGAGCAGCTGGAGCAGACGAAGCGGGTGAAAGCAA
ACCTCGAGAAGGCAAAGCAGACTCTGGAGAACGAGCGGGGGGAGCTGGCCAACGAGGTGAAGGTGCTGCTGCAGGGCAAAGGGGACTCGG
AGCACAAGCGCAAGAAAGTGGAGGCGCAGCTGCAGGAGCTGCAGGTCAAGTTCAACGAGGGAGAGCGCGTGCGCACAGAGCTGGCCGACA
AGGTCACCAAGCTGCAGGTGGAGCTGGACAACGTGACCGGGCTTCTCAGCCAGTCCGACAGCAAGTCCAGCAAGCTCACCAAGGACTTCT
CCGCGCTGGAGTCCCAGCTGCAGGACACTCAGGAGCTGCTGCAGGAGGAGAACCGGCAGAAGCTGAGCCTGAGCACCAAGCTCAAGCAGG
TGGAGGACGAGAAGAATTCCTTCCGGGAGCAGCTGGAGGAGGAGGAGGAGGCCAAGCACAACCTGGAGAAGCAGATCGCCACCCTCCATG
CCCAGGTGGCCGACATGAAAAAGAAGATGGAGGACAGTGTGGGGTGCCTGGAAACTGCTGAGGAGGTGAAGAGGAAGCTCCAGAAGGACC
TGGAGGGCCTGAGCCAGCGGCACGAGGAGAAGGTGGCCGCCTACGACAAGCTGGAGAAGACCAAGACGCGGCTGCAGCAGGAGCTGGACG
ACCTGCTGGTGGACCTGGACCACCAGCGCCAGAGCGCGTGCAACCTGGAGAAGAAGCAGAAGAAGTTTGACCAGCTCCTGGCGGAGGAGA
AGACCATCTCTGCCAAGTATGCAGAGGAGCGCGACCGGGCTGAGGCGGAGGCCCGAGAGAAGGAGACCAAGGCTCTGTCGCTGGCCCGGG
CCCTGGAGGAAGCCATGGAGCAGAAGGCGGAGCTGGAGCGGCTCAACAAGCAGTTCCGCACGGAGATGGAGGACCTTATGAGCTCCAAGG
ATGATGTGGGCAAGAGTGTCCACGAGCTGGAGAAGTCCAAGCGGGCCCTAGAGCAGCAGGTGGAGGAGATGAAGACGCAGCTGGAAGAGC
TGGAGGACGAGCTGCAGGCCACCGAAGATGCCAAGCTGCGGTTGGAGGTCAACCTGCAGGCCATGAAGGCCCAGTTCGAGCGGGACCTGC
AGGGCCGGGACGAGCAGAGCGAGGAGAAGAAGAAGCAGCTGGTCAGACAGGTGCGGGAGATGGAGGCAGAGCTGGAGGACGAGAGGAAGC
AGCGCTCGATGGCAGTGGCCGCCCGGAAGAAGCTGGAGATGGACCTGAAGGACCTGGAGGCGCACATCGACTCGGCCAACAAGAACCGGG
ACGAAGCCATCAAACAGCTGCGGAAGCTGCAGGCCCAGATGAAGGACTGCATGCGCGAGCTGGATGACACCCGCGCCTCTCGTGAGGAGA
TCCTGGCCCAGGCCAAAGAGAACGAGAAGAAGCTGAAGAGCATGGAGGCCGAGATGATCCAGTTGCAGGAGGAACTGGCAGCCGCGGAGC
GTGCCAAGCGCCAGGCCCAGCAGGAGCGGGATGAGCTGGCTGACGAGATCGCCAACAGCAGCGGCAAAGGAGCCCTGGCGTTAGAGGAGA
AGCGGCGTCTGGAGGCCCGCATCGCCCAGCTGGAGGAGGAGCTGGAGGAGGAGCAGGGCAACACGGAGCTGATCAACGACCGGCTGAAGA
AGGCCAACCTGCAGATCGACCAGATCAACACCGACCTGAACCTGGAGCGCAGCCACGCCCAGAAGAACGAGAATGCTCGGCAGCAGCTGG
AACGCCAGAACAAGGAGCTTAAGGTCAAGCTGCAGGAGATGGAGGGCACTGTCAAGTCCAAGTACAAGGCCTCCATCACCGCCCTCGAGG
CCAAGATTGCACAGCTGGAGGAGCAGCTGGACAACGAGACCAAGGAGCGCCAGGCAGCCTGCAAACAGGTGCGTCGGACCGAGAAGAAGC
TGAAGGATGTGCTGCTGCAGGTGGATGACGAGCGGAGGAACGCCGAGCAGTACAAGGACCAGGCCGACAAGGCATCTACCCGCCTGAAGC
AGCTCAAGCGGCAGCTGGAGGAGGCCGAAGAGGAGGCCCAGCGGGCCAACGCCTCCCGCCGGAAACTGCAGCGCGAGCTGGAGGACGCCA
CTGAGACGGCCGATGCCATGAACCGCGAAGTCAGCTCCCTAAAGAACAAGCTCAGGCGCGGGGACCTGCCGTTTGTCGTGCCCCGCCGAA
TGGCCCGGAAAGGCGCCGGGGATGGCTCCGACGAAGAGGTAGATGGCAAAGCGGATGGGGCTGAGGCCAAACCTGCCGAATAAGCCTCTT
CTCCTGCAGCCTGAGATGGATGGACAGACAGACACCACAGCCTCCCCTTCCCAGACCCCGCAGCACGCCTCTCCCCACCTTCTTGGGACT
GCTGTGAACATGCCTCCTCCTGCCCTCCGCCCCGTCCCCCCATCCCGTTTCCCTCCAGGTGTTGTTGAGGGCATTTGGCTTCCTCTGCTG
CATCCCCTTCCAGCTCCCTCCCCTGCTCAGAATCTGATACCAAAGAGACAGGGCCCGGGCCCAGGCAGAGAGCGACCAGCAGGCTCCTCA
GCCCTCTCTTGCCAAAAAGCACAAGATGTTGAGGCGAGCAGGGCAGGCCCCCGGGGAGGGGCCAGAGTTTTCTATGAATCTATTTTTCTT
CAGACTGAGGCCTTTTGGTAGTCGGAGCCCCCGCAGTCGTCAGCCTCCCTGACGTCTGCCACCAGCGCCCCCACTCCTCCTCCTTTCTTT
GCTGTTTGCAATCACACGTGGTGACCTCACACACCTCTGCCCCTTGGGCCTCCCACTCCCATGGCTCTGGGCGGTCCAGAAGGAGCAGGC
CCTGGGCCTCCACCTCTGTGCAGGGCACAGAAGGCTGGGGTGGGGGGAGGAGTGGATTCCTCCCCACCCTGTCCCAGGCAGCGCCACTGT
CCGCTGTCTCCCTCCTGATTCTAAAATGTCTCAAGTGCAATGCCCCCTCCCCTCCTTTACCGAGGACAGCCTGCCTCTGCCACAGCAAGG
CTGTCGGGGTCAAGCTGGAAAGGCCAGCAGCCTTCCAGTGGCTTCTCCCAACACTCTTGGGGACCAAATATATTTAATGGTTAAGGGACT
TGTCCCAAGTCTGACAGCCAGAGCGTTAGAGGGGCCAGCGGCCCTCCCAGGCGATCTTGTGTCTACTCTAGGACTGGGCCCGAGGGTGGT
TTACCTGCACCGTTGACTCAGTATAGTTTAAAAATCTGCCACCTGCACAGGTATTTTTGAAAGCAAAATAAGGTTTTCTTTTTTCCCCTT
TCTTGTAATAAATGATAAAATTCCGAGTCTTTCTCACTGCCTTTGTTTAGAAGAGAGTAGCTCGTCCTCACTGGTCTACACTGGTTGCCG
AATTTACTTGTATTCCTAACTGTTTTGTATATGCTGCATTGAGACTTACGGCAAGAAGGCATTTTTTTTTTTTAAAGGAAACAAACTCTC
AAATCATGAAGTGATATAAAAGCTGCATATGCCTACAAAGCTCTGAATTCAGGTCCCAGTTGCTGTCACAAAGGAGTGAGTGAAACTCCC
ACCCTACCCCCTTTTTTATATAATAAAAGTGCCTTAGCATGTGTTGCAGCTGTCACCACTACAGTAAGCTGGTTTACAGATGTTTTCCAC

>16303_16303_3_CHD2-MYH9_CHD2_chr15_93444529_ENST00000557381_MYH9_chr22_36701148_ENST00000216181_length(amino acids)=1261AA_BP=20
MMRNKDKSQEEDSSLHSNASRYEILTPNSIPKGFMDGKQACVLMIKALELDSNLYRIGQSKVFFRAGVLAHLEEERDLKITDVIIGFQAC
CRGYLARKAFAKRQQQLTAMKVLQRNCAAYLKLRNWQWWRLFTKVKPLLQVSRQEEEMMAKEEELVKVREKQLAAENRLTEMETLQSQLM
AEKLQLQEQLQAETELCAEAEELRARLTAKKQELEEICHDLEARVEEEEERCQHLQAEKKKMQQNIQELEEQLEEEESARQKLQLEKVTT
EAKLKKLEEEQIILEDQNCKLAKEKKLLEDRIAEFTTNLTEEEEKSKSLAKLKNKHEAMITDLEERLRREEKQRQELEKTRRKLEGDSTD
LSDQIAELQAQIAELKMQLAKKEEELQAALARVEEEAAQKNMALKKIRELESQISELQEDLESERASRNKAEKQKRDLGEELEALKTELE
DTLDSTAAQQELRSKREQEVNILKKTLEEEAKTHEAQIQEMRQKHSQAVEELAEQLEQTKRVKANLEKAKQTLENERGELANEVKVLLQG
KGDSEHKRKKVEAQLQELQVKFNEGERVRTELADKVTKLQVELDNVTGLLSQSDSKSSKLTKDFSALESQLQDTQELLQEENRQKLSLST
KLKQVEDEKNSFREQLEEEEEAKHNLEKQIATLHAQVADMKKKMEDSVGCLETAEEVKRKLQKDLEGLSQRHEEKVAAYDKLEKTKTRLQ
QELDDLLVDLDHQRQSACNLEKKQKKFDQLLAEEKTISAKYAEERDRAEAEAREKETKALSLARALEEAMEQKAELERLNKQFRTEMEDL
MSSKDDVGKSVHELEKSKRALEQQVEEMKTQLEELEDELQATEDAKLRLEVNLQAMKAQFERDLQGRDEQSEEKKKQLVRQVREMEAELE
DERKQRSMAVAARKKLEMDLKDLEAHIDSANKNRDEAIKQLRKLQAQMKDCMRELDDTRASREEILAQAKENEKKLKSMEAEMIQLQEEL
AAAERAKRQAQQERDELADEIANSSGKGALALEEKRRLEARIAQLEEELEEEQGNTELINDRLKKANLQIDQINTDLNLERSHAQKNENA
RQQLERQNKELKVKLQEMEGTVKSKYKASITALEAKIAQLEEQLDNETKERQAACKQVRRTEKKLKDVLLQVDDERRNAEQYKDQADKAS
TRLKQLKRQLEEAEEEAQRANASRRKLQRELEDATETADAMNREVSSLKNKLRRGDLPFVVPRRMARKGAGDGSDEEVDGKADGAEAKPA

--------------------------------------------------------------

Top

Fusion Gene PPI Analysis for CHD2-MYH9


check button Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in

ChiPPI page.


check button Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160)
HgeneHgene's interactorsTgeneTgene's interactors


check button - Retained PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenStill interaction with


check button - Lost PPIs in in-frame fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


check button - Retained PPIs, but lost function due to frame-shift fusion.
PartnerGeneHbpTbpENSTStrandBPexonTotalExonProtein feature loci*BPlociTotalLenInteraction lost with


Top

Related Drugs for CHD2-MYH9


check button Drugs targeting genes involved in this fusion gene.
(DrugBank Version 5.1.8 2021-05-08)
PartnerGeneUniProtAccDrugBank IDDrug nameDrug activityDrug typeDrug status

Top

Related Diseases for CHD2-MYH9


check button Diseases associated with fusion partners.
(DisGeNet 4.0)
PartnerGeneDisease IDDisease name# pubmedsSource
HgeneCHD2C0014544Epilepsy2CTD_human
HgeneCHD2C0086237Epilepsy, Cryptogenic2CTD_human
HgeneCHD2C0236018Aura2CTD_human
HgeneCHD2C0751111Awakening Epilepsy2CTD_human
HgeneCHD2C3809278EPILEPTIC ENCEPHALOPATHY, CHILDHOOD-ONSET2GENOMICS_ENGLAND;UNIPROT
HgeneCHD2C0040517Gilles de la Tourette syndrome1GENOMICS_ENGLAND
HgeneCHD2C0238111Lennox-Gastaut syndrome1ORPHANET
HgeneCHD2C0393702Myoclonic Astatic Epilepsy1ORPHANET
HgeneCHD2C0543888Epileptic encephalopathy1GENOMICS_ENGLAND
HgeneCHD2C1510586Autism Spectrum Disorders1CTD_human
HgeneCHD2C1535926Neurodevelopmental Disorders1CTD_human
TgeneC0340978May-Hegglin anomaly25CLINGEN;GENOMICS_ENGLAND;UNIPROT
TgeneC1854520SEBASTIAN SYNDROME14CLINGEN;CTD_human;GENOMICS_ENGLAND;ORPHANET
TgeneC0398641Epstein syndrome (disorder)11CLINGEN
TgeneC0403445Fechtner syndrome (disorder)11CLINGEN
TgeneC0477317Other primary thrombocytopenia11CLINGEN
TgeneC1842035Giant Platelet Syndrome with Thrombocytopenia11CLINGEN
TgeneC1863659DEAFNESS, AUTOSOMAL DOMINANT 176CTD_human;GENOMICS_ENGLAND;UNIPROT
TgeneC0022661Kidney Failure, Chronic2CTD_human
TgeneC0006142Malignant neoplasm of breast1CTD_human;UNIPROT
TgeneC0017668Focal glomerulosclerosis1CTD_human
TgeneC0018784Sensorineural Hearing Loss (disorder)1GENOMICS_ENGLAND
TgeneC0018965Hematuria1GENOMICS_ENGLAND
TgeneC0020544Renal hypertension1CTD_human
TgeneC0027626Neoplasm Invasiveness1CTD_human
TgeneC0027706Hereditary nephritis1CTD_human
TgeneC0033687Proteinuria1GENOMICS_ENGLAND
TgeneC0035078Kidney Failure1GENOMICS_ENGLAND
TgeneC0086432Hyalinosis, Segmental Glomerular1CTD_human
TgeneC0086543Cataract1GENOMICS_ENGLAND
TgeneC0206692Carcinoma, Lobular1CTD_human
TgeneC0410005Nodular fasciitis1ORPHANET
TgeneC0678222Breast Carcinoma1CTD_human
TgeneC1257931Mammary Neoplasms, Human1CTD_human
TgeneC1458155Mammary Neoplasms1CTD_human
TgeneC1567741Alport Syndrome1CTD_human
TgeneC1567742Alport Syndrome, X-Linked1CTD_human
TgeneC1567743Alport Syndrome, Autosomal Dominant1CTD_human
TgeneC1567744Alport Syndrome, Autosomal Recessive1CTD_human
TgeneC1834478MACROTHROMBOCYTOPENIA AND PROGRESSIVE SENSORINEURAL DEAFNESS1CTD_human
TgeneC2931861Hemorrhagic hereditary nephritis1CTD_human
TgeneC4280711Leukocyte inclusion bodies1GENOMICS_ENGLAND
TgeneC4704874Mammary Carcinoma, Human1CTD_human