|
Fusion Gene Summary | |
Fusion Gene ORF analysis | |
Fusion Genomic Features | |
Fusion Protein Features | |
Fusion Gene Sequence | |
Fusion Gene PPI analysis | |
Related Drugs | |
Related Diseases |
Fusion gene:MYH9-USF2 (FusionGDB2 ID:56079) |
Fusion Gene Summary for MYH9-USF2 |
Fusion gene summary |
Fusion gene information | Fusion gene name: MYH9-USF2 | Fusion gene ID: 56079 | Hgene | Tgene | Gene symbol | MYH9 | USF2 | Gene ID | 4627 | 7392 |
Gene name | myosin heavy chain 9 | upstream transcription factor 2, c-fos interacting | |
Synonyms | BDPLT6|DFNA17|EPSTS|FTNS|MATINS|MHA|NMHC-II-A|NMMHC-IIA|NMMHCA | FIP|bHLHb12 | |
Cytomap | 22q12.3 | 19q13.12 | |
Type of gene | protein-coding | protein-coding | |
Description | myosin-9cellular myosin heavy chain, type Amyosin, heavy chain 9, non-musclenon-muscle myosin heavy chain 9non-muscle myosin heavy chain Anon-muscle myosin heavy chain IIanon-muscle myosin heavy polypeptide 9nonmuscle myosin heavy chain II-A | upstream stimulatory factor 2c-fos interacting proteinclass B basic helix-loop-helix protein 12major late transcription factor 2 | |
Modification date | 20200315 | 20200313 | |
UniProtAcc | . | . | |
Ensembl transtripts involved in fusion gene | ENST00000216181, ENST00000475726, ENST00000401701, | ENST00000343550, ENST00000222305, ENST00000595068, ENST00000379134, ENST00000594064, ENST00000600341, | |
Fusion gene scores | * DoF score | 55 X 51 X 22=61710 | 4 X 4 X 3=48 |
# samples | 77 | 4 | |
** MAII score | log2(77/61710*10)=-6.32450203855119 possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs). DoF>8 and MAII<0 | log2(4/48*10)=-0.263034405833794 possibly effective Gene in Pan-Cancer Fusion Genes (peGinPCFGs). DoF>8 and MAII<0 | |
Context | PubMed: MYH9 [Title/Abstract] AND USF2 [Title/Abstract] AND fusion [Title/Abstract] | ||
Most frequent breakpoint | MYH9(36680448)-USF2(35760348), # samples:1 | ||
Anticipated loss of major functional domain due to fusion event. |
* DoF score (Degree of Frequency) = # partners X # break points X # cancer types ** MAII score (Major Active Isofusion Index) = log2(# samples/DoF score*10) |
Gene ontology of each fusion partner gene with evidence of Inferred from Direct Assay (IDA) from Entrez |
Partner | Gene | GO ID | GO term | PubMed ID |
Hgene | MYH9 | GO:0001525 | angiogenesis | 16403913 |
Hgene | MYH9 | GO:0001778 | plasma membrane repair | 27325790 |
Hgene | MYH9 | GO:0006509 | membrane protein ectodomain proteolysis | 16186248 |
Hgene | MYH9 | GO:0030048 | actin filament-based movement | 12237319|15845534 |
Hgene | MYH9 | GO:0031032 | actomyosin structure organization | 24072716 |
Fusion gene breakpoints across MYH9 (5'-gene) * Click on the image to open the UCSC genome browser with custom track showing this image in a new window. |
Fusion gene breakpoints across USF2 (3'-gene) * Click on the image to open the UCSC genome browser with custom track showing this image in a new window. |
Fusion gene information from two resources (ChiTars 5.0 and ChimerDB 4.0) * All genome coordinats were lifted-over on hg19. * Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser. |
Source | Disease | Sample | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand |
ChimerDB4 | OV | TCGA-61-2003 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + |
Top |
Fusion Gene ORF analysis for MYH9-USF2 |
Open reading frame (ORF) analsis of fusion genes based on Ensembl gene isoform structure. * Click on the break point to see the gene structure around the break point region using the UCSC Genome Browser. |
ORF | Henst | Tenst | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand |
In-frame | ENST00000216181 | ENST00000343550 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + |
In-frame | ENST00000216181 | ENST00000222305 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + |
In-frame | ENST00000216181 | ENST00000595068 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + |
In-frame | ENST00000216181 | ENST00000379134 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + |
In-frame | ENST00000216181 | ENST00000594064 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + |
5CDS-intron | ENST00000216181 | ENST00000600341 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + |
5UTR-3CDS | ENST00000475726 | ENST00000343550 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + |
5UTR-3CDS | ENST00000475726 | ENST00000222305 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + |
5UTR-3CDS | ENST00000475726 | ENST00000595068 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + |
5UTR-3CDS | ENST00000475726 | ENST00000379134 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + |
5UTR-3CDS | ENST00000475726 | ENST00000594064 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + |
5UTR-intron | ENST00000475726 | ENST00000600341 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + |
intron-3CDS | ENST00000401701 | ENST00000343550 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + |
intron-3CDS | ENST00000401701 | ENST00000222305 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + |
intron-3CDS | ENST00000401701 | ENST00000595068 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + |
intron-3CDS | ENST00000401701 | ENST00000379134 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + |
intron-3CDS | ENST00000401701 | ENST00000594064 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + |
intron-intron | ENST00000401701 | ENST00000600341 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + |
ORFfinder result based on the fusion transcript sequence of in-frame fusion genes. |
Henst | Tenst | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand | Seq length (transcript) | BP loci (transcript) | Predicted start (transcript) | Predicted stop (transcript) | Seq length (amino acids) |
ENST00000216181 | MYH9 | chr22 | 36680448 | - | ENST00000343550 | USF2 | chr19 | 35760348 | + | 7160 | 5823 | 231 | 6236 | 2001 |
ENST00000216181 | MYH9 | chr22 | 36680448 | - | ENST00000222305 | USF2 | chr19 | 35760348 | + | 7358 | 5823 | 231 | 6017 | 1928 |
ENST00000216181 | MYH9 | chr22 | 36680448 | - | ENST00000595068 | USF2 | chr19 | 35760348 | + | 7336 | 5823 | 231 | 6017 | 1928 |
ENST00000216181 | MYH9 | chr22 | 36680448 | - | ENST00000379134 | USF2 | chr19 | 35760348 | + | 6409 | 5823 | 231 | 6044 | 1937 |
ENST00000216181 | MYH9 | chr22 | 36680448 | - | ENST00000594064 | USF2 | chr19 | 35760348 | + | 6946 | 5823 | 231 | 6017 | 1928 |
DeepORF prediction of the coding potential based on the fusion transcript sequence of in-frame fusion genes. DeepORF is a coding potential classifier based on convolutional neural network by comparing the real Ribo-seq data. If the no-coding score < 0.5 and coding score > 0.5, then the in-frame fusion transcript is predicted as being likely translated. |
Henst | Tenst | Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand | No-coding score | Coding score |
ENST00000216181 | ENST00000343550 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + | 0.01933294 | 0.980667 |
ENST00000216181 | ENST00000222305 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + | 0.019886332 | 0.9801137 |
ENST00000216181 | ENST00000595068 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + | 0.019896464 | 0.9801035 |
ENST00000216181 | ENST00000379134 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + | 0.017502636 | 0.9824974 |
ENST00000216181 | ENST00000594064 | MYH9 | chr22 | 36680448 | - | USF2 | chr19 | 35760348 | + | 0.019613214 | 0.9803868 |
Top |
Fusion Genomic Features for MYH9-USF2 |
FusionAI prediction of the potential fusion gene breakpoint based on the pre-mature RNA sequence context (+/- 5kb of individual partner genes, total 20kb length sequence). FusionAI is a fusion gene breakpoint classifier based on convolutional neural network by comparing the fusion positive and negative sequence context of ~ 20K fusion gene data. From here, we can have the relative potentency of the 20K genomic sequence how individual sequnce will be likely used as the gene fusion breakpoints. |
Hgene | Hchr | Hbp | Hstrand | Tgene | Tchr | Tbp | Tstrand | 1-p | p (fusion gene breakpoint) |
Distribution of 44 human genomic features loci across 20kb length fusion breakpoint regions. We integrated a total of 44 different types of human genomic feature loci information across five big categories including virus integration sites, repeats, structural variants, chromatin states, and gene expression regulation. More details are in help page. |
Distribution of 44 human genomic features loci across 20kb length fusion breakpoint regions that are ovelapped with the top 1% feature importance score regions. More details are in help page. |
Top |
Fusion Protein Features for MYH9-USF2 |
Go to FGviewer for the breakpoints of chr22:36680448-chr19:35760348 - FGviewer provides the online visualization of the retention search of the protein functional features across DNA, RNA, protein, and pathological levels. |
Main function of each fusion partner protein. (from UniProt) |
Hgene | Tgene |
. | . |
FUNCTION: Might normally function as a transcriptional repressor. EWS-fusion-proteins (EFPS) may play a role in the tumorigenic process. They may disturb gene expression by mimicking, or interfering with the normal function of CTD-POLII within the transcription initiation complex. They may also contribute to an aberrant activation of the fusion protein target genes. | FUNCTION: Might normally function as a transcriptional repressor. EWS-fusion-proteins (EFPS) may play a role in the tumorigenic process. They may disturb gene expression by mimicking, or interfering with the normal function of CTD-POLII within the transcription initiation complex. They may also contribute to an aberrant activation of the fusion protein target genes. |
Retention analysis result of each fusion partner protein across 39 protein features of UniProt such as six molecule processing features, 13 region features, four site features, six amino acid modification features, two natural variation features, five experimental info features, and 3 secondary structure features. Here, because of limited space for viewing, we only show the protein feature retention information belong to the 13 regional features. All retention annotation result can be downloaded at * Minus value of BPloci means that the break pointn is located before the CDS. |
- In-frame and retained protein feature among the 13 regional features. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Protein feature | Protein feature note |
Hgene | MYH9 | chr22:36680448 | chr19:35760348 | ENST00000216181 | - | 39 | 41 | 27_77 | 1864.0 | 1961.0 | Domain | Myosin N-terminal SH3-like |
Hgene | MYH9 | chr22:36680448 | chr19:35760348 | ENST00000216181 | - | 39 | 41 | 779_808 | 1864.0 | 1961.0 | Domain | IQ |
Hgene | MYH9 | chr22:36680448 | chr19:35760348 | ENST00000216181 | - | 39 | 41 | 81_776 | 1864.0 | 1961.0 | Domain | Myosin motor |
Hgene | MYH9 | chr22:36680448 | chr19:35760348 | ENST00000216181 | - | 39 | 41 | 174_181 | 1864.0 | 1961.0 | Nucleotide binding | ATP |
Hgene | MYH9 | chr22:36680448 | chr19:35760348 | ENST00000216181 | - | 39 | 41 | 654_676 | 1864.0 | 1961.0 | Region | Note=Actin-binding |
Tgene | USF2 | chr22:36680448 | chr19:35760348 | ENST00000222305 | 0 | 10 | 245_248 | 20.666666666666668 | 347.0 | Compositional bias | Note=Poly-Arg | |
Tgene | USF2 | chr22:36680448 | chr19:35760348 | ENST00000343550 | 0 | 9 | 245_248 | 20.666666666666668 | 280.0 | Compositional bias | Note=Poly-Arg | |
Tgene | USF2 | chr22:36680448 | chr19:35760348 | ENST00000379134 | 0 | 8 | 245_248 | 20.666666666666668 | 216.0 | Compositional bias | Note=Poly-Arg | |
Tgene | USF2 | chr22:36680448 | chr19:35760348 | ENST00000595068 | 0 | 10 | 245_248 | 20.666666666666668 | 339.0 | Compositional bias | Note=Poly-Arg | |
Tgene | USF2 | chr22:36680448 | chr19:35760348 | ENST00000222305 | 0 | 10 | 235_290 | 20.666666666666668 | 347.0 | Domain | bHLH | |
Tgene | USF2 | chr22:36680448 | chr19:35760348 | ENST00000343550 | 0 | 9 | 235_290 | 20.666666666666668 | 280.0 | Domain | bHLH | |
Tgene | USF2 | chr22:36680448 | chr19:35760348 | ENST00000379134 | 0 | 8 | 235_290 | 20.666666666666668 | 216.0 | Domain | bHLH | |
Tgene | USF2 | chr22:36680448 | chr19:35760348 | ENST00000595068 | 0 | 10 | 235_290 | 20.666666666666668 | 339.0 | Domain | bHLH | |
Tgene | USF2 | chr22:36680448 | chr19:35760348 | ENST00000222305 | 0 | 10 | 307_328 | 20.666666666666668 | 347.0 | Region | Note=Leucine-zipper | |
Tgene | USF2 | chr22:36680448 | chr19:35760348 | ENST00000343550 | 0 | 9 | 307_328 | 20.666666666666668 | 280.0 | Region | Note=Leucine-zipper | |
Tgene | USF2 | chr22:36680448 | chr19:35760348 | ENST00000379134 | 0 | 8 | 307_328 | 20.666666666666668 | 216.0 | Region | Note=Leucine-zipper | |
Tgene | USF2 | chr22:36680448 | chr19:35760348 | ENST00000595068 | 0 | 10 | 307_328 | 20.666666666666668 | 339.0 | Region | Note=Leucine-zipper |
- In-frame and not-retained protein feature among the 13 regional features. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Protein feature | Protein feature note |
Hgene | MYH9 | chr22:36680448 | chr19:35760348 | ENST00000216181 | - | 39 | 41 | 837_1926 | 1864.0 | 1961.0 | Coiled coil | Ontology_term=ECO:0000255 |
Tgene | USF2 | chr22:36680448 | chr19:35760348 | ENST00000222305 | 0 | 10 | 11_20 | 20.666666666666668 | 347.0 | Compositional bias | Note=Poly-Ala | |
Tgene | USF2 | chr22:36680448 | chr19:35760348 | ENST00000343550 | 0 | 9 | 11_20 | 20.666666666666668 | 280.0 | Compositional bias | Note=Poly-Ala | |
Tgene | USF2 | chr22:36680448 | chr19:35760348 | ENST00000379134 | 0 | 8 | 11_20 | 20.666666666666668 | 216.0 | Compositional bias | Note=Poly-Ala | |
Tgene | USF2 | chr22:36680448 | chr19:35760348 | ENST00000595068 | 0 | 10 | 11_20 | 20.666666666666668 | 339.0 | Compositional bias | Note=Poly-Ala |
Top |
Fusion Gene Sequence for MYH9-USF2 |
For in-frame fusion transcripts, we provide the fusion transcript sequences and fusion amino acid sequences. To have fusion amino acid sequence, we ran ORFfinder and chose the longest ORF among the all predicted ones. |
>In-frame_ENST00000216181_ENST00000343550_TCGA-61-2003_MYH9_chr22_36680448_-_USF2_chr19_35760348_length(transcript)=7160nt_BP=5823nt GAGGGCGGGGCGGGAAGGCGGCGAGGAGCCGAGCTGGGTGCGGTGAGGCGCGCAGATCACCGCGGTTCCTGGGCAGGGCACGGAAGGCTA AGCAAGGCTGACCTGCTGCAGCTCCCGCCTCGTGCGCTCGCCCCACCCGGCCGCCGCCCGAGCGCTCGAGAAAGTCCTCTCGGGAGAAGC AGCGCCTGTTCCCGGGGCAGATCCAGGTTCAGGTCCTGGCTATAAGTCACCATGGCACAGCAAGCTGCCGATAAGTATCTCTATGTGGAT AAAAACTTCATCAACAATCCGCTGGCCCAGGCCGACTGGGCTGCCAAGAAGCTGGTATGGGTGCCTTCCGACAAGAGTGGCTTTGAGCCA GCCAGCCTCAAGGAGGAGGTGGGCGAAGAGGCCATCGTGGAGCTGGTGGAGAATGGGAAGAAGGTGAAGGTGAACAAGGATGACATCCAG AAGATGAACCCGCCCAAGTTCTCCAAGGTGGAGGACATGGCAGAGCTCACGTGCCTCAACGAAGCCTCGGTGCTGCACAACCTCAAGGAG CGTTACTACTCAGGGCTCATCTACACCTATTCAGGCCTGTTCTGTGTGGTCATCAATCCTTACAAGAACCTGCCCATCTACTCTGAAGAG ATTGTGGAAATGTACAAGGGCAAGAAGAGGCACGAGATGCCCCCTCACATCTATGCCATCACAGACACCGCCTACAGGAGTATGATGCAA GACCGAGAAGATCAATCCATCTTGTGCACTGGTGAATCTGGAGCTGGCAAGACGGAGAACACCAAGAAGGTCATCCAGTATCTGGCGTAC GTGGCGTCCTCGCACAAGAGCAAGAAGGACCAGGGCGAGCTGGAGCGGCAGCTGCTGCAGGCCAACCCCATCCTGGAGGCCTTCGGGAAC GCCAAGACCGTGAAGAATGACAACTCCTCCCGCTTCGGCAAATTCATTCGCATCAACTTTGATGTCAATGGCTACATTGTTGGAGCCAAC ATTGAGACTTATCTTTTGGAGAAATCTCGTGCTATCCGCCAAGCCAAGGAAGAACGGACCTTCCACATCTTCTATTATCTCCTGTCTGGG GCTGGAGAGCACCTGAAGACCGATCTCCTGTTGGAGCCGTACAACAAATACCGCTTCCTGTCCAATGGACACGTCACCATCCCCGGGCAG CAGGACAAGGACATGTTCCAGGAGACCATGGAGGCCATGAGGATTATGGGCATCCCAGAAGAGGAGCAAATGGGCCTGCTGCGGGTCATC TCAGGGGTTCTTCAGCTCGGCAACATCGTCTTCAAGAAGGAGCGGAACACTGACCAGGCGTCCATGCCCGACAACACAGCTGCCCAAAAG GTGTCCCATCTCTTGGGTATCAATGTGACCGATTTCACCAGAGGAATCCTCACCCCGCGCATCAAGGTGGGACGGGATTACGTCCAGAAG GCGCAGACTAAAGAGCAGGCTGACTTTGCCATCGAGGCCTTGGCCAAGGCGACCTATGAGCGGATGTTCCGCTGGCTGGTGCTGCGCATC AACAAGGCTCTGGACAAGACCAAGAGGCAGGGCGCCTCCTTCATCGGGATCCTGGACATTGCCGGCTTCGAGATCTTTGATCTGAACTCG TTTGAGCAGCTGTGCATCAATTACACCAATGAGAAGCTGCAGCAGCTCTTCAACCACACCATGTTCATCCTGGAGCAGGAGGAGTACCAG CGCGAGGGCATCGAGTGGAACTTCATCGACTTTGGCCTCGACCTGCAGCCCTGCATCGACCTCATTGAGAAGCCAGCAGGCCCCCCGGGC ATTCTGGCCCTGCTGGACGAGGAGTGCTGGTTCCCCAAAGCCACCGACAAGAGCTTCGTGGAGAAGGTGATGCAGGAGCAGGGCACCCAC CCCAAGTTCCAGAAGCCCAAGCAGCTGAAGGACAAAGCTGATTTCTGCATTATCCACTATGCCGGCAAGGTGGATTACAAAGCTGACGAG TGGCTGATGAAGAACATGGATCCCCTGAATGACAACATCGCCACACTGCTCCACCAGTCCTCTGACAAGTTTGTCTCGGAGCTGTGGAAG GATGTGGACCGCATCATCGGCCTGGACCAGGTGGCCGGCATGTCGGAGACCGCACTGCCCGGGGCCTTCAAGACGCGGAAGGGCATGTTC CGCACTGTGGGGCAGCTTTACAAGGAGCAGCTGGCCAAGCTGATGGCTACGCTGAGGAACACGAACCCCAACTTTGTCCGCTGCATCATC CCCAACCACGAGAAGAAGGCCGGCAAGCTGGACCCGCATCTCGTGCTGGACCAGCTGCGCTGCAACGGTGTTCTCGAGGGCATCCGTATC TGCCGCCAGGGCTTCCCCAACAGGGTGGTCTTCCAGGAGTTTCGGCAGAGATATGAGATCCTGACTCCAAACTCCATTCCCAAGGGTTTC ATGGACGGGAAGCAGGCGTGCGTGCTCATGATAAAAGCCCTGGAGCTCGACAGCAATCTGTACCGCATTGGCCAGAGCAAAGTCTTCTTC CGTGCCGGTGTGCTGGCCCACCTGGAGGAGGAGCGAGACCTGAAGATCACCGACGTCATCATAGGGTTCCAGGCCTGCTGCAGGGGCTAC CTGGCCAGGAAAGCATTTGCCAAGCGGCAGCAGCAGCTTACCGCCATGAAGGTCCTCCAGCGGAACTGCGCTGCCTACCTGAAGCTGCGG AACTGGCAGTGGTGGCGGCTCTTCACCAAGGTCAAGCCGCTGCTGCAGGTGAGCCGGCAGGAGGAGGAGATGATGGCCAAGGAGGAGGAG CTGGTGAAGGTCAGAGAGAAGCAGCTGGCTGCGGAGAACAGGCTCACGGAGATGGAGACGCTGCAGTCTCAGCTCATGGCAGAGAAATTG CAGCTGCAGGAGCAGCTCCAGGCAGAAACCGAGCTGTGTGCCGAGGCTGAGGAGCTCCGGGCCCGCCTGACCGCCAAGAAGCAGGAATTA GAAGAGATCTGCCATGACCTAGAGGCCAGGGTGGAGGAGGAGGAGGAGCGCTGCCAGCACCTGCAGGCGGAGAAGAAGAAGATGCAGCAG AACATCCAGGAGCTTGAGGAGCAGCTGGAGGAGGAGGAGAGCGCCCGGCAGAAGCTGCAGCTGGAGAAGGTGACCACCGAGGCGAAGCTG AAAAAGCTGGAGGAGGAGCAGATCATCCTGGAGGACCAGAACTGCAAGCTGGCCAAGGAAAAGAAACTGCTGGAAGACAGAATAGCTGAG TTCACCACCAACCTCACAGAAGAGGAGGAGAAATCTAAGAGCCTCGCCAAGCTCAAGAACAAGCATGAGGCAATGATCACTGACTTGGAA GAGCGCCTCCGCAGGGAGGAGAAGCAGCGACAGGAGCTGGAGAAGACCCGCCGGAAGCTGGAGGGAGACTCCACAGACCTCAGCGACCAG ATCGCCGAGCTCCAGGCCCAGATCGCGGAGCTCAAGATGCAGCTGGCCAAGAAAGAGGAGGAGCTCCAGGCCGCCCTGGCCAGAGTGGAA GAGGAAGCTGCCCAGAAGAACATGGCCCTCAAGAAGATCCGGGAGCTGGAATCTCAGATCTCTGAACTCCAGGAAGACCTGGAGTCTGAG CGTGCTTCCAGGAATAAAGCTGAGAAGCAGAAACGGGACCTTGGGGAAGAGCTAGAGGCTCTGAAAACAGAGTTGGAGGACACGCTGGAT TCCACAGCTGCCCAGCAGGAGCTCAGGTCAAAACGTGAGCAGGAGGTGAACATCCTGAAGAAGACCCTGGAGGAGGAGGCCAAGACCCAC GAGGCCCAGATCCAGGAGATGAGGCAGAAGCACTCACAGGCCGTGGAGGAGCTGGCGGAGCAGCTGGAGCAGACGAAGCGGGTGAAAGCA AACCTCGAGAAGGCAAAGCAGACTCTGGAGAACGAGCGGGGGGAGCTGGCCAACGAGGTGAAGGTGCTGCTGCAGGGCAAAGGGGACTCG GAGCACAAGCGCAAGAAAGTGGAGGCGCAGCTGCAGGAGCTGCAGGTCAAGTTCAACGAGGGAGAGCGCGTGCGCACAGAGCTGGCCGAC AAGGTCACCAAGCTGCAGGTGGAGCTGGACAACGTGACCGGGCTTCTCAGCCAGTCCGACAGCAAGTCCAGCAAGCTCACCAAGGACTTC TCCGCGCTGGAGTCCCAGCTGCAGGACACTCAGGAGCTGCTGCAGGAGGAGAACCGGCAGAAGCTGAGCCTGAGCACCAAGCTCAAGCAG GTGGAGGACGAGAAGAATTCCTTCCGGGAGCAGCTGGAGGAGGAGGAGGAGGCCAAGCACAACCTGGAGAAGCAGATCGCCACCCTCCAT GCCCAGGTGGCCGACATGAAAAAGAAGATGGAGGACAGTGTGGGGTGCCTGGAAACTGCTGAGGAGGTGAAGAGGAAGCTCCAGAAGGAC CTGGAGGGCCTGAGCCAGCGGCACGAGGAGAAGGTGGCCGCCTACGACAAGCTGGAGAAGACCAAGACGCGGCTGCAGCAGGAGCTGGAC GACCTGCTGGTGGACCTGGACCACCAGCGCCAGAGCGCGTGCAACCTGGAGAAGAAGCAGAAGAAGTTTGACCAGCTCCTGGCGGAGGAG AAGACCATCTCTGCCAAGTATGCAGAGGAGCGCGACCGGGCTGAGGCGGAGGCCCGAGAGAAGGAGACCAAGGCTCTGTCGCTGGCCCGG GCCCTGGAGGAAGCCATGGAGCAGAAGGCGGAGCTGGAGCGGCTCAACAAGCAGTTCCGCACGGAGATGGAGGACCTTATGAGCTCCAAG GATGATGTGGGCAAGAGTGTCCACGAGCTGGAGAAGTCCAAGCGGGCCCTAGAGCAGCAGGTGGAGGAGATGAAGACGCAGCTGGAAGAG CTGGAGGACGAGCTGCAGGCCACCGAAGATGCCAAGCTGCGGTTGGAGGTCAACCTGCAGGCCATGAAGGCCCAGTTCGAGCGGGACCTG CAGGGCCGGGACGAGCAGAGCGAGGAGAAGAAGAAGCAGCTGGTCAGACAGGTGCGGGAGATGGAGGCAGAGCTGGAGGACGAGAGGAAG CAGCGCTCGATGGCAGTGGCCGCCCGGAAGAAGCTGGAGATGGACCTGAAGGACCTGGAGGCGCACATCGACTCGGCCAACAAGAACCGG GACGAAGCCATCAAACAGCTGCGGAAGCTGCAGGCCCAGATGAAGGACTGCATGCGCGAGCTGGATGACACCCGCGCCTCTCGTGAGGAG ATCCTGGCCCAGGCCAAAGAGAACGAGAAGAAGCTGAAGAGCATGGAGGCCGAGATGATCCAGTTGCAGGAGGAACTGGCAGCCGCGGAG CGTGCCAAGCGCCAGGCCCAGCAGGAGCGGGATGAGCTGGCTGACGAGATCGCCAACAGCAGCGGCAAAGGAGCCCTGGCGTTAGAGGAG AAGCGGCGTCTGGAGGCCCGCATCGCCCAGCTGGAGGAGGAGCTGGAGGAGGAGCAGGGCAACACGGAGCTGATCAACGACCGGCTGAAG AAGGCCAACCTGCAGATCGACCAGATCAACACCGACCTGAACCTGGAGCGCAGCCACGCCCAGAAGAACGAGAATGCTCGGCAGCAGCTG GAACGCCAGAACAAGGAGCTTAAGGTCAAGCTGCAGGAGATGGAGGGCACTGTCAAGTCCAAGTACAAGGCCTCCATCACCGCCCTCGAG GCCAAGATTGCACAGCTGGAGGAGCAGCTGGACAACGAGACCAAGGAGCGCCAGGCAGCCTGCAAACAGGTGCGTCGGACCGAGAAGAAG CTGAAGGATGTGCTGCTGCAGGTGGATGACGAGCGGAGGAACGCCGAGCAGTACAAGGACCAGCCACGACAAGGGACCCGAGGCGGAGGA GGGCGTCGAGCTGCAGGAAGGCGGGGACGGCCCAGGAGCGGAGGAGCAGACAGCGGTGGCCATCACCAGCGTCCAGCAGGCGGCGTTCGG CGACCACAACATCCAGTACCAGTTCCGCACAGAGACAAATGGAGGACAGGCTGTGATCCAAAATCCCTTCAGCAATGGTGGCAGTCCGGC GGCCGAGGCTGTCAGCGGGGAGGCACGATTTGCCTATTTCCCAGCGTCCAGTGTGGGAGATACTACGGCTGTGTCCGTACAGACCACAGA CCAGAGCTTGCAGGCTGGAGGCCAGTTCTACGTCATGATGACGCCCCAGGATGTGCTTCAGACAGGAACACAGAGGACGATCGCCCCCCG GACACACCCTTACTCTCCAAAAATTGATGGAACCAGAACACCCCGAGATGAGAGGAGAAGAGCCCAGCACAACGAAGTGGAGCGGAGGCG GAGGGACAAGATCAACAACTGGATCGTCCAGCTTTCGAAAATCATTCCAGACTGTAACGCAGACAACAGCAAGACGGGAGCGAGTAAAGG AGGGATCCTGTCCAAGGCCTGCGATTACATCCGGGAGTTGCGCCAGACCAACCAGCGCATGCAGGAGACCTTCAAAGAGGCCGAGCGGCT GCAGATGGACAACGAGCTCCTGAGGCAGCAGATCGAGGAGCTGAAGAATGAGAACGCCCTGCTTCGAGCCCAGCTGCAGCAGCACAACCT GGAGATGGTGGGCGAGGGCACCCGGCAGTGACGCCCGCCACCACCACGCAGCCGCCGCCGCCCACGCCGGCCTCTGCTGCCCCCTTCCCC AGCCCTTAGCACAGAGAGGGACACATGCCCCTCCCCCAGCTGCGTTTTTTTATAGTAGATTTTTAACAAAAAACGGGGAGAAATAATGCA TTTCTGTGGATACAGTGCCCACCGCCCTCCTCCACTTGGAAACGGTATCCTCCCTGCCCATCCGTCTGTCTGTCGCCCTTCTCCCGGCCC TCACTAAGCCCCGGCACTTCTAGTGGTCTCACCTGGAGGCAAGAGGGAGGGGACAGAGGCCCTGCCACGTCCCGCTGCCTCCTGCTCTCT GGAGGTACTGAGACAGGGTGCTGATGGGAAGGAGGGGAGCCTTTGGGGGGCCACCCGGGGCCTGGACCTATGCAGGGAGGCCACGTCCCA CCCCACCTCTTGTTTCTGGGTCCCTGCTCCCCTTTGGGGGTGTGTGTGTGTGTTTTAATTTTCTTTATGGAAAAATTGACAAAAAAAAAA >In-frame_ENST00000216181_ENST00000343550_TCGA-61-2003_MYH9_chr22_36680448_-_USF2_chr19_35760348_length(amino acids)=2001AA_start in transcript=231_stop in transcript=6236 MAQQAADKYLYVDKNFINNPLAQADWAAKKLVWVPSDKSGFEPASLKEEVGEEAIVELVENGKKVKVNKDDIQKMNPPKFSKVEDMAELT CLNEASVLHNLKERYYSGLIYTYSGLFCVVINPYKNLPIYSEEIVEMYKGKKRHEMPPHIYAITDTAYRSMMQDREDQSILCTGESGAGK TENTKKVIQYLAYVASSHKSKKDQGELERQLLQANPILEAFGNAKTVKNDNSSRFGKFIRINFDVNGYIVGANIETYLLEKSRAIRQAKE ERTFHIFYYLLSGAGEHLKTDLLLEPYNKYRFLSNGHVTIPGQQDKDMFQETMEAMRIMGIPEEEQMGLLRVISGVLQLGNIVFKKERNT DQASMPDNTAAQKVSHLLGINVTDFTRGILTPRIKVGRDYVQKAQTKEQADFAIEALAKATYERMFRWLVLRINKALDKTKRQGASFIGI LDIAGFEIFDLNSFEQLCINYTNEKLQQLFNHTMFILEQEEYQREGIEWNFIDFGLDLQPCIDLIEKPAGPPGILALLDEECWFPKATDK SFVEKVMQEQGTHPKFQKPKQLKDKADFCIIHYAGKVDYKADEWLMKNMDPLNDNIATLLHQSSDKFVSELWKDVDRIIGLDQVAGMSET ALPGAFKTRKGMFRTVGQLYKEQLAKLMATLRNTNPNFVRCIIPNHEKKAGKLDPHLVLDQLRCNGVLEGIRICRQGFPNRVVFQEFRQR YEILTPNSIPKGFMDGKQACVLMIKALELDSNLYRIGQSKVFFRAGVLAHLEEERDLKITDVIIGFQACCRGYLARKAFAKRQQQLTAMK VLQRNCAAYLKLRNWQWWRLFTKVKPLLQVSRQEEEMMAKEEELVKVREKQLAAENRLTEMETLQSQLMAEKLQLQEQLQAETELCAEAE ELRARLTAKKQELEEICHDLEARVEEEEERCQHLQAEKKKMQQNIQELEEQLEEEESARQKLQLEKVTTEAKLKKLEEEQIILEDQNCKL AKEKKLLEDRIAEFTTNLTEEEEKSKSLAKLKNKHEAMITDLEERLRREEKQRQELEKTRRKLEGDSTDLSDQIAELQAQIAELKMQLAK KEEELQAALARVEEEAAQKNMALKKIRELESQISELQEDLESERASRNKAEKQKRDLGEELEALKTELEDTLDSTAAQQELRSKREQEVN ILKKTLEEEAKTHEAQIQEMRQKHSQAVEELAEQLEQTKRVKANLEKAKQTLENERGELANEVKVLLQGKGDSEHKRKKVEAQLQELQVK FNEGERVRTELADKVTKLQVELDNVTGLLSQSDSKSSKLTKDFSALESQLQDTQELLQEENRQKLSLSTKLKQVEDEKNSFREQLEEEEE AKHNLEKQIATLHAQVADMKKKMEDSVGCLETAEEVKRKLQKDLEGLSQRHEEKVAAYDKLEKTKTRLQQELDDLLVDLDHQRQSACNLE KKQKKFDQLLAEEKTISAKYAEERDRAEAEAREKETKALSLARALEEAMEQKAELERLNKQFRTEMEDLMSSKDDVGKSVHELEKSKRAL EQQVEEMKTQLEELEDELQATEDAKLRLEVNLQAMKAQFERDLQGRDEQSEEKKKQLVRQVREMEAELEDERKQRSMAVAARKKLEMDLK DLEAHIDSANKNRDEAIKQLRKLQAQMKDCMRELDDTRASREEILAQAKENEKKLKSMEAEMIQLQEELAAAERAKRQAQQERDELADEI ANSSGKGALALEEKRRLEARIAQLEEELEEEQGNTELINDRLKKANLQIDQINTDLNLERSHAQKNENARQQLERQNKELKVKLQEMEGT VKSKYKASITALEAKIAQLEEQLDNETKERQAACKQVRRTEKKLKDVLLQVDDERRNAEQYKDQPRQGTRGGGGRRAAGRRGRPRSGGAD SGGHHQRPAGGVRRPQHPVPVPHRDKWRTGCDPKSLQQWWQSGGRGCQRGGTICLFPSVQCGRYYGCVRTDHRPELAGWRPVLRHDDAPG -------------------------------------------------------------- >In-frame_ENST00000216181_ENST00000222305_TCGA-61-2003_MYH9_chr22_36680448_-_USF2_chr19_35760348_length(transcript)=7358nt_BP=5823nt GAGGGCGGGGCGGGAAGGCGGCGAGGAGCCGAGCTGGGTGCGGTGAGGCGCGCAGATCACCGCGGTTCCTGGGCAGGGCACGGAAGGCTA AGCAAGGCTGACCTGCTGCAGCTCCCGCCTCGTGCGCTCGCCCCACCCGGCCGCCGCCCGAGCGCTCGAGAAAGTCCTCTCGGGAGAAGC AGCGCCTGTTCCCGGGGCAGATCCAGGTTCAGGTCCTGGCTATAAGTCACCATGGCACAGCAAGCTGCCGATAAGTATCTCTATGTGGAT AAAAACTTCATCAACAATCCGCTGGCCCAGGCCGACTGGGCTGCCAAGAAGCTGGTATGGGTGCCTTCCGACAAGAGTGGCTTTGAGCCA GCCAGCCTCAAGGAGGAGGTGGGCGAAGAGGCCATCGTGGAGCTGGTGGAGAATGGGAAGAAGGTGAAGGTGAACAAGGATGACATCCAG AAGATGAACCCGCCCAAGTTCTCCAAGGTGGAGGACATGGCAGAGCTCACGTGCCTCAACGAAGCCTCGGTGCTGCACAACCTCAAGGAG CGTTACTACTCAGGGCTCATCTACACCTATTCAGGCCTGTTCTGTGTGGTCATCAATCCTTACAAGAACCTGCCCATCTACTCTGAAGAG ATTGTGGAAATGTACAAGGGCAAGAAGAGGCACGAGATGCCCCCTCACATCTATGCCATCACAGACACCGCCTACAGGAGTATGATGCAA GACCGAGAAGATCAATCCATCTTGTGCACTGGTGAATCTGGAGCTGGCAAGACGGAGAACACCAAGAAGGTCATCCAGTATCTGGCGTAC GTGGCGTCCTCGCACAAGAGCAAGAAGGACCAGGGCGAGCTGGAGCGGCAGCTGCTGCAGGCCAACCCCATCCTGGAGGCCTTCGGGAAC GCCAAGACCGTGAAGAATGACAACTCCTCCCGCTTCGGCAAATTCATTCGCATCAACTTTGATGTCAATGGCTACATTGTTGGAGCCAAC ATTGAGACTTATCTTTTGGAGAAATCTCGTGCTATCCGCCAAGCCAAGGAAGAACGGACCTTCCACATCTTCTATTATCTCCTGTCTGGG GCTGGAGAGCACCTGAAGACCGATCTCCTGTTGGAGCCGTACAACAAATACCGCTTCCTGTCCAATGGACACGTCACCATCCCCGGGCAG CAGGACAAGGACATGTTCCAGGAGACCATGGAGGCCATGAGGATTATGGGCATCCCAGAAGAGGAGCAAATGGGCCTGCTGCGGGTCATC TCAGGGGTTCTTCAGCTCGGCAACATCGTCTTCAAGAAGGAGCGGAACACTGACCAGGCGTCCATGCCCGACAACACAGCTGCCCAAAAG GTGTCCCATCTCTTGGGTATCAATGTGACCGATTTCACCAGAGGAATCCTCACCCCGCGCATCAAGGTGGGACGGGATTACGTCCAGAAG GCGCAGACTAAAGAGCAGGCTGACTTTGCCATCGAGGCCTTGGCCAAGGCGACCTATGAGCGGATGTTCCGCTGGCTGGTGCTGCGCATC AACAAGGCTCTGGACAAGACCAAGAGGCAGGGCGCCTCCTTCATCGGGATCCTGGACATTGCCGGCTTCGAGATCTTTGATCTGAACTCG TTTGAGCAGCTGTGCATCAATTACACCAATGAGAAGCTGCAGCAGCTCTTCAACCACACCATGTTCATCCTGGAGCAGGAGGAGTACCAG CGCGAGGGCATCGAGTGGAACTTCATCGACTTTGGCCTCGACCTGCAGCCCTGCATCGACCTCATTGAGAAGCCAGCAGGCCCCCCGGGC ATTCTGGCCCTGCTGGACGAGGAGTGCTGGTTCCCCAAAGCCACCGACAAGAGCTTCGTGGAGAAGGTGATGCAGGAGCAGGGCACCCAC CCCAAGTTCCAGAAGCCCAAGCAGCTGAAGGACAAAGCTGATTTCTGCATTATCCACTATGCCGGCAAGGTGGATTACAAAGCTGACGAG TGGCTGATGAAGAACATGGATCCCCTGAATGACAACATCGCCACACTGCTCCACCAGTCCTCTGACAAGTTTGTCTCGGAGCTGTGGAAG GATGTGGACCGCATCATCGGCCTGGACCAGGTGGCCGGCATGTCGGAGACCGCACTGCCCGGGGCCTTCAAGACGCGGAAGGGCATGTTC CGCACTGTGGGGCAGCTTTACAAGGAGCAGCTGGCCAAGCTGATGGCTACGCTGAGGAACACGAACCCCAACTTTGTCCGCTGCATCATC CCCAACCACGAGAAGAAGGCCGGCAAGCTGGACCCGCATCTCGTGCTGGACCAGCTGCGCTGCAACGGTGTTCTCGAGGGCATCCGTATC TGCCGCCAGGGCTTCCCCAACAGGGTGGTCTTCCAGGAGTTTCGGCAGAGATATGAGATCCTGACTCCAAACTCCATTCCCAAGGGTTTC ATGGACGGGAAGCAGGCGTGCGTGCTCATGATAAAAGCCCTGGAGCTCGACAGCAATCTGTACCGCATTGGCCAGAGCAAAGTCTTCTTC CGTGCCGGTGTGCTGGCCCACCTGGAGGAGGAGCGAGACCTGAAGATCACCGACGTCATCATAGGGTTCCAGGCCTGCTGCAGGGGCTAC CTGGCCAGGAAAGCATTTGCCAAGCGGCAGCAGCAGCTTACCGCCATGAAGGTCCTCCAGCGGAACTGCGCTGCCTACCTGAAGCTGCGG AACTGGCAGTGGTGGCGGCTCTTCACCAAGGTCAAGCCGCTGCTGCAGGTGAGCCGGCAGGAGGAGGAGATGATGGCCAAGGAGGAGGAG CTGGTGAAGGTCAGAGAGAAGCAGCTGGCTGCGGAGAACAGGCTCACGGAGATGGAGACGCTGCAGTCTCAGCTCATGGCAGAGAAATTG CAGCTGCAGGAGCAGCTCCAGGCAGAAACCGAGCTGTGTGCCGAGGCTGAGGAGCTCCGGGCCCGCCTGACCGCCAAGAAGCAGGAATTA GAAGAGATCTGCCATGACCTAGAGGCCAGGGTGGAGGAGGAGGAGGAGCGCTGCCAGCACCTGCAGGCGGAGAAGAAGAAGATGCAGCAG AACATCCAGGAGCTTGAGGAGCAGCTGGAGGAGGAGGAGAGCGCCCGGCAGAAGCTGCAGCTGGAGAAGGTGACCACCGAGGCGAAGCTG AAAAAGCTGGAGGAGGAGCAGATCATCCTGGAGGACCAGAACTGCAAGCTGGCCAAGGAAAAGAAACTGCTGGAAGACAGAATAGCTGAG TTCACCACCAACCTCACAGAAGAGGAGGAGAAATCTAAGAGCCTCGCCAAGCTCAAGAACAAGCATGAGGCAATGATCACTGACTTGGAA GAGCGCCTCCGCAGGGAGGAGAAGCAGCGACAGGAGCTGGAGAAGACCCGCCGGAAGCTGGAGGGAGACTCCACAGACCTCAGCGACCAG ATCGCCGAGCTCCAGGCCCAGATCGCGGAGCTCAAGATGCAGCTGGCCAAGAAAGAGGAGGAGCTCCAGGCCGCCCTGGCCAGAGTGGAA GAGGAAGCTGCCCAGAAGAACATGGCCCTCAAGAAGATCCGGGAGCTGGAATCTCAGATCTCTGAACTCCAGGAAGACCTGGAGTCTGAG CGTGCTTCCAGGAATAAAGCTGAGAAGCAGAAACGGGACCTTGGGGAAGAGCTAGAGGCTCTGAAAACAGAGTTGGAGGACACGCTGGAT TCCACAGCTGCCCAGCAGGAGCTCAGGTCAAAACGTGAGCAGGAGGTGAACATCCTGAAGAAGACCCTGGAGGAGGAGGCCAAGACCCAC GAGGCCCAGATCCAGGAGATGAGGCAGAAGCACTCACAGGCCGTGGAGGAGCTGGCGGAGCAGCTGGAGCAGACGAAGCGGGTGAAAGCA AACCTCGAGAAGGCAAAGCAGACTCTGGAGAACGAGCGGGGGGAGCTGGCCAACGAGGTGAAGGTGCTGCTGCAGGGCAAAGGGGACTCG GAGCACAAGCGCAAGAAAGTGGAGGCGCAGCTGCAGGAGCTGCAGGTCAAGTTCAACGAGGGAGAGCGCGTGCGCACAGAGCTGGCCGAC AAGGTCACCAAGCTGCAGGTGGAGCTGGACAACGTGACCGGGCTTCTCAGCCAGTCCGACAGCAAGTCCAGCAAGCTCACCAAGGACTTC TCCGCGCTGGAGTCCCAGCTGCAGGACACTCAGGAGCTGCTGCAGGAGGAGAACCGGCAGAAGCTGAGCCTGAGCACCAAGCTCAAGCAG GTGGAGGACGAGAAGAATTCCTTCCGGGAGCAGCTGGAGGAGGAGGAGGAGGCCAAGCACAACCTGGAGAAGCAGATCGCCACCCTCCAT GCCCAGGTGGCCGACATGAAAAAGAAGATGGAGGACAGTGTGGGGTGCCTGGAAACTGCTGAGGAGGTGAAGAGGAAGCTCCAGAAGGAC CTGGAGGGCCTGAGCCAGCGGCACGAGGAGAAGGTGGCCGCCTACGACAAGCTGGAGAAGACCAAGACGCGGCTGCAGCAGGAGCTGGAC GACCTGCTGGTGGACCTGGACCACCAGCGCCAGAGCGCGTGCAACCTGGAGAAGAAGCAGAAGAAGTTTGACCAGCTCCTGGCGGAGGAG AAGACCATCTCTGCCAAGTATGCAGAGGAGCGCGACCGGGCTGAGGCGGAGGCCCGAGAGAAGGAGACCAAGGCTCTGTCGCTGGCCCGG GCCCTGGAGGAAGCCATGGAGCAGAAGGCGGAGCTGGAGCGGCTCAACAAGCAGTTCCGCACGGAGATGGAGGACCTTATGAGCTCCAAG GATGATGTGGGCAAGAGTGTCCACGAGCTGGAGAAGTCCAAGCGGGCCCTAGAGCAGCAGGTGGAGGAGATGAAGACGCAGCTGGAAGAG CTGGAGGACGAGCTGCAGGCCACCGAAGATGCCAAGCTGCGGTTGGAGGTCAACCTGCAGGCCATGAAGGCCCAGTTCGAGCGGGACCTG CAGGGCCGGGACGAGCAGAGCGAGGAGAAGAAGAAGCAGCTGGTCAGACAGGTGCGGGAGATGGAGGCAGAGCTGGAGGACGAGAGGAAG CAGCGCTCGATGGCAGTGGCCGCCCGGAAGAAGCTGGAGATGGACCTGAAGGACCTGGAGGCGCACATCGACTCGGCCAACAAGAACCGG GACGAAGCCATCAAACAGCTGCGGAAGCTGCAGGCCCAGATGAAGGACTGCATGCGCGAGCTGGATGACACCCGCGCCTCTCGTGAGGAG ATCCTGGCCCAGGCCAAAGAGAACGAGAAGAAGCTGAAGAGCATGGAGGCCGAGATGATCCAGTTGCAGGAGGAACTGGCAGCCGCGGAG CGTGCCAAGCGCCAGGCCCAGCAGGAGCGGGATGAGCTGGCTGACGAGATCGCCAACAGCAGCGGCAAAGGAGCCCTGGCGTTAGAGGAG AAGCGGCGTCTGGAGGCCCGCATCGCCCAGCTGGAGGAGGAGCTGGAGGAGGAGCAGGGCAACACGGAGCTGATCAACGACCGGCTGAAG AAGGCCAACCTGCAGATCGACCAGATCAACACCGACCTGAACCTGGAGCGCAGCCACGCCCAGAAGAACGAGAATGCTCGGCAGCAGCTG GAACGCCAGAACAAGGAGCTTAAGGTCAAGCTGCAGGAGATGGAGGGCACTGTCAAGTCCAAGTACAAGGCCTCCATCACCGCCCTCGAG GCCAAGATTGCACAGCTGGAGGAGCAGCTGGACAACGAGACCAAGGAGCGCCAGGCAGCCTGCAAACAGGTGCGTCGGACCGAGAAGAAG CTGAAGGATGTGCTGCTGCAGGTGGATGACGAGCGGAGGAACGCCGAGCAGTACAAGGACCAGCCACGACAAGGGACCCGAGGCGGAGGA GGGCGTCGAGCTGCAGGAAGGCGGGGACGGCCCAGGAGCGGAGGAGCAGACAGCGGTGGCCATCACCAGCGTCCAGCAGGCGGCGTTCGG CGACCACAACATCCAGTACCAGTTCCGCACAGAGACAAATGGAGGACAGGTGACATACCGCGTAGTCCAGGTGACTGATGGTCAGCTGGA CGGCCAGGGCGACACAGCTGGCGCCGTCAGCGTCGTGTCCACCGCTGCCTTCGCGGGGGGGCAGCAGGCTGTGACCCAGGTGGGTGTGGA CGGGGCAGCCCAGCGCCCGGGCCCCGCCGCTGCCTCTGTGCCCCCAGGTCCTGCAGCGCCCTTCCCGCTGGCTGTGATCCAAAATCCCTT CAGCAATGGTGGCAGTCCGGCGGCCGAGGCTGTCAGCGGGGAGGCACGATTTGCCTATTTCCCAGCGTCCAGTGTGGGAGATACTACGGC TGTGTCCGTACAGACCACAGACCAGAGCTTGCAGGCTGGAGGCCAGTTCTACGTCATGATGACGCCCCAGGATGTGCTTCAGACAGGAAC ACAGAGGACGATCGCCCCCCGGACACACCCTTACTCTCCAAAAATTGATGGAACCAGAACACCCCGAGATGAGAGGAGAAGAGCCCAGCA CAACGAAGTGGAGCGGAGGCGGAGGGACAAGATCAACAACTGGATCGTCCAGCTTTCGAAAATCATTCCAGACTGTAACGCAGACAACAG CAAGACGGGAGCGAGTAAAGGAGGGATCCTGTCCAAGGCCTGCGATTACATCCGGGAGTTGCGCCAGACCAACCAGCGCATGCAGGAGAC CTTCAAAGAGGCCGAGCGGCTGCAGATGGACAACGAGCTCCTGAGGCAGCAGATCGAGGAGCTGAAGAATGAGAACGCCCTGCTTCGAGC CCAGCTGCAGCAGCACAACCTGGAGATGGTGGGCGAGGGCACCCGGCAGTGACGCCCGCCACCACCACGCAGCCGCCGCCGCCCACGCCG GCCTCTGCTGCCCCCTTCCCCAGCCCTTAGCACAGAGAGGGACACATGCCCCTCCCCCAGCTGCGTTTTTTTATAGTAGATTTTTAACAA AAAACGGGGAGAAATAATGCATTTCTGTGGATACAGTGCCCACCGCCCTCCTCCACTTGGAAACGGTATCCTCCCTGCCCATCCGTCTGT CTGTCGCCCTTCTCCCGGCCCTCACTAAGCCCCGGCACTTCTAGTGGTCTCACCTGGAGGCAAGAGGGAGGGGACAGAGGCCCTGCCACG TCCCGCTGCCTCCTGCTCTCTGGAGGTACTGAGACAGGGTGCTGATGGGAAGGAGGGGAGCCTTTGGGGGGCCACCCGGGGCCTGGACCT ATGCAGGGAGGCCACGTCCCACCCCACCTCTTGTTTCTGGGTCCCTGCTCCCCTTTGGGGGTGTGTGTGTGTGTTTTAATTTTCTTTATG >In-frame_ENST00000216181_ENST00000222305_TCGA-61-2003_MYH9_chr22_36680448_-_USF2_chr19_35760348_length(amino acids)=1928AA_start in transcript=231_stop in transcript=6017 MAQQAADKYLYVDKNFINNPLAQADWAAKKLVWVPSDKSGFEPASLKEEVGEEAIVELVENGKKVKVNKDDIQKMNPPKFSKVEDMAELT CLNEASVLHNLKERYYSGLIYTYSGLFCVVINPYKNLPIYSEEIVEMYKGKKRHEMPPHIYAITDTAYRSMMQDREDQSILCTGESGAGK TENTKKVIQYLAYVASSHKSKKDQGELERQLLQANPILEAFGNAKTVKNDNSSRFGKFIRINFDVNGYIVGANIETYLLEKSRAIRQAKE ERTFHIFYYLLSGAGEHLKTDLLLEPYNKYRFLSNGHVTIPGQQDKDMFQETMEAMRIMGIPEEEQMGLLRVISGVLQLGNIVFKKERNT DQASMPDNTAAQKVSHLLGINVTDFTRGILTPRIKVGRDYVQKAQTKEQADFAIEALAKATYERMFRWLVLRINKALDKTKRQGASFIGI LDIAGFEIFDLNSFEQLCINYTNEKLQQLFNHTMFILEQEEYQREGIEWNFIDFGLDLQPCIDLIEKPAGPPGILALLDEECWFPKATDK SFVEKVMQEQGTHPKFQKPKQLKDKADFCIIHYAGKVDYKADEWLMKNMDPLNDNIATLLHQSSDKFVSELWKDVDRIIGLDQVAGMSET ALPGAFKTRKGMFRTVGQLYKEQLAKLMATLRNTNPNFVRCIIPNHEKKAGKLDPHLVLDQLRCNGVLEGIRICRQGFPNRVVFQEFRQR YEILTPNSIPKGFMDGKQACVLMIKALELDSNLYRIGQSKVFFRAGVLAHLEEERDLKITDVIIGFQACCRGYLARKAFAKRQQQLTAMK VLQRNCAAYLKLRNWQWWRLFTKVKPLLQVSRQEEEMMAKEEELVKVREKQLAAENRLTEMETLQSQLMAEKLQLQEQLQAETELCAEAE ELRARLTAKKQELEEICHDLEARVEEEEERCQHLQAEKKKMQQNIQELEEQLEEEESARQKLQLEKVTTEAKLKKLEEEQIILEDQNCKL AKEKKLLEDRIAEFTTNLTEEEEKSKSLAKLKNKHEAMITDLEERLRREEKQRQELEKTRRKLEGDSTDLSDQIAELQAQIAELKMQLAK KEEELQAALARVEEEAAQKNMALKKIRELESQISELQEDLESERASRNKAEKQKRDLGEELEALKTELEDTLDSTAAQQELRSKREQEVN ILKKTLEEEAKTHEAQIQEMRQKHSQAVEELAEQLEQTKRVKANLEKAKQTLENERGELANEVKVLLQGKGDSEHKRKKVEAQLQELQVK FNEGERVRTELADKVTKLQVELDNVTGLLSQSDSKSSKLTKDFSALESQLQDTQELLQEENRQKLSLSTKLKQVEDEKNSFREQLEEEEE AKHNLEKQIATLHAQVADMKKKMEDSVGCLETAEEVKRKLQKDLEGLSQRHEEKVAAYDKLEKTKTRLQQELDDLLVDLDHQRQSACNLE KKQKKFDQLLAEEKTISAKYAEERDRAEAEAREKETKALSLARALEEAMEQKAELERLNKQFRTEMEDLMSSKDDVGKSVHELEKSKRAL EQQVEEMKTQLEELEDELQATEDAKLRLEVNLQAMKAQFERDLQGRDEQSEEKKKQLVRQVREMEAELEDERKQRSMAVAARKKLEMDLK DLEAHIDSANKNRDEAIKQLRKLQAQMKDCMRELDDTRASREEILAQAKENEKKLKSMEAEMIQLQEELAAAERAKRQAQQERDELADEI ANSSGKGALALEEKRRLEARIAQLEEELEEEQGNTELINDRLKKANLQIDQINTDLNLERSHAQKNENARQQLERQNKELKVKLQEMEGT VKSKYKASITALEAKIAQLEEQLDNETKERQAACKQVRRTEKKLKDVLLQVDDERRNAEQYKDQPRQGTRGGGGRRAAGRRGRPRSGGAD -------------------------------------------------------------- >In-frame_ENST00000216181_ENST00000595068_TCGA-61-2003_MYH9_chr22_36680448_-_USF2_chr19_35760348_length(transcript)=7336nt_BP=5823nt GAGGGCGGGGCGGGAAGGCGGCGAGGAGCCGAGCTGGGTGCGGTGAGGCGCGCAGATCACCGCGGTTCCTGGGCAGGGCACGGAAGGCTA AGCAAGGCTGACCTGCTGCAGCTCCCGCCTCGTGCGCTCGCCCCACCCGGCCGCCGCCCGAGCGCTCGAGAAAGTCCTCTCGGGAGAAGC AGCGCCTGTTCCCGGGGCAGATCCAGGTTCAGGTCCTGGCTATAAGTCACCATGGCACAGCAAGCTGCCGATAAGTATCTCTATGTGGAT AAAAACTTCATCAACAATCCGCTGGCCCAGGCCGACTGGGCTGCCAAGAAGCTGGTATGGGTGCCTTCCGACAAGAGTGGCTTTGAGCCA GCCAGCCTCAAGGAGGAGGTGGGCGAAGAGGCCATCGTGGAGCTGGTGGAGAATGGGAAGAAGGTGAAGGTGAACAAGGATGACATCCAG AAGATGAACCCGCCCAAGTTCTCCAAGGTGGAGGACATGGCAGAGCTCACGTGCCTCAACGAAGCCTCGGTGCTGCACAACCTCAAGGAG CGTTACTACTCAGGGCTCATCTACACCTATTCAGGCCTGTTCTGTGTGGTCATCAATCCTTACAAGAACCTGCCCATCTACTCTGAAGAG ATTGTGGAAATGTACAAGGGCAAGAAGAGGCACGAGATGCCCCCTCACATCTATGCCATCACAGACACCGCCTACAGGAGTATGATGCAA GACCGAGAAGATCAATCCATCTTGTGCACTGGTGAATCTGGAGCTGGCAAGACGGAGAACACCAAGAAGGTCATCCAGTATCTGGCGTAC GTGGCGTCCTCGCACAAGAGCAAGAAGGACCAGGGCGAGCTGGAGCGGCAGCTGCTGCAGGCCAACCCCATCCTGGAGGCCTTCGGGAAC GCCAAGACCGTGAAGAATGACAACTCCTCCCGCTTCGGCAAATTCATTCGCATCAACTTTGATGTCAATGGCTACATTGTTGGAGCCAAC ATTGAGACTTATCTTTTGGAGAAATCTCGTGCTATCCGCCAAGCCAAGGAAGAACGGACCTTCCACATCTTCTATTATCTCCTGTCTGGG GCTGGAGAGCACCTGAAGACCGATCTCCTGTTGGAGCCGTACAACAAATACCGCTTCCTGTCCAATGGACACGTCACCATCCCCGGGCAG CAGGACAAGGACATGTTCCAGGAGACCATGGAGGCCATGAGGATTATGGGCATCCCAGAAGAGGAGCAAATGGGCCTGCTGCGGGTCATC TCAGGGGTTCTTCAGCTCGGCAACATCGTCTTCAAGAAGGAGCGGAACACTGACCAGGCGTCCATGCCCGACAACACAGCTGCCCAAAAG GTGTCCCATCTCTTGGGTATCAATGTGACCGATTTCACCAGAGGAATCCTCACCCCGCGCATCAAGGTGGGACGGGATTACGTCCAGAAG GCGCAGACTAAAGAGCAGGCTGACTTTGCCATCGAGGCCTTGGCCAAGGCGACCTATGAGCGGATGTTCCGCTGGCTGGTGCTGCGCATC AACAAGGCTCTGGACAAGACCAAGAGGCAGGGCGCCTCCTTCATCGGGATCCTGGACATTGCCGGCTTCGAGATCTTTGATCTGAACTCG TTTGAGCAGCTGTGCATCAATTACACCAATGAGAAGCTGCAGCAGCTCTTCAACCACACCATGTTCATCCTGGAGCAGGAGGAGTACCAG CGCGAGGGCATCGAGTGGAACTTCATCGACTTTGGCCTCGACCTGCAGCCCTGCATCGACCTCATTGAGAAGCCAGCAGGCCCCCCGGGC ATTCTGGCCCTGCTGGACGAGGAGTGCTGGTTCCCCAAAGCCACCGACAAGAGCTTCGTGGAGAAGGTGATGCAGGAGCAGGGCACCCAC CCCAAGTTCCAGAAGCCCAAGCAGCTGAAGGACAAAGCTGATTTCTGCATTATCCACTATGCCGGCAAGGTGGATTACAAAGCTGACGAG TGGCTGATGAAGAACATGGATCCCCTGAATGACAACATCGCCACACTGCTCCACCAGTCCTCTGACAAGTTTGTCTCGGAGCTGTGGAAG GATGTGGACCGCATCATCGGCCTGGACCAGGTGGCCGGCATGTCGGAGACCGCACTGCCCGGGGCCTTCAAGACGCGGAAGGGCATGTTC CGCACTGTGGGGCAGCTTTACAAGGAGCAGCTGGCCAAGCTGATGGCTACGCTGAGGAACACGAACCCCAACTTTGTCCGCTGCATCATC CCCAACCACGAGAAGAAGGCCGGCAAGCTGGACCCGCATCTCGTGCTGGACCAGCTGCGCTGCAACGGTGTTCTCGAGGGCATCCGTATC TGCCGCCAGGGCTTCCCCAACAGGGTGGTCTTCCAGGAGTTTCGGCAGAGATATGAGATCCTGACTCCAAACTCCATTCCCAAGGGTTTC ATGGACGGGAAGCAGGCGTGCGTGCTCATGATAAAAGCCCTGGAGCTCGACAGCAATCTGTACCGCATTGGCCAGAGCAAAGTCTTCTTC CGTGCCGGTGTGCTGGCCCACCTGGAGGAGGAGCGAGACCTGAAGATCACCGACGTCATCATAGGGTTCCAGGCCTGCTGCAGGGGCTAC CTGGCCAGGAAAGCATTTGCCAAGCGGCAGCAGCAGCTTACCGCCATGAAGGTCCTCCAGCGGAACTGCGCTGCCTACCTGAAGCTGCGG AACTGGCAGTGGTGGCGGCTCTTCACCAAGGTCAAGCCGCTGCTGCAGGTGAGCCGGCAGGAGGAGGAGATGATGGCCAAGGAGGAGGAG CTGGTGAAGGTCAGAGAGAAGCAGCTGGCTGCGGAGAACAGGCTCACGGAGATGGAGACGCTGCAGTCTCAGCTCATGGCAGAGAAATTG CAGCTGCAGGAGCAGCTCCAGGCAGAAACCGAGCTGTGTGCCGAGGCTGAGGAGCTCCGGGCCCGCCTGACCGCCAAGAAGCAGGAATTA GAAGAGATCTGCCATGACCTAGAGGCCAGGGTGGAGGAGGAGGAGGAGCGCTGCCAGCACCTGCAGGCGGAGAAGAAGAAGATGCAGCAG AACATCCAGGAGCTTGAGGAGCAGCTGGAGGAGGAGGAGAGCGCCCGGCAGAAGCTGCAGCTGGAGAAGGTGACCACCGAGGCGAAGCTG AAAAAGCTGGAGGAGGAGCAGATCATCCTGGAGGACCAGAACTGCAAGCTGGCCAAGGAAAAGAAACTGCTGGAAGACAGAATAGCTGAG TTCACCACCAACCTCACAGAAGAGGAGGAGAAATCTAAGAGCCTCGCCAAGCTCAAGAACAAGCATGAGGCAATGATCACTGACTTGGAA GAGCGCCTCCGCAGGGAGGAGAAGCAGCGACAGGAGCTGGAGAAGACCCGCCGGAAGCTGGAGGGAGACTCCACAGACCTCAGCGACCAG ATCGCCGAGCTCCAGGCCCAGATCGCGGAGCTCAAGATGCAGCTGGCCAAGAAAGAGGAGGAGCTCCAGGCCGCCCTGGCCAGAGTGGAA GAGGAAGCTGCCCAGAAGAACATGGCCCTCAAGAAGATCCGGGAGCTGGAATCTCAGATCTCTGAACTCCAGGAAGACCTGGAGTCTGAG CGTGCTTCCAGGAATAAAGCTGAGAAGCAGAAACGGGACCTTGGGGAAGAGCTAGAGGCTCTGAAAACAGAGTTGGAGGACACGCTGGAT TCCACAGCTGCCCAGCAGGAGCTCAGGTCAAAACGTGAGCAGGAGGTGAACATCCTGAAGAAGACCCTGGAGGAGGAGGCCAAGACCCAC GAGGCCCAGATCCAGGAGATGAGGCAGAAGCACTCACAGGCCGTGGAGGAGCTGGCGGAGCAGCTGGAGCAGACGAAGCGGGTGAAAGCA AACCTCGAGAAGGCAAAGCAGACTCTGGAGAACGAGCGGGGGGAGCTGGCCAACGAGGTGAAGGTGCTGCTGCAGGGCAAAGGGGACTCG GAGCACAAGCGCAAGAAAGTGGAGGCGCAGCTGCAGGAGCTGCAGGTCAAGTTCAACGAGGGAGAGCGCGTGCGCACAGAGCTGGCCGAC AAGGTCACCAAGCTGCAGGTGGAGCTGGACAACGTGACCGGGCTTCTCAGCCAGTCCGACAGCAAGTCCAGCAAGCTCACCAAGGACTTC TCCGCGCTGGAGTCCCAGCTGCAGGACACTCAGGAGCTGCTGCAGGAGGAGAACCGGCAGAAGCTGAGCCTGAGCACCAAGCTCAAGCAG GTGGAGGACGAGAAGAATTCCTTCCGGGAGCAGCTGGAGGAGGAGGAGGAGGCCAAGCACAACCTGGAGAAGCAGATCGCCACCCTCCAT GCCCAGGTGGCCGACATGAAAAAGAAGATGGAGGACAGTGTGGGGTGCCTGGAAACTGCTGAGGAGGTGAAGAGGAAGCTCCAGAAGGAC CTGGAGGGCCTGAGCCAGCGGCACGAGGAGAAGGTGGCCGCCTACGACAAGCTGGAGAAGACCAAGACGCGGCTGCAGCAGGAGCTGGAC GACCTGCTGGTGGACCTGGACCACCAGCGCCAGAGCGCGTGCAACCTGGAGAAGAAGCAGAAGAAGTTTGACCAGCTCCTGGCGGAGGAG AAGACCATCTCTGCCAAGTATGCAGAGGAGCGCGACCGGGCTGAGGCGGAGGCCCGAGAGAAGGAGACCAAGGCTCTGTCGCTGGCCCGG GCCCTGGAGGAAGCCATGGAGCAGAAGGCGGAGCTGGAGCGGCTCAACAAGCAGTTCCGCACGGAGATGGAGGACCTTATGAGCTCCAAG GATGATGTGGGCAAGAGTGTCCACGAGCTGGAGAAGTCCAAGCGGGCCCTAGAGCAGCAGGTGGAGGAGATGAAGACGCAGCTGGAAGAG CTGGAGGACGAGCTGCAGGCCACCGAAGATGCCAAGCTGCGGTTGGAGGTCAACCTGCAGGCCATGAAGGCCCAGTTCGAGCGGGACCTG CAGGGCCGGGACGAGCAGAGCGAGGAGAAGAAGAAGCAGCTGGTCAGACAGGTGCGGGAGATGGAGGCAGAGCTGGAGGACGAGAGGAAG CAGCGCTCGATGGCAGTGGCCGCCCGGAAGAAGCTGGAGATGGACCTGAAGGACCTGGAGGCGCACATCGACTCGGCCAACAAGAACCGG GACGAAGCCATCAAACAGCTGCGGAAGCTGCAGGCCCAGATGAAGGACTGCATGCGCGAGCTGGATGACACCCGCGCCTCTCGTGAGGAG ATCCTGGCCCAGGCCAAAGAGAACGAGAAGAAGCTGAAGAGCATGGAGGCCGAGATGATCCAGTTGCAGGAGGAACTGGCAGCCGCGGAG CGTGCCAAGCGCCAGGCCCAGCAGGAGCGGGATGAGCTGGCTGACGAGATCGCCAACAGCAGCGGCAAAGGAGCCCTGGCGTTAGAGGAG AAGCGGCGTCTGGAGGCCCGCATCGCCCAGCTGGAGGAGGAGCTGGAGGAGGAGCAGGGCAACACGGAGCTGATCAACGACCGGCTGAAG AAGGCCAACCTGCAGATCGACCAGATCAACACCGACCTGAACCTGGAGCGCAGCCACGCCCAGAAGAACGAGAATGCTCGGCAGCAGCTG GAACGCCAGAACAAGGAGCTTAAGGTCAAGCTGCAGGAGATGGAGGGCACTGTCAAGTCCAAGTACAAGGCCTCCATCACCGCCCTCGAG GCCAAGATTGCACAGCTGGAGGAGCAGCTGGACAACGAGACCAAGGAGCGCCAGGCAGCCTGCAAACAGGTGCGTCGGACCGAGAAGAAG CTGAAGGATGTGCTGCTGCAGGTGGATGACGAGCGGAGGAACGCCGAGCAGTACAAGGACCAGCCACGACAAGGGACCCGAGGCGGAGGA GGGCGTCGAGCTGCAGGAAGGCGGGGACGGCCCAGGAGCGGAGGAGCAGACAGCGGTGGCCATCACCAGCGTCCAGCAGGCGGCGTTCGG CGACCACAACATCCAGTACCAGTTCCGCACAGAGACAAATGGAGGACAGGTGACATACCGCGTAGTCCAGGTGACTGATGGTCAGCTGGA CGGCCAGGGCGACACAGCTGGCGCCGTCAGCGTCGTGTCCACCGCTGCCTTCGCGGGGGGGCAGCAGGCTGTGACCCAGGTGGGTGTGGA CGGGGCAGCCCAGCGCCCGGGCCCCGCCGCTGCCTCTGTGCCCCCAGGTCCTGCAGCGCCCTTCCCGCTGGCTGTGATCCAAAATCCCTT CAGCAATGGTGGCAGTCCGGCGGCCGAGGCTGTCAGCGGGGAGGCACGATTTGCCTATTTCCCAGCGTCCAGTGTGGGAGATACTACGGC TGTGTCCGTACAGACCACAGACCAGAGCTTGCAGGCTGGAGGCCAGTTCTACGTCATGATGACGCCCCAGGATGTGCTTCAGACAGGAAC ACAGAGGACGATCGCCCCCCGGACACACCCTTACTCTCCAAAAATTGATGGAACCAGAACACCCCGAGATGAGAGGAGAAGAGCCCAGCA CAACGAAGTGGAGCGGAGGCGGAGGGACAAGATCAACAACTGGATCGTCCAGCTTTCGAAAATCATTCCAGACTGTAACGCAGACAACAG CAAGACGGGAGCGGCCTGCGATTACATCCGGGAGTTGCGCCAGACCAACCAGCGCATGCAGGAGACCTTCAAAGAGGCCGAGCGGCTGCA GATGGACAACGAGCTCCTGAGGCAGCAGATCGAGGAGCTGAAGAATGAGAACGCCCTGCTTCGAGCCCAGCTGCAGCAGCACAACCTGGA GATGGTGGGCGAGGGCACCCGGCAGTGACGCCCGCCACCACCACGCAGCCGCCGCCGCCCACGCCGGCCTCTGCTGCCCCCTTCCCCAGC CCTTAGCACAGAGAGGGACACATGCCCCTCCCCCAGCTGCGTTTTTTTATAGTAGATTTTTAACAAAAAACGGGGAGAAATAATGCATTT CTGTGGATACAGTGCCCACCGCCCTCCTCCACTTGGAAACGGTATCCTCCCTGCCCATCCGTCTGTCTGTCGCCCTTCTCCCGGCCCTCA CTAAGCCCCGGCACTTCTAGTGGTCTCACCTGGAGGCAAGAGGGAGGGGACAGAGGCCCTGCCACGTCCCGCTGCCTCCTGCTCTCTGGA GGTACTGAGACAGGGTGCTGATGGGAAGGAGGGGAGCCTTTGGGGGGCCACCCGGGGCCTGGACCTATGCAGGGAGGCCACGTCCCACCC CACCTCTTGTTTCTGGGTCCCTGCTCCCCTTTGGGGGTGTGTGTGTGTGTTTTAATTTTCTTTATGGAAAAATTGACAAAAAAAAAATAG >In-frame_ENST00000216181_ENST00000595068_TCGA-61-2003_MYH9_chr22_36680448_-_USF2_chr19_35760348_length(amino acids)=1928AA_start in transcript=231_stop in transcript=6017 MAQQAADKYLYVDKNFINNPLAQADWAAKKLVWVPSDKSGFEPASLKEEVGEEAIVELVENGKKVKVNKDDIQKMNPPKFSKVEDMAELT CLNEASVLHNLKERYYSGLIYTYSGLFCVVINPYKNLPIYSEEIVEMYKGKKRHEMPPHIYAITDTAYRSMMQDREDQSILCTGESGAGK TENTKKVIQYLAYVASSHKSKKDQGELERQLLQANPILEAFGNAKTVKNDNSSRFGKFIRINFDVNGYIVGANIETYLLEKSRAIRQAKE ERTFHIFYYLLSGAGEHLKTDLLLEPYNKYRFLSNGHVTIPGQQDKDMFQETMEAMRIMGIPEEEQMGLLRVISGVLQLGNIVFKKERNT DQASMPDNTAAQKVSHLLGINVTDFTRGILTPRIKVGRDYVQKAQTKEQADFAIEALAKATYERMFRWLVLRINKALDKTKRQGASFIGI LDIAGFEIFDLNSFEQLCINYTNEKLQQLFNHTMFILEQEEYQREGIEWNFIDFGLDLQPCIDLIEKPAGPPGILALLDEECWFPKATDK SFVEKVMQEQGTHPKFQKPKQLKDKADFCIIHYAGKVDYKADEWLMKNMDPLNDNIATLLHQSSDKFVSELWKDVDRIIGLDQVAGMSET ALPGAFKTRKGMFRTVGQLYKEQLAKLMATLRNTNPNFVRCIIPNHEKKAGKLDPHLVLDQLRCNGVLEGIRICRQGFPNRVVFQEFRQR YEILTPNSIPKGFMDGKQACVLMIKALELDSNLYRIGQSKVFFRAGVLAHLEEERDLKITDVIIGFQACCRGYLARKAFAKRQQQLTAMK VLQRNCAAYLKLRNWQWWRLFTKVKPLLQVSRQEEEMMAKEEELVKVREKQLAAENRLTEMETLQSQLMAEKLQLQEQLQAETELCAEAE ELRARLTAKKQELEEICHDLEARVEEEEERCQHLQAEKKKMQQNIQELEEQLEEEESARQKLQLEKVTTEAKLKKLEEEQIILEDQNCKL AKEKKLLEDRIAEFTTNLTEEEEKSKSLAKLKNKHEAMITDLEERLRREEKQRQELEKTRRKLEGDSTDLSDQIAELQAQIAELKMQLAK KEEELQAALARVEEEAAQKNMALKKIRELESQISELQEDLESERASRNKAEKQKRDLGEELEALKTELEDTLDSTAAQQELRSKREQEVN ILKKTLEEEAKTHEAQIQEMRQKHSQAVEELAEQLEQTKRVKANLEKAKQTLENERGELANEVKVLLQGKGDSEHKRKKVEAQLQELQVK FNEGERVRTELADKVTKLQVELDNVTGLLSQSDSKSSKLTKDFSALESQLQDTQELLQEENRQKLSLSTKLKQVEDEKNSFREQLEEEEE AKHNLEKQIATLHAQVADMKKKMEDSVGCLETAEEVKRKLQKDLEGLSQRHEEKVAAYDKLEKTKTRLQQELDDLLVDLDHQRQSACNLE KKQKKFDQLLAEEKTISAKYAEERDRAEAEAREKETKALSLARALEEAMEQKAELERLNKQFRTEMEDLMSSKDDVGKSVHELEKSKRAL EQQVEEMKTQLEELEDELQATEDAKLRLEVNLQAMKAQFERDLQGRDEQSEEKKKQLVRQVREMEAELEDERKQRSMAVAARKKLEMDLK DLEAHIDSANKNRDEAIKQLRKLQAQMKDCMRELDDTRASREEILAQAKENEKKLKSMEAEMIQLQEELAAAERAKRQAQQERDELADEI ANSSGKGALALEEKRRLEARIAQLEEELEEEQGNTELINDRLKKANLQIDQINTDLNLERSHAQKNENARQQLERQNKELKVKLQEMEGT VKSKYKASITALEAKIAQLEEQLDNETKERQAACKQVRRTEKKLKDVLLQVDDERRNAEQYKDQPRQGTRGGGGRRAAGRRGRPRSGGAD -------------------------------------------------------------- >In-frame_ENST00000216181_ENST00000379134_TCGA-61-2003_MYH9_chr22_36680448_-_USF2_chr19_35760348_length(transcript)=6409nt_BP=5823nt GAGGGCGGGGCGGGAAGGCGGCGAGGAGCCGAGCTGGGTGCGGTGAGGCGCGCAGATCACCGCGGTTCCTGGGCAGGGCACGGAAGGCTA AGCAAGGCTGACCTGCTGCAGCTCCCGCCTCGTGCGCTCGCCCCACCCGGCCGCCGCCCGAGCGCTCGAGAAAGTCCTCTCGGGAGAAGC AGCGCCTGTTCCCGGGGCAGATCCAGGTTCAGGTCCTGGCTATAAGTCACCATGGCACAGCAAGCTGCCGATAAGTATCTCTATGTGGAT AAAAACTTCATCAACAATCCGCTGGCCCAGGCCGACTGGGCTGCCAAGAAGCTGGTATGGGTGCCTTCCGACAAGAGTGGCTTTGAGCCA GCCAGCCTCAAGGAGGAGGTGGGCGAAGAGGCCATCGTGGAGCTGGTGGAGAATGGGAAGAAGGTGAAGGTGAACAAGGATGACATCCAG AAGATGAACCCGCCCAAGTTCTCCAAGGTGGAGGACATGGCAGAGCTCACGTGCCTCAACGAAGCCTCGGTGCTGCACAACCTCAAGGAG CGTTACTACTCAGGGCTCATCTACACCTATTCAGGCCTGTTCTGTGTGGTCATCAATCCTTACAAGAACCTGCCCATCTACTCTGAAGAG ATTGTGGAAATGTACAAGGGCAAGAAGAGGCACGAGATGCCCCCTCACATCTATGCCATCACAGACACCGCCTACAGGAGTATGATGCAA GACCGAGAAGATCAATCCATCTTGTGCACTGGTGAATCTGGAGCTGGCAAGACGGAGAACACCAAGAAGGTCATCCAGTATCTGGCGTAC GTGGCGTCCTCGCACAAGAGCAAGAAGGACCAGGGCGAGCTGGAGCGGCAGCTGCTGCAGGCCAACCCCATCCTGGAGGCCTTCGGGAAC GCCAAGACCGTGAAGAATGACAACTCCTCCCGCTTCGGCAAATTCATTCGCATCAACTTTGATGTCAATGGCTACATTGTTGGAGCCAAC ATTGAGACTTATCTTTTGGAGAAATCTCGTGCTATCCGCCAAGCCAAGGAAGAACGGACCTTCCACATCTTCTATTATCTCCTGTCTGGG GCTGGAGAGCACCTGAAGACCGATCTCCTGTTGGAGCCGTACAACAAATACCGCTTCCTGTCCAATGGACACGTCACCATCCCCGGGCAG CAGGACAAGGACATGTTCCAGGAGACCATGGAGGCCATGAGGATTATGGGCATCCCAGAAGAGGAGCAAATGGGCCTGCTGCGGGTCATC TCAGGGGTTCTTCAGCTCGGCAACATCGTCTTCAAGAAGGAGCGGAACACTGACCAGGCGTCCATGCCCGACAACACAGCTGCCCAAAAG GTGTCCCATCTCTTGGGTATCAATGTGACCGATTTCACCAGAGGAATCCTCACCCCGCGCATCAAGGTGGGACGGGATTACGTCCAGAAG GCGCAGACTAAAGAGCAGGCTGACTTTGCCATCGAGGCCTTGGCCAAGGCGACCTATGAGCGGATGTTCCGCTGGCTGGTGCTGCGCATC AACAAGGCTCTGGACAAGACCAAGAGGCAGGGCGCCTCCTTCATCGGGATCCTGGACATTGCCGGCTTCGAGATCTTTGATCTGAACTCG TTTGAGCAGCTGTGCATCAATTACACCAATGAGAAGCTGCAGCAGCTCTTCAACCACACCATGTTCATCCTGGAGCAGGAGGAGTACCAG CGCGAGGGCATCGAGTGGAACTTCATCGACTTTGGCCTCGACCTGCAGCCCTGCATCGACCTCATTGAGAAGCCAGCAGGCCCCCCGGGC ATTCTGGCCCTGCTGGACGAGGAGTGCTGGTTCCCCAAAGCCACCGACAAGAGCTTCGTGGAGAAGGTGATGCAGGAGCAGGGCACCCAC CCCAAGTTCCAGAAGCCCAAGCAGCTGAAGGACAAAGCTGATTTCTGCATTATCCACTATGCCGGCAAGGTGGATTACAAAGCTGACGAG TGGCTGATGAAGAACATGGATCCCCTGAATGACAACATCGCCACACTGCTCCACCAGTCCTCTGACAAGTTTGTCTCGGAGCTGTGGAAG GATGTGGACCGCATCATCGGCCTGGACCAGGTGGCCGGCATGTCGGAGACCGCACTGCCCGGGGCCTTCAAGACGCGGAAGGGCATGTTC CGCACTGTGGGGCAGCTTTACAAGGAGCAGCTGGCCAAGCTGATGGCTACGCTGAGGAACACGAACCCCAACTTTGTCCGCTGCATCATC CCCAACCACGAGAAGAAGGCCGGCAAGCTGGACCCGCATCTCGTGCTGGACCAGCTGCGCTGCAACGGTGTTCTCGAGGGCATCCGTATC TGCCGCCAGGGCTTCCCCAACAGGGTGGTCTTCCAGGAGTTTCGGCAGAGATATGAGATCCTGACTCCAAACTCCATTCCCAAGGGTTTC ATGGACGGGAAGCAGGCGTGCGTGCTCATGATAAAAGCCCTGGAGCTCGACAGCAATCTGTACCGCATTGGCCAGAGCAAAGTCTTCTTC CGTGCCGGTGTGCTGGCCCACCTGGAGGAGGAGCGAGACCTGAAGATCACCGACGTCATCATAGGGTTCCAGGCCTGCTGCAGGGGCTAC CTGGCCAGGAAAGCATTTGCCAAGCGGCAGCAGCAGCTTACCGCCATGAAGGTCCTCCAGCGGAACTGCGCTGCCTACCTGAAGCTGCGG AACTGGCAGTGGTGGCGGCTCTTCACCAAGGTCAAGCCGCTGCTGCAGGTGAGCCGGCAGGAGGAGGAGATGATGGCCAAGGAGGAGGAG CTGGTGAAGGTCAGAGAGAAGCAGCTGGCTGCGGAGAACAGGCTCACGGAGATGGAGACGCTGCAGTCTCAGCTCATGGCAGAGAAATTG CAGCTGCAGGAGCAGCTCCAGGCAGAAACCGAGCTGTGTGCCGAGGCTGAGGAGCTCCGGGCCCGCCTGACCGCCAAGAAGCAGGAATTA GAAGAGATCTGCCATGACCTAGAGGCCAGGGTGGAGGAGGAGGAGGAGCGCTGCCAGCACCTGCAGGCGGAGAAGAAGAAGATGCAGCAG AACATCCAGGAGCTTGAGGAGCAGCTGGAGGAGGAGGAGAGCGCCCGGCAGAAGCTGCAGCTGGAGAAGGTGACCACCGAGGCGAAGCTG AAAAAGCTGGAGGAGGAGCAGATCATCCTGGAGGACCAGAACTGCAAGCTGGCCAAGGAAAAGAAACTGCTGGAAGACAGAATAGCTGAG TTCACCACCAACCTCACAGAAGAGGAGGAGAAATCTAAGAGCCTCGCCAAGCTCAAGAACAAGCATGAGGCAATGATCACTGACTTGGAA GAGCGCCTCCGCAGGGAGGAGAAGCAGCGACAGGAGCTGGAGAAGACCCGCCGGAAGCTGGAGGGAGACTCCACAGACCTCAGCGACCAG ATCGCCGAGCTCCAGGCCCAGATCGCGGAGCTCAAGATGCAGCTGGCCAAGAAAGAGGAGGAGCTCCAGGCCGCCCTGGCCAGAGTGGAA GAGGAAGCTGCCCAGAAGAACATGGCCCTCAAGAAGATCCGGGAGCTGGAATCTCAGATCTCTGAACTCCAGGAAGACCTGGAGTCTGAG CGTGCTTCCAGGAATAAAGCTGAGAAGCAGAAACGGGACCTTGGGGAAGAGCTAGAGGCTCTGAAAACAGAGTTGGAGGACACGCTGGAT TCCACAGCTGCCCAGCAGGAGCTCAGGTCAAAACGTGAGCAGGAGGTGAACATCCTGAAGAAGACCCTGGAGGAGGAGGCCAAGACCCAC GAGGCCCAGATCCAGGAGATGAGGCAGAAGCACTCACAGGCCGTGGAGGAGCTGGCGGAGCAGCTGGAGCAGACGAAGCGGGTGAAAGCA AACCTCGAGAAGGCAAAGCAGACTCTGGAGAACGAGCGGGGGGAGCTGGCCAACGAGGTGAAGGTGCTGCTGCAGGGCAAAGGGGACTCG GAGCACAAGCGCAAGAAAGTGGAGGCGCAGCTGCAGGAGCTGCAGGTCAAGTTCAACGAGGGAGAGCGCGTGCGCACAGAGCTGGCCGAC AAGGTCACCAAGCTGCAGGTGGAGCTGGACAACGTGACCGGGCTTCTCAGCCAGTCCGACAGCAAGTCCAGCAAGCTCACCAAGGACTTC TCCGCGCTGGAGTCCCAGCTGCAGGACACTCAGGAGCTGCTGCAGGAGGAGAACCGGCAGAAGCTGAGCCTGAGCACCAAGCTCAAGCAG GTGGAGGACGAGAAGAATTCCTTCCGGGAGCAGCTGGAGGAGGAGGAGGAGGCCAAGCACAACCTGGAGAAGCAGATCGCCACCCTCCAT GCCCAGGTGGCCGACATGAAAAAGAAGATGGAGGACAGTGTGGGGTGCCTGGAAACTGCTGAGGAGGTGAAGAGGAAGCTCCAGAAGGAC CTGGAGGGCCTGAGCCAGCGGCACGAGGAGAAGGTGGCCGCCTACGACAAGCTGGAGAAGACCAAGACGCGGCTGCAGCAGGAGCTGGAC GACCTGCTGGTGGACCTGGACCACCAGCGCCAGAGCGCGTGCAACCTGGAGAAGAAGCAGAAGAAGTTTGACCAGCTCCTGGCGGAGGAG AAGACCATCTCTGCCAAGTATGCAGAGGAGCGCGACCGGGCTGAGGCGGAGGCCCGAGAGAAGGAGACCAAGGCTCTGTCGCTGGCCCGG GCCCTGGAGGAAGCCATGGAGCAGAAGGCGGAGCTGGAGCGGCTCAACAAGCAGTTCCGCACGGAGATGGAGGACCTTATGAGCTCCAAG GATGATGTGGGCAAGAGTGTCCACGAGCTGGAGAAGTCCAAGCGGGCCCTAGAGCAGCAGGTGGAGGAGATGAAGACGCAGCTGGAAGAG CTGGAGGACGAGCTGCAGGCCACCGAAGATGCCAAGCTGCGGTTGGAGGTCAACCTGCAGGCCATGAAGGCCCAGTTCGAGCGGGACCTG CAGGGCCGGGACGAGCAGAGCGAGGAGAAGAAGAAGCAGCTGGTCAGACAGGTGCGGGAGATGGAGGCAGAGCTGGAGGACGAGAGGAAG CAGCGCTCGATGGCAGTGGCCGCCCGGAAGAAGCTGGAGATGGACCTGAAGGACCTGGAGGCGCACATCGACTCGGCCAACAAGAACCGG GACGAAGCCATCAAACAGCTGCGGAAGCTGCAGGCCCAGATGAAGGACTGCATGCGCGAGCTGGATGACACCCGCGCCTCTCGTGAGGAG ATCCTGGCCCAGGCCAAAGAGAACGAGAAGAAGCTGAAGAGCATGGAGGCCGAGATGATCCAGTTGCAGGAGGAACTGGCAGCCGCGGAG CGTGCCAAGCGCCAGGCCCAGCAGGAGCGGGATGAGCTGGCTGACGAGATCGCCAACAGCAGCGGCAAAGGAGCCCTGGCGTTAGAGGAG AAGCGGCGTCTGGAGGCCCGCATCGCCCAGCTGGAGGAGGAGCTGGAGGAGGAGCAGGGCAACACGGAGCTGATCAACGACCGGCTGAAG AAGGCCAACCTGCAGATCGACCAGATCAACACCGACCTGAACCTGGAGCGCAGCCACGCCCAGAAGAACGAGAATGCTCGGCAGCAGCTG GAACGCCAGAACAAGGAGCTTAAGGTCAAGCTGCAGGAGATGGAGGGCACTGTCAAGTCCAAGTACAAGGCCTCCATCACCGCCCTCGAG GCCAAGATTGCACAGCTGGAGGAGCAGCTGGACAACGAGACCAAGGAGCGCCAGGCAGCCTGCAAACAGGTGCGTCGGACCGAGAAGAAG CTGAAGGATGTGCTGCTGCAGGTGGATGACGAGCGGAGGAACGCCGAGCAGTACAAGGACCAGCCACGACAAGGGACCCGAGGCGGAGGA GGGCGTCGAGCTGCAGGAAGGCGGGGACGGCCCAGGAGCGGAGGAGCAGACAGCGGTGGCCATCACCAGCGTCCAGCAGGCGGCGTTCGG CGACCACAACATCCAGTACCAGTTCCGCACAGAGACAAATGGAGGACAGACAGGAACACAGAGGACGATCGCCCCCCGGACACACCCTTA CTCTCCAAAAATTGATGGAACCAGAACACCCCGAGATGAGAGGAGAAGAGCCCAGCACAACGAAGTGGAGCGGAGGCGGAGGGACAAGAT CAACAACTGGATCGTCCAGCTTTCGAAAATCATTCCAGACTGTAACGCAGACAACAGCAAGACGGGAGCGAGTAAAGGAGGGATCCTGTC CAAGGCCTGCGATTACATCCGGGAGTTGCGCCAGACCAACCAGCGCATGCAGGAGACCTTCAAAGAGGCCGAGCGGCTGCAGATGGACAA CGAGCTCCTGAGGCAGCAGATCGAGGAGCTGAAGAATGAGAACGCCCTGCTTCGAGCCCAGCTGCAGCAGCACAACCTGGAGATGGTGGG >In-frame_ENST00000216181_ENST00000379134_TCGA-61-2003_MYH9_chr22_36680448_-_USF2_chr19_35760348_length(amino acids)=1937AA_start in transcript=231_stop in transcript=6044 MAQQAADKYLYVDKNFINNPLAQADWAAKKLVWVPSDKSGFEPASLKEEVGEEAIVELVENGKKVKVNKDDIQKMNPPKFSKVEDMAELT CLNEASVLHNLKERYYSGLIYTYSGLFCVVINPYKNLPIYSEEIVEMYKGKKRHEMPPHIYAITDTAYRSMMQDREDQSILCTGESGAGK TENTKKVIQYLAYVASSHKSKKDQGELERQLLQANPILEAFGNAKTVKNDNSSRFGKFIRINFDVNGYIVGANIETYLLEKSRAIRQAKE ERTFHIFYYLLSGAGEHLKTDLLLEPYNKYRFLSNGHVTIPGQQDKDMFQETMEAMRIMGIPEEEQMGLLRVISGVLQLGNIVFKKERNT DQASMPDNTAAQKVSHLLGINVTDFTRGILTPRIKVGRDYVQKAQTKEQADFAIEALAKATYERMFRWLVLRINKALDKTKRQGASFIGI LDIAGFEIFDLNSFEQLCINYTNEKLQQLFNHTMFILEQEEYQREGIEWNFIDFGLDLQPCIDLIEKPAGPPGILALLDEECWFPKATDK SFVEKVMQEQGTHPKFQKPKQLKDKADFCIIHYAGKVDYKADEWLMKNMDPLNDNIATLLHQSSDKFVSELWKDVDRIIGLDQVAGMSET ALPGAFKTRKGMFRTVGQLYKEQLAKLMATLRNTNPNFVRCIIPNHEKKAGKLDPHLVLDQLRCNGVLEGIRICRQGFPNRVVFQEFRQR YEILTPNSIPKGFMDGKQACVLMIKALELDSNLYRIGQSKVFFRAGVLAHLEEERDLKITDVIIGFQACCRGYLARKAFAKRQQQLTAMK VLQRNCAAYLKLRNWQWWRLFTKVKPLLQVSRQEEEMMAKEEELVKVREKQLAAENRLTEMETLQSQLMAEKLQLQEQLQAETELCAEAE ELRARLTAKKQELEEICHDLEARVEEEEERCQHLQAEKKKMQQNIQELEEQLEEEESARQKLQLEKVTTEAKLKKLEEEQIILEDQNCKL AKEKKLLEDRIAEFTTNLTEEEEKSKSLAKLKNKHEAMITDLEERLRREEKQRQELEKTRRKLEGDSTDLSDQIAELQAQIAELKMQLAK KEEELQAALARVEEEAAQKNMALKKIRELESQISELQEDLESERASRNKAEKQKRDLGEELEALKTELEDTLDSTAAQQELRSKREQEVN ILKKTLEEEAKTHEAQIQEMRQKHSQAVEELAEQLEQTKRVKANLEKAKQTLENERGELANEVKVLLQGKGDSEHKRKKVEAQLQELQVK FNEGERVRTELADKVTKLQVELDNVTGLLSQSDSKSSKLTKDFSALESQLQDTQELLQEENRQKLSLSTKLKQVEDEKNSFREQLEEEEE AKHNLEKQIATLHAQVADMKKKMEDSVGCLETAEEVKRKLQKDLEGLSQRHEEKVAAYDKLEKTKTRLQQELDDLLVDLDHQRQSACNLE KKQKKFDQLLAEEKTISAKYAEERDRAEAEAREKETKALSLARALEEAMEQKAELERLNKQFRTEMEDLMSSKDDVGKSVHELEKSKRAL EQQVEEMKTQLEELEDELQATEDAKLRLEVNLQAMKAQFERDLQGRDEQSEEKKKQLVRQVREMEAELEDERKQRSMAVAARKKLEMDLK DLEAHIDSANKNRDEAIKQLRKLQAQMKDCMRELDDTRASREEILAQAKENEKKLKSMEAEMIQLQEELAAAERAKRQAQQERDELADEI ANSSGKGALALEEKRRLEARIAQLEEELEEEQGNTELINDRLKKANLQIDQINTDLNLERSHAQKNENARQQLERQNKELKVKLQEMEGT VKSKYKASITALEAKIAQLEEQLDNETKERQAACKQVRRTEKKLKDVLLQVDDERRNAEQYKDQPRQGTRGGGGRRAAGRRGRPRSGGAD -------------------------------------------------------------- >In-frame_ENST00000216181_ENST00000594064_TCGA-61-2003_MYH9_chr22_36680448_-_USF2_chr19_35760348_length(transcript)=6946nt_BP=5823nt GAGGGCGGGGCGGGAAGGCGGCGAGGAGCCGAGCTGGGTGCGGTGAGGCGCGCAGATCACCGCGGTTCCTGGGCAGGGCACGGAAGGCTA AGCAAGGCTGACCTGCTGCAGCTCCCGCCTCGTGCGCTCGCCCCACCCGGCCGCCGCCCGAGCGCTCGAGAAAGTCCTCTCGGGAGAAGC AGCGCCTGTTCCCGGGGCAGATCCAGGTTCAGGTCCTGGCTATAAGTCACCATGGCACAGCAAGCTGCCGATAAGTATCTCTATGTGGAT AAAAACTTCATCAACAATCCGCTGGCCCAGGCCGACTGGGCTGCCAAGAAGCTGGTATGGGTGCCTTCCGACAAGAGTGGCTTTGAGCCA GCCAGCCTCAAGGAGGAGGTGGGCGAAGAGGCCATCGTGGAGCTGGTGGAGAATGGGAAGAAGGTGAAGGTGAACAAGGATGACATCCAG AAGATGAACCCGCCCAAGTTCTCCAAGGTGGAGGACATGGCAGAGCTCACGTGCCTCAACGAAGCCTCGGTGCTGCACAACCTCAAGGAG CGTTACTACTCAGGGCTCATCTACACCTATTCAGGCCTGTTCTGTGTGGTCATCAATCCTTACAAGAACCTGCCCATCTACTCTGAAGAG ATTGTGGAAATGTACAAGGGCAAGAAGAGGCACGAGATGCCCCCTCACATCTATGCCATCACAGACACCGCCTACAGGAGTATGATGCAA GACCGAGAAGATCAATCCATCTTGTGCACTGGTGAATCTGGAGCTGGCAAGACGGAGAACACCAAGAAGGTCATCCAGTATCTGGCGTAC GTGGCGTCCTCGCACAAGAGCAAGAAGGACCAGGGCGAGCTGGAGCGGCAGCTGCTGCAGGCCAACCCCATCCTGGAGGCCTTCGGGAAC GCCAAGACCGTGAAGAATGACAACTCCTCCCGCTTCGGCAAATTCATTCGCATCAACTTTGATGTCAATGGCTACATTGTTGGAGCCAAC ATTGAGACTTATCTTTTGGAGAAATCTCGTGCTATCCGCCAAGCCAAGGAAGAACGGACCTTCCACATCTTCTATTATCTCCTGTCTGGG GCTGGAGAGCACCTGAAGACCGATCTCCTGTTGGAGCCGTACAACAAATACCGCTTCCTGTCCAATGGACACGTCACCATCCCCGGGCAG CAGGACAAGGACATGTTCCAGGAGACCATGGAGGCCATGAGGATTATGGGCATCCCAGAAGAGGAGCAAATGGGCCTGCTGCGGGTCATC TCAGGGGTTCTTCAGCTCGGCAACATCGTCTTCAAGAAGGAGCGGAACACTGACCAGGCGTCCATGCCCGACAACACAGCTGCCCAAAAG GTGTCCCATCTCTTGGGTATCAATGTGACCGATTTCACCAGAGGAATCCTCACCCCGCGCATCAAGGTGGGACGGGATTACGTCCAGAAG GCGCAGACTAAAGAGCAGGCTGACTTTGCCATCGAGGCCTTGGCCAAGGCGACCTATGAGCGGATGTTCCGCTGGCTGGTGCTGCGCATC AACAAGGCTCTGGACAAGACCAAGAGGCAGGGCGCCTCCTTCATCGGGATCCTGGACATTGCCGGCTTCGAGATCTTTGATCTGAACTCG TTTGAGCAGCTGTGCATCAATTACACCAATGAGAAGCTGCAGCAGCTCTTCAACCACACCATGTTCATCCTGGAGCAGGAGGAGTACCAG CGCGAGGGCATCGAGTGGAACTTCATCGACTTTGGCCTCGACCTGCAGCCCTGCATCGACCTCATTGAGAAGCCAGCAGGCCCCCCGGGC ATTCTGGCCCTGCTGGACGAGGAGTGCTGGTTCCCCAAAGCCACCGACAAGAGCTTCGTGGAGAAGGTGATGCAGGAGCAGGGCACCCAC CCCAAGTTCCAGAAGCCCAAGCAGCTGAAGGACAAAGCTGATTTCTGCATTATCCACTATGCCGGCAAGGTGGATTACAAAGCTGACGAG TGGCTGATGAAGAACATGGATCCCCTGAATGACAACATCGCCACACTGCTCCACCAGTCCTCTGACAAGTTTGTCTCGGAGCTGTGGAAG GATGTGGACCGCATCATCGGCCTGGACCAGGTGGCCGGCATGTCGGAGACCGCACTGCCCGGGGCCTTCAAGACGCGGAAGGGCATGTTC CGCACTGTGGGGCAGCTTTACAAGGAGCAGCTGGCCAAGCTGATGGCTACGCTGAGGAACACGAACCCCAACTTTGTCCGCTGCATCATC CCCAACCACGAGAAGAAGGCCGGCAAGCTGGACCCGCATCTCGTGCTGGACCAGCTGCGCTGCAACGGTGTTCTCGAGGGCATCCGTATC TGCCGCCAGGGCTTCCCCAACAGGGTGGTCTTCCAGGAGTTTCGGCAGAGATATGAGATCCTGACTCCAAACTCCATTCCCAAGGGTTTC ATGGACGGGAAGCAGGCGTGCGTGCTCATGATAAAAGCCCTGGAGCTCGACAGCAATCTGTACCGCATTGGCCAGAGCAAAGTCTTCTTC CGTGCCGGTGTGCTGGCCCACCTGGAGGAGGAGCGAGACCTGAAGATCACCGACGTCATCATAGGGTTCCAGGCCTGCTGCAGGGGCTAC CTGGCCAGGAAAGCATTTGCCAAGCGGCAGCAGCAGCTTACCGCCATGAAGGTCCTCCAGCGGAACTGCGCTGCCTACCTGAAGCTGCGG AACTGGCAGTGGTGGCGGCTCTTCACCAAGGTCAAGCCGCTGCTGCAGGTGAGCCGGCAGGAGGAGGAGATGATGGCCAAGGAGGAGGAG CTGGTGAAGGTCAGAGAGAAGCAGCTGGCTGCGGAGAACAGGCTCACGGAGATGGAGACGCTGCAGTCTCAGCTCATGGCAGAGAAATTG CAGCTGCAGGAGCAGCTCCAGGCAGAAACCGAGCTGTGTGCCGAGGCTGAGGAGCTCCGGGCCCGCCTGACCGCCAAGAAGCAGGAATTA GAAGAGATCTGCCATGACCTAGAGGCCAGGGTGGAGGAGGAGGAGGAGCGCTGCCAGCACCTGCAGGCGGAGAAGAAGAAGATGCAGCAG AACATCCAGGAGCTTGAGGAGCAGCTGGAGGAGGAGGAGAGCGCCCGGCAGAAGCTGCAGCTGGAGAAGGTGACCACCGAGGCGAAGCTG AAAAAGCTGGAGGAGGAGCAGATCATCCTGGAGGACCAGAACTGCAAGCTGGCCAAGGAAAAGAAACTGCTGGAAGACAGAATAGCTGAG TTCACCACCAACCTCACAGAAGAGGAGGAGAAATCTAAGAGCCTCGCCAAGCTCAAGAACAAGCATGAGGCAATGATCACTGACTTGGAA GAGCGCCTCCGCAGGGAGGAGAAGCAGCGACAGGAGCTGGAGAAGACCCGCCGGAAGCTGGAGGGAGACTCCACAGACCTCAGCGACCAG ATCGCCGAGCTCCAGGCCCAGATCGCGGAGCTCAAGATGCAGCTGGCCAAGAAAGAGGAGGAGCTCCAGGCCGCCCTGGCCAGAGTGGAA GAGGAAGCTGCCCAGAAGAACATGGCCCTCAAGAAGATCCGGGAGCTGGAATCTCAGATCTCTGAACTCCAGGAAGACCTGGAGTCTGAG CGTGCTTCCAGGAATAAAGCTGAGAAGCAGAAACGGGACCTTGGGGAAGAGCTAGAGGCTCTGAAAACAGAGTTGGAGGACACGCTGGAT TCCACAGCTGCCCAGCAGGAGCTCAGGTCAAAACGTGAGCAGGAGGTGAACATCCTGAAGAAGACCCTGGAGGAGGAGGCCAAGACCCAC GAGGCCCAGATCCAGGAGATGAGGCAGAAGCACTCACAGGCCGTGGAGGAGCTGGCGGAGCAGCTGGAGCAGACGAAGCGGGTGAAAGCA AACCTCGAGAAGGCAAAGCAGACTCTGGAGAACGAGCGGGGGGAGCTGGCCAACGAGGTGAAGGTGCTGCTGCAGGGCAAAGGGGACTCG GAGCACAAGCGCAAGAAAGTGGAGGCGCAGCTGCAGGAGCTGCAGGTCAAGTTCAACGAGGGAGAGCGCGTGCGCACAGAGCTGGCCGAC AAGGTCACCAAGCTGCAGGTGGAGCTGGACAACGTGACCGGGCTTCTCAGCCAGTCCGACAGCAAGTCCAGCAAGCTCACCAAGGACTTC TCCGCGCTGGAGTCCCAGCTGCAGGACACTCAGGAGCTGCTGCAGGAGGAGAACCGGCAGAAGCTGAGCCTGAGCACCAAGCTCAAGCAG GTGGAGGACGAGAAGAATTCCTTCCGGGAGCAGCTGGAGGAGGAGGAGGAGGCCAAGCACAACCTGGAGAAGCAGATCGCCACCCTCCAT GCCCAGGTGGCCGACATGAAAAAGAAGATGGAGGACAGTGTGGGGTGCCTGGAAACTGCTGAGGAGGTGAAGAGGAAGCTCCAGAAGGAC CTGGAGGGCCTGAGCCAGCGGCACGAGGAGAAGGTGGCCGCCTACGACAAGCTGGAGAAGACCAAGACGCGGCTGCAGCAGGAGCTGGAC GACCTGCTGGTGGACCTGGACCACCAGCGCCAGAGCGCGTGCAACCTGGAGAAGAAGCAGAAGAAGTTTGACCAGCTCCTGGCGGAGGAG AAGACCATCTCTGCCAAGTATGCAGAGGAGCGCGACCGGGCTGAGGCGGAGGCCCGAGAGAAGGAGACCAAGGCTCTGTCGCTGGCCCGG GCCCTGGAGGAAGCCATGGAGCAGAAGGCGGAGCTGGAGCGGCTCAACAAGCAGTTCCGCACGGAGATGGAGGACCTTATGAGCTCCAAG GATGATGTGGGCAAGAGTGTCCACGAGCTGGAGAAGTCCAAGCGGGCCCTAGAGCAGCAGGTGGAGGAGATGAAGACGCAGCTGGAAGAG CTGGAGGACGAGCTGCAGGCCACCGAAGATGCCAAGCTGCGGTTGGAGGTCAACCTGCAGGCCATGAAGGCCCAGTTCGAGCGGGACCTG CAGGGCCGGGACGAGCAGAGCGAGGAGAAGAAGAAGCAGCTGGTCAGACAGGTGCGGGAGATGGAGGCAGAGCTGGAGGACGAGAGGAAG CAGCGCTCGATGGCAGTGGCCGCCCGGAAGAAGCTGGAGATGGACCTGAAGGACCTGGAGGCGCACATCGACTCGGCCAACAAGAACCGG GACGAAGCCATCAAACAGCTGCGGAAGCTGCAGGCCCAGATGAAGGACTGCATGCGCGAGCTGGATGACACCCGCGCCTCTCGTGAGGAG ATCCTGGCCCAGGCCAAAGAGAACGAGAAGAAGCTGAAGAGCATGGAGGCCGAGATGATCCAGTTGCAGGAGGAACTGGCAGCCGCGGAG CGTGCCAAGCGCCAGGCCCAGCAGGAGCGGGATGAGCTGGCTGACGAGATCGCCAACAGCAGCGGCAAAGGAGCCCTGGCGTTAGAGGAG AAGCGGCGTCTGGAGGCCCGCATCGCCCAGCTGGAGGAGGAGCTGGAGGAGGAGCAGGGCAACACGGAGCTGATCAACGACCGGCTGAAG AAGGCCAACCTGCAGATCGACCAGATCAACACCGACCTGAACCTGGAGCGCAGCCACGCCCAGAAGAACGAGAATGCTCGGCAGCAGCTG GAACGCCAGAACAAGGAGCTTAAGGTCAAGCTGCAGGAGATGGAGGGCACTGTCAAGTCCAAGTACAAGGCCTCCATCACCGCCCTCGAG GCCAAGATTGCACAGCTGGAGGAGCAGCTGGACAACGAGACCAAGGAGCGCCAGGCAGCCTGCAAACAGGTGCGTCGGACCGAGAAGAAG CTGAAGGATGTGCTGCTGCAGGTGGATGACGAGCGGAGGAACGCCGAGCAGTACAAGGACCAGCCACGACAAGGGACCCGAGGCGGAGGA GGGCGTCGAGCTGCAGGAAGGCGGGGACGGCCCAGGAGCGGAGGAGCAGACAGCGGTGGCCATCACCAGCGTCCAGCAGGCGGCGTTCGG CGACCACAACATCCAGTACCAGTTCCGCACAGAGACAAATGGAGGACAGGTGACATACCGCGTAGTCCAGGTGACTGATGGTCAGCTGGA CGGCCAGGGCGACACAGCTGGCGCCGTCAGCGTCGTGTCCACCGCTGCCTTCGCGGGGGGGCAGCAGGCTGTGACCCAGGTGGGTGTGGA CGGGGCAGCCCAGCGCCCGGGCCCCGCCGCTGCCTCTGTGCCCCCAGGTCCTGCAGCGCCCTTCCCGCTGGCTGTGATCCAAAATCCCTT CAGCAATGGTGGCAGTCCGGCGGCCGAGGCTGTCAGCGGGGAGGCACGATTTGCCTATTTCCCAGCGTCCAGTGTGGGAGATACTACGGC TGTGTCCGTACAGACCACAGACCAGAGCTTGCAGGCTGGAGGCCAGTTCTACGTCATGATGACGCCCCAGGATGTGCTTCAGACAGGAAC ACAGAGGACGATCGCCCCCCGGACACACCCTTACTCTCCAAAAATTGATGGAACCAGAACACCCCGAGATGAGAGGAGAAGAGCCCAGCA CAACGAAGTGGAGCGGAGGCGGAGGGACAAGATCAACAACTGGATCGTCCAGCTTTCGAAAATCATTCCAGACTGTAACGCAGACAACAG CAAGACGGGAGCGAGTAAAGGAGGGATCCTGTCCAAGGCCTGCGATTACATCCGGGAGTTGCGCCAGACCAACCAGCGCATGCAGGAGAC CTTCAAAGAGGCCGAGCGGCTGCAGATGGACAACGAGCTCCTGAGGCAGCAGGTGGGTGCGGGGCCTGGAGCGGGTCAGGGCCCAGGAGC CCCAGATGCAAGGCGCTGGCCCTCAGCTCCCTTGACCTCCGTCGTGTCCGCCAGATCGAGGAGCTGAAGAATGAGAACGCCCTGCTTCGA GCCCAGCTGCAGCAGCACAACCTGGAGATGGTGGGCGAGGGCACCCGGCAGTGACGCCCGCCACCACCACGCAGCCGCCGCCGCCCACGC >In-frame_ENST00000216181_ENST00000594064_TCGA-61-2003_MYH9_chr22_36680448_-_USF2_chr19_35760348_length(amino acids)=1928AA_start in transcript=231_stop in transcript=6017 MAQQAADKYLYVDKNFINNPLAQADWAAKKLVWVPSDKSGFEPASLKEEVGEEAIVELVENGKKVKVNKDDIQKMNPPKFSKVEDMAELT CLNEASVLHNLKERYYSGLIYTYSGLFCVVINPYKNLPIYSEEIVEMYKGKKRHEMPPHIYAITDTAYRSMMQDREDQSILCTGESGAGK TENTKKVIQYLAYVASSHKSKKDQGELERQLLQANPILEAFGNAKTVKNDNSSRFGKFIRINFDVNGYIVGANIETYLLEKSRAIRQAKE ERTFHIFYYLLSGAGEHLKTDLLLEPYNKYRFLSNGHVTIPGQQDKDMFQETMEAMRIMGIPEEEQMGLLRVISGVLQLGNIVFKKERNT DQASMPDNTAAQKVSHLLGINVTDFTRGILTPRIKVGRDYVQKAQTKEQADFAIEALAKATYERMFRWLVLRINKALDKTKRQGASFIGI LDIAGFEIFDLNSFEQLCINYTNEKLQQLFNHTMFILEQEEYQREGIEWNFIDFGLDLQPCIDLIEKPAGPPGILALLDEECWFPKATDK SFVEKVMQEQGTHPKFQKPKQLKDKADFCIIHYAGKVDYKADEWLMKNMDPLNDNIATLLHQSSDKFVSELWKDVDRIIGLDQVAGMSET ALPGAFKTRKGMFRTVGQLYKEQLAKLMATLRNTNPNFVRCIIPNHEKKAGKLDPHLVLDQLRCNGVLEGIRICRQGFPNRVVFQEFRQR YEILTPNSIPKGFMDGKQACVLMIKALELDSNLYRIGQSKVFFRAGVLAHLEEERDLKITDVIIGFQACCRGYLARKAFAKRQQQLTAMK VLQRNCAAYLKLRNWQWWRLFTKVKPLLQVSRQEEEMMAKEEELVKVREKQLAAENRLTEMETLQSQLMAEKLQLQEQLQAETELCAEAE ELRARLTAKKQELEEICHDLEARVEEEEERCQHLQAEKKKMQQNIQELEEQLEEEESARQKLQLEKVTTEAKLKKLEEEQIILEDQNCKL AKEKKLLEDRIAEFTTNLTEEEEKSKSLAKLKNKHEAMITDLEERLRREEKQRQELEKTRRKLEGDSTDLSDQIAELQAQIAELKMQLAK KEEELQAALARVEEEAAQKNMALKKIRELESQISELQEDLESERASRNKAEKQKRDLGEELEALKTELEDTLDSTAAQQELRSKREQEVN ILKKTLEEEAKTHEAQIQEMRQKHSQAVEELAEQLEQTKRVKANLEKAKQTLENERGELANEVKVLLQGKGDSEHKRKKVEAQLQELQVK FNEGERVRTELADKVTKLQVELDNVTGLLSQSDSKSSKLTKDFSALESQLQDTQELLQEENRQKLSLSTKLKQVEDEKNSFREQLEEEEE AKHNLEKQIATLHAQVADMKKKMEDSVGCLETAEEVKRKLQKDLEGLSQRHEEKVAAYDKLEKTKTRLQQELDDLLVDLDHQRQSACNLE KKQKKFDQLLAEEKTISAKYAEERDRAEAEAREKETKALSLARALEEAMEQKAELERLNKQFRTEMEDLMSSKDDVGKSVHELEKSKRAL EQQVEEMKTQLEELEDELQATEDAKLRLEVNLQAMKAQFERDLQGRDEQSEEKKKQLVRQVREMEAELEDERKQRSMAVAARKKLEMDLK DLEAHIDSANKNRDEAIKQLRKLQAQMKDCMRELDDTRASREEILAQAKENEKKLKSMEAEMIQLQEELAAAERAKRQAQQERDELADEI ANSSGKGALALEEKRRLEARIAQLEEELEEEQGNTELINDRLKKANLQIDQINTDLNLERSHAQKNENARQQLERQNKELKVKLQEMEGT VKSKYKASITALEAKIAQLEEQLDNETKERQAACKQVRRTEKKLKDVLLQVDDERRNAEQYKDQPRQGTRGGGGRRAAGRRGRPRSGGAD -------------------------------------------------------------- |
Top |
Fusion Gene PPI Analysis for MYH9-USF2 |
Go to ChiPPI (Chimeric Protein-Protein interactions) to see the chimeric PPI interaction in |
Protein-protein interactors with each fusion partner protein in wild-type (BIOGRID-3.4.160) |
Hgene | Hgene's interactors | Tgene | Tgene's interactors |
- Retained PPIs in in-frame fusion. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Still interaction with |
- Lost PPIs in in-frame fusion. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Interaction lost with |
- Retained PPIs, but lost function due to frame-shift fusion. |
Partner | Gene | Hbp | Tbp | ENST | Strand | BPexon | TotalExon | Protein feature loci | *BPloci | TotalLen | Interaction lost with |
Top |
Related Drugs for MYH9-USF2 |
Drugs targeting genes involved in this fusion gene. (DrugBank Version 5.1.8 2021-05-08) |
Partner | Gene | UniProtAcc | DrugBank ID | Drug name | Drug activity | Drug type | Drug status |
Top |
Related Diseases for MYH9-USF2 |
Diseases associated with fusion partners. (DisGeNet 4.0) |
Partner | Gene | Disease ID | Disease name | # pubmeds | Source |
Hgene | MYH9 | C0340978 | May-Hegglin anomaly | 25 | CLINGEN;GENOMICS_ENGLAND;UNIPROT |
Hgene | MYH9 | C1854520 | SEBASTIAN SYNDROME | 14 | CLINGEN;CTD_human;GENOMICS_ENGLAND;ORPHANET |
Hgene | MYH9 | C0398641 | Epstein syndrome (disorder) | 11 | CLINGEN |
Hgene | MYH9 | C0403445 | Fechtner syndrome (disorder) | 11 | CLINGEN |
Hgene | MYH9 | C0477317 | Other primary thrombocytopenia | 11 | CLINGEN |
Hgene | MYH9 | C1842035 | Giant Platelet Syndrome with Thrombocytopenia | 11 | CLINGEN |
Hgene | MYH9 | C1863659 | DEAFNESS, AUTOSOMAL DOMINANT 17 | 6 | CTD_human;GENOMICS_ENGLAND;UNIPROT |
Hgene | MYH9 | C0022661 | Kidney Failure, Chronic | 2 | CTD_human |
Hgene | MYH9 | C0006142 | Malignant neoplasm of breast | 1 | CTD_human;UNIPROT |
Hgene | MYH9 | C0017668 | Focal glomerulosclerosis | 1 | CTD_human |
Hgene | MYH9 | C0018784 | Sensorineural Hearing Loss (disorder) | 1 | GENOMICS_ENGLAND |
Hgene | MYH9 | C0018965 | Hematuria | 1 | GENOMICS_ENGLAND |
Hgene | MYH9 | C0020544 | Renal hypertension | 1 | CTD_human |
Hgene | MYH9 | C0027626 | Neoplasm Invasiveness | 1 | CTD_human |
Hgene | MYH9 | C0027706 | Hereditary nephritis | 1 | CTD_human |
Hgene | MYH9 | C0033687 | Proteinuria | 1 | GENOMICS_ENGLAND |
Hgene | MYH9 | C0035078 | Kidney Failure | 1 | GENOMICS_ENGLAND |
Hgene | MYH9 | C0086432 | Hyalinosis, Segmental Glomerular | 1 | CTD_human |
Hgene | MYH9 | C0086543 | Cataract | 1 | GENOMICS_ENGLAND |
Hgene | MYH9 | C0206692 | Carcinoma, Lobular | 1 | CTD_human |
Hgene | MYH9 | C0410005 | Nodular fasciitis | 1 | ORPHANET |
Hgene | MYH9 | C0678222 | Breast Carcinoma | 1 | CTD_human |
Hgene | MYH9 | C1257931 | Mammary Neoplasms, Human | 1 | CTD_human |
Hgene | MYH9 | C1458155 | Mammary Neoplasms | 1 | CTD_human |
Hgene | MYH9 | C1567741 | Alport Syndrome | 1 | CTD_human |
Hgene | MYH9 | C1567742 | Alport Syndrome, X-Linked | 1 | CTD_human |
Hgene | MYH9 | C1567743 | Alport Syndrome, Autosomal Dominant | 1 | CTD_human |
Hgene | MYH9 | C1567744 | Alport Syndrome, Autosomal Recessive | 1 | CTD_human |
Hgene | MYH9 | C1834478 | MACROTHROMBOCYTOPENIA AND PROGRESSIVE SENSORINEURAL DEAFNESS | 1 | CTD_human |
Hgene | MYH9 | C2931861 | Hemorrhagic hereditary nephritis | 1 | CTD_human |
Hgene | MYH9 | C4280711 | Leukocyte inclusion bodies | 1 | GENOMICS_ENGLAND |
Hgene | MYH9 | C4704874 | Mammary Carcinoma, Human | 1 | CTD_human |